AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation Paper • 2406.01388 • Published Jun 3, 2024
VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization Paper • 2606.02564 • Published 11 days ago • 29
VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization Paper • 2606.02564 • Published 11 days ago • 29
PUSA V1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation Paper • 2507.16116 • Published Jul 22, 2025 • 13
GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors Paper • 2508.09667 • Published Aug 13, 2025 • 6
MV-Performer: Taming Video Diffusion Model for Faithful and Synchronized Multi-view Performer Synthesis Paper • 2510.07190 • Published Oct 8, 2025 • 1
SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery Paper • 2511.20157 • Published Nov 25, 2025 • 3
EmoCAST: Emotional Talking Portrait via Emotive Text Description Paper • 2508.20615 • Published Aug 28, 2025
MLLM-4D: Towards Visual-based Spatial-Temporal Intelligence Paper • 2603.00515 • Published Feb 28 • 2
CutClaw: Agentic Hours-Long Video Editing via Music Synchronization Paper • 2603.29664 • Published Mar 31 • 50
CutClaw: Agentic Hours-Long Video Editing via Music Synchronization Paper • 2603.29664 • Published Mar 31 • 50
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling Paper • 2603.25746 • Published Mar 26 • 155
PersonaLive! Expressive Portrait Image Animation for Live Streaming Paper • 2512.11253 • Published Dec 12, 2025 • 41
Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO Paper • 2511.16669 • Published Nov 20, 2025 • 31
GenCompositor: Generative Video Compositing with Diffusion Transformer Paper • 2509.02460 • Published Sep 2, 2025 • 26
LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation Paper • 2309.09294 • Published Sep 17, 2023
Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos Paper • 2304.01186 • Published Apr 3, 2023
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models Paper • 2310.07702 • Published Oct 11, 2023