AnchorWorld: Embodied Egocentric World Simulation with View-based Evolution Customization Paper • 2606.07326 • Published 4 days ago • 23
Qwen-Image-Bench: From Generation to Creation in Text-to-Image Evaluation Paper • 2605.28091 • Published 13 days ago • 5
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 12 days ago • 139
Qwen-Image-Bench: From Generation to Creation in Text-to-Image Evaluation Paper • 2605.28091 • Published 13 days ago • 5
LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation Paper • 2605.18739 • Published 22 days ago • 112
Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models Paper • 2603.01571 • Published Mar 2 • 33
DP$^2$O-SR: Direct Perceptual Preference Optimization for Real-World Image Super-Resolution Paper • 2510.18851 • Published Oct 21, 2025
Long-Horizon Streaming Video Generation via Hybrid Attention with Decoupled Distillation Paper • 2604.10103 • Published Apr 28
MultiWorld: Scalable Multi-Agent Multi-View Video World Models Paper • 2604.18564 • Published Apr 20 • 46
OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence Paper • 2604.07296 • Published Apr 8 • 40