OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data Paper • 2606.13432 • Published 7 days ago • 96
Geometric Latent Reasoning Induces Shorter Generations in LLMs Paper • 2606.02248 • Published 17 days ago • 1
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 22 days ago • 423
terriblekiwi/whisper-large-v3-yuriyvnv-mixed-cv-nl-whispercpp-q8_0 Automatic Speech Recognition • Updated 25 days ago • 1
Perception or Prejudice: Can MLLMs Go Beyond First Impressions of Personality? Paper • 2605.22109 • Published 28 days ago • 169
KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving Paper • 2605.13734 • Published May 13 • 12
WildRelight: A Real-World Benchmark and Physics-Guided Adaptation for Single-Image Relighting Paper • 2605.11696 • Published May 12 • 3