MechVQA: Benchmarking and Enhancing Multimodal LLMs on Comprehensive Mechanical Drawing Understanding Paper • 2605.30794 • Published 10 days ago • 3
RAFT: Data Refinement and Adaptive Distillation for Domain Fine-Tuning with Alleviated Forgetting Paper • 2606.00147 • Published 10 days ago
Rethinking Supervised Fine-Tuning: Emphasizing Key Answer Tokens for Improved LLM Accuracy Paper • 2512.21017 • Published Dec 24, 2025 • 1
MechVQA: Benchmarking and Enhancing Multimodal LLMs on Comprehensive Mechanical Drawing Understanding Paper • 2605.30794 • Published 10 days ago • 3
MathSticks: A Benchmark for Visual Symbolic Compositional Reasoning with Matchstick Puzzles Paper • 2510.00483 • Published Oct 1, 2025
OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence Paper • 2602.08683 • Published Feb 9 • 52
PRM-as-a-Judge: A Dense Evaluation Paradigm for Fine-Grained Robotic Auditing Paper • 2603.21669 • Published Mar 23 • 1
LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence Paper • 2605.25979 • Published 14 days ago • 27
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 12 days ago • 73
From Pixels to Words -- Towards Native One-Vision Models at Scale Paper • 2605.28820 • Published 12 days ago • 73
Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models Paper • 2604.26951 • Published Apr 29 • 48
Sat3DGen: Comprehensive Street-Level 3D Scene Generation from Single Satellite Image Paper • 2605.14984 • Published 25 days ago • 5
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation Paper • 2601.22153 • Published Jan 29 • 75
VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text? Paper • 2602.04802 • Published Feb 4 • 2
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture Paper • 2605.12500 • Published 27 days ago • 191
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture Paper • 2605.12500 • Published 27 days ago • 191