Running on CPU Upgrade Agents 28 RISEBench Gallery π 28 A Gallery of Generation Results on RISEBench
Running Agents 6 Open LMM Spatial Leaderboard π₯ 6 A Leaderboard for LMM spatial understanding capabilities
Running Agents 44 Open LMM Reasoning Leaderboard π₯ 44 A Leaderboard that demonstrates LMM reasoning capabilities
Sleeping Agents 4 CompassJudger Subjective Evaluation Learderboard π 4 CompassJudger Subjective Evaluation Learderboard
Running Agents Featured 135 Open VLM Video Leaderboard π 135 VLMEvalKit Eval Results in video understanding benchmark