VLMs are Blind! Vision language models are blind Paper • 2407.06581 • Published Jul 9, 2024 • 84 XAI/vlmsareblind Viewer • Updated Nov 22, 2024 • 8.02k • 912 • 28 Runtime error Agents 4 VLMsAreBlind ResultsReview 📚 4 Review model results on visual tasks
ImageNet-Hard The Hardest Images Remaining from a Study of the Power of Zoom and Spatial Biases in Image Classification Zoom is what you need: An empirical study of the power of zoom and spatial biases in image classification Paper • 2304.05538 • Published Apr 11, 2023 • 2 Running Agents 5 ImageNet-Hard Browser 🔍 5 Browse and filter ImageNet-Hard dataset images taesiri/imagenet-hard Viewer • Updated Nov 12, 2025 • 10.5k • 289 • 12 taesiri/imagenet-hard-4K Viewer • Updated Nov 12, 2025 • 9.6k • 1.08k • 7
Zoom is what you need: An empirical study of the power of zoom and spatial biases in image classification Paper • 2304.05538 • Published Apr 11, 2023 • 2
ZeroBench ZeroBench jonathan-roberts1/zerobench Viewer • Updated Apr 6 • 434 • 303 • 32 ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models Paper • 2502.09696 • Published Feb 13, 2025 • 43
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models Paper • 2502.09696 • Published Feb 13, 2025 • 43
ArXiv QA Automated ArXiv Question Answering with LLMs Running 16 ArXiv Daily Papers 📚 16 Browse daily arXiv paper summaries with filters taesiri/ArXiv Viewer • Updated Nov 12, 2025 • 19.2k • 138 Paused Agents 63 Claude Reads Arxiv 📖 63 taesiri/arxiv_qa Viewer • Updated Apr 15, 2024 • 211k • 566 • 138
VideoGameBunny VideoGameBunny: Towards vision assistants for video games Paper • 2407.15295 • Published Jul 21, 2024 • 23 asgaardlab/VideoGameBunny-v1_0-8B Text Generation • 8B • Updated Aug 26, 2024 • 12 • 1 asgaardlab/VideoGameBunny-v1_0-4B Text Generation • 4B • Updated Aug 26, 2024 • 6 • 1 asgaardlab/VideoGameBunnyCheckpoints Updated Jul 26, 2024
VideoGameBunny: Towards vision assistants for video games Paper • 2407.15295 • Published Jul 21, 2024 • 23
VLMs are Blind! Vision language models are blind Paper • 2407.06581 • Published Jul 9, 2024 • 84 XAI/vlmsareblind Viewer • Updated Nov 22, 2024 • 8.02k • 912 • 28 Runtime error Agents 4 VLMsAreBlind ResultsReview 📚 4 Review model results on visual tasks
ArXiv QA Automated ArXiv Question Answering with LLMs Running 16 ArXiv Daily Papers 📚 16 Browse daily arXiv paper summaries with filters taesiri/ArXiv Viewer • Updated Nov 12, 2025 • 19.2k • 138 Paused Agents 63 Claude Reads Arxiv 📖 63 taesiri/arxiv_qa Viewer • Updated Apr 15, 2024 • 211k • 566 • 138
ImageNet-Hard The Hardest Images Remaining from a Study of the Power of Zoom and Spatial Biases in Image Classification Zoom is what you need: An empirical study of the power of zoom and spatial biases in image classification Paper • 2304.05538 • Published Apr 11, 2023 • 2 Running Agents 5 ImageNet-Hard Browser 🔍 5 Browse and filter ImageNet-Hard dataset images taesiri/imagenet-hard Viewer • Updated Nov 12, 2025 • 10.5k • 289 • 12 taesiri/imagenet-hard-4K Viewer • Updated Nov 12, 2025 • 9.6k • 1.08k • 7
Zoom is what you need: An empirical study of the power of zoom and spatial biases in image classification Paper • 2304.05538 • Published Apr 11, 2023 • 2
VideoGameBunny VideoGameBunny: Towards vision assistants for video games Paper • 2407.15295 • Published Jul 21, 2024 • 23 asgaardlab/VideoGameBunny-v1_0-8B Text Generation • 8B • Updated Aug 26, 2024 • 12 • 1 asgaardlab/VideoGameBunny-v1_0-4B Text Generation • 4B • Updated Aug 26, 2024 • 6 • 1 asgaardlab/VideoGameBunnyCheckpoints Updated Jul 26, 2024
VideoGameBunny: Towards vision assistants for video games Paper • 2407.15295 • Published Jul 21, 2024 • 23
ZeroBench ZeroBench jonathan-roberts1/zerobench Viewer • Updated Apr 6 • 434 • 303 • 32 ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models Paper • 2502.09696 • Published Feb 13, 2025 • 43
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models Paper • 2502.09696 • Published Feb 13, 2025 • 43