Datasets and models for ACL 2026 paper: Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems.
AI & ML interests
Natural Language Processing at Yale
Recent Activity
View all activity
Papers
OpenComputer: Verifiable Software Worlds for Computer-Use Agents
Step-level Optimization for Efficient Computer-use Agents
models 95
yale-nlp/RTriever-4B
Feature Extraction • 4B • Updated • 9 • 2
yale-nlp/AgentTrek-1.0-32B_webarena-verified_milestone-bert
0.1B • Updated • 2
yale-nlp/gpt-oss-20b_webarena-verified_stuck-bert
0.1B • Updated • 2
yale-nlp/AgentTrek-1.0-32B_webarena-verified_stuck-bert
0.1B • Updated • 3
yale-nlp/gpt-oss-20b_webarena-verified_milestone-bert
0.1B • Updated • 3
yale-nlp/modernbert-evocua-milestone-detector
0.1B • Updated • 6
yale-nlp/modernbert-evocua-stuck-detector
0.1B • Updated • 6
yale-nlp/modernbert-qwen-milestone-detector
0.1B • Updated • 4
yale-nlp/modernbert-qwen-stuck-detector
0.1B • Updated • 3
yale-nlp/Qwen3-VL-8B-Anchor-Windows
770k • Updated
datasets 29
yale-nlp/Bright-Pro
Viewer • Updated • 530k • 608 • 1
yale-nlp/Anchor
Viewer • Updated • 30.6k • 1.01k
yale-nlp/MedTutor
Updated • 120 • 2
yale-nlp/SciArena
Viewer • Updated • 13.2k • 43 • 25
yale-nlp/SciReas-Pro
Viewer • Updated • 1.36k • 111 • 1
yale-nlp/MSRS
Viewer • Updated • 2.44k • 272 • 2
yale-nlp/SciArena-Eval
Viewer • Updated • 2k • 53
yale-nlp/SciArena-with-paperbank
Viewer • Updated • 15.2k • 74 • 1
yale-nlp/SciDQA
Viewer • Updated • 2.94k • 336 • 2
yale-nlp/AbGen
Viewer • Updated • 3.3k • 211 • 3