A collection of benchmarks for evaluating LMs or VLMs under multi-turn interaction
Young-Jun Lee PRO
passing2961
AI & ML interests
Scientific Discovery, Autoresearch
Recent Activity
upvoted a paper about 6 hours ago
Evolution Fine-Tuning: Learning to Discover Across 371 Optimization Tasks updated a collection about 6 hours ago
Evolution Fine-Tuning submitted a paper about 6 hours ago
Evolution Fine-Tuning: Learning to Discover Across 371 Optimization Tasks