6 22 4

Haoran Zhang

zzzhr97

AI & ML interests

Lange Language Models, Large Reasoning Models

Recent Activity

updated a dataset 6 days ago

Simplified-Reasoning/ComBench

published a dataset 6 days ago

Simplified-Reasoning/ComBench

authored a paper 9 days ago

Characterizing, Evaluating, and Optimizing Complex Reasoning

View all activity

Organizations

updated a dataset 6 days ago

Simplified-Reasoning/ComBench

Updated 6 days ago • 32

published a dataset 6 days ago

Simplified-Reasoning/ComBench

Updated 6 days ago • 32

authored 2 papers 9 days ago

Characterizing, Evaluating, and Optimizing Complex Reasoning

Paper • 2602.08498 • Published 17 days ago

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

Paper • 2606.10479 • Published 11 days ago • 19

submitted a paper to Daily Papers 9 days ago

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

Paper • 2606.10479 • Published 11 days ago • 19

upvoted a paper 10 days ago

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

Paper • 2606.10479 • Published 11 days ago • 19

upvoted 2 papers 12 days ago

SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents

Paper • 2606.05761 • Published 16 days ago • 19

SceneCode: Executable World Programs for Editable Indoor Scenes with Articulated Objects

Paper • 2605.19587 • Published May 19 • 10

updated a dataset 14 days ago

zzzhr97/WebInstruct-Verified-Processed

Viewer • Updated 14 days ago • 233k • 36

New activity in zzzhr97/WebInstruct-Verified-Processed 14 days ago

Add dataset card, link to paper and GitHub repository

#2 opened 15 days ago by

nielsr

New activity in zzzhr97/TRM-8B 14 days ago

Improve model card: add paper link, metadata, and sample usage

#1 opened 15 days ago by

nielsr

New activity in zzzhr97/TRM-Preference 14 days ago

Add dataset card and paper/code links

#1 opened 15 days ago by

nielsr

upvoted a paper 18 days ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Paper • 2605.29343 • Published 23 days ago • 35

updated a dataset 27 days ago

zzzhr97/Pi-Bench

Viewer • Updated 26 days ago • 100 • 146 • 1

published a dataset 27 days ago

zzzhr97/Pi-Bench

Viewer • Updated 26 days ago • 100 • 146 • 1

upvoted a paper 29 days ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published May 19 • 106

submitted a paper to Daily Papers 29 days ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published May 19 • 106

authored a paper 29 days ago

$π$-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published May 19 • 106

authored a paper about 1 month ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 163

upvoted a paper about 1 month ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 163

Haoran Zhang

AI & ML interests

Recent Activity

Organizations

zzzhr97's activity

Add dataset card, link to paper and GitHub repository

Improve model card: add paper link, metadata, and sample usage

Add dataset card and paper/code links