GLM-5.1 / .eval_results /claw-eval.yaml
ZHANGYUXUAN-zR's picture
Add Claw-Eval evaluation results (#33)
26e1bd6
- dataset:
id: claw-eval/Claw-Eval
task_id: general
value: 62.7
date: '2026-04-23'
notes: "Pass³% | N=3 | 161 tasks"
source:
url: https://claw-eval.github.io
name: Claw-Eval Leaderboard
user: tobiaslee
- dataset:
id: claw-eval/Claw-Eval
task_id: multi_turn
value: 60.5
date: '2026-04-23'
notes: "Pass³% | N=3 | 38 tasks"
source:
url: https://claw-eval.github.io
name: Claw-Eval Leaderboard
user: tobiaslee