Yukai Wang's picture

Yukai Wang

defu2596

·

wonderNefelibata

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

upvoted a paper about 1 month ago

Rubric-based On-policy Distillation

liked a dataset 8 months ago

View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published May 7 • 112

Rubric-based On-policy Distillation

Paper • 2605.07396 • Published May 8 • 41

liked a dataset 8 months ago

cais/hle

Benchmark • Updated Jan 20 • 2.5k • 35.9k • 833

upvoted a paper 9 months ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 119

liked a dataset 9 months ago

data-is-better-together/10k_prompts_ranked

Viewer • Updated Mar 7, 2024 • 10.3k • 1.11k • 168

New activity in meta-llama/Llama-3.2-11B-Vision-Instruct about 1 year ago

Request rejected

#109 opened about 1 year ago by

New activity in Fancy-MLLM/R1-Onevision-7B-RL about 1 year ago

Model Selection

#1 opened about 1 year ago by