Runpeng Dai
Leo-Dai
AI & ML interests
None yet
Recent Activity
authored a paper about 8 hours ago
Small RL Controller, Large Language Model: RL-Guided Adaptive Sampling for Test-Time Scaling authored a paper 24 days ago
DeltaRubric: Generative Multimodal Reward Modeling via Joint Planning and Verification