arxiv:2510.00492
Jiongdao Jin
jiongdao
AI & ML interests
None yet
Recent Activity
upvoted a paper 3 days ago
Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients updated a model 7 days ago
jiongdao/grpo-outputs updated a dataset 7 days ago
jiongdao/grpo-results