arxiv:2606.26790
Jinyang Wu
Jinyang23
AI & ML interests
large language models, reasoning, agentic rl
Recent Activity
authored a paper about 18 hours ago
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization authored a paper about 18 hours ago
Self-Distilled Agentic Reinforcement Learning authored a paper about 18 hours ago
OPID: On-Policy Skill Distillation for Agentic Reinforcement LearningOrganizations
None yet