1 25 5

Yifan Zeng

yokey

https://xhmy.github.io/

AI & ML interests

Large Language Model, Agentic AI, Deep Learning

Recent Activity

upvoted a paper about 1 month ago

General Agentic Memory Via Deep Research

upvoted a paper 3 months ago

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

liked a dataset 4 months ago

nvidia/Nemotron-CC-v2

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

General Agentic Memory Via Deep Research

Paper • 2511.18423 • Published Nov 23, 2025 • 161

upvoted a paper 3 months ago

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Paper • 2509.25541 • Published Sep 29, 2025 • 140

liked a dataset 4 months ago

nvidia/Nemotron-CC-v2

Viewer • Updated 11 days ago • 8.79B • 49.2k • 96

upvoted a paper 4 months ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 160

upvoted 3 papers 5 months ago

upvoted a paper 7 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

updated a collection 7 months ago

LLM

Collection

21 items • Updated Jun 11, 2025

upvoted 2 papers 7 months ago

WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue

Paper • 2506.01881 • Published Jun 2, 2025 • 6

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published May 29, 2025 • 93

upvoted a paper 9 months ago

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

Paper • 2504.05118 • Published Apr 7, 2025 • 26

updated a collection 10 months ago

LLM

Collection

21 items • Updated Jun 11, 2025

upvoted 3 papers 10 months ago

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published Mar 7, 2025 • 57

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published Mar 12, 2025 • 74

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

upvoted an article 10 months ago

Article

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

Feb 10, 2025

•

upvoted a paper 12 months ago

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published Jan 10, 2025 • 65

liked a model about 1 year ago

sfairXC/FsfairX-LLaMA3-RM-v0.1

Text Classification • 8B • Updated Oct 14, 2024 • 720 • 60

upvoted a paper about 1 year ago

Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published Dec 24, 2024 • 46

Yifan Zeng

AI & ML interests

Recent Activity

Organizations

yokey's activity

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset