Yonggan Fu PRO

YongganFu

10 18 1

https://www.yongganfu.com/

AI & ML interests

None yet

Recent Activity

upvoted a paper 25 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

updated a model about 1 month ago

nvidia/Nemotron-Labs-Diffusion-VLM-8B

updated a model about 1 month ago

nvidia/Nemotron-Labs-Diffusion-3B-Base

View all activity

Organizations

upvoted a paper 25 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published 29 days ago • 63

upvoted a collection about 2 months ago

Efficient-DLM

Collection

2 items • Updated Jun 12 • 4

upvoted a paper about 2 months ago

Agent Explorative Policy Optimization for Multimodal Agentic Reasoning

Paper • 2605.28774 • Published May 27 • 93

upvoted an article about 2 months ago

Article

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

nvidia

•

May 23

• 35

upvoted a collection about 2 months ago

Nemotron-Labs-Diffusion

Collection

A Tri-Mode Language Model Family Unifying Autoregressive, Diffusion, and Self-Speculation Decoding • 7 items • Updated Jun 12 • 52

upvoted 2 papers 6 months ago

PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution

Paper • 2601.10657 • Published Jan 15 • 20

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 235

upvoted 2 papers 7 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 129

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Paper • 2511.18890 • Published Nov 24, 2025 • 37

upvoted an article about 1 year ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 782

upvoted a paper about 1 year ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98

upvoted 3 papers over 1 year ago

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25, 2025 • 42

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published Jan 30, 2025 • 27

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 62

upvoted a collection over 1 year ago

Hymba

Collection

A series of Hybrid Small Language Models. • 3 items • Updated Jun 12 • 34

upvoted a paper over 1 year ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 50

Yonggan Fu PRO

AI & ML interests

Recent Activity

Organizations

YongganFu's activity

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

SmolLM3: smol, multilingual, long-context reasoner