Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations Paper • 2512.21004 • Published 4 days ago • 11
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published 10 days ago • 79
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper • 2512.13687 • Published 13 days ago • 96
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation Paper • 2512.04678 • Published 24 days ago • 40
AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose Paper • 2308.03610 • Published Aug 7, 2023 • 24