view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 6 days ago β’ 59
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper β’ 2501.17703 β’ Published Jan 29 β’ 59
Kandinsky 5.0 Video Lite Collection Kandinsky 5.0 Video Lite is a lightweight 2B model that generates up to 10-second SD videos from English and Russian prompts with high visual quality. β’ 9 items β’ Updated 16 days ago β’ 8
Kandinsky 5.0 Video Lite Diffusers Collection Kandinsky 5.0 Video Lite is a lightweight 2B model that generates up to 10-second SD videos from English and Russian prompts with high visual quality. β’ 8 items β’ Updated 16 days ago β’ 4
Kandinsky 5.0 Video Pro Diffusers Collection Kandinsky 5.0 Video Pro is a 19B model that generates high-quality HD videos from English and Russian prompts with controllable camera motion. β’ 4 items β’ Updated about 8 hours ago β’ 6
Kandinsky 5.0 Video Pro Collection Kandinsky 5.0 Video Pro is a 19B model that generates high-quality HD videos from English and Russian prompts with controllable camera motion. β’ 5 items β’ Updated 16 days ago β’ 15
Kandinsky 5.0 Image Lite Collection Kandinsky 5.0 Image Lite is a 6B DiT-based model that generates and edits HD images from English and Russian text prompts with high visual quality. β’ 4 items β’ Updated 16 days ago β’ 13
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism Paper β’ 2511.11373 β’ Published 26 days ago β’ 12
TiDAR: Think in Diffusion, Talk in Autoregression Paper β’ 2511.08923 β’ Published 29 days ago β’ 112
INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats Paper β’ 2510.25602 β’ Published Oct 29 β’ 77
Latent Diffusion Model without Variational Autoencoder Paper β’ 2510.15301 β’ Published Oct 17 β’ 48
Learning to Reason as Action Abstractions with Scalable Mid-Training RL Paper β’ 2509.25810 β’ Published Sep 30 β’ 5