Running 115 Unlocking On-Policy Distillation for Any Model Family 📝 115 Explore on-policy distillation visualization for any model
Running 82 Maintain the unmaintainable 📚 82 Explore the complex relationships between 400+ machine learning models
Running Agents 80 Transformers Timeline 🤗 80 Interactive timeline to explore the 🤗Transformers models
Running 3.92k The Ultra-Scale Playbook 🌌 3.92k The ultimate guide to training LLM on large GPU Clusters
Running 601 Scaling test-time compute 📈 601 Boost LLM answers with flexible test‑time search strategies