Running 108 Unlocking On-Policy Distillation for Any Model Family 📝 108 Visualize on-policy distillation for any model family
Running 81 Maintain the unmaintainable 📚 81 Explore the complex relationships between 400+ machine learning models
Running Agents 80 Transformers Timeline 🤗 80 Interactive timeline to explore the 🤗Transformers models
Running 3.87k The Ultra-Scale Playbook 🌌 3.87k The ultimate guide to training LLM on large GPU Clusters
Running 600 Scaling test-time compute 📈 600 Boost LLM answers with flexible test‑time search strategies