view article Article SyGra: The One-Stop Framework for Building Data for LLMs and SLMs Sep 22, 2025 • 14
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers 23 days ago • 53
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention Oct 7, 2024 • 70
Running on CPU Upgrade Featured 3.14k The Smol Training Playbook 📚 3.14k The secrets to building world-class LLMs
Running 3.82k The Ultra-Scale Playbook 🌌 3.82k The ultimate guide to training LLM on large GPU Clusters