view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 28 days ago • 113
view article Article The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU 12 days ago • 10
📝 Research & Long-Form Blog Posts Collection In-depth technical articles and research pieces published by Hugging Face • 9 items • Updated 7 days ago • 15
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2512.20848 • Published 22 days ago • 33
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 271
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15, 2025 • 222
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face +3 Jul 29, 2025 • 207
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models Paper • 2410.10733 • Published Oct 14, 2024 • 9
view article Article Fine-Tune Wav2Vec2 for English ASR in Hugging Face with 🤗 Transformers Mar 12, 2021 • 42
view article Article Fine-Tune XLSR-Wav2Vec2 for low-resource ASR with 🤗 Transformers Nov 15, 2021 • 39