view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency Jan 30, 2025 • 269
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! +4 Aug 8, 2025 • 108
view article Article NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks Aug 11, 2025 • 76
view article Article How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio Aug 14, 2025 • 25
view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 • 95
view article Article Make your ZeroGPU Spaces go brrr with ahead-of-time compilation +2 Sep 2, 2025 • 77
view article Article Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio Jul 31, 2025 • 60
view article Article Open Preference Dataset for Text-to-Image Generation by the 🤗 Community +5 Dec 9, 2024 • 70
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18, 2025 • 95
view article Article Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️ Jan 4, 2025 • 9
GLM-4.5 Collection GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 8 items • Updated 28 days ago • 252