hesamation/Qwen3.6-35B-A3B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 35B • Updated 5 days ago • 63.1k • 165
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 503
bartowski/allura-forge_Llama-3.3-8B-Instruct-GGUF Text Generation • 8B • Updated Dec 30, 2025 • 2.76k • 27
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated Feb 25 • 137