unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF Text Generation • 31B • Updated about 22 hours ago • 107k • 427
view post Post 3095 You can now run Kimi K2.5 locally! 🔥We shrank the 1T model to 240GB (-60%) via Dynamic 1-bit.Get >40 tok/s on 242GB or 622GB VRAM/RAM for near full precision.GGUF: unsloth/Kimi-K2.5-GGUFGuide: https://unsloth.ai/docs/models/kimi-k2.5 See translation 7 replies · 🚀 15 15 😎 3 3 🔥 2 2 👀 1 1 + Reply