Inference Providers
Active filters: draft
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-REAP-50pct-draft
Text Generation
• 64B • Updated • 69
• 5
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-REAP-50pct-AutoRound-W4A16-draft
Text Generation
• 6B • Updated • 68
• 3
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-REAP-25pct-AutoRound-W4A16-draft
Text Generation
• 6B • Updated • 55
• 2
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-REAP-25pct-draft
Text Generation
• 92B • Updated • 47
• 1
0xSero/NVIDIA-Nemotron-3-Super-120B-A12B-BF16-AutoRound-W4A16-draft
Text Generation
• Updated • 1
mradermacher/DeepSeek-R1-DRAFT-0.5B-v1.0-GGUF
0.5B • Updated • 68
mradermacher/DeepSeek-V3-0324-DRAFT-0.5B-v1.0-GGUF
0.5B • Updated • 48
Gapeleon/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.0-Q4_K_M-GGUF
0.6B • Updated • 8
mradermacher/DeepSeek-V3-0324-CODER-DRAFT-0.6B-v1.0-GGUF
0.6B • Updated • 31
mradermacher/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.0-GGUF
0.6B • Updated • 78
0.8B • Updated • 3
mradermacher/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.1-GGUF
0.6B • Updated • 165
mradermacher/DeepSeek-V3-0324-CODER-DRAFT-0.6B-v1.1-GGUF
0.6B • Updated • 113
mradermacher/DeepSeek-R1-DRAFT-0.6B-v2.0-GGUF
0.6B • Updated • 19
mradermacher/DeepSeek-V3-DRAFT-0.6B-v2.0-GGUF
0.6B • Updated • 65
• 1
jukofyork/GLM-4.5-DRAFT-0.6B-v3.0
0.6B • Updated • 11
• 5
jukofyork/GLM-4.5-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 239
• 19
mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 73
mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-i1-GGUF
0.6B • Updated • 97
• 1
jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0
0.6B • Updated • 7
• 1
jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 28
mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 22
mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-i1-GGUF
0.6B • Updated • 99
jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0
0.6B • Updated • 9
• 2
jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 60
jukofyork/Qwen3-0.6B-YaRN-GGUF
0.8B • Updated • 676
• 4
jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0
0.7B • Updated • 4
• 1
jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0-GGUF
0.7B • Updated • 56
jukofyork/Qwen3-Coder-Instruct-DRAFT-0.75B-GGUF
0.8B • Updated • 589
• 7
mradermacher/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF
0.6B • Updated • 55