-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
Text Generation
•
22B
•
Updated
•
5.84M
•
•
4.32k
Text Generation
•
120B
•
Updated
•
3.26M
•
•
4.46k
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
Text Generation
•
18B
•
Updated
•
110k
•
86
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
6.21k
•
1.29k
mlx-community/Qwen3-Coder-Next-8bit
Text Generation
•
80B
•
Updated
•
1.24k
•
8
GadflyII/GLM-4.7-Flash-NVFP4
Text Generation
•
18B
•
Updated
•
295k
•
59
openai/gpt-oss-safeguard-20b
Text Generation
•
22B
•
Updated
•
41.9k
•
•
191
GadflyII/GLM-4.7-Flash-MXFP4
Text Generation
•
18B
•
Updated
•
11k
•
8
unsloth/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
Text Generation
•
18B
•
Updated
•
339
•
8
inferencerlabs/Qwen3-Coder-Next-MLX-9bit
Text Generation
•
80B
•
Updated
•
1.07k
•
3
nvidia/Llama-3.3-70B-Instruct-NVFP4
41B
•
Updated
•
11.4k
•
34
MaziyarPanahi/Qwen3-14B-GGUF
Text Generation
•
15B
•
Updated
•
272k
•
7
mlx-community/LFM2-350M-8bit
Text Generation
•
99.7M
•
Updated
•
253
•
4
nvidia/DeepSeek-R1-0528-NVFP4-v2
Text Generation
•
394B
•
Updated
•
102k
•
13
Text Generation
•
22B
•
Updated
•
29.4k
•
41
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-NVFP4-QAD
Image-Text-to-Text
•
8B
•
Updated
•
36.2k
•
19
kldzj/gpt-oss-120b-heretic-v2
Text Generation
•
117B
•
Updated
•
428
•
20
mlx-community/mistralai_Devstral-Small-2-24B-Instruct-2512-MLX-8Bit
Text Generation
•
24B
•
Updated
•
779
•
6
lmstudio-community/functiongemma-270m-it-MLX-8bit
Text Generation
•
75.4M
•
Updated
•
598
•
5
Text Generation
•
177B
•
Updated
•
4.22k
•
14
lukealonso/MiniMax-M2.1-NVFP4
115B
•
Updated
•
27.6k
•
23
nvidia/Qwen3-235B-A22B-Thinking-2507-NVFP4
Text Generation
•
120B
•
Updated
•
137
•
3
MultiverseComputingCAI/HyperNova-60B
Text Generation
•
60B
•
Updated
•
1.24k
•
52
Image-Text-to-Text
•
62B
•
Updated
•
7.72k
•
5
mlx-community/translategemma-12b-it-8bit
Text Generation
•
12B
•
Updated
•
1.32k
•
5
mlx-community/Qwen3-ASR-1.7B-8bit
0.8B
•
Updated
•
1.23k
•
7
CalamitousFelicitousness/HunyuanImage-3.0-Instruct-Distil-SDNQ-4bit-dynamic
Image-to-Image
•
45B
•
Updated
•
86
•
2
mlx-community/GLM-OCR-8bit
Image-to-Text
•
0.6B
•
Updated
•
773
•
2
EpistemeAI/rsi-gpt-oss-120bv2-8bit
Text Generation
•
120B
•
Updated
•
136
•
2
MuXodious/gpt-oss-20b-RichardErkhov-heresy
Text Generation
•
22B
•
Updated
•
72
•
2