-
-
-
-
-
-
Inference Providers
Active filters:
fp8
Qwen/Qwen3-4B-Thinking-2507-FP8
Text Generation
•
4B
•
Updated
•
160k
•
56
XiaomiMiMo/MiMo-V2-Flash-Base
Text Generation
•
310B
•
Updated
•
238
•
42
RamManavalan/Qwen3-VL-Embedding-8B-FP8
Feature Extraction
•
9B
•
Updated
•
121k
•
2
RamonGuthrie/z_image_base-nvfp8-mixed
Text-to-Image
•
Updated
•
730
•
10
RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic
Text Generation
•
71B
•
Updated
•
48.3k
•
14
RedHatAI/DeepSeek-V2.5-1210-FP8
Text Generation
•
236B
•
Updated
•
40.8k
•
4
Text Generation
•
8B
•
Updated
•
318k
•
52
RedHatAI/gemma-3-12b-it-FP8-dynamic
Image-to-Text
•
12B
•
Updated
•
2.34k
•
7
RedHatAI/gemma-3-27b-it-FP8-dynamic
Image-to-Text
•
27B
•
Updated
•
19.9k
•
12
Text Generation
•
31B
•
Updated
•
25.5k
•
80
unsloth/DeepSeek-R1-0528-GGUF
Text Generation
•
671B
•
Updated
•
3.51k
•
194
LGAI-EXAONE/EXAONE-4.0-32B-FP8
Text Generation
•
32B
•
Updated
•
15.4k
•
15
unsloth/Kimi-K2-Instruct-GGUF
Text Generation
•
1T
•
Updated
•
40.1k
•
219
Text Generation
•
358B
•
Updated
•
3.13k
•
77
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation
•
480B
•
Updated
•
239k
•
•
146
unsloth/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation
•
480B
•
Updated
•
22
•
8
phazei/HunyuanVideo-Foley
Qwen/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation
•
81B
•
Updated
•
142k
•
48
deepseek-ai/DeepSeek-V3.1-Terminus
Text Generation
•
685B
•
Updated
•
5.84k
•
•
360
Qwen/Qwen3-VL-235B-A22B-Instruct-FP8
Image-Text-to-Text
•
236B
•
Updated
•
168k
•
38
Qwen/Qwen3-VL-30B-A3B-Thinking-FP8
Image-Text-to-Text
•
31B
•
Updated
•
159k
•
50
Qwen/Qwen3-VL-4B-Instruct-FP8
Image-Text-to-Text
•
5B
•
Updated
•
45k
•
46
Qwen/Qwen3-VL-32B-Instruct-FP8
Image-Text-to-Text
•
33B
•
Updated
•
152k
•
37
6chan/krea-realtime-video-fp8
Text-to-Video
•
Updated
•
390
•
6
nex-agi/DeepSeek-V3.1-Nex-N1
Text Generation
•
671B
•
Updated
•
76
•
42
Text-to-Image
•
Updated
•
8.43k
•
36
Text Generation
•
685B
•
Updated
•
63
•
9
mlx-community/Ministral-3-14B-Instruct-2512
Text Generation
•
Updated
•
220
•
1
Doradus-AI/MiroThinker-v1.0-30B-FP8
Text Generation
•
31B
•
Updated
•
12
•
4
RedHatAI/Qwen3-VL-32B-Instruct-NVFP4
Text Generation
•
20B
•
Updated
•
13.3k
•
2