Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,304

Full-text search

Active filters: fp8

Qwen/Qwen3-4B-Thinking-2507-FP8

Text Generation • 4B • Updated Aug 6, 2025 • 160k • 56

XiaomiMiMo/MiMo-V2-Flash-Base

Text Generation • 310B • Updated Dec 17, 2025 • 238 • 42

RamManavalan/Qwen3-VL-Embedding-8B-FP8

Feature Extraction • 9B • Updated 17 days ago • 121k • 2

RamonGuthrie/z_image_base-nvfp8-mixed

Text-to-Image • Updated 3 days ago • 730 • 10

RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic

Text Generation • 71B • Updated Dec 12, 2025 • 48.3k • 14

RedHatAI/DeepSeek-V2.5-1210-FP8

Text Generation • 236B • Updated Jan 4, 2025 • 40.8k • 4

Qwen/Qwen3-8B-FP8

Text Generation • 8B • Updated Jul 26, 2025 • 318k • 52

RedHatAI/gemma-3-12b-it-FP8-dynamic

Image-to-Text • 12B • Updated Jun 9, 2025 • 2.34k • 7

RedHatAI/gemma-3-27b-it-FP8-dynamic

Image-to-Text • 27B • Updated Jun 9, 2025 • 19.9k • 12

Qwen/Qwen3-30B-A3B-FP8

Text Generation • 31B • Updated Jul 26, 2025 • 25.5k • 80

unsloth/DeepSeek-R1-0528-GGUF

Text Generation • 671B • Updated Jun 15, 2025 • 3.51k • 194

LGAI-EXAONE/EXAONE-4.0-32B-FP8

Text Generation • 32B • Updated Aug 1, 2025 • 15.4k • 15

unsloth/Kimi-K2-Instruct-GGUF

Text Generation • 1T • Updated Dec 29, 2025 • 40.1k • 219

zai-org/GLM-4.5-FP8

Text Generation • 358B • Updated Aug 12, 2025 • 3.13k • 77

Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8

Text Generation • 480B • Updated Aug 21, 2025 • 239k • • 146

unsloth/Qwen3-Coder-480B-A35B-Instruct-FP8

Text Generation • 480B • Updated Jul 22, 2025 • 22 • 8

phazei/HunyuanVideo-Foley

Updated Oct 3, 2025 • 14

Qwen/Qwen3-Next-80B-A3B-Thinking-FP8

Text Generation • 81B • Updated Sep 22, 2025 • 142k • 48

deepseek-ai/DeepSeek-V3.1-Terminus

Text Generation • 685B • Updated Sep 29, 2025 • 5.84k • • 360

Qwen/Qwen3-VL-235B-A22B-Instruct-FP8

Image-Text-to-Text • 236B • Updated Nov 26, 2025 • 168k • 38

Qwen/Qwen3-VL-30B-A3B-Thinking-FP8

Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 159k • 50

Qwen/Qwen3-VL-4B-Instruct-FP8

Image-Text-to-Text • 5B • Updated Oct 15, 2025 • 45k • 46

Qwen/Qwen3-VL-32B-Instruct-FP8

Image-Text-to-Text • 33B • Updated Oct 22, 2025 • 152k • 37

6chan/krea-realtime-video-fp8

Text-to-Video • Updated Oct 23, 2025 • 390 • 6

nex-agi/DeepSeek-V3.1-Nex-N1

Text Generation • 671B • Updated Dec 10, 2025 • 76 • 42

drbaph/Z-Image-Turbo-FP8

Text-to-Image • Updated Nov 27, 2025 • 8.43k • 36

unsloth/DeepSeek-V3.2

Text Generation • 685B • Updated 11 days ago • 63 • 9

mlx-community/Ministral-3-14B-Instruct-2512

Text Generation • Updated Dec 3, 2025 • 220 • 1

Doradus-AI/MiroThinker-v1.0-30B-FP8

Text Generation • 31B • Updated Dec 5, 2025 • 12 • 4

RedHatAI/Qwen3-VL-32B-Instruct-NVFP4

Text Generation • 20B • Updated Dec 10, 2025 • 13.3k • 2