Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nm-testing
's Collections
KV Cache Quantization
Models in CI
FP8-Block Quantized Models
LLM Compressor testing
Speculators testing
Sparse-Llama-3.1-8B-2of4
SparseGPT LLMs
FP8 Models
FP8-Block Quantized Models
updated
Nov 17
Collection of State-of-the-art FP8 Block Quantized Models
Upvote
-
RedHatAI/Qwen3-8B-FP8-block
Text Generation
•
8B
•
Updated
Nov 7
•
105
RedHatAI/Qwen3-32B-FP8-block
Text Generation
•
33B
•
Updated
Oct 24
•
29
RedHatAI/Qwen3-14B-FP8-block
Text Generation
•
15B
•
Updated
Oct 24
•
13
RedHatAI/Llama-3.1-8B-Instruct-FP8-block
Text Generation
•
8B
•
Updated
Oct 29
•
97
nm-testing/Qwen3-VL-235B-A22B-Instruct-FP8-BLOCK
Text Generation
•
Updated
Oct 27
nm-testing/Llama-4-Scout-17B-16E-Instruct-BLOCK-FP8
Text Generation
•
109B
•
Updated
Oct 27
•
6
RedHatAI/Llama-3.3-70B-Instruct-FP8-block
Text Generation
•
71B
•
Updated
Oct 24
•
8
nm-testing/Llama-4-Maverick-17B-128E-Instruct-block-FP8
Text Generation
•
Updated
Oct 27
•
12
nm-testing/Qwen3-30B-A3B-FP8-block
Text Generation
•
3B
•
Updated
Oct 27
•
10
nm-testing/granite-4.0-h-small-FP8-block
Text Generation
•
32B
•
Updated
Nov 17
•
10
Upvote
-
Share collection
View history
Collection guide
Browse collections