Hy3-preview FP8_BLOCK
This is a checkpoint-only FP8_BLOCK quantization of tencent/Hy3-preview, produced with llmcompressor.entrypoints.model_free.model_free_ptq.
- Base model:
tencent/Hy3-preview - Quantization scheme:
FP8_BLOCK - Ignored modules/patterns:
lm_head, model.embed_tokens, re:.*router.gate$, re:.*expert_bias$ - Source snapshot: recorded in
QUANTIZATION_MANIFEST.json - License: inherits Tencent Hy Community License Agreement from the base model; original
LICENSEis included.
Notes
This release quantizes safetensors weights without importing the custom HYV3 model class. Router gates, expert bias tensors, embeddings, and lm_head are preserved unquantized for compatibility/conservatism.
- Downloads last month
- 30
Model tree for 0xSero/Hy3-preview-FP8
Base model
tencent/Hy3-preview