Hy3-preview NVFP4A16
This is a checkpoint-only NVFP4A16 quantization of tencent/Hy3-preview, produced with llmcompressor.entrypoints.model_free.model_free_ptq.
- Base model:
tencent/Hy3-preview - Quantization scheme:
NVFP4A16 - Ignored modules/patterns:
lm_head, model.embed_tokens, re:.*router.gate$, re:.*expert_bias$ - Source snapshot: recorded in
QUANTIZATION_MANIFEST.json - License: inherits Tencent Hy Community License Agreement from the base model; original
LICENSEis included.
Notes
This release quantizes safetensors weights without importing the custom HYV3 model class. Router gates, expert bias tensors, embeddings, and lm_head are preserved unquantized for compatibility/conservatism.
- Downloads last month
- 37
Model tree for 0xSero/Hy3-preview-NVFP4
Base model
tencent/Hy3-preview