nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-Asym-Updated-ActOrder
0.3B
•
Updated
•
1.83k
nm-testing/TinyLlama-1.1B-Chat-v1.0-awq-group128-asym256
0.3B
•
Updated
•
1
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-Asym-Updated-Channel
0.3B
•
Updated
•
1
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-Asym-Updated
0.3B
•
Updated
•
1
nm-testing/Llama-2-7b-hf-gsm8k-quant_w4a16_sym-uncompressed
7B
•
Updated
•
1
nm-testing/Llama-2-7b-hf-gsm8k-quant_w4a16_sym-compressed
1B
•
Updated
•
1
nm-testing/Llama-2-7b-hf-gsm8k-gptq_w4a16_sym-uncompressed
7B
•
Updated
•
2
nm-testing/Llama-2-7b-hf-gsm8k-gptq_w4a16_sym-compressed
1B
•
Updated
•
1
nm-testing/Llama-2-7b-hf-gsm8k-awq_w4a16_sym-uncompressed
7B
•
Updated
•
1
nm-testing/Llama-2-7b-hf-gsm8k-awq_w4a16_sym-compressed
1B
•
Updated
•
2
nm-testing/Llama-2-7b-hf-gsm8k-awq_gptq_sym-uncompressed
7B
•
Updated
•
1
nm-testing/Llama-2-7b-hf-gsm8k-awq_gptq_sym-compressed
1B
•
Updated
•
1
nm-testing/Mixtral-8x7B-Instruct-v0.1-FP8-Dynamic
47B
•
Updated
•
1
nm-testing/Llama-3.1-8B-Instruct-W4A16-G128-shared-pipeline
2B
•
Updated
•
1
nm-testing/Qwen2-VL-2B-Instruct-FP8-dynamic-cli
2B
•
Updated
•
1
nm-testing/Qwen2-VL-2B-Instruct-FP8_DYNAMIC
Image-to-Text
•
2B
•
Updated
•
1
nm-testing/whisper-large-v3-quantized.w4a16
0.3B
•
Updated
•
1
nm-testing/whisper-large-v3-quantized.w8a8_sq
2B
•
Updated
•
1
nm-testing/whisper-large-v3-quantized.w8a8
2B
•
Updated
•
3
nm-testing/llama2.c-stories110M-gsm8k-fp8_dynamic-compressed
0.1B
•
Updated
•
603
nm-testing/llama2.c-stories110M-gsm8k-recipe_w4a16_actorder_weight-compressed
60.5M
•
Updated
•
636
nm-testing/Llama-3.2-1B-Instruct-W4A16-uncompressed-mse-hadamard
5B
•
Updated
nm-testing/llama2.c-stories15M
Text Generation
•
24.4M
•
Updated
•
1.85k
nm-testing/Meta-Llama-3-8B-Instruct-FP8-channel-output-activation-kv_cache-qkv_proj
8B
•
Updated
•
1
nm-testing/Meta-Llama-3-8B-Instruct-FP8-channel-output-activation-q_proj
8B
•
Updated
•
1
nm-testing/Meta-Llama-3-8B-Instruct-FP8-channel-output-activation
8B
•
Updated
•
1
nm-testing/Llama-3.2-1B-W4A16-Transforms
4B
•
Updated
•
1
nm-testing/Ministral-8B-Instruct-2410-FP8-dynamic
8B
•
Updated
•
3
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-asym
0.3B
•
Updated
•
1
nm-testing/Phi-4-mini-instruct-quantized.w4a16.asymmetric
2B
•
Updated
•
1