nm-testing/llama2.c-stories42M-gsm8k-quantized-only-uncompressed 58.2M • Updated Feb 12, 2025 • 2.11k
nm-testing/DeepSeek-R1-Distill-Llama-70B-FP8-dynamic Text Generation • 71B • Updated Feb 1, 2025 • 1 • 3
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-partial-24-remaining-fp8-compressed 1B • Updated Jan 29, 2025 • 5
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-partial-24-entire-fp8-compressed 1B • Updated Jan 29, 2025 • 8