Running 12 Defeating the trainer-generator precision mismatch in TRL 🎯 12 Download research PDF (Pro access required)
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated 11 days ago • 590k • 335
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-Base-BF16 Text Generation • 124B • Updated Mar 14 • 14.9k • 27