QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 17 days ago • 103
Intel/Qwen3-Next-80B-A3B-Instruct-int4-mixed-AutoRound Text Generation • Updated Sep 18, 2025 • 18.9k • 23
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 Text Generation • 32B • Updated about 17 hours ago • 464k • 214
view article Article AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems 9 days ago • 37
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 15 days ago • 90