Running on CPU Upgrade 180 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 180 Explore synthetic data experiments in a bookshelf view
mcp-server-bench Collection This is a collection of Benchmarking results between Gradio and FastMCP • 4 items • Updated 11 days ago
Qwen3.5 Dense-to-MoE Weight Transfer Collection Qwen3.5 MoE models from dual-source weight transfer (dense backbone + 35B-A3B experts). Hybrid DeltaNet + GQA attention. • 6 items • Updated 11 days ago