Learning from the Self-future: On-policy Self-distillation for dLLMs Paper • 2606.18195 • Published 4 days ago • 70
DCAgent3/medagentbench_g1_diverse_tezos_top4_316_8b_20260602_100929 Viewer • Updated 17 days ago • 1.64k • 28 • 1
CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations Paper • 2605.26293 • Published 26 days ago • 6
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 24 days ago • 427
sashaboguraev/pythia-160m-ppt-shuffle_dyck_steps500-seed208-keep_layernorm Text Generation • 0.2B • Updated 28 days ago • 29 • 1
trjxter/Qwimi3.5-9B-Kimik2.6-Opus-Distill-MTP-BF16 Text Generation • 10B • Updated 29 days ago • 211 • 1
IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools Paper • 2605.20682 • Published about 1 month ago • 84
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published May 12 • 196
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published May 6 • 105
The Geometric Canary: Predicting Steerability and Detecting Drift via Representational Stability Paper • 2604.17698 • Published Apr 20 • 4