TRACE: A Unified Rollout Budget Allocation Framework for Efficient Agentic Reinforcement Learning Paper • 2606.11119 • Published 22 days ago • 18
FastKernels: Benchmarking GPU Kernel Generation in Production Paper • 2605.23215 • Published May 22 • 8
LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know? Paper • 2605.28721 • Published May 27 • 18
Skill0.5: Joint Skill Internalization and Utilization for Out-of-Distribution Generalization in Agentic Reinforcement Learning Paper • 2605.28424 • Published May 27 • 32
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published May 20 • 207
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models Paper • 2604.08546 • Published Apr 9 • 116
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 509
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 639
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 344