stefanocarrera/autophagycode_D_mercury_Qwen3-8B_lr0.0001_c142_trust_t1_g6_run2 Viewer • Updated 6 days ago • 142 • 29 • 1
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published 11 days ago • 97
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 14 days ago • 155
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published 17 days ago • 57
felixwangg/prime_vul_minus_splitted_line_diff_mask_skip_indent_ctx3_chat_v2 Viewer • Updated Apr 12 • 4.05k • 45
AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents Paper • 2604.02947 • Published Apr 3 • 19
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 628
DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models Paper • 2603.26164 • Published Mar 27 • 364
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization Paper • 2604.02268 • Published Apr 2 • 101
The Geometric Alignment Tax: Tokenization vs. Continuous Geometry in Scientific Foundation Models Paper • 2604.04155 • Published Apr 5 • 12
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence Paper • 2603.28032 • Published Mar 30 • 342
MOOZY: A Patient-First Foundation Model for Computational Pathology Paper • 2603.27048 • Published Mar 27 • 6