ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking Paper • 2601.06487 • Published 5 days ago • 38
Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis Paper • 2512.14157 • Published 30 days ago • 10
Enhancing Step-by-Step and Verifiable Medical Reasoning in MLLMs Paper • 2506.16962 • Published Jun 20, 2025 • 10