dodojorid

yizhuoli

12 3

Dodojordi

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

published a dataset 7 days ago

yizhuoli/mllm

updated a dataset 7 days ago

yizhuoli/mllm

View all activity

Organizations

None yet

upvoted a paper 3 days ago

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

Paper • 2606.26300 • Published 7 days ago • 46

published a dataset 7 days ago

yizhuoli/mllm

Updated 7 days ago • 32

updated a dataset 7 days ago

yizhuoli/mllm

Updated 7 days ago • 32

upvoted 2 papers 16 days ago

SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction

Paper • 2605.20110 • Published May 19 • 4

ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics

Paper • 2606.10479 • Published 22 days ago • 19

upvoted a paper 23 days ago

SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents

Paper • 2606.05761 • Published 27 days ago • 19

upvoted a paper 29 days ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Paper • 2605.29343 • Published May 28 • 36

upvoted 3 papers about 1 month ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published May 19 • 108

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Paper • 2605.06326 • Published May 7 • 26

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 135

authored 3 papers about 1 month ago

liked a model about 1 month ago

rookiexiong/SetCon-8B

Image Segmentation • Updated May 20 • 8 • 4

liked a dataset about 1 month ago

rookiexiong/setcon_training_datasets

Preview • Updated 15 days ago • 95 • 3

published a model about 2 months ago

yizhuoli/qwen3-8b-base

Updated May 16

liked a model about 2 months ago

Simplified-Reasoning/SU-01

Text Generation • 31B • Updated May 20 • 295 • 27

upvoted a paper about 2 months ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published May 13 • 165

upvoted 2 papers 2 months ago

TEMPO: Scaling Test-time Training for Large Reasoning Models

Paper • 2604.19295 • Published Apr 21 • 35

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published Feb 12 • 38

dodojorid

AI & ML interests

Recent Activity

Organizations

yizhuoli's activity