Zhuoran Jin

jinzhuoran

·

jinzhuoran

AI & ML interests

NLP

Recent Activity

authored a paper 1 day ago

Look Light, Think Heavy: What Multimodal Chain-of-Thought Reasoning Can and Cannot Do

authored a paper 1 day ago

Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It

upvoted a paper 4 days ago

Critique of Agent Model

View all activity

Organizations

None yet

Collections 1

Papers 18

arxiv:2606.26027

arxiv:2606.22565

arxiv:2606.12191

arxiv:2603.11896

spaces 1

RWKU

models 4

jinzhuoran/Qwen3-4B-Instruct-16Env

4B • Updated Jan 22 • 1

jinzhuoran/Qwen3-4B-Instruct-32Env

4B • Updated Jan 21 • 1

jinzhuoran/OmniRewardModel

Any-to-Any • 8B • Updated Oct 29, 2025 • 11 • 5

jinzhuoran/OmniRewardModel2

8B • Updated May 19, 2025 • 4

datasets 45

jinzhuoran/MMR-VIP

Viewer • Updated Nov 24, 2025 • 1.68k • 11 • 3

jinzhuoran/OmniRewardData

Viewer • Updated Oct 29, 2025 • 778k • 156 • 2

jinzhuoran/AOKVQA-200

Viewer • Updated May 5, 2025 • 200 • 10

jinzhuoran/MMMU_Pro-200

Viewer • Updated Apr 30, 2025 • 200 • 7

jinzhuoran/MathVista-200

Viewer • Updated Apr 29, 2025 • 200 • 7

jinzhuoran/MathVerse-400

Viewer • Updated Apr 25, 2025 • 400 • 8

jinzhuoran/VisualSimpleQA-200

Viewer • Updated Apr 25, 2025 • 200 • 7

jinzhuoran/MMMU-500

Viewer • Updated Apr 25, 2025 • 500 • 10

jinzhuoran/MathVision-200

Viewer • Updated Apr 25, 2025 • 200 • 11

jinzhuoran/MathVerse-200

Viewer • Updated Apr 25, 2025 • 200 • 9

View 45 datasets