3 29 5

JGC

Nothing2Say

jiangguochaoGG

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

upvoted a paper about 2 months ago

The Principles of Diffusion Models

upvoted a paper about 2 months ago

Tongyi DeepResearch Technical Report

View all activity

Organizations

None yet

upvoted 3 papers about 2 months ago

upvoted 2 papers 3 months ago

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9 • 44

Tree Search for LLM Agent Reinforcement Learning

Paper • 2509.21240 • Published Sep 25 • 89

authored a paper 3 months ago

VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models

Paper • 2509.19803 • Published Sep 24 • 120

upvoted a paper 3 months ago

VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models

Paper • 2509.19803 • Published Sep 24 • 120

commented a paper 3 months ago

VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models

Paper • 2509.19803 • Published Sep 24 • 120 •

upvoted 2 papers 3 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 225

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 189

upvoted a paper 4 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 194

commented a paper 4 months ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Paper • 2508.21104 • Published Aug 28 • 35 •

authored a paper 4 months ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Paper • 2508.21104 • Published Aug 28 • 35

upvoted 2 papers 4 months ago

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Paper • 2508.21104 • Published Aug 28 • 35

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 180

upvoted a paper 5 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 82

commented a paper 6 months ago

FlashThink: An Early Exit Method For Efficient Reasoning

Paper • 2505.13949 • Published May 20 • 1 •

upvoted a paper 6 months ago

ARIA: Training Language Agents with Intention-Driven Reward Aggregation

Paper • 2506.00539 • Published May 31 • 30

upvoted 2 papers 7 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263

BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation

Paper • 2504.14538 • Published Apr 20 • 30

JGC

AI & ML interests

Recent Activity

Organizations

Nothing2Say's activity