Open to Collab

Yucheng Wang

Echoandland

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 4 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

updated a model 25 days ago

Echoandland/olmo3-7b-physics-grpo-purerl-step9

View all activity

Organizations

upvoted 2 papers 4 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 6 days ago • 130

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 5 days ago • 76

updated 2 models 25 days ago

Echoandland/olmo3-7b-physics-grpo-purerl-step9

Reinforcement Learning • 7B • Updated 25 days ago • 6

Echoandland/olmo3-7b-physics-grpo-purerl-step7

Reinforcement Learning • 7B • Updated 25 days ago • 9

published 2 models 25 days ago

Echoandland/olmo3-7b-physics-grpo-purerl-step7

Reinforcement Learning • 7B • Updated 25 days ago • 9

Echoandland/olmo3-7b-physics-grpo-purerl-step9

Reinforcement Learning • 7B • Updated 25 days ago • 6

updated a model 27 days ago

Echoandland/qwen3-8b-dapo-high-entropy-step2

Reinforcement Learning • 8B • Updated 27 days ago • 10

published a model 27 days ago

Echoandland/qwen3-8b-dapo-high-entropy-step2

Reinforcement Learning • 8B • Updated 27 days ago • 10

updated a model 27 days ago

Echoandland/qwen3-8b-dapo-high-entropy-step8

Reinforcement Learning • 8B • Updated 27 days ago • 22

published a model 27 days ago

Echoandland/qwen3-8b-dapo-high-entropy-step8

Reinforcement Learning • 8B • Updated 27 days ago • 22

updated a model 28 days ago

Echoandland/olmo3-7b-grpo-weighted-mul-creativity-step6

Reinforcement Learning • 7B • Updated 28 days ago • 16

published a model 28 days ago

Echoandland/olmo3-7b-grpo-weighted-mul-creativity-step6

Reinforcement Learning • 7B • Updated 28 days ago • 16

updated a model 28 days ago

Echoandland/olmo3-7b-grpo-weighted-mul-creativity-step7

Reinforcement Learning • 7B • Updated 28 days ago • 11

published a model 28 days ago

Echoandland/olmo3-7b-grpo-weighted-mul-creativity-step7

Reinforcement Learning • 7B • Updated 28 days ago • 11

updated a model 28 days ago

Echoandland/olmo3-7b-grpo-purerl-creativity-step28

Reinforcement Learning • 7B • Updated 28 days ago • 13

published a model 28 days ago

Echoandland/olmo3-7b-grpo-purerl-creativity-step28

Reinforcement Learning • 7B • Updated 28 days ago • 13

updated a model 28 days ago

Echoandland/olmo3-7b-grpo-purerl-creativity-step5

Reinforcement Learning • 7B • Updated 28 days ago • 14

published a model 28 days ago

Echoandland/olmo3-7b-grpo-purerl-creativity-step5

Reinforcement Learning • 7B • Updated 28 days ago • 14

updated a model 28 days ago

Echoandland/qwen3-8b-grpo-purerl-creativity-step21

Reinforcement Learning • 8B • Updated 28 days ago • 11

published a model 28 days ago

Echoandland/qwen3-8b-grpo-purerl-creativity-step21

Reinforcement Learning • 8B • Updated 28 days ago • 11

Yucheng Wang

AI & ML interests

Recent Activity

Organizations

Echoandland's activity