-
E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models
Paper • 2601.00423 • Published • 8 -
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Paper • 2601.05242 • Published • 190 -
Motion Attribution for Video Generation
Paper • 2601.08828 • Published • 65
Jongmin Kim
jmkim0309
AI & ML interests
None yet
Recent Activity
liked
a model
about 10 hours ago
kakaocorp/kanana-2-30b-a3b-thinking-2601
updated
a collection
2 days ago
paper_seminar_260121
updated
a collection
4 days ago
paper_seminar_260121
Organizations
None yet