Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences
Zhuoran Jin
jinzhuoran
AI & ML interests
NLP
Recent Activity
authored a paper 1 day ago
Look Light, Think Heavy: What Multimodal Chain-of-Thought Reasoning Can and Cannot Do authored a paper 1 day ago
Why Multi-Step Tool-Use Reinforcement Learning Collapses and How Supervisory Signals Fix It upvoted a paper 4 days ago
Critique of Agent ModelOrganizations
None yet