xinyi chen's picture

10 6

xinyi chen

quasdo

·

AI & ML interests

None yet

Recent Activity

authored a paper 17 days ago

RoboGround: Robotic Manipulation with Grounded Vision-Language Priors

authored a paper 17 days ago

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation

authored a paper 17 days ago

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

View all activity

Organizations

None yet

upvoted a paper 24 days ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published 24 days ago • 45

upvoted a paper 27 days ago

MM-ACT: Learn from Multimodal Parallel Generation to Act

Paper • 2512.00975 • Published 28 days ago • 6

upvoted a paper 2 months ago

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Paper • 2510.13778 • Published Oct 15 • 16

upvoted 2 papers 3 months ago

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

Paper • 2510.11341 • Published Oct 13 • 34

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28 • 173

upvoted a paper 4 months ago

F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions

Paper • 2509.06951 • Published Sep 8 • 32

upvoted a paper 5 months ago

The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner

Paper • 2507.13332 • Published Jul 17 • 48

upvoted a paper 7 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 147

upvoted 2 papers 10 months ago

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10 • 61

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 74