HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning Paper • 2603.17024 • Published 5 days ago • 46
MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning Paper • 2602.10575 • Published Feb 11 • 4
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders Paper • 2601.16208 • Published Jan 22 • 55
Image Implication (Metaphor) Series Works Collection 1.II-Bench 2.CII-Bench 3.Let Andriods Dream(LAD) • 6 items • Updated about 1 month ago • 1
CPsyCoun Collection CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling • 3 items • Updated Jan 18 • 1
MetaphorStar Collection MetaphorStar: Image Metaphor Understanding and Reasoning with End-to-End Visual Reinforcement Learning • 8 items • Updated Feb 13 • 2
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published Oct 13, 2025 • 170
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities Paper • 2507.06261 • Published Jul 7, 2025 • 67
Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework Paper • 2505.17019 • Published May 22, 2025 • 4
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Paper • 2502.03275 • Published Feb 5, 2025 • 18
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28, 2025 • 125
Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper • 2501.12599 • Published Jan 22, 2025 • 128
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22, 2025 • 443
Can MLLMs Understand the Deep Implication Behind Chinese Images? Paper • 2410.13854 • Published Oct 17, 2024 • 12