MindGPT-4ov: An Enhanced MLLM via a Multi-Stage Post-Training Paradigm Paper β’ 2512.02895 β’ Published Dec 2, 2025 β’ 5
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning Paper β’ 2508.16949 β’ Published Aug 23, 2025 β’ 24