Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published 5 days ago • 151
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation Paper • 2512.24271 • Published 21 days ago • 59
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published 11 days ago • 159