Evaluating Gemini Robotics Policies in a Veo World Simulator Paper β’ 2512.10675 β’ Published 19 days ago β’ 16
Dyna-Mind: Learning to Simulate from Experience for Better AI Agents Paper β’ 2510.09577 β’ Published Oct 10 β’ 7
SIMA 2: A Generalist Embodied Agent for Virtual Worlds Paper β’ 2512.04797 β’ Published 26 days ago β’ 24
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning Paper β’ 2511.16043 β’ Published Nov 20 β’ 108
LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions Paper β’ 2508.18321 β’ Published Aug 24 β’ 2
Running on CPU Upgrade Featured 2.74k The Smol Training Playbook π 2.74k The secrets to building world-class LLMs
Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics Paper β’ 2510.05137 β’ Published Oct 1 β’ 5
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent Paper β’ 2508.05748 β’ Published Aug 7 β’ 141
view article Article π¦Έπ»#1: Open-endedness and AI Agents β A Path from Generative to Creative AI? Dec 25, 2024 β’ 16
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Paper β’ 2504.13169 β’ Published Apr 17 β’ 39