view article Article Introducing Daggr: Chain apps programmatically, inspect visually +3 9 days ago • 86
MAI-UI Technical Report: Real-World Centric Foundation GUI Agents Paper • 2512.22047 • Published Dec 26, 2025 • 29
InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper • 2512.17504 • Published Dec 19, 2025 • 97
Cosmos-Reason1 Collection ⚠️ The latest version of Cosmos Reason is now live! 👉 https://huggingface.co/collections/nvidia/cosmos-reason2 • 8 items • Updated 1 day ago • 40
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 Jun 3, 2025 • 319
view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? +5 May 11, 2025 • 92
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control +2 Feb 4, 2025 • 189
Android in the Wild: A Large-Scale Dataset for Android Device Control Paper • 2307.10088 • Published Jul 19, 2023 • 11
OpenMask3D: Open-Vocabulary 3D Instance Segmentation Paper • 2306.13631 • Published Jun 23, 2023 • 10