MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning Paper • 2309.05653 • Published Sep 11, 2023 • 11
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published Jan 1, 2025 • 110
BirdSet: A Multi-Task Benchmark for Classification in Avian Bioacoustics Paper • 2403.10380 • Published Mar 15, 2024 • 4
VersaViT: Enhancing MLLM Vision Backbones via Task-Guided Optimization Paper • 2602.09934 • Published Feb 10 • 1
Rethinking the Inception Architecture for Computer Vision Paper • 1512.00567 • Published Dec 2, 2015 • 1
SWE-rebench-V2 Collection SWE-rebench-V2 is a curated dataset of software-engineering tasks derived from real GitHub issues and pull requests. • 3 items • Updated 27 days ago • 8
OpenVLA: An Open-Source Vision-Language-Action Model Paper • 2406.09246 • Published Jun 13, 2024 • 46
GRM2 Collection Powerfull Reasoning-focused models for general reasoning and agentic tasks. • 2 items • Updated about 17 hours ago • 2
MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator Paper • 2512.11782 • Published Dec 12, 2025 • 3
Alpamayo Collection A collection related to the Alpamayo ecosystem, containing Reasoning VLA models, Physical AI data, simulation frameworks, training utilities, and more • 4 items • Updated 6 days ago • 13
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 20
Extending Puzzle for Mixture-of-Experts Reasoning Models with Application to GPT-OSS Acceleration Paper • 2602.11937 • Published Feb 12 • 3
UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction Paper • 2503.15661 • Published Mar 19, 2025 • 3
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces Paper • 2412.14171 • Published Dec 18, 2024 • 25
Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts Paper • 2511.04655 • Published Nov 6, 2025 • 10
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context Paper • 2309.08105 • Published Sep 15, 2023 • 1