Running Featured 152 DINOv3 Web 🦖 152 Visualize rich, dense image features locally in your browser
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding Paper • 2512.19693 • Published 24 days ago • 63
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published 28 days ago • 83
Fast and Accurate Causal Parallel Decoding using Jacobi Forcing Paper • 2512.14681 • Published about 1 month ago • 39
LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper • 2512.13604 • Published Dec 15, 2025 • 73
sensenova/SenseNova-SI-1.1-Qwen2.5-VL-7B Image-Text-to-Text • 8B • Updated Dec 9, 2025 • 1.27k • 4
sensenova/SenseNova-SI-1.1-Qwen2.5-VL-3B Image-Text-to-Text • 4B • Updated Dec 9, 2025 • 1.3k • 3
sensenova/SenseNova-SI-1.2-InternVL3-8B Image-Text-to-Text • 8B • Updated Dec 10, 2025 • 3.63k • 10
sensenova/SenseNova-SI-1.1-Qwen3-VL-8B Image-Text-to-Text • 9B • Updated Dec 9, 2025 • 1.45k • 5
sensenova/SenseNova-SI-1.2-InternVL3-8B Image-Text-to-Text • 8B • Updated Dec 10, 2025 • 3.63k • 10