vision - a shobbs Collection

shobbs 's Collections

Mobile use aka smart phone actions dataset

papers

think and learn

NSFW

bio

vision

video llm llava

arm

vision

updated 22 days ago

google/paligemma2-28b-pt-896

Image-Text-to-Text • 28B • Updated Dec 5, 2024 • 278 • 51
lmstudio-community/olmOCR-7B-0225-preview-GGUF

Image-Text-to-Text • 8B • Updated Feb 25, 2025 • 220 • 12
vidore/colqwen2.5-v0.2

Visual Document Retrieval • Updated Jun 16, 2025 • 16.7k • 93
vidore/colpali-v1.3

Visual Document Retrieval • Updated Mar 14, 2025 • 31.7k • 84
vidore/colSmol-500M

Visual Document Retrieval • Updated Mar 14, 2025 • 1.54k • 21
deepseek-ai/deepseek-vl2

Image-Text-to-Text • 27B • Updated Dec 18, 2024 • 12.9k • 377
Sleeping

5

gen2seg: Generative Models Enable Generalizable Instance Segmentation

🚀

5

A demo of our gen2seg SD and MAE-H models.
nvidia/NitroGen

Robotics • Updated 8 days ago • 478
naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B

Text Generation • 11B • Updated 15 days ago • 4.91k • 180