Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
shobbs 's Collections
Mobile use aka smart phone actions dataset
papers
storytime
embed RAG
think and learn
small and fast
NSFW
bio
vision
video llm llava
image art
arm

vision

updated 22 days ago
Upvote
-

  • google/paligemma2-28b-pt-896

    Image-Text-to-Text • 28B • Updated Dec 5, 2024 • 278 • 51

  • lmstudio-community/olmOCR-7B-0225-preview-GGUF

    Image-Text-to-Text • 8B • Updated Feb 25, 2025 • 220 • 12

  • vidore/colqwen2.5-v0.2

    Visual Document Retrieval • Updated Jun 16, 2025 • 16.7k • 93

  • vidore/colpali-v1.3

    Visual Document Retrieval • Updated Mar 14, 2025 • 31.7k • 84

  • vidore/colSmol-500M

    Visual Document Retrieval • Updated Mar 14, 2025 • 1.54k • 21

  • deepseek-ai/deepseek-vl2

    Image-Text-to-Text • 27B • Updated Dec 18, 2024 • 12.9k • 377

  • Sleeping
    5

    gen2seg: Generative Models Enable Generalizable Instance Segmentation

    🚀
    5

    A demo of our gen2seg SD and MAE-H models.


  • nvidia/NitroGen

    Robotics • Updated 8 days ago • 478

  • naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B

    Text Generation • 11B • Updated 15 days ago • 4.91k • 180
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs