video/image - a dbest111 Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

dbest111 's Collections

video/image

updated Jul 24, 2025

google/vit-base-patch16-224

Image Classification • 86.6M • Updated Sep 5, 2023 • 4.79M • • 957
OpenGVLab/internimage_g_jointto22k_384

Image Classification • 3B • Updated Mar 25, 2025 • 12 • 1
chancharikm/qwen2.5-vl-72b-cam-motion

Video-Text-to-Text • 73B • Updated Sep 19, 2025 • 94 • 1
lmms-lab/Aero-1-Audio

Text Generation • 2B • Updated Jun 7, 2025 • 845 • 90
mipal/AVATAR

Updated Nov 3, 2025 • 44 • 1
zl2048/FAVOR

Viewer • Updated Aug 1, 2025 • 27.1k • 1k • 2
lmms-lab/VideoMMMU

Viewer • Updated May 5, 2025 • 900 • 3.31k • 13
moonshotai/Kimi-VL-A3B-Thinking-2506

Image-Text-to-Text • 16B • Updated Jan 30 • 6.35k • 357
lmms-lab/llava-critic-113k

Viewer • Updated Oct 5, 2024 • 113k • 403 • 28
lmms-lab/M4-Instruct-Data

Updated Jul 21, 2024 • 1.47k • 78
lmms-lab/llava-next-interleave-qwen-7b

Text Generation • 8B • Updated Jul 24, 2024 • 297 • 27
lmms-lab/LLaVA-OneVision-Data

Viewer • Updated May 24, 2025 • 3.94M • 19.8k • 235
avalab/syndicom

Viewer • Updated May 10, 2024 • 19.2k • 24
avalab/iTBLS

Viewer • Updated Jan 17, 2025 • 12.5k • 27
Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification

Paper • 2312.14378 • Published Dec 22, 2023
avalab/cTBLS_knowledge_retriever

Updated Jan 12, 2024
avalab/cTBLS_encoder

Updated Apr 27, 2023
CraftJarvis/minecraft-vla-sft

Viewer • Updated Mar 21, 2025 • 3.78M • 345 • 10

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs