zhanghang's picture

zhanghang

hangzhang-nlp

·

hangzhang-nlp

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

Qwen3-VL Technical Report

liked a model about 2 months ago

Qwen/Qwen3-VL-2B-Thinking

liked a model about 2 months ago

Qwen/Qwen3-VL-2B-Instruct

View all activity

Organizations

upvoted a paper 14 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 21 days ago • 126

liked 13 models about 2 months ago

Qwen/Qwen3-VL-2B-Thinking

Image-Text-to-Text • 2B • Updated Oct 20 • 41k • 91

Qwen/Qwen3-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Oct 23 • 556k • 234

Qwen/Qwen3-VL-4B-Instruct

Image-Text-to-Text • 4B • Updated Oct 15 • 837k • 270

Qwen/Qwen3-VL-4B-Thinking

Image-Text-to-Text • 4B • Updated Oct 15 • 51.6k • 89

Qwen/Qwen3-VL-8B-Instruct

Image-Text-to-Text • 9B • Updated Oct 15 • 2.69M • • 557

Qwen/Qwen3-VL-8B-Thinking

Image-Text-to-Text • 9B • Updated 21 days ago • 183k • 155

Qwen/Qwen3-VL-30B-A3B-Instruct-FP8

Image-Text-to-Text • 31B • Updated 21 days ago • 128k • 91

Qwen/Qwen3-VL-30B-A3B-Instruct

Image-Text-to-Text • 31B • Updated 21 days ago • 1.38M • • 446

Qwen/Qwen3-VL-30B-A3B-Thinking

Image-Text-to-Text • 31B • Updated 21 days ago • 55.7k • • 165

Qwen/Qwen3-VL-235B-A22B-Instruct-FP8

Image-Text-to-Text • 236B • Updated 21 days ago • 319k • 32

Qwen/Qwen3-VL-235B-A22B-Thinking-FP8

Image-Text-to-Text • 236B • Updated 21 days ago • 8.46k • 24

Qwen/Qwen3-VL-235B-A22B-Instruct

Image-Text-to-Text • 236B • Updated 21 days ago • 156k • • 335

Qwen/Qwen3-VL-235B-A22B-Thinking

Image-Text-to-Text • 236B • Updated 21 days ago • 10k • • 346

liked a Space 6 months ago

VideoRefer VideoLLaMA3

VideoRefer x VideoLLaMA3

upvoted a paper 6 months ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8 • 114

upvoted a paper 8 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 304

upvoted a paper 9 months ago

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Paper • 2406.07476 • Published Jun 11, 2024 • 37

upvoted 2 papers 10 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

Paper • 2502.13922 • Published Feb 19 • 28