Collections

Discover the best community collections!

Collections including paper arxiv:2503.19903
Multimodal LLM
Collection by
5 days ago
PS3: Scaling Vision Pre-Training to 4K Resolution
Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/
readings
Collection by
about 5 hours ago
General Multimodal Learning
Collection by
Jul 18, 2025
Multimodal Pre-training
Exploring pre-training paradigms of large models across modalities towards Artificial General Intelligence (AGI).
Multimodal Language Model
What does matter besides data receipt when training a Multimodal language model?
Multimodal LLM
Collection by
5 days ago
Multimodal Pre-training
Exploring pre-training paradigms of large models across modalities towards Artificial General Intelligence (AGI).
PS3: Scaling Vision Pre-Training to 4K Resolution
Enabling 4k resolution for VLMs, CVPR 2025, https://nvlabs.github.io/PS3/
readings
Collection by
about 5 hours ago
Multimodal Language Model
What does matter besides data receipt when training a Multimodal language model?
General Multimodal Learning
Collection by
Jul 18, 2025