FastVLM: Efficient Vision Encoding for Vision Language Models
Paper • 2412.13303 • Published • 75
Efficient Vision Encoding for Vision Language Models
Real-time video captioning powered by FastVLM
Note MLX checkpoint
Note MLX checkpoint
Note MLX checkpoint