Audio-Omni: Extending Multi-modal Understanding to Versatile Audio Generation and Editing Paper • 2604.10708 • Published Apr 12 • 44
view article Article SmolVLM - small yet mighty Vision Language Model +3 andito, merve, mfarre, eliebak, pcuenq • Nov 26, 2024 • 418
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models +1 loubnabnl, anton-l, davanstrien • Mar 20, 2024 • 114