Arabic Speech Datasets Collection Best Datasets for Arabic Speech Tasks • 16 items • Updated 3 days ago • 14
MoshiVis v0.1 Collection MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 9 items • Updated 11 days ago • 23
view article Article How to Choose the Best Open Source LLM for Your Project in 2025 Sep 9, 2025 • 74
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! +10 Aug 5, 2025 • 508
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency Paper • 2409.02634 • Published Sep 4, 2024 • 97
view article Article PaliGemma – Google's Cutting-Edge Open Vision Language Model +1 May 14, 2024 • 278