Audio - a OpenTSLab Collection

OpenTSLab 's Collections

Audio

Scientific Time Series

Audio

updated Nov 12, 2025

rookie9/PicoAudio2

Updated Sep 29, 2025 • 12
Running on Zero

4

PicoAudio2

🐨

4

Online inference for PicoAudio2
PicoAudio2: Temporal Controllable Text-to-Audio Generation with Natural Language Description

Paper • 2509.00683 • Published Aug 31, 2025
wsntxxn/UniFlow-Audio-large

0.8B • Updated Dec 1, 2025 • 21
wsntxxn/UniFlow-Audio-medium

0.4B • Updated Dec 1, 2025 • 4
wsntxxn/UniFlow-Audio-small

0.2B • Updated Dec 1, 2025 • 7
Running on Zero

4

UniFlow-Audio

👁

4

Generate audio from omni-modalities in a single model.
UniFlow-Audio: Unified Flow Matching for Audio Generation from Omni-Modalities

Paper • 2509.24391 • Published Sep 29, 2025
Bayesian Speech synthesizers Can Learn from Multiple Teachers

Paper • 2510.24372 • Published Oct 28, 2025
marcoyang/spear-base-speech

93.3M • Updated Nov 3, 2025 • 6
marcoyang/spear-base-speech-audio

93.3M • Updated Nov 3, 2025 • 42
marcoyang/spear-large-speech

0.3B • Updated Nov 3, 2025 • 6
marcoyang/spear-large-speech-audio

0.3B • Updated Nov 3, 2025 • 59
marcoyang/spear-xlarge-speech-audio

0.6B • Updated Nov 3, 2025 • 5.86k • 1
SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations

Paper • 2510.25955 • Published Oct 29, 2025