MuseControlLite / README.md
manoskary's picture
Add audio utilities and track sample audio with LFS
7a421a5

A newer version of the Gradio SDK is available: 6.2.0

Upgrade
metadata
title: MuseControlLite
emoji: πŸ‘
colorFrom: red
colorTo: blue
sdk: gradio
sdk_version: 6.1.0
app_file: app.py
pinned: false
license: mit
short_description: Inference for Stable-Audio-Open with more controls

MuseControlLite (Space)

Gradio UI for MuseControlLite adapters on top of stabilityai/stable-audio-open-1.0.

Requirements

  • GPU Space is required for generation (fp16 by default).
  • A Hugging Face token with access to stabilityai/stable-audio-open-1.0 (set as a Space secret, e.g., HF_TOKEN).

What happens on startup

  1. Installs Python deps from requirements.txt (includes gradio, gdown, diffusers fork).
  2. Downloads MuseControlLite checkpoints with
    gdown 1Q9B333jcq1czA11JKTbM-DHANJ8YqGbP --folder
    into checkpoints/ if they are missing.

Using the Space

  1. Provide a text prompt.
  2. Upload a 47.5s (or longer) audio file when using MuseControlLite conditions.
  3. Select condition types (melody_stereo, melody_mono, dynamics, rhythm, audio) and adjust guidance/scales if needed.
  4. Click Generate. Output is a single 47.5s WAV plus a short status summary.

Tips

  • melody_stereo cannot be combined with dynamics, rhythm, or melody_mono.
  • For audio in/out-painting, use the audio condition with the masking sliders.
  • Default examples are preloaded in the UI for quick tests.

Local run (optional)

pip install -r requirements.txt
gdown 1Q9B333jcq1czA11JKTbM-DHANJ8YqGbP --folder
huggingface-cli login
python app.py

Acknowledgments