File size: 1,630 Bytes
a52e073
 
 
 
 
 
 
 
 
 
 
 
 
7a421a5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
---
title: MuseControlLite
emoji: πŸ‘
colorFrom: red
colorTo: blue
sdk: gradio
sdk_version: 6.1.0
app_file: app.py
pinned: false
license: mit
short_description: Inference for Stable-Audio-Open with more controls
---

## MuseControlLite (Space)

Gradio UI for MuseControlLite adapters on top of `stabilityai/stable-audio-open-1.0`.

### Requirements
- **GPU Space** is required for generation (fp16 by default).
- A Hugging Face token with access to `stabilityai/stable-audio-open-1.0` (set as a Space secret, e.g., `HF_TOKEN`).

### What happens on startup
1) Installs Python deps from `requirements.txt` (includes `gradio`, `gdown`, `diffusers` fork).  
2) Downloads MuseControlLite checkpoints with  
   `gdown 1Q9B333jcq1czA11JKTbM-DHANJ8YqGbP --folder`  
   into `checkpoints/` if they are missing.

### Using the Space
1) Provide a text prompt.  
2) Upload a 47.5s (or longer) audio file when using MuseControlLite conditions.  
3) Select condition types (`melody_stereo`, `melody_mono`, `dynamics`, `rhythm`, `audio`) and adjust guidance/scales if needed.  
4) Click **Generate**. Output is a single 47.5s WAV plus a short status summary.

### Tips
- `melody_stereo` cannot be combined with `dynamics`, `rhythm`, or `melody_mono`.  
- For audio in/out-painting, use the audio condition with the masking sliders.  
- Default examples are preloaded in the UI for quick tests.

### Local run (optional)
```bash
pip install -r requirements.txt
gdown 1Q9B333jcq1czA11JKTbM-DHANJ8YqGbP --folder
huggingface-cli login
python app.py
```

### Acknowledgments
- Original repository: https://github.com/fundwotsai2001/MuseControlLite