ylacombe/expresso
Viewer โข Updated โข 11.6k โข 907 โข 90
How to use Tachyeon/orpheus-3b-sft-lora with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("canopylabs/orpheus-3b-0.1-ft")
model = PeftModel.from_pretrained(base_model, "Tachyeon/orpheus-3b-sft-lora")LoRA adapter trained via supervised fine-tuning on Orpheus 3B using the Expresso conversational speech dataset.
This SFT approach produced a model that closely mimics the Expresso speaker's voice characteristics but showed slight regression in UTMOS naturalness compared to the base Orpheus model. This led us to explore GRPO-based optimization as an alternative โ see orpheus-3b-conversational-grpo.
The lesson: Orpheus was pretrained on ~100k hours of diverse speech. Supervised fine-tuning on a small single-speaker dataset can overfit to that speaker's patterns at the cost of the model's general conversational ability.
from peft import PeftModel
from transformers import AutoModelForCausalLM
base = AutoModelForCausalLM.from_pretrained("canopylabs/orpheus-3b-0.1-ft")
model = PeftModel.from_pretrained(base, "Tachyeon/orpheus-3b-sft-lora")
model = model.merge_and_unload()
Base model
meta-llama/Llama-3.2-3B-Instruct