Behavior Uncloning โ GA (step 20)
VLA unlearning checkpoint: pi0.5 model with GA unlearning applied.
Results
| Metric | Value |
|---|---|
| Method | GA |
| Training Steps | 20 |
| Forget Task | "turn on the stove" (LIBERO-Goal T6) |
| Forget SR | 0% (baseline: 100%) |
| Retain SR | 44.4% (baseline: 97.8%) |
| HM | 0.61 |
Usage
# Serve with openpi
uv run scripts/serve_policy.py --env LIBERO policy:checkpoint \
--policy.config pi05_libero --policy.dir <path_to_checkpoint>
Method
Gradient Ascent: L = -L_flow(D_forget). Maximizes flow matching loss on forget data.
Base model: pi0.5 LIBERO
See full report: experiment_report.md