Behavior Uncloning โ€” GA (step 20)

VLA unlearning checkpoint: pi0.5 model with GA unlearning applied.

Results

Metric Value
Method GA
Training Steps 20
Forget Task "turn on the stove" (LIBERO-Goal T6)
Forget SR 0% (baseline: 100%)
Retain SR 44.4% (baseline: 97.8%)
HM 0.61

Usage

# Serve with openpi
uv run scripts/serve_policy.py --env LIBERO policy:checkpoint \
    --policy.config pi05_libero --policy.dir <path_to_checkpoint>

Method

Gradient Ascent: L = -L_flow(D_forget). Maximizes flow matching loss on forget data.

Base model: pi0.5 LIBERO

See full report: experiment_report.md

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading