Cached layer activations for steering vector experiments
Abdullah
amirali1985
AI & ML interests
Mechanistic interpretability, high dimensional geometry, persona role playing.
Recent Activity
updated a model about 1 hour ago
stride-influence/stride-applications-models updated a dataset about 5 hours ago
curveball-steering/conversations_wealth_seeking_llama3.2-1B-it_large updated a dataset about 5 hours ago
curveball-steering/conversations_sycophancy_llama3.2-1B-it_large