Apollo Deception Probes Datasets - a scale-safety-research Collection

Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

scale-safety-research 's Collections

Open Source RM Sycophancy

Alignment Faking Datasets

Gemma 2 9b Emergent Misalignment

Apollo Deception Probes Datasets

Helpful-Only Synthetic Documents

Apollo Deception Probes Datasets

updated Mar 2

scale-safety-research/roleplaying

Viewer • Updated Mar 18, 2025 • 742 • 5
scale-safety-research/insider_trading

Viewer • Updated Mar 18, 2025 • 1.01k • 15 • 3

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs