Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
scale-safety-research
's Collections
Open Source RM Sycophancy
Alignment Faking Datasets
Gemma 2 9b Emergent Misalignment
Apollo Deception Probes Datasets
Helpful-Only Synthetic Documents
Apollo Deception Probes Datasets
updated
Mar 2
Upvote
-
scale-safety-research/roleplaying
Viewer
•
Updated
Mar 18, 2025
•
742
•
5
scale-safety-research/insider_trading
Viewer
•
Updated
Mar 18, 2025
•
1.01k
•
15
•
3
Upvote
-
Share collection
View history
Collection guide
Browse collections