AI & ML interests

AI Safety

Recent Activity

sarenne-aisi  updated a collection 3 days ago
RealityTest benchmark
sarenne-aisi  updated a collection 3 days ago
RealityTest benchmark
alancooneydsit  updated a collection 20 days ago
Lie Confession
View all activity

ai-safety-institute 's collections 11

Lie Detection Model Organisms Merged
Merged adaptors into base model
Lie Detection
Datasets, model organisms and trained probes for lie detection research. Paper: Did you lie? Evaluating Lie Detection in Language Models
Lie Detection
Datasets, model organisms and trained probes for lie detection research. Paper: Did you lie? Evaluating Lie Detection in Language Models
Lie Detection Model Organisms Merged
Merged adaptors into base model