Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

UK AI Safety Institute

Team
https://www.aisi.gov.uk/
AISafetyInst
https://github.com/AI-Safety-Institute/
Activity Feed

AI & ML interests

AI Safety

Recent Activity

7vik-aisi  published a model 12 days ago
ai-safety-institute/em-olmo32b-insecure-seed42-chkpt-1425
7vik-aisi  published a model 12 days ago
ai-safety-institute/cc-olmo32b-vsutl-b0.01-s210
7vik-aisi  published a dataset 12 days ago
ai-safety-institute/reward-hacking-sdf-default
View all activity

Art O Cathain's profile pictureTom Catling's profile pictureWill's profile pictureJake Pencharz's profile pictureAlan Cooney's profile pictureJoe Skinner's profile pictureJess's profile pictureEd Saunders's profile pictureEric Winsor's profile pictureJ W's profile pictureJoseph Bloom's profile pictureRogan Inglis's profile pictureAlexandraSouly's profile pictureAlex Remedios's profile pictureJason Gwartz's profile pictureBen Millwood's profile pictureDishank Bansal's profile pictureIman Syed's profile pictureEkin Zorer's profile pictureJordan Taylor's profile pictureOliver's profile pictureJames Hawkes's profile pictureMario Giulianelli's profile pictureLennart Luettgau's profile pictureGiorgi Giglemiani's profile pictureArathi Mani's profile pictureSam's profile pictureSatvik Golechha's profile pictureRebecca Anselmetti's profile pictureJon hall's profile pictureKevin Wei's profile pictureOlli Järviniemi's profile pictureKeno Juchems's profile pictureGiles Harper-Donnelly's profile pictureAleksandr Bowkis's profile pictureMerlin's profile pictureVy Hong's profile pictureThomas Read's profile pictureBessie O'Dell's profile pictureThanushan's profile pictureDan Lenton's profile pictureStuart Jennings's profile pictureDavid Demitri Africa's profile pictureLuke Symes's profile pictureJames Walpole's profile pictureRoddy McNeill's profile pictureFinlay Wright's profile pictureTolga Hasan Dur's profile picture

ai-safety-institute 's datasets 3

ai-safety-institute/reward-hacking-sdf-default

Viewer • Updated 12 days ago • 68.4k • 12

ai-safety-institute/harmful-advice-dataset

Viewer • Updated Dec 17, 2025 • 3.65k • 69 • 6

ai-safety-institute/AgentHarm

Viewer • Updated Dec 19, 2024 • 468 • 5.31k • 54
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs