Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Lipeng (Tony) He's picture
3 5 25

Lipeng (Tony) He

ttttonyhe
6b4b86ec-928a-4b7e-9c1e-8d5f009e3272's profile picture Ruben148's profile picture tfrere's profile picture
·
https://lipeng.ac
  • ttttonyhe

AI & ML interests

Trustworthy Machine Learning

Recent Activity

authored a paper 2 days ago
Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance
submitted a paper 2 days ago
Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance
updated a collection 3 days ago
Red-Teaming Models & Datasets
View all activity

Organizations

Zhejiang University's profile picture University of Waterloo's profile picture

commented a paper 3 months ago

Locket: Robust Feature-Locking Technique for Language Models

Paper • 2510.12117 • Published Oct 14, 2025 •
2
commented a paper 11 months ago

Activation Approximations Can Incur Safety Vulnerabilities Even in Aligned LLMs: Comprehensive Analysis and Defense

Paper • 2502.00840 • Published Feb 2, 2025 •
3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs