Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhexin Zhang's picture
6 6 1

Zhexin Zhang

nonstopfor
yangjunxiao2021's profile picture buaa42wxy's profile picture
·

AI & ML interests

None yet

Organizations

Conversational AI (CoAI) group from Tsinghua University's profile picture

commented 2 papers 7 months ago

How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study

Paper • 2505.15404 • Published May 21 • 13 •
2

Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!

Paper • 2505.15656 • Published May 21 • 15 •
2
commented a paper 10 months ago

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

Paper • 2502.16776 • Published Feb 24 • 6 •
2
New activity in thu-coai/AISafetyLab_Datasets about 1 year ago

Upload 6 files

#2 opened about 1 year ago by
yangjunxiao2021
commented a paper about 1 year ago

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published Dec 19, 2024 • 13 •
2
commented a paper over 1 year ago

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Paper • 2407.02855 • Published Jul 3, 2024 • 12 •
1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs