Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
random's picture
16 28 56

random

fakerbaby
kramp's profile picture chriszhouwei's profile picture 21world's profile picture
·
  • fakerbaby

AI & ML interests

NLP, RL, VLM

Recent Activity

upvoted a paper 7 days ago
MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification
upvoted a collection 11 days ago
Qwen3.5
liked a dataset 11 days ago
Agent-Ark/Toucan-1.5M
View all activity

Organizations

Skywork's profile picture

fakerbaby 's collections 1

Alignment
  • Secrets of RLHF in Large Language Models Part I: PPO

    Paper • 2307.04964 • Published Jul 11, 2023 • 30
Alignment
  • Secrets of RLHF in Large Language Models Part I: PPO

    Paper • 2307.04964 • Published Jul 11, 2023 • 30
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs