Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Amaldev's picture
1

Amaldev

amaldev024
Mi6paulino's profile picture
·
  • amaldev024

AI & ML interests

None yet

Organizations

None yet

Collections 1

Llm
  • Training Language Models to Self-Correct via Reinforcement Learning

    Paper • 2409.12917 • Published Sep 19, 2024 • 140
Llm
  • Training Language Models to Self-Correct via Reinforcement Learning

    Paper • 2409.12917 • Published Sep 19, 2024 • 140

models 1

amaldev024/ppo-LunarLander-v2

Reinforcement Learning • Updated Mar 17, 2024 • 4

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs