Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Ethan Walker's picture
4 1

Ethan Walker

Ethan2222

AI & ML interests

None yet

Recent Activity

liked a dataset 23 days ago
nvidia/Nemotron-Math-v2
upvoted a paper 3 months ago
Arbitrary Entropy Policy Optimization: Entropy Is Controllable in Reinforcement Fine-tuning
upvoted a paper 3 months ago
Agentic Entropy-Balanced Policy Optimization
View all activity

Organizations

None yet

liked a dataset 23 days ago

nvidia/Nemotron-Math-v2

Preview • Updated 3 days ago • 7.18k • 111
upvoted 2 papers 3 months ago

Arbitrary Entropy Policy Optimization: Entropy Is Controllable in Reinforcement Fine-tuning

Paper • 2510.08141 • Published Oct 9, 2025 • 1

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16, 2025 • 104
upvoted 2 papers 8 months ago

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Paper • 2505.22334 • Published May 28, 2025 • 36

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Paper • 2505.22453 • Published May 28, 2025 • 46
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs