Sergio Paniego's picture

Building on HF

Sergio Paniego PRO

sergiopaniego

huggingface

·

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 hour ago

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

updated a Space about 19 hours ago

sergiopaniego/wordle-grpo-Qwen3-1.7B-test

published a Space about 19 hours ago

sergiopaniego/wordle-grpo-Qwen3-1.7B-test

View all activity

Organizations

upvoted an article about 1 hour ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

3 days ago

•

37

upvoted an article 10 days ago

Article

Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments

10 days ago

•

9

upvoted an article 14 days ago

Article

Open Responses: What you need to know

+2

15 days ago

•

101

upvoted a paper 15 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 22 days ago • 210

upvoted a paper 18 days ago

Recursive Language Models

Paper • 2512.24601 • Published about 1 month ago • 78

upvoted an article 22 days ago

Article

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

24 days ago

•

60

upvoted 2 articles about 1 month ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

Dec 18, 2025

•

119

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Dec 15, 2025

•

106

upvoted a collection about 2 months ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 7 items • Updated about 9 hours ago • 128

upvoted an article about 2 months ago

Article

Building Deep Research: How we Achieved State of the Art

Nov 24, 2025

•

34

upvoted a changelog about 2 months ago

Changelog

Team & Enterprise Articles Now Featured on the Hugging Face Blog

Dec 8, 2025

• 91

upvoted an article about 2 months ago

Article

20x Faster TRL Fine-tuning with RapidFire AI

Dec 9, 2025

•

1

upvoted a collection about 2 months ago

GLM-4.6V

3 items • Updated Dec 8, 2025 • 48

upvoted an article about 2 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

584

upvoted a collection about 2 months ago

Ministral 3 - Additional Checkpoints

Different formats and Quantized versions of our Ministral 3 family; 14B/8B/3B Instruct/Reasoning GGUF, 3B Instruct ONNX and 14B/8B/3B Instruct BF16. • 13 items • Updated Dec 2, 2025 • 18

upvoted 2 articles about 2 months ago

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

Dec 4, 2025

•

63

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

Dec 1, 2025

•

287

upvoted 3 articles 2 months ago

Article

Diffusers welcomes FLUX-2

+6

Nov 25, 2025

•

176

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

313

Article

20x Faster TRL Fine-tuning with RapidFire AI

+1

Nov 21, 2025

•

26