Open to Work

14 12

JP2

MJPT2

AI & ML interests

NLP Generative Multimodal Models

Recent Activity

liked a dataset 21 days ago

nyu-visionx/CV-Bench

liked a Space 21 days ago

HuggingFaceH4/on-policy-distillation

upvoted an article about 1 month ago

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

View all activity

Organizations

None yet

liked a dataset 21 days ago

nyu-visionx/CV-Bench

Viewer • Updated Jul 20, 2025 • 5.28k • 4.49k • 46

liked a Space 21 days ago

Unlocking On-Policy Distillation for Any Model Family

📝

113

Explore on-policy distillation visualization for any model

upvoted an article about 1 month ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 16

• 72

liked a dataset about 2 months ago

OX-PIXL/STVQA-7K

Viewer • Updated Nov 12, 2025 • 7.59k • 174 • 2

liked a model about 2 months ago

nyu-visionx/cambrian-8b

Text Generation • 8B • Updated Jun 28, 2024 • 803 • 64

liked a dataset about 2 months ago

ccvl/3DSRBench

Viewer • Updated Feb 3, 2025 • 5.16k • 1.22k • 9

updated a model 2 months ago

MJPT2/SmolGRPO-135M

Text Generation • 0.1B • Updated Apr 2 • 1

published a model 2 months ago

MJPT2/SmolGRPO-135M

Text Generation • 0.1B • Updated Apr 2 • 1

upvoted 2 articles 3 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 162

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 168

liked a Space 3 months ago

The Smol Training Playbook

📚

3.2k

The secrets to building world-class LLMs

upvoted a collection 4 months ago

L1

Collection

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 7 items • Updated Jul 13, 2025 • 9

upvoted 2 articles 4 months ago

Article

What is test-time compute and how to scale it?

Kseniase

•

Feb 6, 2025

• 122

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 345

liked a Space 4 months ago

Scaling test-time compute

📈

600

Boost LLM answers with flexible test‑time search strategies

liked a Space 5 months ago

Model Family Tree

🌳

Generate a family tree of a given model

upvoted a collection 5 months ago

Scaling Test-Time Compute with Open Models

Collection

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6, 2025 • 31

published a model 5 months ago

MJPT2/Qwen2.5-VL-3B-Instruct-Thinking

Updated Jan 6

upvoted a paper 5 months ago

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Paper • 2502.02339 • Published Feb 4, 2025 • 23

liked a dataset 5 months ago

vbdai/Ego3D-Bench

Viewer • Updated Jan 26 • 8.68k • 509 • 12

JP2

AI & ML interests

Recent Activity

Organizations

MJPT2's activity

Unlocking On-Policy Distillation for Any Model Family

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Mixture of Experts (MoEs) in Transformers

The Smol Training Playbook

What is test-time compute and how to scale it?

KV Caching Explained: Optimizing Transformer Inference Efficiency

Scaling test-time compute

Model Family Tree