🔄 In a Training Loop

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

upvoted a paper about 9 hours ago

AsyncOPD: How Stale Can On-Policy Distillation Be?

liked a Space 2 days ago

rl-llm-wiki/rl-dashboard

published a bucket 4 days ago

lewtun/trl-internal-testing

View all activity

Organizations

lewtun 's collections 6