view post Post 154 Great experience yesterday at PyTorch Conf Europe in Paris 🇫🇷We (w/ @kashif ) talked about training LLMs through interaction, using trajectories across games, browsers, or simulatorsRoom was packed, a clear sign of interest in where RL post-training is heading.sharing the slides! 🤓https://drive.google.com/file/d/16k7YRnf5EJEo0XjXGlRJ_hVeLoFWKyNP/view?usp=sharing See translation 🔥 1 1 + Reply
view post Post 2688 Gemma 4 💎 is here and it’s strong!to celebrate, we’re rolling out in TRL:> support for multimodal tool responses for environments (OpenEnv)> an example to train it in CARLA for autonomous driving with image-based tool callsgo check it out 🏎️🏎️blog: https://huggingface.co/blog/gemma4script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/carla_vlm_gemma.py See translation 👍 4 4 + Reply
view post Post 1945 TRL is officially an adult 🥳excited to announce TRL v1.0❗️head to the blog to see how we got here and what’s next for this post-training library, designed to keep pace with the fieldhttps://huggingface.co/blog/trl-v1 See translation 2 replies · 🔥 4 4 ❤️ 1 1 + Reply