Aloïs Thomas's picture

Aloïs Thomas

alothomas

·

AI & ML interests

None yet

Organizations

Collections 1

models 12

alothomas/radbert-rad-verifier-context

Text Classification • 0.1B • Updated Sep 30 • 16

alothomas/radbert-rad-verifier-single

Text Classification • 0.1B • Updated Sep 29 • 27

alothomas/deberta-rad-verifier-context

Text Classification • 0.2B • Updated Sep 28 • 17

alothomas/deberta-rad-verifier-single

Text Classification • 0.2B • Updated Sep 28 • 18

alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k-LastStepOnly

Token Classification • 0.5B • Updated Sep 24 • 8

alothomas/Qwen2.5-0.5B-PRM-RAD-seq

alothomas/ppo-LunarLander-v2

Reinforcement Learning • Updated Jul 16 • 2

alothomas/Qwen2.5-3B-PRM-RAD-balanced-150k

Token Classification • 3B • Updated Mar 4 • 12

alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k

Token Classification • 0.5B • Updated Mar 3 • 108

alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-V4

Token Classification • 0.5B • Updated Feb 23 • 21

datasets 0

None public yet