Aloïs Thomas
alothomas
·
AI & ML interests
None yet
Organizations
models
12
alothomas/radbert-rad-verifier-context
Text Classification
•
0.1B
•
Updated
•
16
alothomas/radbert-rad-verifier-single
Text Classification
•
0.1B
•
Updated
•
27
alothomas/deberta-rad-verifier-context
Text Classification
•
0.2B
•
Updated
•
17
alothomas/deberta-rad-verifier-single
Text Classification
•
0.2B
•
Updated
•
18
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k-LastStepOnly
Token Classification
•
0.5B
•
Updated
•
8
alothomas/Qwen2.5-0.5B-PRM-RAD-seq
Updated
alothomas/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
2
alothomas/Qwen2.5-3B-PRM-RAD-balanced-150k
Token Classification
•
3B
•
Updated
•
12
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k
Token Classification
•
0.5B
•
Updated
•
108
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-V4
Token Classification
•
0.5B
•
Updated
•
21
datasets
0
None public yet