rlsamplingJF/Qwen2.5-7B-Instruct-finemath_part1-rm-lr1e-6-constant-warmup_0.05-bs8-gc1.0-step60 7B • Updated about 5 hours ago
rlsamplingJF/Llama-3.2-3B-finemath_part1-rm-lr1e-6-constant-warmup_0.05-bs32-gc1.0-cc0.01-ls0-initial 3B • Updated about 18 hours ago
rlsamplingJF/Llama-3.2-3B-finemath_part1-rm-lr1e-6-constant-warmup_0.05-bs32-gc1.0-cc0.01-ls0-step109 3B • Updated about 18 hours ago
rlsamplingJF/Llama-3.2-3B-finemath_part1-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-initial 3B • Updated about 20 hours ago
rlsamplingJF/Llama-3.2-3B-finemath_part1-rm-lr1e-6-constant-warmup_0.05-bs16-gc1.0-step220 3B • Updated 1 day ago • 9
rlsamplingJF/Qwen2.5-3B-Instruct-finemath-highquality-part1-seed2028-initial 3B • Updated 26 days ago • 30
rlsamplingJF/myllama-1B-20BT-finemath-highquality-part1-seed2026-initial 0.9B • Updated 26 days ago • 29