Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Jason Wei
JWei05
Follow
0 followers
·
1 following
AI & ML interests
RL, LLMs, DL Theory
Recent Activity
updated
a model
2 days ago
JWei05/gemma3-4b-pt-off-policy-distilled-from-27b-pt-base
published
a model
2 days ago
JWei05/gemma3-4b-pt-off-policy-distilled-from-27b-pt-base
updated
a model
3 days ago
JWei05/gemma3-12b-pt-off-policy-distilled-from-27bptw20-step80
View all activity
Organizations
models
14
Sort: Recently updated
JWei05/gemma3-4b-pt-off-policy-distilled-from-27b-pt-base
Updated
2 days ago
JWei05/gemma3-12b-pt-off-policy-distilled-from-27bptw20-step80
Updated
3 days ago
JWei05/gemma3-4b-pt-off-policy-distilled-from-27bptw20-step80
Updated
3 days ago
JWei05/dapo-gemma3-27b-pt-warmup20
Updated
3 days ago
JWei05/dapo-gemma3-27b-it-warmup20
Updated
5 days ago
JWei05/gemma3-12b-it-off-policy-distilled-from-gemma4-31b
Updated
5 days ago
JWei05/gemma3-4b-it-off-policy-distilled-from-gemma4-31b
Updated
5 days ago
JWei05/gemma3-12b-pt-off-policy-distilled-from-dapo27b
Updated
5 days ago
JWei05/gemma3-4b-pt-off-policy-distilled-from-dapo27b
Updated
5 days ago
JWei05/gemma3-12b-it-off-policy-distilled-from-dapo27b-correct
Updated
5 days ago
View 14 models
datasets
38
Sort: Recently updated
JWei05/DAPO-Gemma3-27B-PT-warmup20-step80-SFT-Data
Viewer
•
Updated
3 days ago
•
34.8k
•
24
JWei05/DAPO-Gemma4-31B-IT-SFT-Data
Viewer
•
Updated
5 days ago
•
34.8k
•
14
JWei05/DAPO-Gemma3-27B-IT-RL-SFT-Data-correct
Viewer
•
Updated
5 days ago
•
41.8k
•
24
JWei05/DAPO-Gemma3-27B-IT-RL-SFT-Data
Viewer
•
Updated
6 days ago
•
69.6k
•
25
JWei05/swe_smith_py_qwen3.5_35b_trajs_1952
Viewer
•
Updated
10 days ago
•
2k
•
50
JWei05/swe_smith_rs_qwen3.5_35b_trajs_2477
Viewer
•
Updated
10 days ago
•
5k
•
41
JWei05/swe_smith_go_qwen3.5_35b_trajs_1448
Viewer
•
Updated
10 days ago
•
1.63k
•
39
JWei05/swe_smith_js_qwen3.5_35b_trajs_4358
Viewer
•
Updated
10 days ago
•
5k
•
44
JWei05/swe_smith_java_qwen3.5_35b_trajs_4369
Viewer
•
Updated
10 days ago
•
5k
•
53
JWei05/swe_smith_js_5902_filtered
Viewer
•
Updated
20 days ago
•
5.9k
•
32
View 38 datasets