Pretrained LLMs from scratch.
Youzhi Yu
PursuitOfDataScience
AI & ML interests
LLM, GPU Computing, PyTorch
Recent Activity
liked
a dataset
23 days ago
GAIR/lima
updated
a model
23 days ago
PursuitOfDataScience/llama3.2-1b-thinking
published
a model
23 days ago
PursuitOfDataScience/llama3.2-1b-thinking
Organizations
None yet
Sandbox Models
Trial & Error models for various tasks.
-
PursuitOfDataScience/roberta-large-ner
Token Classification • 0.4B • Updated • 14 -
PursuitOfDataScience/distilbert-base-cased-ner
Token Classification • 65.2M • Updated • 13 -
PursuitOfDataScience/bert-base-ner
Token Classification • 0.1B • Updated • 13 -
PursuitOfDataScience/t5-large-summary-model
0.7B • Updated • 9
ArgonneAI
Pretrained LLMs from scratch.
Sandbox Models
Trial & Error models for various tasks.
-
PursuitOfDataScience/roberta-large-ner
Token Classification • 0.4B • Updated • 14 -
PursuitOfDataScience/distilbert-base-cased-ner
Token Classification • 65.2M • Updated • 13 -
PursuitOfDataScience/bert-base-ner
Token Classification • 0.1B • Updated • 13 -
PursuitOfDataScience/t5-large-summary-model
0.7B • Updated • 9
models
21
PursuitOfDataScience/llama3.2-1b-thinking
Text Generation
•
1B
•
Updated
•
12
PursuitOfDataScience/llama-3-2-1b-open-r1-mot-sft
Text Generation
•
1B
•
Updated
•
21
PursuitOfDataScience/qwen2.5-0.5b-r1-dpo
Text Generation
•
0.5B
•
Updated
•
24
PursuitOfDataScience/qwen2.5-0.5b-dpo
Text Generation
•
0.5B
•
Updated
•
39
PursuitOfDataScience/qwen2.5-0.5b-open-r1-mot-cot-sft
Text Generation
•
0.5B
•
Updated
•
18
PursuitOfDataScience/llama3.2-1b-dpo
Text Generation
•
1B
•
Updated
•
16
PursuitOfDataScience/qwen2.5-0.5b-ultrachat-sft-multi-turn
0.5B
•
Updated
•
43
PursuitOfDataScience/finetuned-llama-3.2-3b-math-reasoning
3B
•
Updated
•
7
PursuitOfDataScience/finetuned-llama-3.2-3b-dpo
Text Generation
•
3B
•
Updated
•
6
PursuitOfDataScience/Qwen2.5-1.5B-Instruct-Lora-Deepseek-R1
2B
•
Updated
•
21
datasets
38
PursuitOfDataScience/bbc-news-llama4-maverick-summary
Viewer
•
Updated
•
174k
•
50
PursuitOfDataScience/govreport-llama4-maverick-summary
Viewer
•
Updated
•
19.5k
•
41
•
1
PursuitOfDataScience/arxiv-llama4-maverick-abstract
Viewer
•
Updated
•
198k
•
125
PursuitOfDataScience/xsum-llama4-maverick-summary
Viewer
•
Updated
•
227k
•
39
PursuitOfDataScience/cnn-dailymail-llama4-maverick-summary
Viewer
•
Updated
•
312k
•
60
PursuitOfDataScience/earnings-call-llama4-maverick-summary
Viewer
•
Updated
•
191k
•
165
PursuitOfDataScience/mistral-awesome-chatgpt-prompts
Viewer
•
Updated
•
203
•
15
PursuitOfDataScience/s1k-magistral-small-2506
Viewer
•
Updated
•
1k
•
16
PursuitOfDataScience/llama-awesome-chatgpt-prompts
Viewer
•
Updated
•
203
•
20
PursuitOfDataScience/gsm8k-Llama-4-Maverick-17B-128E-Instruct-FP8
Viewer
•
Updated
•
8.79k
•
32