Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
13
2
Suchir Salhan
suchirsalhan
Follow
pkubiak00's profile picture
Donya's profile picture
mariagrandury's profile picture
14 followers
·
32 following
https://www.suchirsalhan.com/
suchirsalhan
suchirsalhan
ssalhan
AI & ML interests
Multilinguality and Cognitively-Inspired AI. Tokenization, Pretraining, Interpretability & Alignment.
Recent Activity
updated
a model
about 14 hours ago
Beetle-FineWeb-2B/beetle-bilingual-l2-80-late-b5-fineweb-2b-deu-eng
published
a model
about 14 hours ago
Beetle-FineWeb-2B/beetle-bilingual-l2-80-late-b5-fineweb-2b-deu-eng
updated
a model
about 15 hours ago
Beetle-FineWeb-2B/beetle-bilingual-l2-50-classroom-20-b4-fineweb-2b-deu-eng
View all activity
Organizations
suchirsalhan
's datasets
9
Sort: Recently updated
suchirsalhan/kidalign-llama-filterable
Viewer
•
Updated
27 days ago
•
97.6k
•
38
suchirsalhan/kidalign-llama-3.1-8B-Instruct
Updated
27 days ago
•
2.41k
suchirsalhan/babylm-detox
Viewer
•
Updated
Apr 8
•
11.6M
•
57
suchirsalhan/gptbert-tokenised
Updated
Jul 24, 2025
•
2
suchirsalhan/Phonemized-UD
Viewer
•
Updated
May 30, 2025
•
1.19M
•
69
suchirsalhan/BabyLM-Pretokenised
Viewer
•
Updated
Jan 31, 2025
•
1.64M
•
11
suchirsalhan/MAO-CHILDES
Viewer
•
Updated
Apr 11, 2024
•
3.81M
•
14
suchirsalhan/CLiMP
Preview
•
Updated
Apr 2, 2024
•
24
•
1
suchirsalhan/SLING
Viewer
•
Updated
Apr 2, 2024
•
40k
•
68