Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
hmBERT 64k
non-profit
stefan_it_
stefan-it
Activity Feed
Request to join this org
Follow
2
AI & ML interests
Pretraining Historical Multilingual Language Models
Recent Activity
stefan-it
submitted
a paper
1 day ago
Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling
stefan-it
submitted
a paper
3 months ago
FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale
stefan-it
authored
a paper
6 months ago
SindBERT, the Sailor: Charting the Seas of Turkish NLP
View all activity
Team members
1
hmbert-64k
's datasets
None public yet