Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Bk9x
's Collections
Data_Pretrain_NLP
Dataset_NLP
Small LM
Dataset_voice
Embedding
Automatic Speech Recognition
SDXL
TTS
LLM
model_NLP
VLM + OCR
Data_Pretrain_NLP
updated
24 days ago
Upvote
-
aisingapore/SEA-PILE-v2
Viewer
•
Updated
Apr 14, 2025
•
187M
•
513
•
4
BlossomsAI/vietnamese-corpus
Viewer
•
Updated
Dec 17, 2024
•
29M
•
173
•
8
uonlp/CulturaX
Viewer
•
Updated
Dec 16, 2024
•
7.18B
•
40.4k
•
580
bkai-foundation-models/BKAINewsCorpus
Viewer
•
Updated
Mar 5, 2024
•
16.8M
•
103
•
12
Upvote
-
Share collection
View history
Collection guide
Browse collections