Aletheia-ng/pidgin-corpus-synth
Viewer
•
Updated
•
48.5k
•
59
Aletheia-ng/nigerian-pidgin-corpus-synth
Aletheia-ng/pretrain_data10
Viewer
•
Updated
•
40.9M
•
42
Aletheia-ng/low_resource_languages_pretrain_data4
Viewer
•
Updated
•
469M
•
898
Aletheia-ng/pretrain_data11
Aletheia-ng/pretrain_data9
Viewer
•
Updated
•
79.1M
•
59
Aletheia-ng/pretrain_data5
Viewer
•
Updated
•
9.43M
•
188
Aletheia-ng/pretrain_data4
Viewer
•
Updated
•
124M
•
271
Aletheia-ng/pretrain_data7
Viewer
•
Updated
•
13M
•
45
Aletheia-ng/pretrain_data3
Viewer
•
Updated
•
143M
•
580
Viewer
•
Updated
•
136
•
6
Aletheia-ng/pretrain_data
Viewer
•
Updated
•
109M
•
358
Aletheia-ng/pretrain_data2
Viewer
•
Updated
•
18.2M
•
180
Aletheia-ng/low_resource_languages_pretrain
Viewer
•
Updated
•
202M
•
1.73k
•
1
Aletheia-ng/masakhaner_eval
Aletheia-ng/noisy_dataset
Viewer
•
Updated
•
84k
•
6
Viewer
•
Updated
•
84k
•
5
Aletheia-ng/personal_finance_v0.2
Viewer
•
Updated
•
56.6k
•
11
•
1
Aletheia-ng/bloomberg-news-articles-pretraining-dataset
Viewer
•
Updated
•
437k
•
41
•
5
Aletheia-ng/ChatML-aya_dataset
Viewer
•
Updated
•
202k
•
6
Aletheia-ng/yo_wiki_processed
Viewer
•
Updated
•
43.5k
•
3
Viewer
•
Updated
•
270k
•
6
Viewer
•
Updated
•
4.4k
•
2
Viewer
•
Updated
•
43.5k
•
4
Viewer
•
Updated
•
288
•
3
Viewer
•
Updated
•
1.01k
•
4
Viewer
•
Updated
•
3.67k
•
6