Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
BEE-spoke-data
's Collections
Survivor Library Books - OCR
smol llama
finetuned smol 220M
Pretrained Encoders
Bee Models 🍯
book genre classifiers
tokenizers
FineWeb Concept Datasets
FineWeb Concept Datasets
updated
May 22, 2024
concept datasets extracted from fineweb
Upvote
-
BEE-spoke-data/SaunaWeb-50k
Viewer
•
Updated
Dec 29, 2025
•
50k
•
7
BEE-spoke-data/FineMeme-100k
Viewer
•
Updated
Dec 29, 2025
•
100k
•
40
BEE-spoke-data/beeweb-5k
Viewer
•
Updated
Dec 29, 2025
•
5k
•
20
BEE-spoke-data/fineweb-synergy-20k
Viewer
•
Updated
Dec 29, 2025
•
20k
•
21
BEE-spoke-data/MoistWeb-25k
Viewer
•
Updated
Dec 29, 2025
•
25k
•
3
•
1
BEE-spoke-data/fineweb-cryptid-5k
Viewer
•
Updated
Dec 29, 2025
•
5k
•
18
BEE-spoke-data/fineweb-literature-100k
Viewer
•
Updated
Dec 29, 2025
•
100k
•
21
•
1
BEE-spoke-data/fineweb-cinema-100k
Viewer
•
Updated
Dec 29, 2025
•
100k
•
22
Upvote
-
Share collection
View history
Collection guide
Browse collections