Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Zhayr1
/
BitMamba-2-0.25B
like
4
Text Generation
JAX
HuggingFaceFW/fineweb-edu
bigcode/the-stack-dedup
HuggingFaceTB/cosmopedia
English
bitmamba
bitnet
mamba
ssm
1.58-bit
ternary
efficient-inference
edge-computing
License:
mit
Model card
Files
Files and versions
xet
Community
main
BitMamba-2-0.25B
1.28 GB
Ctrl+K
Ctrl+K
1 contributor
History:
9 commits
Zhayr1
Update README.md
0b8d8c4
verified
about 2 months ago
bitmamba_cpp
Initial commit: Upload BitMamba-1B model, weights and benchmarks
2 months ago
jax_weights
Initial commit: Upload BitMamba-1B model, weights and benchmarks
2 months ago
.gitattributes
Safe
1.75 kB
Upload BitMamba-2: Efficient Scaling of 1.58-bit State Space Models.pdf
2 months ago
BitMamba-2: Efficient Scaling of 1.58-bit State Space Models.pdf
Safe
726 kB
xet
Upload BitMamba-2: Efficient Scaling of 1.58-bit State Space Models.pdf
2 months ago
README.md
Safe
3.38 kB
Update README.md
about 2 months ago
config.json
Safe
400 Bytes
Initial commit: Upload BitMamba-1B model, weights and benchmarks
2 months ago
scaling_comparisson.png
Safe
207 kB
xet
Initial commit: Upload BitMamba-1B model, weights and benchmarks
2 months ago
tokenizer.json
Safe
3.56 MB
Initial commit: Upload BitMamba-1B model, weights and benchmarks
2 months ago
tokenizer_config.json
Safe
286 Bytes
Initial commit: Upload BitMamba-1B model, weights and benchmarks
2 months ago