reaperdoesntknow/Qwen3-0.6B-Distilled-30B-A3B-Thinking-SFT Text Generation • 0.8B • Updated 1 day ago • 4.23k • 2
cs4248-nlp/paper-s10-bimga-dw100-aw10-tinybert-general-4l-312d-taco-hf-20260402-015143 14.4M • Updated about 8 hours ago • 1
sebastian-hofstaetter/distilbert-dot-margin_mse-T2-msmarco Feature Extraction • Updated Mar 16, 2021 • 66 • 2
sebastian-hofstaetter/distilbert-dot-tas_b-b256-msmarco Feature Extraction • Updated Apr 15, 2021 • 5.12k • • 26
LilaBoualili/colbert-distilbert-margin_mse-T2-msmarco-encoder-only Feature Extraction • Updated Apr 4, 2023 • 12