jina-embeddings-v5-text: Task-Targeted Embedding Distillation Paper • 2602.15547 • Published Feb 17 • 31
view article Article MTEB Leaderboard: From a slow demo to feature-rich leaderboard Samoed • 10 days ago • 21
Towards Retrieving Interaction Spaces for Agentic Search Paper • 2606.06880 • Published 18 days ago • 4
Is Position Bias in Dense Retrievers Built In-or Learned from Data? Paper • 2605.26578 • Published 28 days ago • 20
Seq vs Seq: An Open Suite of Paired Encoders and Decoders Paper • 2507.11412 • Published Jul 15, 2025 • 33
view article Article ModernVBERT: Towards Smaller Visual Document Retrievers paultltc • Oct 3, 2025 • 46
On the Challenges and Opportunities of Learned Sparse Retrieval for Code Paper • 2603.22008 • Published Mar 23 • 4
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling lightonai • Feb 12 • 57
view article Article How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas nvidia • Apr 21 • 26
view article Article DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models lightonai • Apr 21 • 40
view article Article Introducing RTEB: A New Standard for Retrieval Evaluation +4 fzliu, KennethEnevoldsen, Samoed, isaacchung, tomaarsen, fzoll • Oct 1, 2025 • 146
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers tomaarsen • Apr 16 • 72
KoViDoRe Benchmark (BEIR) v2 Collection Korean Vision Document Retrieval Benchmark • 4 items • Updated Mar 2 • 6
Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval Paper • 2604.04734 • Published Apr 6 • 14
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers tomaarsen • Apr 9 • 62
view article Article How I contributed a new model to the Transformers library using Codex nielsr • Mar 30 • 52