EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory

πŸ”— GitHub Repository | πŸ“š Training Dataset | πŸ“‘ Paper (https://arxiv.org/abs/2606.21649)

EvoEmbedding is a novel embedding model designed for long-context and dynamic retrieval scenarios. Unlike static embedding models that chunk text in isolation, EvoEmbedding maintains a continuously updated Latent Memory Queue. This allows it to capture temporal dynamics and generate context-aware, evolvable embeddings for precise retrieval in agentic workflows and long-conversations.

πŸ“¦ Model Family

We provide EvoEmbedding in three sizes based on the Qwen architecture:

Model Parameters Base Model Hugging Face Link
EvoEmbedding-0.8B 0.8B Qwen3.5-0.8B MiG-NJU/EvoEmbedding-0.8B
EvoEmbedding-2B 2B Qwen3.5-2B MiG-NJU/EvoEmbedding-2B
EvoEmbedding-4B 4B Qwen3-4B MiG-NJU/EvoEmbedding-4B

πŸ“š Citation

If you find this model or our methodology useful, please cite our paper:

@article{nie2026evoembedding,
  title={EvoEmbedding: Evolvable Representations for Long-Context Retrieval and Agentic Memory},
  author={Nie, Chang and Fu, Chaoyou and Feng, Junlan and Shan, Caifeng},
  journal={arXiv preprint arXiv:2606.21649},
  year={2026}
}
Downloads last month
148
Safetensors
Model size
5B params
Tensor type
F32
Β·
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Paper for ClareNie/EvoEmbedding-4B