AnglE📐-based Embeddings Collection This collection consists of popular embeddings trained with AnglE: https://github.com/SeanLee97/AnglE • 9 items • Updated Aug 1, 2024 • 4
Beyond the Grid: Layout-Informed Multi-Vector Retrieval with Parsed Visual Document Representations Paper • 2603.01666 • Published 15 days ago • 1
Unlocking Multimodal Document Intelligence: From Current Triumphs to Future Frontiers of Visual Document Retrieval Paper • 2602.19961 • Published 22 days ago • 1
Sculpting the Vector Space: Towards Efficient Multi-Vector Visual Document Retrieval via Prune-then-Merge Framework Paper • 2602.19549 • Published 22 days ago • 1
VisionDocumentRetrieval Datasets Collection Datasets for vision document retrieval (VDR) • 19 items • Updated 15 days ago • 10
TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models Paper • 2511.13704 • Published Nov 17, 2025 • 43
DocPruner: A Storage-Efficient Framework for Multi-Vector Visual Document Retrieval via Adaptive Patch-Level Embedding Pruning Paper • 2509.23883 • Published Sep 28, 2025 • 1
Go with Your Gut: Scaling Confidence for Autoregressive Image Generation Paper • 2509.26376 • Published Sep 30, 2025 • 10
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models Paper • 2505.16211 • Published May 22, 2025 • 18
Temporal Regularization Makes Your Video Generator Stronger Paper • 2503.15417 • Published Mar 19, 2025 • 22
OmniCreator: Self-Supervised Unified Generation with Universal Editing Paper • 2412.02114 • Published Dec 3, 2024 • 14