BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing Paper • 2506.17450 • Published Jun 20, 2025 • 64
Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index Paper • 2506.12229 • Published Jun 13, 2025 • 3
DocRAG Datasets Collection Processed ("Unified") datasets used in DocRAG for training or inference purposes. • 12 items • Updated Jun 14, 2025 • 1
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs Paper • 2504.15280 • Published Apr 21, 2025 • 25
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 11 days ago • 309
Synthetic Object Compositions for Det / Seg / Grounding Collection Dataset Collections for paper: https://github.com/weikaih04/Synthetic-Detection-Segmentation-Grounding-Data • 10 items • Updated Nov 21, 2025 • 2
CoTA Datasets Collection This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets. • 5 items • Updated Oct 31, 2025 • 7
xGen-MM (BLIP-3): A Family of Open Large Multimodal Models Paper • 2408.08872 • Published Aug 16, 2024 • 101
TaskMeAnything Collection A collection of TaskMeAnything resources [https://github.com/JieyuZ2/TaskMeAnything] • 12 items • Updated Aug 4, 2024 • 3