HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming Paper • 2512.21338 • Published 25 days ago • 21
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper • 2512.07802 • Published Dec 8, 2025 • 44
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper • 2512.02014 • Published Dec 1, 2025 • 72
MedVLThinker: Simple Baselines for Multimodal Medical Reasoning Paper • 2508.02669 • Published Aug 4, 2025
MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs Paper • 2510.25867 • Published Oct 29, 2025 • 6
MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs Paper • 2510.25867 • Published Oct 29, 2025 • 6
MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs Paper • 2510.25867 • Published Oct 29, 2025 • 6 • 1
VecGlypher/250903-alphanumeric-ref_img-b64_pil-ood_font_family_decon-dev Viewer • Updated Oct 29, 2025 • 20
Uniform Discrete Diffusion with Metric Path for Video Generation Paper • 2510.24717 • Published Oct 28, 2025 • 40
VecGlypher/250903-alphanumeric-ref_img-b64_pil-ood_font_family_decon-dev Viewer • Updated Oct 29, 2025 • 20
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 649