Context-Picker: Dynamic context selection using multi-stage reinforcement learning Paper • 2512.14465 • Published Dec 16, 2025 • 1
IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Generation Paper • 2512.10730 • Published Dec 11, 2025 • 3
LOVE-R1: Advancing Long Video Understanding with an Adaptive Zoom-in Mechanism via Multi-Step Reasoning Paper • 2509.24786 • Published Sep 29, 2025 • 7
Temporal Memory Attention for Video Semantic Segmentation Paper • 2102.08643 • Published Feb 17, 2021
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion Paper • 2407.07844 • Published Jul 10, 2024 • 1
WebNovelBench: Placing LLM Novelists on the Web Novel Distribution Paper • 2505.14818 • Published May 20, 2025 • 4
ViSpeak: Visual Instruction Feedback in Streaming Videos Paper • 2503.12769 • Published Mar 17, 2025 • 8
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding Paper • 2406.08877 • Published Jun 13, 2024