AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement Paper β’ 2511.23475 β’ Published 27 days ago β’ 41
GenCompositor: Generative Video Compositing with Diffusion Transformer Paper β’ 2509.02460 β’ Published Sep 2 β’ 25
Runtime error 216 IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System π 216 Generate speech from text using a reference audio
Configuration error Featured 1.45k EasyControl Ghibli π¦ 1.45k New Ghibli EasyControl model is now released!!
Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation Paper β’ 2505.22647 β’ Published May 28 β’ 3
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models Paper β’ 2505.04921 β’ Published May 8 β’ 185
VerIPO: Cultivating Long Reasoning in Video-LLMs via Verifier-Gudied Iterative Policy Optimization Paper β’ 2505.19000 β’ Published May 25 β’ 42