PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation Paper • 2512.24551 • Published 2 days ago • 14
Long-Context Autoregressive Video Modeling with Next-Frame Prediction Paper • 2503.19325 • Published Mar 25, 2025 • 73
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Paper • 2502.10248 • Published Feb 14, 2025 • 55
Pixel-Space Post-Training of Latent Diffusion Models Paper • 2409.17565 • Published Sep 26, 2024 • 20
Imagine yourself: Tuning-Free Personalized Image Generation Paper • 2409.13346 • Published Sep 20, 2024 • 69
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions Paper • 2407.06358 • Published Jul 8, 2024 • 19
MotionClone: Training-Free Motion Cloning for Controllable Video Generation Paper • 2406.05338 • Published Jun 8, 2024 • 41
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Paper • 2406.06525 • Published Jun 10, 2024 • 71
Improved Distribution Matching Distillation for Fast Image Synthesis Paper • 2405.14867 • Published May 23, 2024 • 15
Looking Backward: Streaming Video-to-Video Translation with Feature Banks Paper • 2405.15757 • Published May 24, 2024 • 15 • 2