Text to video
updated
EgoX: Egocentric Video Generation from a Single Exocentric Video
Paper
• 2512.08269
• Published • 122
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
Paper
• 2512.08765
• Published • 134
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation
Paper
• 2512.09363
• Published • 74
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform
Paper
• 2512.08478
• Published • 77
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Paper
• 2512.07802
• Published • 46
OmniPSD: Layered PSD Generation with Diffusion Transformer
Paper
• 2512.09247
• Published • 49
Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation
Paper
• 2512.10949
• Published • 47
EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing
Paper
• 2512.06065
• Published • 29
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards
Paper
• 2512.00473
• Published • 26
X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale
Paper
• 2512.04537
• Published • 7
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper
• 2512.11253
• Published • 39