-
Learning Flow Fields in Attention for Controllable Person Image Generation
Paper β’ 2412.08486 β’ Published β’ 36 -
franciszzj/Leffa
Image-to-Image β’ Updated β’ 340 -
TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models
Paper β’ 2411.18350 β’ Published β’ 28 -
TryOffDiff
π₯62Extract garment images from everyday images!
Collections
Discover the best community collections!
Collections including paper arxiv:2411.00225
-
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems
Paper β’ 2411.02959 β’ Published β’ 71 -
GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details
Paper β’ 2411.03047 β’ Published β’ 9 -
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Paper β’ 2411.02336 β’ Published β’ 24 -
GenXD: Generating Any 3D and 4D Scenes
Paper β’ 2411.02319 β’ Published β’ 20
-
Fashion-VDM: Video Diffusion Model for Virtual Try-On
Paper β’ 2411.00225 β’ Published β’ 11 -
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models
Paper β’ 2410.22901 β’ Published β’ 8 -
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Paper β’ 2506.18898 β’ Published β’ 34
-
Fashion-VDM: Video Diffusion Model for Virtual Try-On
Paper β’ 2411.00225 β’ Published β’ 11 -
GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details
Paper β’ 2411.03047 β’ Published β’ 9 -
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Paper β’ 2411.10499 β’ Published β’ 13 -
TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models
Paper β’ 2411.18350 β’ Published β’ 28
-
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
Paper β’ 2401.09985 β’ Published β’ 18 -
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
Paper β’ 2401.09962 β’ Published β’ 9 -
Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution
Paper β’ 2401.10404 β’ Published β’ 10 -
ActAnywhere: Subject-Aware Video Background Generation
Paper β’ 2401.10822 β’ Published β’ 13
-
Learning Flow Fields in Attention for Controllable Person Image Generation
Paper β’ 2412.08486 β’ Published β’ 36 -
franciszzj/Leffa
Image-to-Image β’ Updated β’ 340 -
TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models
Paper β’ 2411.18350 β’ Published β’ 28 -
TryOffDiff
π₯62Extract garment images from everyday images!
-
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems
Paper β’ 2411.02959 β’ Published β’ 71 -
GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details
Paper β’ 2411.03047 β’ Published β’ 9 -
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
Paper β’ 2411.02336 β’ Published β’ 24 -
GenXD: Generating Any 3D and 4D Scenes
Paper β’ 2411.02319 β’ Published β’ 20
-
Fashion-VDM: Video Diffusion Model for Virtual Try-On
Paper β’ 2411.00225 β’ Published β’ 11 -
GarVerseLOD: High-Fidelity 3D Garment Reconstruction from a Single In-the-Wild Image using a Dataset with Levels of Details
Paper β’ 2411.03047 β’ Published β’ 9 -
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Paper β’ 2411.10499 β’ Published β’ 13 -
TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models
Paper β’ 2411.18350 β’ Published β’ 28
-
Fashion-VDM: Video Diffusion Model for Virtual Try-On
Paper β’ 2411.00225 β’ Published β’ 11 -
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models
Paper β’ 2410.22901 β’ Published β’ 8 -
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Paper β’ 2506.18898 β’ Published β’ 34
-
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
Paper β’ 2401.09985 β’ Published β’ 18 -
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
Paper β’ 2401.09962 β’ Published β’ 9 -
Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution
Paper β’ 2401.10404 β’ Published β’ 10 -
ActAnywhere: Subject-Aware Video Background Generation
Paper β’ 2401.10822 β’ Published β’ 13