CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model

Zeng, Jianhao; Song, Dan; Nie, Weizhi; Tian, Hongshuo; Wang, Tongtong; Liu, Anan

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2311

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model

Authors: Jianhao Zeng, Dan Song, Weizhi Nie, Hongshuo Tian, Tongtong Wang, Anan Liu

(Submitted on 30 Nov 2023 (this version), latest version 26 Apr 2024 (v2))

Abstract: Image-based virtual try-on enables users to virtually try on different garments by altering original clothes in their photographs. Generative Adversarial Networks (GANs) dominate the research field in image-based virtual try-on, but have not resolved problems such as unnatural deformation of garments and the blurry generation quality. Recently, diffusion models have emerged with surprising performance across various image generation tasks. While the generative quality of diffusion models is impressive, achieving controllability poses a significant challenge when applying it to virtual try-on tasks and multiple denoising iterations limit its potential for real-time applications. In this paper, we propose Controllable Accelerated virtual Try-on with Diffusion Model called CAT-DM. To enhance the controllability, a basic diffusion-based virtual try-on network is designed, which utilizes ControlNet to introduce additional control conditions and improves the feature extraction of garment images. In terms of acceleration, CAT-DM initiates a reverse denoising process with an implicit distribution generated by a pre-trained GAN-based model. Compared with previous try-on methods based on diffusion models, CAT-DM not only retains the pattern and texture details of the in-shop garment but also reduces the sampling steps without compromising generation quality. Extensive experiments demonstrate the superiority of CAT-DM against both GAN-based and diffusion-based methods in producing more realistic images and accurately reproducing garment patterns. Our code and models will be publicly released.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.18405 [cs.CV]
	(or arXiv:2311.18405v1 [cs.CV] for this version)

Submission history

From: Jianhao Zeng [view email]
[v1] Thu, 30 Nov 2023 09:56:17 GMT (21044kb,D)
[v2] Fri, 26 Apr 2024 01:57:00 GMT (7797kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2311.18405v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model

Submission history