Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation

Shengqi Liu,Yuhao Cheng,Zhuo Chen,Xingyu Ren,Wenhan Zhu,Lincheng Li,Mengxiao Bi,Xiaokang Yang,Yichao Yan
2024-12-19
Abstract:Generating sewing patterns in garment design is receiving increasing attention due to its CG-friendly and flexible-editing nature. Previous sewing pattern generation methods have been able to produce exquisite clothing, but struggle to design complex garments with detailed control. To address these issues, we propose SewingLDM, a multi-modal generative model that generates sewing patterns controlled by text prompts, body shapes, and garment sketches. Initially, we extend the original vector of sewing patterns into a more comprehensive representation to cover more intricate details and then compress them into a compact latent space. To learn the sewing pattern distribution in the latent space, we design a two-step training strategy to inject the multi-modal conditions, \ie, body shapes, text prompts, and garment sketches, into a diffusion model, ensuring the generated garments are body-suited and detail-controlled. Comprehensive qualitative and quantitative experiments show the effectiveness of our proposed method, significantly surpassing previous approaches in terms of complex garment design and various body adaptability. Our project page: <a class="link-external link-https" href="https://shengqiliu1.github.io/SewingLDM" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition,Graphics,Machine Learning
What problem does this paper attempt to address?