Abstract:Deep learning based virtual try-on system has achieved some encouraging progress recently, but there still remain several big challenges that need to be solved, such as trying on arbitrary clothes of all types, trying on the clothes from one category to another and generating image-realistic results with few artifacts. To handle this issue, we in this paper first collect a new dataset with all types of clothes, \ie tops, bottoms, and whole clothes, each one has multiple categories with rich information of clothing characteristics such as patterns, logos, and other details. Based on this dataset, we then propose the Arbitrary Virtual Try-On Network (AVTON) that is utilized for all-type clothes, which can synthesize realistic try-on images by preserving and trading off characteristics of the target clothes and the reference person. Our approach includes three modules: 1) Limbs Prediction Module, which is utilized for predicting the human body parts by preserving the characteristics of the reference person. This is especially good for handling cross-category try-on task (\eg long sleeves \(\leftrightarrow\) short sleeves or long pants \(\leftrightarrow\) skirts, \etc), where the exposed arms or legs with the skin colors and details can be reasonably predicted; 2) Improved Geometric Matching Module, which is designed to warp clothes according to the geometry of the target person. We improve the TPS based warping method with a compactly supported radial function (Wendland's \(\Psi\)-function); 3) Trade-Off Fusion Module, which is to trade off the characteristics of the warped clothes and the reference person. This module is to make the generated try-on images look more natural and realistic based on a fine-tune symmetry of the network structure. Extensive simulations are conducted and our approach can achieve better performance compared with the state-of-the-art virtual try-on methods.

CS-VITON: a realistic virtual try-on network based on clothing region alignment and SPM

Toward Realistic Virtual Try-on Through Landmark Guided Shape Matching

VITON: An Image-based Virtual Try-on Network

VTNCT: an Image-Based Virtual Try-on Network by Combining Feature with Pixel Transformation

DP-VTON: Toward Detail-Preserving Image-Based Virtual Try-on Network

SP-VITON: shape-preserving image-based virtual try-on network

C-VTON: Context-Driven Image-Based Virtual Try-On Network

VTON-SCFA: A Virtual Try-On Network Based on the Semantic Constraints and Flow Alignment

High-Resolution Virtual Try-On with Misalignment and Occlusion-Handled Conditions

VTON-HF: High Fidelity Virtual Try-on Network Via Semantic Adaptation

Toward Characteristic-Preserving Image-based Virtual Try-On Network

StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On

PG-VTON: A Novel Image-Based Virtual Try-On Method Via Progressive Inference Paradigm

VITON-HD: High-Resolution Virtual Try-On via Misalignment-Aware Normalization

SPG-VTON: Semantic Prediction Guidance for Multi-pose Virtual Try-on

Toward Detail-Oriented Image-Based Virtual Try-On with Arbitrary Poses

Arbitrary Virtual Try-On Network: Characteristics Preservation and Trade-off between Body and Clothing

SieveNet: A Unified Framework for Robust Image-Based Virtual Try-On

Three stages of 3D virtual try-on network with appearance flow and shape field

UF-VTON: Toward User-Friendly Virtual Try-On Network