Abstract:Purpose Existing clothing parsing methods make little use of dataset-level information. This paper aims to propose a novel clothing parsing method which utilizes higher-level outfit combinatorial consistency knowledge from the whole clothing dataset to improve the accuracy of segmenting clothing images. Design/methodology/approach In this paper, the authors propose an Outfit Memory Net (OMNet) that augments original feature by aggregating dataset-level prior clothing combination information. Specifically, the authors design an Outfit Matrix (OM) to represent clothing combination information of single image and an Outfit Memory Module (OMM) to store the clothing combination information of all images in the training set, i.e. dataset-level clothing combination information. In addition, the authors propose a Multi-scale Aggregation Module (MAM) to aggregate the clothing combination information in a multi-scale manner to solve the problem of large variance in the scale of objects in the clothing images. Findings Experiments on Colorful Fashion Parsing Dataset (CFPD) dataset show that the authors' method achieves 93.15% pixel accuracy (PA) and 51.24% mean of class-wise intersection over union (mIoU), which are satisfactory parsing results compared with existing methods such as PSPNet, DANet and DeepLabV3. Moreover, through comparing the segmentation accuracy of different methods for each category, MAM could effectively improve the segmentation of small objects. Originality/value With the rise of various online shopping platforms and the continuous development of deep learning technology, emerging applications such as clothing recommendation, matching, classification and virtual try-on system have emerged in the clothing field. Clothing parsing is the key technology to realize these applications. Therefore, improving the accuracy of clothing parsing is necessary.

Unabridged Adjacent Modulation for Clothing Parsing

OMNet: Outfit Memory Net for clothing parsing

Phase Contour Enhancement Network for Clothing Parsing

Looking at Outfit to Parse Clothing

Feature fusion network for clothing parsing

Clothing Retrieval with Visual Attention Model.

Clothing Co-Parsing by Joint Image Segmentation and Labeling

DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment

Attentive Fashion Grammar Network For Fashion Landmark Detection And Clothing Category Classification

Towards Better Understanding the Clothing Fashion Styles: A Multimodal Deep Learning Approach

Channel and Spatial Enhancement Network for human parsing

Clothes Grasping and Unfolding Based on RGB-D Semantic Segmentation

ClothSeg: semantic segmentation network with feature projection for clothing parsing

FFENet: Frequency-Spatial Feature Enhancement Network for Clothing Classification

DeepCloth: Neural Garment Representation for Shape and Style Editing

Unconstrained Fashion Landmark Detection via Hierarchical Recurrent Transformer Networks

Describe Fashion Products via Local Sparse Self-Attention Mechanism and Attribute-based Re-sampling Strategy

IMAGDressing-v1: Customizable Virtual Dressing

An End-to-End Framework for Clothing Collocation Based on Semantic Feature Fusion

Quality-Aware Network for Human Parsing

AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models