ClothSeg: semantic segmentation network with feature projection for clothing parsing

Guangyu Tang,Feng Yu,Huiyin Li,Yankang Shi,Li Liu,Tao Peng,Xinrong Hu,Minghua Jiang
DOI: https://doi.org/10.1016/j.jvcir.2023.103980
IF: 2.887
2023-01-01
Journal of Visual Communication and Image Representation
Abstract:Semantic segmentation of clothing presents a formidable challenge owing to the non-rigid geometric deforma-tion properties inherent in garments. In this paper, we use the Transformer as the encoder to better learn global information for clothing semantic segmentation. In addition, we propose a Feature Projection Fusion (FPF) module to better utilize local information. This module facilitates the integration of deep feature maps with shallow local details, thereby enabling the network to capture both high-level abstractions and fine-grained details of features. We also design a pixel distance loss in training to emphasize the impact of edge features. This loss calculates the mean of the shortest distances between all predicted clothing edges and the true clothing edges during the training process. We perform extensive experiments and our method achieves 56.30% and 74.97% mIoU on the public dataset CFPD and our self-made dataset LIC, respectively, demonstrating a competitive performance when compared to the state-of-the-art.
What problem does this paper attempt to address?