Improving CNN-based Semantic Segmentation on Structurally Similar Data Using Contrastive Graph Convolutional Networks

Ling Chen,Zedong Tang,Hao Li
DOI: https://doi.org/10.1016/j.patcog.2024.110622
IF: 8
2024-01-01
Pattern Recognition
Abstract:Structurally similar data exist in most practical semantic segmentation applications. For example, objects can appear identical or positionally similar in many images, such as video frames. Objects with structural similarity in data samples can confuse deep neural networks (DNNs) in semantic segmentation applications. These challenges often lead to lower pixel classification accuracy of natural object segmentation. This study proposes a novel approach (S2-GCN) that enhances CNN-based semantic segmentation for structurally similar data using a contrastive graph convolutional network (GCN). By selecting specific label pairs and developing a customized GCN branch parallel to an encoder-decoder backbone, our method significantly improves accuracy, IoU, and F1-score, by up to 8%, as demonstrated through an extensive evaluation of five datasets. Our findings show that our proposed method effectively addresses the structural similarity problem of CNN-based semantic segmentation and can be applied to a wide range of practical applications.
What problem does this paper attempt to address?