Class-related Graph Convolution for Weakly Supervised Semantic Segmentation

Jinkai Zhang,Hui Yan,Tao Chen
DOI: https://doi.org/10.1117/12.2601023
2021-01-01
Abstract:Semantic segmentation with deep learning has achieved remarkable progress in classifying the pixels in the image. Acquiring sufficient ground-truth supervision to train deep visual models has been a bottleneck over the years due to the data-hungry nature of deep learning. Image-level label-based weakly supervised semantic segmentation (WSSS) aims to adopt image-level labels to train semantic segmentation models, saving vast human labors for costly pixel-level annotations. A typical pipeline for this problem is to adopt Class Activation Maps (CAMs) with image-level labels to generate pseudo-masks (a.k.a. seeds) and then use them for training segmentation models. The main difficulty is that seeds are usually sparse and incomplete. In recent years, GCNs have made great strides in various fields. GCN can perform global modeling and reasoning on the relationship between regions, which is beneficial for many computer vision tasks. This is our motivation to combine these two aspects. Therefore we propose the module of class-related graph convolution. Because there are differences between classes, our GCN is parallel. Each GCN can learn the classrelated region extension strategy. To enable GCN to learn more authentic relationships, we also introduce the attention mechanisms. We conduct lots of experiments on the public PASCAL VOC dataset, and our model yields state-of-the-art performance.
What problem does this paper attempt to address?