Similarity-Guided and Multi-Layer Fusion Network for Few-shot Semantic Segmentation

Yemao Zhang,Wei Jia,Hai Min,Yingke Lei,Yang Zhao,Chunxiao Fan
DOI: https://doi.org/10.1109/ispds56360.2022.9874208
2022-01-01
Abstract:Few-shot semantic segmentation aims to tackle the problem that segmenting unseen object class using only a few support images with the same object class. At present, most related methods focus on prototype learning or feature similarity. However, these few-shot segmentation methods do not make good use of high-level features to enhance the prediction results. In this paper, we propose a lightweight Similarity-Guided and Multi-layer Fusion Network (SMNet) with two modules including Similarity-Guided Module (SGM) and Multi-Layer Fusion Module (MLFM). Specifically, the SGM utilizes cosine similarities in multiple high-level feature layers to augment the features in middle-level from query and support image, and then augmented features are refined via a residual attention module. In order to enhance the diversity of features, we reformulate the refined features as a spatiotemporal sequence problem. Then, we introduce the MLFM, which combines two ConvLSTMs to obtain fused feature from different scales. Finally, the decoder takes fused features to obtain predicted mask. Experiment results demonstrate that our model can achieve superior or competitive performances in several datasets.
What problem does this paper attempt to address?