Multiscale Prototype Contrast Network for High-Resolution Aerial Imagery Semantic Segmentation

Qixiong Wang,Xiaoyan Luo,Jiaqi Feng,Guangyun Zhang,Xiuping Jia,Jihao Yin
DOI: https://doi.org/10.1109/tgrs.2023.3292919
IF: 8.2
2023-07-26
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Semantic segmentation of high-resolution aerial images is a challenging task on account of complex scene variations and large-scale differences. However, these two issues are inadequately addressed in general semantic segmentation methods. In this article, we propose a multiscale prototype contrast network (MPCNet) to improve the adaptive capability for different scenes and scales. Specifically, a novel multiscale prototype transformer decoder (MPTD) is designed to extract dynamic scene-specific prototypes as pixel classifiers by fusing information from feature maps and learnable class tokens. To exploit cross-scene context information and accommodate the large-scale difference in the aerial image, we build a multiscale prototype memory queue to store these multiscale prototypes during training. Upon the multiscale prototype memory queue, a novel multiscale prototype contrastive loss is proposed to increase object feature discriminability across multiple scales, which brings better consistency of intermediate features and boosts the convergence of the network. Extensive experimental results on three publicly available datasets demonstrate the effectiveness and efficiency of our MPCNet over other state-of-the-art methods. The code is available at https://github.com/qixiong-wang/mmsegmentation-mpcnet.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?