Dual-Path Geometry-Aware Network for Semantic Segmentation of High-Resolution Aerial Images

Zhenglin Xian,Jiaying Wang,Junli Yang,Bin Wang,Zideng Feng,Yifei Huang
DOI: https://doi.org/10.1109/icpr56361.2022.9956268
2022-01-01
Abstract:Semantic segmentation of high-resolution aerial images is a fundamental research topic for its extensive applications. Different from natural scene datasets, the high-resolution aerial datasets provide additional elevation data such as Digital Surface Model (DSM). However, the current semantic segmentation methods of high-resolution aerial images focus on improving the feature extraction of the spectral images but fail to make full use of DSM images. Besides, the feature fusion of these two disparate data is a challenging problem. Moreover, the tremendous details and the considerable variations in scale of objects limit the representation capacity of existing segmentation networks. To address the above problems, we propose a new dual-path geometry-aware end-to-end DPGANet which consists of Multi-scale Digital Surface Model Awareness(MDSMA) path and Swin Transformer path. The MDSMA path is designed to extract multi-stage 3D geometry features from DSM images. The Res2Net modules in the MDSMA path can enhance the multi-scale representation capability of our network. The Swin Transformer path is designed to extract the multi-stage long-range dependencies from spectral images. Furthermore, for full usage of feature maps produced by corresponding stages of these two paths, we design an Attention Fusion Module(AFM) for memory-saving and computation-effective feature fusion from both spatial and channel dimensions. The segmentation results on the ISPRS Potsdam dataset achieve a competitive performance compared to other state-of-the-art methods.
What problem does this paper attempt to address?