MA-DBFAN: Multiple-Attention-based Dual Branch Feature Aggregation Network for Aerial Image Semantic Segmentation

Haoyu Yue,Junhong Yue,Xuejun Guo,Yizhen Wang,Liancheng Jiang
DOI: https://doi.org/10.1007/s11760-024-03106-3
IF: 1.583
2024-01-01
Signal Image and Video Processing
Abstract:Aerial image semantic segmentation has extensive applications in the fields of land resource management, ecological monitoring, and traffic management. Currently, many convolutional neural networks have been developed, but they do not fully utilize the long-term dependence and multi-scale information in high-resolution images, making it difficult for these models to further enhance their segmentation performance. Therefore, a multiple-attention-based dual branch feature aggregation network is proposed to improve the segmentation accuracy of aerial images. This model includes a contextual feature extraction branch (CFEB), a spatial information extraction branch (SIEB), and a feature aggregation module (FAM). In the CFEB, we designed a SeMask-based dual category attention module to extract semantic category features and utilized the ASPP module to extract multi-scale features, effectively capturing global contextual information with categories and multi-scales. Meanwhile, in the SIEB, a shallow CNN is employed to retain the fine-grained features of images. In the FAM, a dual attention interaction module is designed that includes spatial attention and channel attention, effectively fusing the global contextual and spatial local information extracted by the two branches. Extensive experiments on three freely accessible datasets (the UAVid dataset, the Landcover.ai dataset and the Vaihingen dataset) demonstrate that our method outperforms other state-of-the-art models for aerial images.
What problem does this paper attempt to address?