Abstract:Abstract. Semantic segmentation of high-resolution remote sensing images based on deep learning has become a hot research topic and has been widely applied. At present, based on the structure of the convolutional neural network, when extracting target features through multiple layer convolutional layers, it is easy to cause the loss of small target features and fuzzy boundary of ground object classification. Therefore, we propose a remote sensing image semantic segmentation method P-Net to detect small target and enhance edge feature. The proposed network was based on an Encoder-Decoder structure. The decoder included the following components: a progressive small target feature enhancement network (IFEN), a boundary thinning module (BRM), and a feature aggregation module (FIAM). Firstly, the dense side output features of the encoder network were utilized to learn and acquired small target feature information and target edge features. Secondly, the pyramid segmentation attention module was introduced to effectively extract fine-grained and multi-scale spatial information. This module enhanced the feature expression of small targets and obtained high-level semantic feature information. The boundary refinement module was designed to refine the low-level spatial feature information extracted by the encoder. Finally, in order to improve the accuracy of remote sensing image object segmentation boundaries, skip connections were used to fuse high-level semantic information and low-level spatial information acrossed layers. These skip connections had the same spatial resolution but different semantic information. In this paper, six evaluation indices including mean intersection over union, frequency weighted intersection over union, pixel accuracy, F1, recall, and precision were used to verify on two public datasets of high-resolution remote sensing images, Gaofen image dataset (GID) and wuhan dense labeling dataset (WHDLD). In the GID dataset, each index reached 78.90%, 78.87%, 87.76%, 87.74%, 87.51%, and 88.04%, respectively; in the WHDLD dataset, each index reached 63.21%, 75.20%, 84.67%, 75.79%, 76.56%, and 75.45%, respectively. The results show that the performance of proposed method is better than that of DeepLabv3+, U-NET, PSPNet, and DUC_HDC methods. More precisely, the recognition performance of small target features is better, and the boundary obtained between object categories is clearer.

A position-aware attention network with progressive detailing for land use semantic segmentation of Remote Sensing images

ACNET: Attention Based Network to Exploit Complementary Features for RGBD Semantic Segmentation.

A Stage-Adaptive Selective Network with Position Awareness for Semantic Segmentation of LULC Remote Sensing Images

Multi-Attention-Based Semantic Segmentation Network for Land Cover Remote Sensing Images

RAANet: A Residual ASPP with Attention Framework for Semantic Segmentation of High-Resolution Remote Sensing Images

AANet: Adaptive Attention Networks for Semantic Segmentation of High-Resolution Remote Sensing Imagery

Multiattention Network for Semantic Segmentation of Fine-Resolution Remote Sensing Images

Multi-Attention-Network for Semantic Segmentation of Fine Resolution Remote Sensing Images

Remote sensing image semantic segmentation method based on small target and edge feature enhancement

Semantic Segmentation With Attention Mechanism for Remote Sensing Images

SRMA: a dual-branch parallel multi-scale attention network for remote sensing images sea-land segmentation

Laplacian Filter Embedding Based Attention Network for Land Segmentation from Remote Sensing Images

Semantic segmentation of urban land classes using a multi-scale dataset

High-resolution feature pyramid attention network for high spatial resolution images land-cover classification in arid oasis zones

CTMU-Net: an Improved U-Net for Semantic Segmentation of Remote-Sensing Images Based on the Combined Attention Mechanism

Adaptive Local Cross-Channel Vector Pooling Attention Module for Semantic Segmentation of Remote Sensing Imagery

Category attention guided network for semantic segmentation of Fine-Resolution remote sensing images

Semantic Segmentation of Remote Sensing Image Based on Regional Self-Attention Mechanism

SPANet: Successive Pooling Attention Network for Semantic Segmentation of Remote Sensing Images

Semantic Attention and Scale Complementary Network for Instance Segmentation in Remote Sensing Images

An Attention-Fused Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery