Abstract:Abstract. Semantic segmentation of high-resolution remote sensing images based on deep learning has become a hot research topic and has been widely applied. At present, based on the structure of the convolutional neural network, when extracting target features through multiple layer convolutional layers, it is easy to cause the loss of small target features and fuzzy boundary of ground object classification. Therefore, we propose a remote sensing image semantic segmentation method P-Net to detect small target and enhance edge feature. The proposed network was based on an Encoder-Decoder structure. The decoder included the following components: a progressive small target feature enhancement network (IFEN), a boundary thinning module (BRM), and a feature aggregation module (FIAM). Firstly, the dense side output features of the encoder network were utilized to learn and acquired small target feature information and target edge features. Secondly, the pyramid segmentation attention module was introduced to effectively extract fine-grained and multi-scale spatial information. This module enhanced the feature expression of small targets and obtained high-level semantic feature information. The boundary refinement module was designed to refine the low-level spatial feature information extracted by the encoder. Finally, in order to improve the accuracy of remote sensing image object segmentation boundaries, skip connections were used to fuse high-level semantic information and low-level spatial information acrossed layers. These skip connections had the same spatial resolution but different semantic information. In this paper, six evaluation indices including mean intersection over union, frequency weighted intersection over union, pixel accuracy, F1, recall, and precision were used to verify on two public datasets of high-resolution remote sensing images, Gaofen image dataset (GID) and wuhan dense labeling dataset (WHDLD). In the GID dataset, each index reached 78.90%, 78.87%, 87.76%, 87.74%, 87.51%, and 88.04%, respectively; in the WHDLD dataset, each index reached 63.21%, 75.20%, 84.67%, 75.79%, 76.56%, and 75.45%, respectively. The results show that the performance of proposed method is better than that of DeepLabv3+, U-NET, PSPNet, and DUC_HDC methods. More precisely, the recognition performance of small target features is better, and the boundary obtained between object categories is clearer.

An improved U-Net method for the semantic segmentation of remote sensing images

Semantic segmentation network of uav image based on improved U-net

Semantic Segmentation of High-Resolution Remote Sensing Images with Improved U-Net Based on Transfer Learning

High-Resolution Remote Sensing Image Semantic Segmentation Method Based on Improved Encoder-Decoder Convolutional Neural Network

CTMU-Net: an Improved U-Net for Semantic Segmentation of Remote-Sensing Images Based on the Combined Attention Mechanism

Remote Sensing Image Semantic Segmentation Method Based on a Deep Convolutional Neural Network and Multiscale Feature Fusion

A deep learning method for optimizing semantic segmentation accuracy of remote sensing images based on improved UNet

Research on Improved U-net Based Remote Sensing Image Segmentation Algorithm

Object-Enhanced Semantic Segmentation Model for High-Resolution Remote Sensing Images

Optimizing Spatial Relationships in GCN to Improve the Classification Accuracy of Remote Sensing Images

UNeXt: An Efficient Network for the Semantic Segmentation of High-Resolution Remote Sensing Images

Remote sensing image semantic segmentation method based on small target and edge feature enhancement

CM-Unet: A Novel Remote Sensing Image Segmentation Method Based on Improved U-Net

Densely Based Multi-Scale and Multi-Modal Fully Convolutional Networks for High-Resolution Remote-Sensing Image Semantic Segmentation

Efficient Multi-scale Network for Semantic Segmentation of fine-Resolution Remotely Sensed Images

Advancing high-resolution remote sensing: a compact and powerful approach to semantic segmentation

Transformer and CNN Hybrid Deep Neural Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery

Improvement of deep learning Method for water body segmentation of remote sensing images based on attention modules

MACU-Net for Semantic Segmentation of Fine-Resolution Remotely Sensed Images

An Attention-Fused Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery

Hybridizing Cross-Level Contextual and Attentive Representations for Remote Sensing Imagery Semantic Segmentation