Abstract:Abstract. Semantic segmentation of high-resolution remote sensing images based on deep learning has become a hot research topic and has been widely applied. At present, based on the structure of the convolutional neural network, when extracting target features through multiple layer convolutional layers, it is easy to cause the loss of small target features and fuzzy boundary of ground object classification. Therefore, we propose a remote sensing image semantic segmentation method P-Net to detect small target and enhance edge feature. The proposed network was based on an Encoder-Decoder structure. The decoder included the following components: a progressive small target feature enhancement network (IFEN), a boundary thinning module (BRM), and a feature aggregation module (FIAM). Firstly, the dense side output features of the encoder network were utilized to learn and acquired small target feature information and target edge features. Secondly, the pyramid segmentation attention module was introduced to effectively extract fine-grained and multi-scale spatial information. This module enhanced the feature expression of small targets and obtained high-level semantic feature information. The boundary refinement module was designed to refine the low-level spatial feature information extracted by the encoder. Finally, in order to improve the accuracy of remote sensing image object segmentation boundaries, skip connections were used to fuse high-level semantic information and low-level spatial information acrossed layers. These skip connections had the same spatial resolution but different semantic information. In this paper, six evaluation indices including mean intersection over union, frequency weighted intersection over union, pixel accuracy, F1, recall, and precision were used to verify on two public datasets of high-resolution remote sensing images, Gaofen image dataset (GID) and wuhan dense labeling dataset (WHDLD). In the GID dataset, each index reached 78.90%, 78.87%, 87.76%, 87.74%, 87.51%, and 88.04%, respectively; in the WHDLD dataset, each index reached 63.21%, 75.20%, 84.67%, 75.79%, 76.56%, and 75.45%, respectively. The results show that the performance of proposed method is better than that of DeepLabv3+, U-NET, PSPNet, and DUC_HDC methods. More precisely, the recognition performance of small target features is better, and the boundary obtained between object categories is clearer.

DenseU-Net-Based Semantic Segmentation of Small Objects in Urban Remote Sensing Images.

Remote sensing image semantic segmentation method based on small target and edge feature enhancement

Efficient Multi-scale Network for Semantic Segmentation of fine-Resolution Remotely Sensed Images

Object-Based Semi-Supervised Spatial Attention Residual UNet for Urban High-Resolution Remote Sensing Image Classification

Semantic Segmentation of Urban Buildings from VHR Remote Sensing Imagery Using a Deep Convolutional Neural Network

UrbanSegNet: An urban meshes semantic segmentation network using diffusion perceptron and vertex spatial attention

Full Semantic Constructed Network for Urban Use Classification from Very High-Resolution Optical Remote Sensing Imagery

Semantic segmentation of urban land classes using a multi-scale dataset

BFANet: Effective Segmentation Network for Low Altitude High-Resolution Urban Scene Image

Advancing high-resolution remote sensing: a compact and powerful approach to semantic segmentation

Pos-DANet: A dual-branch awareness network for small object segmentation within high-resolution remote sensing images

Unbalanced Class Learning Network With Scale-Adaptive Perception for Complicated Scene in Remote Sensing Images Segmentation

MACU-Net for Semantic Segmentation of Fine-Resolution Remotely Sensed Images

Semantic Segmentation for Urban-Scene Images

CTMU-Net: an Improved U-Net for Semantic Segmentation of Remote-Sensing Images Based on the Combined Attention Mechanism

Semantic segmentation of remote sensing image based on bilateral branch network

Semantic Segmentation of High-Resolution Remote Sensing Images with Improved U-Net Based on Transfer Learning

TARGET-GUIDED LEARNING FOR RARE CLASS SEGMENTATION IN LARGE-SCALE URBAN POINT CLOUDS

A Semantic Segmentation Network for Urban-Scale Building Footprint Extraction Using RGB Satellite Imagery

Fast-SegNet: fast semantic segmentation network for small objects

Object-Enhanced Semantic Segmentation Model for High-Resolution Remote Sensing Images