Abstract:Building contour extraction from high-resolution remote sensing images is a basic task for the reasonable planning of regional construction. Recently, building segmentation methods based on the U-Net network have become popular as they largely improve the segmentation accuracy by applying ‘skip connection’ to combine high-level and low-level feature information more effectively. Meanwhile, researchers have demonstrated that introducing an attention mechanism into U-Net can enhance local feature expression and improve the performance of building extraction in remote sensing images. In this paper, we intend to explore the effectiveness of the primeval attention gate module and propose the novel Attention Gate Module (AG) based on adjusting the position of ‘Resampler’ in an attention gate to Sigmoid function for a building extraction task, and a novel Attention Gates U network (AGs-Unet) is further proposed based on AG, which can automatically learn different forms of building structures in high-resolution remote sensing images and realize efficient extraction of building contour. AGs-Unet integrates attention gates with a single U-Net network, in which a series of attention gate modules are added into the ‘skip connection’ for suppressing the irrelevant and noisy feature responses in the input image to highlight the dominant features of the buildings in the image. AGs-Unet improves the feature selection of the attention map to enhance the ability of feature learning, as well as paying attention to the feature information of small-scale buildings. We conducted the experiments on the WHU building dataset and the INRIA Aerial Image Labeling dataset, in which the proposed AGs-Unet model is compared with several classic models (such as FCN8s, SegNet, U-Net, and DANet) and two state-of-the-art models (such as PISANet, and ARC-Net). The extraction accuracy of each model is evaluated by using three evaluation indexes, namely, overall accuracy, precision, and intersection over union. Experimental results show that the proposed AGs-Unet model can improve the quality of building extraction from high-resolution remote sensing images effectively in terms of prediction performance and result accuracy.

CSA-UNet: Channel-Spatial Attention-Based Encoder–Decoder Network for Rural Blue-Roofed Building Extraction from UAV Imagery

CSA-Net: Complex Scenarios Adaptive Network for Building Extraction for Remote Sensing Images

Attention-Gate-Based Encoder–Decoder Network for Automatical Building Extraction

MAD-UNet: A Multi-Region UAV Remote Sensing Network for Rural Building Extraction

A Building Extraction Method for High-Resolution Remote Sensing Images with Multiple Attentions and Parallel Encoders Combining Enhanced Spectral Information

Building Extraction from Very High Resolution Aerial Imagery Using Joint Attention Deep Neural Network

ASCEND-UNet: An Improved UNet Configuration Optimized for Rural Settlements Mapping

CD-TransUNet: A Hybrid Transformer Network for the Change Detection of Urban Buildings Using L-Band SAR Images

EA U2-Net: An Efficient Building Extraction Algorithm Based on Complex Background Information

High-resolution building extraction based on the edge-aware network CEEAU_Net

SDSC-UNet: Dual Skip Connection ViT-Based U-Shaped Model for Building Extraction

SDSNet: Building Extraction in High-Resolution Remote Sensing Images Using a Deep Convolutional Network with Cross-Layer Feature Information Interaction Filtering

Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement

A method for extracting buildings from remote sensing images based on 3DJA-UNet3+

Building Extraction From Very High-Resolution Remote Sensing Images Using Refine-UNet

Enhancing building extraction from remote sensing images through UNet and transfer learning

Asymmetric Network Combining CNN and Transformer for Building Extraction from Remote Sensing Images

RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model

Building Extraction from Unmanned Aerial Vehicle (UAV) Data in a Landslide-Affected Scattered Mountainous Area Based on Res-Unet

AGs-Unet: Building Extraction Model for High Resolution Remote Sensing Images Based on Attention Gates U Network

Building Extraction from Remotely Sensed Images by Integrating Saliency Cue