Abstract:Building extraction from high-resolution images has been a fundamental task in the remote sensing field. It helps in monitoring natural disasters and developing urban areas. Encoder–Decoder based convolutional neural network (CNN) has provided a paradigm for automatic building extraction. However, extracting building information is difficult due to many reasons like diverse scales, complex background and variety of building structures. Moreover, achieving accurate boundary information remains challenging due to various impediments surrounding buildings. To deal with these challenges, in this article, we proposed a dual-branch model. One branch is the segmentation branch that includes an encoder–decoder framework (based on Attention-ResUNet architecture) combining residual unit and attention network, to generate the segmentation mask. The residual unit improves the ability to learn the deep and complex building features whereas the attention network focuses on the informative spatial information. In addition, a dilated module is positioned at the end of the decoder of Attention-ResUNet to capture the multiscale information. Another branch is the edge branch consisting of canny edge extraction, morphological operation and squeeze-excitation network, to improve the boundary information. The canny edge detection method extracts the edges of the buildings which is further enhanced through the morphological operation. In addition, a squeeze-excitation network is added for fine adjustment of generated feature maps. At the end, our proposed model integrates the segmentation mask obtained using the segmentation branch and boundary information generated by the edge branch to produce the refined segmentation mask. Experiments have been performed on the Massachusetts building dataset and the WHU-I building dataset. The performance of proposed model is compared with state-of-the-art models such as SegNet, DeepLabV3Plus, UNet, Attention-UNet, ResUNet and Attention-ResUNet. The results demonstrate that the proposed approach improves the performance for both the datasets. Hence, we can conclude that the proposed approach has a great potential in extracting multiscale information and enhancing the boundary information of buildings.

Supervised domain adaptation for building extraction from off-nadir aerial images

Multi-Scale Feature Map Aggregation and Supervised Domain Adaptation of Fully Convolutional Networks for Urban Building Footprint Extraction

Boundary-Aware Adversarial Learning Domain Adaption and Active Learning for Cross-Sensor Building Extraction

Unsupervised domain adaptive building semantic segmentation network by edge-enhanced contrastive learning

Joint Progressive Knowledge Distillation and Unsupervised Domain Adaptation

Unsupervised Domain Adaptation Approach for Vision-Based Semantic Understanding of Bridge Inspection Scenes Without Manual Annotations

E-D-Net: Automatic Building Extraction from High-Resolution Aerial Images with Boundary Information

Scalable domain adaptation of convolutional neural networks

Select, Purify, and Exchange: A Multisource Unsupervised Domain Adaptation Method for Building Extraction

SDSNet: Building Extraction in High-Resolution Remote Sensing Images Using a Deep Convolutional Network with Cross-Layer Feature Information Interaction Filtering

A scale robust convolutional neural network for automatic building extraction from aerial and satellite imagery

Architecture of Deep Convolutional Encoder-Decoder Networks for Building Footprint Semantic Segmentation

BUILDING OUTLINE EXTRACTION FROM AERIAL IMAGES USING CONVOLUTIONAL NEURAL NETWORKS

Asymmetric Network Combining CNN and Transformer for Building Extraction from Remote Sensing Images

Deep cross-domain building extraction for selective depth estimation from oblique aerial imagery

Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement

Improved Building Extraction from Remotely Sensed Images by Integration of Encode–Decoder and Edge Enhancement Models

An improved self-training network for building and road extraction in urban areas by integrating optical and radar remotely sensed data

Domain Adaptation by Stacked Local Constraint Auto-Encoder Learning

Building Precision: Efficient Encoder–Decoder Networks for Remote Sensing Based on Aerial RGB and LiDAR Data

GIS-Supervised Building Extraction With Label Noise-Adaptive Fully Convolutional Neural Network