Abstract:Extracting building footprints from remotely sensed imagery has long been a challenging task and is not yet fully solved. Obstructions from nearby shadows or trees, varying shapes of rooftops, omission of small buildings, and varying scale of buildings hinder existing automated models for extracting sharp building boundaries. Different reasons account for these challenges. In convolutional neural network-based methods, the down-sampling operation loses spatial details of the input images; and small buildings are omitted from the high-level features. The sheltering trees and adjacent objects shadowing may cause errors since semantic information cannot be effectively preserved. Moreover, the insufficient use of multi-scale building features causes blurry edges in the predictions for buildings with complex shapes. To address these challenges, we propose a novel coarse-to-fine boundary refinement network (CBR-Net) that accurately extracts building footprints from remote sensing imagery. Unlike the existing semantic segmentation methods that directly generate building predictions at the highest level, we designed a module that progressively refines the building prediction in a coarse-to-fine manner. In this way, the advantages of both the high-level and low-level features can be retained. We also present a novel boundary refinement (BR) module that enhances the ability of the CBR-Net model to perceive and refine building edges. The BR module refines building prediction by perceiving the direction of each pixel in a remotely sensed optical image to the center of the nearest object to which it might belong. The refined results are used as pseudo labels in a self-supervision process that increases model robustness to noisy labels or obstructions. Experimental results on three public building datasets, including the WHU building dataset, the Massachusetts building dataset, and the Inria aerial image dataset, demonstrate the effectiveness of the proposed method. In evaluation tests, CBR-Net outperformed other state-of-the-art algorithms on the three datasets by maintaining both the continuous entities and accurate boundaries of buildings. The source code of the proposed CBR-Net is available at https://github.com/HaonanGuo/CBRNet.

Improved Building Extraction from Remotely Sensed Images by Integration of Encode–Decoder and Edge Enhancement Models

Extracting Buildings from Remote Sensing Images Using a Multitask Encoder-Decoder Network with Boundary Refinement

Attention-Gate-Based Encoder–Decoder Network for Automatical Building Extraction

BRRNet: A Fully Convolutional Neural Network for Automatic Building Extraction From High-Resolution Remote Sensing Images

Building Extraction From High Spatial Resolution Remote Sensing Images of Complex Scenes by Combining Region-Line Feature Fusion and OCNN

SSDBN: A Single-Side Dual-Branch Network with Encoder–Decoder for Building Extraction

Asymmetric Network Combining CNN and Transformer for Building Extraction from Remote Sensing Images

HCRB-MSAN: Horizontally Connected Residual Blocks-Based Multiscale Attention Network for Semantic Segmentation of Buildings in HSR Remote Sensing Images

Performance Analysis of Various EfficientNet Based U-Net++ Architecture for Automatic Building Extraction from High Resolution Satellite Images

HA U-Net: Improved Model for Building Extraction From High Resolution Remote Sensing Imagery

A Multi-Scale Edge Constraint Network for the Fine Extraction of Buildings from Remote Sensing Images

Multi-Scale Feature Fusion Attention Network for Building Extraction in Remote Sensing Images

Building Extraction from Very High Resolution Aerial Imagery Using Joint Attention Deep Neural Network

A coarse-to-fine boundary refinement network for building footprint extraction from remote sensing imagery

EU-Net: an Efficient Fully Convolutional Network for Building Extraction from Optical Remote Sensing Images.

BOMSC-Net: Boundary Optimization and Multi-Scale Context Awareness Based Building Extraction From High-Resolution Remote Sensing Imagery

ACMFNet: Attention-Based Cross-Modal Fusion Network for Building Extraction of Remote Sensing Images

Building Extraction From Very High-Resolution Remote Sensing Images Using Refine-UNet

A Building Extraction Method for High-Resolution Remote Sensing Images with Multiple Attentions and Parallel Encoders Combining Enhanced Spectral Information

Effective Building Extraction From High-Resolution Remote Sensing Images With Multitask Driven Deep Neural Network

Hierarchical Disentangling Network for Building Extraction from Very High Resolution Optical Remote Sensing Imagery