Abstract:Buildings represent pivotal entities in remote sensing imagery for various applications such as urban planning and land resource management. Predominantly, methods for building footprint extraction in the literature focus on optical imagery with visual attributes that faithfully mirror the physical world. Nevertheless, the acquisition of high-quality optical images presents formidable challenges due to the susceptibility to illumination conditions and scene visibility. In contrast, synthetic aperture radar (SAR) images can be acquired in all-weather and all-time situations, unburdened by the aforementioned constraints. However, the coherent imaging mechanism engenders intricate complexities for building footprint extraction SAR images. To address this issue, this article introduces the boundary information distillation network (BIDNet) to improve the prediction accuracy in SAR images by distilling knowledge from optical images. The proposed approach adopts a teacher–student framework, featuring two customized components: the explicit distillation module (EDM) and the latent distillation module (LDM). Different from the conventional practice of directly aligning feature maps, BIDNet focuses on leveraging the more conspicuous boundary information in optical images. The EDM operates by simultaneously yielding a boundary map to emphasize the boundary area and assimilating the explicit low-level features of two modalities. The LDM represents the structural attributes within the high-level latent feature space and aligns the representations of the two modalities. Within this module, intrinsic self-correlations among features originating from boundary regions are encoded, and so are the cross-correlations established between features from boundary regions and alternative areas. The two modules also serve as the conduit for knowledge distillation (KD) from the teacher network to the student network, enabling the utilization of optical imagery for enhancing the building footprint extraction in SAR imagery. Extensive experiments demonstrate that our BIDNet achieves state-of-the-art performance on the Multi-Sensor All Weather Mapping (MSAW) dataset, outperforming the strong baseline by 4.3–7.2 points in F1-score and 4.9–8.0 points in IoU. The source code and trained models will be publicly available at https://github.com/wangyx-chn/BIDNet.

Prompt-Driven Building Footprint Extraction in Aerial Images with Offset-Building Model

Extracting polygonal footprints in off-nadir images with Segment Anything Model

Learning to Extract Building Footprints from Off-Nadir Aerial Images

Extracting Rectified Building Footprints from Traditional Orthophotos: A New Workflow

3-D Building Instance Extraction From High-Resolution Remote Sensing Images and DSM With an End-to-End Deep Neural Network

POI Detection of High-Rise Buildings Using Remote Sensing Images: A Semantic Segmentation Method Based on Multitask Attention Res-U-Net

BOMSC-Net: Boundary Optimization and Multi-Scale Context Awareness Based Building Extraction From High-Resolution Remote Sensing Imagery

An Efficient Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box Prompts

A Lightweight Building Extraction Approach for Contour Recovery in Complex Urban Environments

Boundary Regularized Building Footprint Extraction From Satellite Images Using Deep Neural Network

A Boundary Regulated Network for Accurate Roof Segmentation and Outline Extraction

Extracting Building Footprints in SAR Images via Distilling Boundary Information From Optical Images

LOD2-Level+ Low-Rise Building Model Extraction Method for Oblique Photography Data Using U-NET and a Multi-Decision RANSAC Segmentation Algorithm

A Single Data Extraction Algorithm for Oblique Photographic Data Based on the U-Net

Boundary Shape-Preserving Model for Building Mapping From High-Resolution Remote Sensing Images

Interactive Occlusion Boundary Estimation through Exploitation of Synthetic Data

Automatic Building Roof Plane Extraction in Urban Environments for 3D City Modelling Using Remote Sensing Data

Multi-scale attention integrated hierarchical networks for high-resolution building footprint extraction

OEC-RNN: Object-Oriented Delineation of Rooftops With Edges and Corners Using the Recurrent Neural Network From the Aerial Images