Abstract:As a fundamental task for geographical information updating, 3D city modeling, and other critical applications, the automatic extraction of building footprints from high-resolution remote sensing images has been substan-tially explored and received increasing attention over recent years. Among different types of building extraction methods, the polygonal segmentation methods produce vector building polygons that are in a more realistic format compared with those obtained from pixel-wise semantic labeling and contour-based methods. However, existing polygonal building segmentation methods usually require a perfect segmentation map and a complex post-processing procedure to guarantee the polygonization quality, or produce inaccurate vertex prediction results that suffer from wrong vertex sequence, self-intersections, fixed vertex quantity, etc. In our previous work, we have proposed a method for polygonal building segmentation from remote sensing images that addresses the above limitations of existing methods. In this paper, we propose PolyCity, which further extends and improves our previous work in terms of the application scenario, methodology design, and experimental results. Our proposed PolyCity contains the following three components: (1) a pixel-wise multi-task network for learning the semantic and geometric information via three tasks, i.e., building segmentation, boundary prediction, and edge orientation prediction; (2) a simple but effective vertex selection module (VSM), which effectively bridges the gap between pixel-wise and graph-based models via transforming the segmentation map into valid polygon vertices; (3) a graph-based vertex refinement network (VRN) for automatically adjusting the coordinates of VSM-generated valid polygon vertices, producing the final building polygons with more precise vertices. Results on three large-scale building extraction datasets demonstrate that our proposed PolyCity generates vector building footprints with more accurate vertices, edges, shapes, etc., achieving significant vertex score improvements while maintaining high segmentation and boundary scores compared with the current state-of-the-art. The code of PolyCity will be released at https://github.com/liweijia/polycity.

Joint Semantic-geometric Learning for Polygonal Building Segmentation

Joint semantic–geometric learning for polygonal building segmentation from high-resolution remote sensing images

Joint semantic-geometric learning for polygonal building segmentation from-resolution remote

Polygonal Building Segmentation by Frame Field Learning

An end-to-end shape modeling framework for vectorized building outline generation from aerial images

Iterative Polygon Deformation for Building Extraction

3D Reconstruction and Semantic Segmentation Method Combining PointNet and 3D-Lmnet from Single Image

Robust Extraction of Vectorized Buildings Via Bidirectional Tracing of Keypoints from Remotely Sensed Imagery.

From lines to Polygons: Polygonal building contour extraction from High-Resolution remote sensing imagery

Enhancing Polygonal Building Segmentation via Oriented Corners

A Lightweight Building Extraction Approach for Contour Recovery in Complex Urban Environments

Polygonizer: An auto-regressive building delineator

Progressive fusion learning: A multimodal joint segmentation framework for building extraction from optical and SAR images

Adaptive Polygon Generation Algorithm for Automatic Building Extraction.

GeoSegNet: Point Cloud Semantic Segmentation via Geometric Encoder-Decoder Modeling

Semantic Segmentation Based Building Extraction Method Using Multi-source GIS Map Datasets and Satellite Imagery

Semantic Segmentation-Based Building Footprint Extraction Using Very High-Resolution Satellite Images and Multi-Source GIS Data

Machine-learned Regularization and Polygonization of Building Segmentation Masks

Joint Semantic Segmentation using representations of LiDAR point clouds and camera images

PolyBuilding: Polygon transformer for building extraction

Segment Any Building