Asymmetric Cascade Fusion Network for Building Extraction

Sixian Chan,Yuan Wang,Yanjing Lei,Xu Cheng,Zhaomin Chen,Wei Wu
DOI: https://doi.org/10.1109/tgrs.2023.3306018
IF: 8.2
2023-09-13
IEEE Transactions on Geoscience and Remote Sensing
Abstract:The U-Net-like model has been widely studied in the field of building extraction. However, most of these models are based on locally sensed convolutional neural networks (CNNs) designed with symmetric structure and single feature processing, which cannot accurately identify buildings with different sizes, shapes, and colors in remote sensing images. To overcome these problems, we propose the asymmetric cascade fusion network (ACFN), based on the vision transformer (ViT), to design a novel asymmetric architecture to recognize buildings of different sizes and shapes by processing multigranularity features by different means. First, the asymmetric architecture obtains multigranularity features with global contextual information by embedding different types of attention in encoder–decoders of different sizes. This architecture can identify densely distributed and occluded buildings by semantic reasoning in remote sensing images with complex information. Second, we design a multibranch weighted pyramid pooling module (MWPPM), which sets different branch weights to offset the background noise introduced in introducing global contextual information. Our ACFN significantly improves the Beijing buildings, ISPRS-Vaihingen, and LoveDA datasets.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?