Progressive fusion learning: A multimodal joint segmentation framework for building extraction from optical and SAR images

Xue Li,Guo Zhang,Hao Cui,Shasha Hou,Yujia Chen,Zhijiang Li,Haifeng Li,Huabin Wang
DOI: https://doi.org/10.1016/j.isprsjprs.2022.11.015
IF: 12.7
2022-12-04
ISPRS Journal of Photogrammetry and Remote Sensing
Abstract:Automatic and high-precision extraction of buildings from remote sensing images has a wide range of application and importance. Optical and synthetic aperture radar (SAR) images are typical types of multimodal remote sensing data with different imaging methods. To bridge the huge gap between them and achieve high-precision joint semantic segmentation, this study proposes a progressive fusion learning framework. The framework explicitly extracts the shared features (that is, modal invariants) of multimodal images as the information medium and realizes information fusion through multistage learning. Based on this framework, we design a network called the multistage multimodal fusion network (MMFNet), which uses phase as a modal invariant to joint optical and SAR images to achieve high-precision building extraction. We conducted experiments with the Multi-Sensor All-Weather Mapping aerial dataset and the WHU-OPT-SAR_WuHan satellite dataset. This study shows MMFNet has a significant extraction effect and yields more optimized extraction of the edge details of buildings, which is improved by 0.2% to 9.5% compared to other multimodal joint segmentation methods.
imaging science & photographic technology,remote sensing,geography, physical,geosciences, multidisciplinary
What problem does this paper attempt to address?