Elevation Information-Guided Multimodal Fusion Robust Framework for Remote Sensing Image Segmentation
Junyu Fan,Jinjiang Li,Zhen Hua,Fan Zhang,Caiming Zhang
DOI: https://doi.org/10.1109/lgrs.2024.3350593
IF: 5.343
2024-02-02
IEEE Geoscience and Remote Sensing Letters
Abstract:Currently, the task of remote sensing image segmentation still faces some challenges, such as variations in illumination, shadows, and occlusions present in remote sensing images. In addition, there may be similarities and confusions between different types of terrain features. In this letter, we aim to explore how to use information exchange between multiple modalities to reduce the impact of interfering factors. To fully exploit the complementary information between different modalities, we establish an information exchange mechanism between optical images (visible light + infrared) features and digital surface model (DSM) features. This allows them to interact and express themselves in a shared feature space, facilitating the acquisition of complementary information from different modalities. Furthermore, through a multimodal fusion encoder and decoder based on transformer design, the optical features and DSM features are integrated, enabling the learning of high-level semantic representations in different dimensions. Extensive subjective, objective comparative experiments, and ablation experiments are conducted on the ISPRS Vaihingen and Potsdam datasets to evaluate the proposed method. The mIoU on the Vaihingen and Potsdam datasets reached 85.06% and 87.6%, respectively, while the OA reached 92.01% and 91.92%, respectively. The source code will be available at https://github.com/JunyuFan/MIEFNet.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics