OPT-SAR-MS2Net: A Multi-Source Multi-Scale Siamese Network for Land Object Classification Using Remote Sensing Images

Wei Hu,Xinhui Wang,Feng Zhan,Lu Cao,Yong Liu,Weili Yang,Mingjiang Ji,Ling Meng,Pengyu Guo,Zhi Yang,Yuhang Liu
DOI: https://doi.org/10.3390/rs16111850
IF: 5
2024-05-23
Remote Sensing
Abstract:The utilization of optical and synthetic aperture radar (SAR) multi-source data to obtain better land classification results has received increasing research attention. However, there is a large property and distributional difference between optical and SAR data, resulting in an enormous challenge to fuse the inherent correlation information to better characterize land features. Additionally, scale differences in various features in remote sensing images also influence the classification results. To this end, an optical and SAR Siamese semantic segmentation network, OPT-SAR-MS2Net, is proposed. This network can intelligently learn effective multi-source features and realize end-to-end interpretation of multi-source data. Firstly, the Siamese network is used to extract features from optical and SAR images in different channels. In order to fuse the complementary information, the multi-source feature fusion module fuses the cross-modal heterogeneous remote sensing information from both high and low levels. To adapt to the multi-scale features of the land object, the multi-scale feature-sensing module generates multiple information perception fields. This enhances the network's capability to learn contextual information. The experimental results obtained using WHU-OPT-SAR demonstrate that our method outperforms the state of the art, with an mIoU of 45.2% and an OA of 84.3%. These values are 2.3% and 2.6% better than those achieved by the most recent method, MCANet, respectively.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to use multi - source data of optical and Synthetic Aperture Radar (SAR) to improve the accuracy of feature classification. Specifically, the paper focuses on the following points: 1. **Challenges in multi - source data fusion**: There are large differences in attributes and distributions between optical and SAR data, which makes it difficult to fuse these data to extract useful features. The paper proposes a method to effectively fuse these multi - source data to better represent feature characteristics. 2. **Multi - scale feature processing**: The scale differences of different features in remote sensing images also affect the classification results. The paper proposes a multi - scale feature perception module to adapt to feature characteristics at different scales and enhance the network's ability to learn contextual information. 3. **End - to - end multi - source data interpretation**: The paper proposes an end - to - end network structure that can intelligently learn effective multi - source features and achieve multi - source data interpretation. To address the above challenges, the paper proposes a multi - source and multi - scale Siamese network named OPT - SAR - MS²Net. This network solves these problems in the following ways: - **Dual - branch Siamese network**: Extract features from optical and SAR images and fuse these features through the multi - source feature fusion module (OPT - SAR - MFF) to reduce redundant information and enhance complementary information. - **Multi - scale information perception module (OPT - SAR - MIP)**: Generate multiple information perception fields to adapt to feature characteristics at different scales and enhance the network's ability to learn contextual information. - **End - to - end architecture**: Avoid the complexity of multi - stage training strategies and achieve an efficient network architecture. Experimental results show that the performance of this method on the WHU - OPT - SAR dataset is better than existing methods, with mIoU and OA increased by 2.3% and 2.6% respectively.