Unsupervised Multimodal Change Detection Based on Structural Relationship Graph Representation Learning

Hongruixuan Chen,Naoto Yokoya,Chen Wu,Bo Du
DOI: https://doi.org/10.1109/TGRS.2022.3229027
2022-10-03
Abstract:Unsupervised multimodal change detection is a practical and challenging topic that can play an important role in time-sensitive emergency applications. To address the challenge that multimodal remote sensing images cannot be directly compared due to their modal heterogeneity, we take advantage of two types of modality-independent structural relationships in multimodal images. In particular, we present a structural relationship graph representation learning framework for measuring the similarity of the two structural relationships. Firstly, structural graphs are generated from preprocessed multimodal image pairs by means of an object-based image analysis approach. Then, a structural relationship graph convolutional autoencoder (SR-GCAE) is proposed to learn robust and representative features from graphs. Two loss functions aiming at reconstructing vertex information and edge information are presented to make the learned representations applicable for structural relationship similarity measurement. Subsequently, the similarity levels of two structural relationships are calculated from learned graph representations and two difference images are generated based on the similarity levels. After obtaining the difference images, an adaptive fusion strategy is presented to fuse the two difference images. Finally, a morphological filtering-based postprocessing approach is employed to refine the detection results. Experimental results on five datasets with different modal combinations demonstrate the effectiveness of the proposed method.
Computer Vision and Pattern Recognition,Image and Video Processing,Signal Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to perform unsupervised change detection in multi - modal remote sensing images. Due to the modal heterogeneity between multi - modal images (such as optical images and SAR images), it is very difficult to directly compare these images to detect changes. The paper proposes a framework based on structural relationship graph representation learning, aiming to overcome this challenge. Specifically, this method utilizes two modality - independent structural relationships, represents the pre - processed multi - modal image pairs by generating structural graphs, and proposes a Structural Relationship Graph Convolutional Auto - Encoder (SR - GCAE) to learn robust and representative features from these graphs. These features are used to measure the similarity of the two structural relationships, thereby generating a difference image, and finally refining the change detection results through a fusion strategy and morphological filtering post - processing. The main contributions of the paper include: 1. Simultaneously exploring local and non - local structural relationships for unsupervised multi - modal change detection. 2. Designing a graph representation learning framework for the first time for unsupervised multi - modal change detection. The proposed network can learn robust high - level graph representations for measuring the similarity levels of local and non - local structural relationships through two reconstruction optimization objectives as loss functions. 3. Proposing an adaptive fusion strategy based on the discrimination of change intensity in the difference image, which better highlights the changed pixels and suppresses the unchanged pixels. 4. On change detection datasets with five different modality combinations, the proposed method outperforms existing methods, showing its superiority. Through these innovations, the paper provides a new method to solve the change detection problem in multi - modal remote sensing images, and has important practical significance especially in time - sensitive emergency applications.