Multi-granularity Siamese Transformer-Based Change Detection in Remote Sensing Imagery

Lei Song,Min Xia,Yao Xu,Liguo Weng,Kai Hu,Haifeng Lin,Ming Qian
DOI: https://doi.org/10.1016/j.engappai.2024.108960
IF: 8
2024-01-01
Engineering Applications of Artificial Intelligence
Abstract:In recent years, Convolutional Neural Networks (CNNs) have promoted the prosperity of Change Detection (CD). However, due to the intrinsic property of convolution kernel, this method cannot effectively model the long-distance dependency. The emergence of Vision Transformer (ViT) brings a new way to solve the problem. Based on ViT architecture, a novel Multi-Granularity remote sensing image Change Detection model (MGCDT) is proposed in this paper. We cascade several Local-Global Siamese Transformer (LGST) as backbone to extract local and global semantic discriminative features. In order to solve the serious problem of false detection and missing detection of feature boundary, a plug-and-play High Frequency Enhancement Unit (HFE) is proposed to replace the inflexible U-shaped structure to optimize the detection boundary. Considering the problem of multi- scale modeling of ground objects, a Multi-Scale Fusion Attention Unit (MSFA) is proposed, which integrates the flow of multi-scale information into the calculation process of self-attention. Finally, we utilize a Deep Feature Guidance Unit (DFG) to optimize the shallow detailed feature information. Extensive experiments show that, considering multi-granularity information, MGCDT outperforms the existing change detection algorithms on four remote sensing image change detection datasets.
What problem does this paper attempt to address?