Lightweight Structure-Aware Transformer Network for Remote Sensing Image Change Detection

Tao Lei,Yetong Xu,Hailong Ning,Zhiyong Lv,Chongdan Min,Yaochu Jin,Asoke K. Nandi
DOI: https://doi.org/10.1109/lgrs.2023.3323534
IF: 5.343
2023-01-01
IEEE Geoscience and Remote Sensing Letters
Abstract:Popular Transformer networks have been successfully applied to remote sensing (RS) image change detection (CD) identifications and achieved better results than most convolutional neural networks (CNNs), but they still suffer from two main problems. First, the computational complexity of the Transformer grows quadratically with the increase of image spatial resolution, which is unfavorable to RS images. Second, these popular Transformer networks tend to ignore the importance of fine-grained features, which results in poor edge integrity and internal tightness for largely changed objects and leads to the loss of small changed objects. To address the above issues, this letter proposes a lightweight structure-aware Transformer (LSAT) network for RS image CD. The proposed LSAT has two advantages. First, a cross-dimension interactive self-attention (CISA) module with linear complexity is designed to replace the vanilla self-attention (SA) in the visual Transformer, which effectively reduces the computational complexity while improving the feature representation ability of the proposed LSAT. Second, a structure-aware enhancement module (SAEM) is designed to enhance difference features and edge detail information, which can achieve double enhancement by difference refinement and detail aggregation to obtain fine-grained features of bi-temporal RS images. Experimental results show that the proposed LSAT achieves significant improvement in detection accuracy and offers a better tradeoff between accuracy and computational costs than most state-of-the-art (SOTA) CD methods for RS images.
What problem does this paper attempt to address?