A Dual-Stream Transformer with Diff-Attention for Multispectral and Panchromatic Classification

Lin Xu,Hao Zhu,Licheng Jiao,Wenhao Zhao,Xiaotong Li,Biao Hou,Zhongle Ren,Wenping Ma
DOI: https://doi.org/10.1109/tgrs.2023.3336466
IF: 8.2
2023-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:To minimize the feature redundancy of multispectral (MS) and panchromatic (PAN) images and maximize the complementary advantages of PAN and MS, a dual-stream transformer with diff-attention (DSTD)-Net is proposed for PAN and MS classification in this article. First, in terms of feature extraction, we use self-attention and coattention (SCA) block to extract both specific advantageous features and common essential features. Based on that, a self-attention module strengthened by diff-attention (SSDA) that pays attention to the difference between two specific advantageous features is designed to reduce the essential redundancy in specific features. It can take advantage of the difference between two specific features and reduce the essential redundancy of the specific advantageous features, making them purer and better for classification. Finally, since the specific features and common features of MS and PAN images make different contributions to classification, a multistage gated fusion strategy is used. The multistage gate fusion (MGF) strategy mainly uses gated multisource units (GMUs) to adapt the weight of different features and fuse them. So, our MGF strategy can strengthen the specific advantageous features beneficial for classification. Above all, the several experiment results verify our proposed networks’ effectiveness and robustness. Our code is available at: https://github.com/blackkiring/DSTD .
What problem does this paper attempt to address?