Hybrid Multi-Scale Spatial-Spectral Transformer for Hyperspectral Image Classification

Yan He,Bing Tu,Bo Liu,Yunyun Chen,Jun Li,Antonio Plaza
DOI: https://doi.org/10.1109/tgrs.2024.3443662
IF: 8.2
2024-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Hyperspectral image (HSI) classification constitutes a significant foundation for remote sensing analysis. Transformer architecture establishes long-range dependencies with a self-attention mechanism (SA), which exhibits advantages in HSI classification. However, most existing transformer-based methods are inadequate in exploring the multiscale properties of hybrid spatial and spectral information inherent in HSI data. To countermeasure this problem, this work investigates a hybrid multiscale spatial-spectral framework (HMSSF). It innovatively models global dependencies across multiple scales from both spatial and spectral domains, which allows for cooperatively capturing hybrid multiscale spatial and spectral characteristics for HSI classification. Technically, a spatial-spectral token generation (SSTG) module is first designed to generate the spatial tokens and spectral tokens. Then, a multiscale SA (MSSA) is developed to achieve multiscale attention modeling by constructing different dimensional attention heads per attention layer. This mechanism is adaptively integrated into both spatial and spectral branches for hybrid multiscale feature extraction. Furthermore, a spatial-spectral attention aggregation (SSAA) module is introduced to dynamically fuse the multiscale spatial and spectral features to enhance the classification robustness. Experimental results and analysis demonstrate that the proposed method outperforms the state-of-the-art methods on several public HSI datasets.
What problem does this paper attempt to address?