Hyperspectral Image Classification based on Multi-Scale Convolutional Features and Multi-Attention Mechanisms

Qian Sun,Guangrui Zhao,Xinyuan Xia,Yu Xie,Chenrong Fang,Le Sun,Zebin Wu,Chengsheng Pan
DOI: https://doi.org/10.3390/rs16122185
IF: 5
2024-06-17
Remote Sensing
Abstract:Convolutional neural network (CNN)-based and Transformer-based methods for hyperspectral image (HSI) classification have rapidly advanced due to their unique characterization capabilities. However, the fixed kernel sizes in convolutional layers limit the comprehensive utilization of multi-scale features in HSI land cover analysis, while the Transformer's multi-head self-attention (MHSA) mechanism faces challenges in effectively encoding feature information across various dimensions. To tackle this issue, this article introduces an HSI classification method, based on multi-scale convolutional features and multi-attention mechanisms (i.e., MSCF-MAM). Firstly, the model employs a multi-scale convolutional module to capture features across different scales in HSIs. Secondly, to enhance the integration of local and global channel features and establish long-range dependencies, a feature enhancement module based on pyramid squeeze attention (PSA) is employed. Lastly, the model leverages a classical Transformer Encoder (TE) and linear layers to encode and classify the transformed spatial–spectral features. The proposed method is evaluated on three publicly available datasets—Salina Valley (SV), WHU-Hi-HanChuan (HC), and WHU-Hi-HongHu (HH). Extensive experimental results have demonstrated that the MSCF-MAM method outperforms several representative methods in terms of classification performance.
environmental sciences,geosciences, multidisciplinary,imaging science & photographic technology,remote sensing
What problem does this paper attempt to address?