Abstract:In the past, convolutional neural network (CNN) has become one of the most popular deep learning frameworks, and has been widely used in Hyperspectral image classification tasks. Convolution (Conv) in CNN uses filter weights to extract features in local receiving domain, and the weight parameters are shared globally, which more focus on the high‐frequency information of the image. Different from Conv, Transformer can obtain the long‐term dependence between long‐distance features through modelling, and adaptively focus on different regions. In addition, Transformer is considered as a low‐pass filter, which more focuses on the low‐frequency information of the image. Considering the complementary characteristics of Conv and Transformer, the two modes can be integrated for full feature extraction. In addition, the most important image features correspond to the discrimination region, while the secondary image features represent important but easily ignored regions, which are also conducive to the classification of HSIs. In this study, a complementary integrated Transformer network (CITNet) for hyperspectral image classification is proposed. Firstly, three‐dimensional convolution (Conv3D) and two‐dimensional convolution (Conv2D) are utilised to extract the shallow semantic information of the image. In order to enhance the secondary features, a channel Gaussian modulation attention module is proposed, which is embedded between Conv3D and Conv2D. This module can not only enhance secondary features, but suppress the most important and least important features. Then, considering the different and complementary characteristics of Conv and Transformer, a complementary integrated Transformer module is designed. Finally, through a large number of experiments, this study evaluates the classification performance of CITNet and several state‐of‐the‐art networks on five common datasets. The experimental results show that compared with these classification networks, CITNet can provide better classification performance.

CACFTNet:A Hybrid Cov-Attention and Cross-Layer Fusion Transformer Network for Hyperspectral Image Classification

CACFTNet: A Hybrid Cov-Attention and Cross-Layer Fusion Transformer Network for Hyperspectral Image Classification

A Novel Transformer Network with a CNN-Enhanced Cross-Attention Mechanism for Hyperspectral Image Classification

Bridging CNN and Transformer With Cross-Attention Fusion Network for Hyperspectral Image Classification

Multiscale and Cross-Level Attention Learning for Hyperspectral Image Classification

Convolution-Transformer Adaptive Fusion Network for Hyperspectral Image Classification

Center Category Focusing Transformer Network for Hyperspectral Image Classification

A U-Shaped Convolution-Aided Transformer with Double Attention for Hyperspectral Image Classification

A Complementary Integrated Transformer Network for Hyperspectral Image Classification

A Multiscale Dual-Branch Feature Fusion and Attention Network for Hyperspectral Images Classification

Attention-Guided CNN-Transformer Hybrid Network for Hyperspectral Image Classification

Fusion of Dilated Convolution in CNN and Transformer Networks for Hyperspectral Image Classification

DCTN: Dual-Branch Convolutional Transformer Network With Efficient Interactive Self-Attention for Hyperspectral Image Classification

End-to-End Convolutional Network and Spectral-Spatial Transformer Architecture for Hyperspectral Image Classification

Spectral-Spatial Fused Attention Network for Hyperspectral Image Classification

Asymmetric Coordinate Attention Spectral-Spatial Feature Fusion Network for Hyperspectral Image Classification

Adaptive Pixel-Level and Superpixel-Level Feature Fusion Transformer for Hyperspectral Image Classification

CNN and Transformer Hybrid Network for Hyperspectral Image Classification

Feature Fusion Network Model Based on Dual Attention Mechanism for Hyperspectral Image Classification

Deep Collaborative Attention Network for Hyperspectral Image Classification by Combining 2-D CNN and 3-D CNN

Convolution Transformer Fusion Splicing Network for Hyperspectral Image Classification