Lightweight Tensor Attention-Driven ConvLSTM Neural Network for Hyperspectral Image Classification

Wen-Shuai Hu,Heng-Chao Li,Yang-Jun Deng,Xian Sun,Qian Du,Antonio Plaza

DOI: https://doi.org/10.1109/jstsp.2021.3063805

IF: 7.695

2021-04-01

IEEE Journal of Selected Topics in Signal Processing

Abstract:Recurrent neural networks, especially the convolutional long short-term memory (ConvLSTM), have attracted plenty of attention and shown promising results due to their ability in modeling long-term dependencies in many research fields. In this paper, a lightweight tensor attention-driven ConvLSTM neural network (TACLNN) is proposed for hyperspectral image (HSI) classification. Firstly, to reduce the trainable parameters and memory requirements of ConvLSTM (specifically, the 2-D version of LSTM, i.e., ConvLSTM2D), a lightweight ConvLSTM2D cell is developed by utilizing tensor-train decomposition, resulting in a TT-ConvLSTM2D cell, with which a spatial-spectral TT-ConvLSTM 2-D neural network (SSTTCL2DNN) is built. However, it is inevitable for SSTTCL2DNN to obtain lower accuracies for HSI classification. To recover the accuracy loss caused by the TT-ConvLSTM2D cell in SSTTCL2DNN, a learnable tensor attention residual block (TARB) module is built to further enhance its geometrical structure. When applied to three widely used HSI benchmarks, the proposed TACLNN model outperforms several state-of-the-art methods for HSI classification. In addition, the proposed TACLNN can effectively reduce the number of parameters and storage requirements achieving higher classification accuracies as compared to other competitive baselines.

engineering, electrical & electronic

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the computational complexity and storage requirements in hyperspectral image (HSI) classification, while maintaining or improving classification accuracy. Specifically, the paper proposes a lightweight tensor - attention - driven convolutional long - short - term memory neural network (TACLNN), aiming to reduce the number of parameters and memory requirements in the traditional ConvLSTM model, thereby improving computational efficiency and storage efficiency. However, reducing the number of parameters may lead to a decline in model performance. For this reason, the paper introduces a learnable tensor - attention - residual - block (TARB) module to recover the performance loss caused by the reduction of parameters and enhance the feature extraction ability of the model. The main contributions of the paper include: 1. Developed a lightweight ConvLSTM2D unit, which reduces the number of parameters and memory requirements by using tensor - train decomposition (TTD). 2. Proposed an effective TARB module for preserving the geometric structure information of hyperspectral images, achieving satisfactory classification accuracy with only two additional training parameters. These innovations enable the TACLNN model to maintain or even improve classification performance while reducing the number of parameters, and are suitable for hyperspectral image classification tasks.

Lightweight Tensor Attention-Driven ConvLSTM Neural Network for Hyperspectral Image Classification

A Novel Transformer Network with a CNN-Enhanced Cross-Attention Mechanism for Hyperspectral Image Classification

Lightweight Tensorized Neural Networks for Hyperspectral Image Classification

A Hyperspectral Image Classification Method Based on the Nonlocal Attention Mechanism of a Multiscale Convolutional Neural Network.

A Lightweight Transformer Network for Hyperspectral Image Classification

Pseudo Complex-Valued Deformable ConvLSTM Neural Network With Mutual Attention Learning for Hyperspectral Image Classification

Hyperspectral Image Classification Based on Two-Branch Spectral–Spatial-Feature Attention Network

Spatial–Spectral Feature Extraction via Deep ConvLSTM Neural Networks for Hyperspectral Image Classification

Hyperspectral Image Classification Using Attention-Based Bidirectional Long Short-Term Memory Network

LCTCS: Low-Cost and Two-Channel Sparse Network for Hyperspectral Image Classification

End-to-End Convolutional Network and Spectral-Spatial Transformer Architecture for Hyperspectral Image Classification

MSLAN: A Two-Branch Multidirectional Spectral–Spatial LSTM Attention Network for Hyperspectral Image Classification

Hyperspectral Image Classification Based on Multibranch Attention Transformer Networks

Hyperspectral Image Classification Using Spectral–Spatial Double-Branch Attention Mechanism

Ultralightweight Feature-Compressed Multihead Self-Attention Learning Networks for Hyperspectral Image Classification

Hybrid Dense Network with Dual Attention for Hyperspectral Image Classification

Bidirectional-Convolutional LSTM Based Spectral-Spatial Feature Learning for Hyperspectral Image Classification

Channel-Layer-Oriented Lightweight Spectral–Spatial Network for Hyperspectral Image Classification

Multiscale Residual Weakly Dense Network with Attention Mechanism for Hyperspectral Image Classification

DCTN: Dual-Branch Convolutional Transformer Network With Efficient Interactive Self-Attention for Hyperspectral Image Classification

Hyperspectral Image Classification Based on a 3D Octave Convolution and 3D Multiscale Spatial Attention Network