Lightweight Tensor Attention-Driven ConvLSTM Neural Network for Hyperspectral Image Classification

Wen-Shuai Hu,Heng-Chao Li,Yang-Jun Deng,Xian Sun,Qian Du,Antonio Plaza
DOI: https://doi.org/10.1109/jstsp.2021.3063805
IF: 7.695
2021-04-01
IEEE Journal of Selected Topics in Signal Processing
Abstract:Recurrent neural networks, especially the convolutional long short-term memory (ConvLSTM), have attracted plenty of attention and shown promising results due to their ability in modeling long-term dependencies in many research fields. In this paper, a lightweight tensor attention-driven ConvLSTM neural network (TACLNN) is proposed for hyperspectral image (HSI) classification. Firstly, to reduce the trainable parameters and memory requirements of ConvLSTM (specifically, the 2-D version of LSTM, i.e., ConvLSTM2D), a lightweight ConvLSTM2D cell is developed by utilizing tensor-train decomposition, resulting in a TT-ConvLSTM2D cell, with which a spatial-spectral TT-ConvLSTM 2-D neural network (SSTTCL2DNN) is built. However, it is inevitable for SSTTCL2DNN to obtain lower accuracies for HSI classification. To recover the accuracy loss caused by the TT-ConvLSTM2D cell in SSTTCL2DNN, a learnable tensor attention residual block (TARB) module is built to further enhance its geometrical structure. When applied to three widely used HSI benchmarks, the proposed TACLNN model outperforms several state-of-the-art methods for HSI classification. In addition, the proposed TACLNN can effectively reduce the number of parameters and storage requirements achieving higher classification accuracies as compared to other competitive baselines.
engineering, electrical & electronic
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the computational complexity and storage requirements in hyperspectral image (HSI) classification, while maintaining or improving classification accuracy. Specifically, the paper proposes a lightweight tensor - attention - driven convolutional long - short - term memory neural network (TACLNN), aiming to reduce the number of parameters and memory requirements in the traditional ConvLSTM model, thereby improving computational efficiency and storage efficiency. However, reducing the number of parameters may lead to a decline in model performance. For this reason, the paper introduces a learnable tensor - attention - residual - block (TARB) module to recover the performance loss caused by the reduction of parameters and enhance the feature extraction ability of the model. The main contributions of the paper include: 1. Developed a lightweight ConvLSTM2D unit, which reduces the number of parameters and memory requirements by using tensor - train decomposition (TTD). 2. Proposed an effective TARB module for preserving the geometric structure information of hyperspectral images, achieving satisfactory classification accuracy with only two additional training parameters. These innovations enable the TACLNN model to maintain or even improve classification performance while reducing the number of parameters, and are suitable for hyperspectral image classification tasks.