Abstract:Recently there is a rising trend of employing convolutional neural network (CNN) for modeling the complex high-dimensional hyperspectral images in object-scale analysis. Compared with 1D CNN and 2D CNN for merely extracting spectral or spatial features, the 3D CNN naturally offers a more effective method for simultaneously extracting the integrated deep spectral-spatial features. Due to the convolution characteristics of operating within a local receptive field, computer vision studies had incorporated the attention mechanism into 2D CNN to exploit the relationship between features for adaptive feature refinement. No exploration has been reported on incorporating the attention mechanism into 3D CNN for model development of hyperspectral image in object-scale analysis. In this study, we investigated an improved 3D CNN architecture with attention modules embedded for adaptive feature refinement in object-scale hyperspectral image modeling. Besides the adapted channel attention, the band attention module was specially designed to learn the band-wise relationship. Based on the 3D ResNet architecture, various modifications on the arrangement and structure of channel and band attention modules were explored systematically for higher modeling performance. An exemplar hyperspectral image dataset of basil leaves for predicting their relative chlorophyll content (RCC), was applied to evaluate the proposed model. Comprehensive comparison experiments showed performance improvement after adding attention modules into the residual block of 3D ResNet, demonstrating the effectiveness of adaptive feature refinement along channel and band dimensions through the learned attention maps. The sequential channel-band attention module achieved the highest model performance, with testing determination coefficient (R2) of 0.8998. The results indicated the effectiveness of the channel and band attention embedded 3D CNN for model development of hyperspectral image in object-scale analysis.

Pyramidal and conditional convolution attention network for hyperspectral image classification using limited training samples

A Hyperspectral Image Classification Method Based on the Nonlocal Attention Mechanism of a Multiscale Convolutional Neural Network.

Channel and band attention embedded 3D CNN for model development of hyperspectral image in object-scale analysis

Pyramid Cascaded Convolutional Neural Network with Graph Convolution for Hyperspectral Image Classification

Hyperspectral Image Classification Based on a 3D Octave Convolution and 3D Multiscale Spatial Attention Network

Global and pyramid convolutional neural network with hybrid attention mechanism for hyperspectral image classification

A Multiscale Dilated Attention Network for Hyperspectral Image Classification

Hyperspectral Image Classification with Attention Aided CNNs

Hyperspectral image classification based on multiscale piecewise spectral-spatial attention network

Spatial–Spectral Feature Refinement for Hyperspectral Image Classification Based on Attention-Dense 3D-2D-CNN

Spectral-Spatial Fused Attention Network for Hyperspectral Image Classification

Two-Branch Convolutional Neural Network with Polarized Full Attention for Hyperspectral Image Classification

Computationally Lightweight Hyperspectral Image Classification Using a Multiscale Depthwise Convolutional Network With Channel Attention

An Effective Hyperspectral Image Classification Network Based on Multi-Head Self-Attention and Spectral-Coordinate Attention

Heterogeneous Spectral-Spatial Network with 3D Attention and MLP for Hyperspectral Image Classification Using Limited Training Samples

Multiscale Densely Connected Attention Network for Hyperspectral Image Classification

Center-similarity spectral-spatial attention network for hyperspectral image classification

A 3D Cascaded Spectral–Spatial Element Attention Network for Hyperspectral Image Classification

Attention-Aware Pseudo-3-D Convolutional Neural Network for Hyperspectral Image Classification

Consolidated Convolutional Neural Network for Hyperspectral Image Classification

A multi-scale multi-channel CNN introducing a channel-spatial attention mechanism hyperspectral remote sensing image classification method