Abstract:Recently there is a rising trend of employing convolutional neural network (CNN) for modeling the complex high-dimensional hyperspectral images in object-scale analysis. Compared with 1D CNN and 2D CNN for merely extracting spectral or spatial features, the 3D CNN naturally offers a more effective method for simultaneously extracting the integrated deep spectral-spatial features. Due to the convolution characteristics of operating within a local receptive field, computer vision studies had incorporated the attention mechanism into 2D CNN to exploit the relationship between features for adaptive feature refinement. No exploration has been reported on incorporating the attention mechanism into 3D CNN for model development of hyperspectral image in object-scale analysis. In this study, we investigated an improved 3D CNN architecture with attention modules embedded for adaptive feature refinement in object-scale hyperspectral image modeling. Besides the adapted channel attention, the band attention module was specially designed to learn the band-wise relationship. Based on the 3D ResNet architecture, various modifications on the arrangement and structure of channel and band attention modules were explored systematically for higher modeling performance. An exemplar hyperspectral image dataset of basil leaves for predicting their relative chlorophyll content (RCC), was applied to evaluate the proposed model. Comprehensive comparison experiments showed performance improvement after adding attention modules into the residual block of 3D ResNet, demonstrating the effectiveness of adaptive feature refinement along channel and band dimensions through the learned attention maps. The sequential channel-band attention module achieved the highest model performance, with testing determination coefficient (R2) of 0.8998. The results indicated the effectiveness of the channel and band attention embedded 3D CNN for model development of hyperspectral image in object-scale analysis.

FDAM: Full-Dimension Attention Module for Deep Convolutional Neural Networks

Embrace Smaller Attention: Efficient Cross-Modal Matching with Dual Gated Attention Fusion

Channel and band attention embedded 3D CNN for model development of hyperspectral image in object-scale analysis

HAM: Hybrid Attention Module in Deep Convolutional Neural Networks for Image Classification

An Attention Module for Convolutional Neural Networks

DRTAM: Dual Rank-1 Tensor Attention Module

ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks

Adaptive Attention Module for Image Recognition Systems in Autonomous Driving

Efficient Multi-Scale Attention Module with Cross-Spatial Learning

An efficient attention module for 3d convolutional neural networks in action recognition

DECA: a novel multi-scale efficient channel attention module for object detection in real-life fire images

A Multi-scale Contextual Attention Mechanism for Convolutional Neural Networks

ELA: Efficient Local Attention for Deep Convolutional Neural Networks

A feature-wise attention module based on the difference with surrounding features for convolutional neural networks

DAMAF: dual attention network with multi-level adaptive complementary fusion for medical image segmentation

A Simple and Light-Weight Attention Module for Convolutional Neural Networks

Attentional Feature Fusion

The Costs and Benefits of Goal-Directed Attention in Deep Convolutional Neural Networks

DAS: A Deformable Attention to Capture Salient Information in CNNs