Abstract:Recently there is a rising trend of employing convolutional neural network (CNN) for modeling the complex high-dimensional hyperspectral images in object-scale analysis. Compared with 1D CNN and 2D CNN for merely extracting spectral or spatial features, the 3D CNN naturally offers a more effective method for simultaneously extracting the integrated deep spectral-spatial features. Due to the convolution characteristics of operating within a local receptive field, computer vision studies had incorporated the attention mechanism into 2D CNN to exploit the relationship between features for adaptive feature refinement. No exploration has been reported on incorporating the attention mechanism into 3D CNN for model development of hyperspectral image in object-scale analysis. In this study, we investigated an improved 3D CNN architecture with attention modules embedded for adaptive feature refinement in object-scale hyperspectral image modeling. Besides the adapted channel attention, the band attention module was specially designed to learn the band-wise relationship. Based on the 3D ResNet architecture, various modifications on the arrangement and structure of channel and band attention modules were explored systematically for higher modeling performance. An exemplar hyperspectral image dataset of basil leaves for predicting their relative chlorophyll content (RCC), was applied to evaluate the proposed model. Comprehensive comparison experiments showed performance improvement after adding attention modules into the residual block of 3D ResNet, demonstrating the effectiveness of adaptive feature refinement along channel and band dimensions through the learned attention maps. The sequential channel-band attention module achieved the highest model performance, with testing determination coefficient (R2) of 0.8998. The results indicated the effectiveness of the channel and band attention embedded 3D CNN for model development of hyperspectral image in object-scale analysis.

Spatial Decomposition and Aggregation for Attention in Convolutional Neural Networks

A Channel-Wise Spatial-Temporal Aggregation Network for Action Recognition

Channel and band attention embedded 3D CNN for model development of hyperspectral image in object-scale analysis

Improved Channel Attention Methods Via Hierarchical Pooling and Reducing Information Loss

HAM: Hybrid Attention Module in Deep Convolutional Neural Networks for Image Classification

Efficient Multi-Scale Attention Module with Cross-Spatial Learning

Channelized Axial Attention for Semantic Segmentation -- Considering Channel Relation within Spatial Attention for Semantic Segmentation

CAA : Channelized Axial Attention for Semantic Segmentation.

Spatial and Channel Dimensions Attention Feature Transfer for Better Convolutional Neural Networks

Cross-Layer Channel Attention Mechanism For Convolutional Neural Networks

An Empirical Study of Spatial Attention Mechanisms in Deep Networks

Global Attention Mechanism: Retain Information to Enhance Channel-Spatial Interactions

CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion

Coordinate Attention for Efficient Mobile Network Design

SCSA: Exploring the Synergistic Effects Between Spatial and Channel Attention

Adaptive Channel Encoding for Point Cloud Analysis

ELA: Efficient Local Attention for Deep Convolutional Neural Networks

A Spatial–Channel–Temporal-Fused Attention for Spiking Neural Networks

CSA-Net: Channel-wise Spatially Autocorrelated Attention Networks

Point Attention Network for Semantic Segmentation of 3D Point Clouds

SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning