Gabor-Modulated Grouped Separable Convolutional Network for Hyperspectral Image Classification

Zhuoyi Zhao,Xiang Xu,Jun Li,Shutao Li,Antonio Plaza
DOI: https://doi.org/10.1109/tgrs.2023.3301183
IF: 8.2
2023-08-19
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Nowadays, convolutional neural network (CNN)-based deep learning (DL) models have been popularized in hyperspectral image classification (HSIC) and achieved significant accuracy gains, which is due to their hierarchical and nonlinear feature learning patterns. However, too deeper network structures may induce a huge amount of parameters and excessive computing overhead, leading to the need for plenty of labeled samples for training. Besides, highly abstract semantic features may not be the most suitable for hyperspectral land-cover classification tasks. To address these issues, we propose a fairly lightweight network model for HSIC, which is built on a type of exquisitely designed convolution module, namely, grouped separable convolution (GSC). Compared with the standard convolution, the designed GSC module combines grouped convolution with pointwise convolution, which not only greatly reduces the number of parameters of convolution kernels but also caters to the inherent 3-D cube style of hyperspectral image (HSI) data. Moreover, Gabor filters are introduced to modulate the GSC kernels, so as to further use relatively few convolution kernels with additional prior orientation and scale information for feature extraction. The experiments are carried out on four real hyperspectral datasets, and the experimental results reveal that the proposed model has low training cost and memory overhead. Compared with some existing deep network models that have been applied to HSIC, our proposed model can achieve competitive classification accuracy with fewer training samples.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?
This paper aims to solve several key problems in hyperspectral image classification (HSIC): 1. **Number of parameters and computational cost**: When dealing with hyperspectral image classification tasks, traditional deep convolutional neural networks (CNNs) have a deep network structure, which leads to a large number of parameters and high computational cost. This not only increases the training cost but also requires a large number of labeled samples for training. 2. **Effectiveness of feature representation**: Highly abstract semantic features may not be suitable for hyperspectral land - cover classification tasks. Different from natural image classification, hyperspectral image classification mainly depends on spectral and spatial structure information, and shallow - level features such as color, texture and shape as well as middle - level attribute features are more important. To address these problems, the author proposes a lightweight network model - Gabor - Modulated Grouped Separable Convolutional Network (GS - GCN). The main innovations of this model are as follows: 1. **Lightweight architecture**: By combining grouped convolution (GC) and pointwise convolution (PC), a grouped separable convolution (GSC) module is designed, which greatly reduces the number of parameters of the convolution kernel while retaining the inherent 3D cube structure of the hyperspectral image. 2. **Gabor - modulated convolution filter**: Gabor filters with multiple directions and scales are introduced to modulate the GSC convolution kernel, so that features with prior direction and scale information can be extracted with fewer convolution kernels. This not only enhances the invariance of the network to rotation and other complex transformations but also improves the ability of deep - level feature representation. 3. **Experimental verification**: A comprehensive experimental evaluation was carried out on four actual hyperspectral data sets. The results show that the proposed GS - GCN model has a lower training cost and memory cost, and can achieve classification accuracy comparable to existing deep network models when using fewer training samples. In conclusion, through the design of a lightweight and efficient network architecture, this paper solves the problems of large number of parameters, high computational cost and unsatisfactory feature representation faced by traditional deep learning methods in hyperspectral image classification tasks.