Image Classification Using Convolutional Neural Network with Wavelet Domain Inputs

Luyuan Wang,Yankui Sun
DOI: https://doi.org/10.1049/ipr2.12466
IF: 2.3
2022-01-01
IET Image Processing
Abstract:Commonly used convolutional neural networks (CNNs) usually compress high-resolution input images. Although it reduces the computation requirements into a reasonable range, the downsampling operation causes information loss, which affects the accuracy of image classification. How to adopt high-resolution image inputs to improve the quality of input information and thus improve the classification accuracy without changing the overall structure of the pre-defined CNN model or increasing the model parameters is an important issue. Here, a CNN model with wavelet domain inputs is proposed to provide a solving scheme. Specifically, the proposed method applies wavelet packet transform or dual-tree complex wavelet transform to extract information from input images with higher resolutions in the image pre-processing stage. Some subband image channels are selected as the inputs of conventional CNNs where the first several convolutional layers are removed, so that the networks directly learn in the wavelet domain. Experiment results on the Caltech-256 dataset and the Describable Textures Dataset with the ResNet-50 show that the classification accuracy of our method can have a maximum improvement of 2.15% and 10.26%, respectively. These validate the effectiveness of our proposed scheme. This code is publicly available at .
What problem does this paper attempt to address?