Feature Selection of Infrared Spectra Analysis with Convolutional Neural Network

Jingjing Xia,Jixiong Zhang,Yanmei Xiong,Shungeng Min
DOI: https://doi.org/10.1016/j.saa.2021.120361
IF: 4.831
2021-01-01
Spectrochimica Acta Part A Molecular and Biomolecular Spectroscopy
Abstract:Data-driven deep learning analysis, especially for convolution neural network (CNN), has been developed and successfully applied in many domains. CNN is regarded as a black box, and the main drawback is the lack of interpretation. In this study, an interpretable CNN model was presented for infrared data analysis. An ascending stepwise linear regression (ASLR)-based approach was leveraged to extract the informative neurons in the flatten layer from the trained model. The characteristic of CNN network was employed to visualize the active variables according to the extracted neurons. Partial least squares (PLS) model was presented for comparison on the performance of extracted features and model interpretation. The CNN models yielded accuracies with extracted features of 93.27%, 97.50% and 96.65% for Tablet, meat, and juice datasets on the test set, while the PLS-DA models obtained accuracies with latent variables (LVs) of 95.19%, 95.50% and 98.17%. Both the CNN and PLS models demonstrated the stable patterns on active variables. The repeatability of CNN model and proposed strategies were verified by conducting the Monte-Carlo cross-validation.
What problem does this paper attempt to address?