Pharmaceutical Discrimination by Using Sparse Denoising Autoencoder Combined with Gaussian Process Based on Near Infrared Spectrum

Zhou Jie-qian,Liu Zhen-bing,Yang Hui-hua,Zheng An-bing,Pan Xi-peng,Cao Zhi-wei,Wu Kai-yu,Yang Jin-xin,Feng Yan-chun,Yin Li-hui,Hu Chang-qin
DOI: https://doi.org/10.3964/j.issn.1000-0593(2017)08-2412-06
2017-01-01
Spectroscopy and spectral analysis
Abstract:In this paper, a new method for pharmaceutical discrimination by the near infrared spectrum is proposed, which is based on the sparse denoising autoencoder (SDAE) combined with Gauss process (GP). First of all, the Mexican hat wavelet transform was used to eliminate noise and baseline drift from the spectra data, then SDAE network was used to extract the feature and reduce dimension of spectrum. Finally, GP was used for binary classification, in which the GP selected the spectral mixture (SM) kernel function as its covariance function. This classification method was named as wSDAG(SM). Autoencoder network has a strong ability of model representation, and GP classifier has the advantage in dealing with small sample data. The WSDAG(SM) network is able to obtain fewer dimensions and more valuable features by SDAE learning to represent the input data. Meanwhile, the spectral mixture kernel function which has a good expression was used as the covariance function of the GP in the WSDAG(SM) network. Therefore the WSDAG(SM) network is conducive to more accurate classification of spectral data. With near infrared spectra of Erythromycin Ethylsuccinate and other pharmaceuticals as experimental data, some classification methods were used after Mexican hat wavelet transform, they were BP neural network (wBP), support vector machine (wSVM) SDAE combined with binary classification of Logistic (wSDAL), SDAE combined with binary classification of GP selected the squared exponential (SE) kernel function (wSDAG(SE)). And another method was also applied, which was SDAG(SM) network without Mexican hat wavelet transform. All above methods were used for comparing with wSDAG(SM) network. Experimental results show that SDAG(SM) can effectively improve the classification accuracy and stability by applying the wavelet transform to the spectral data. The proposed method wSDAG(SM) is superior to other classifiers in terms of classification accuracy and stability of the classification results.
What problem does this paper attempt to address?