Wavelet-Based Classification and Influence Matrix Analysis Method for the Fast Discrimination of Chinese Herbal Medicines According to the Geographical Origins with Near Infrared Spectroscopy

Wenlong Li,Haibin Qu
DOI: https://doi.org/10.1142/s1793545813500612
IF: 2.396
2014-01-01
Journal of Innovative Optical Health Sciences
Abstract:A discriminant analysis technique using wavelet transformation (WT) and influence matrix analysis (CAIMAN) method is proposed for the near infrared (NIR) spectroscopy classification. In the proposed methodology, NIR spectra are decomposed by WT for data compression and a forward feature selection is further employed to extract the relevant information from the wavelet coefficients, reducing both classification errors and model complexity. A discriminant-CAIMAN (D-CAIMAN) method is utilized to build the classification model in wavelet domain on the basis of reduced wavelet coefficients of spectral variables. NIR spectra data set of 265 salviae miltiorrhizae radix samples from 9 different geographical origins is used as an example to test the classification performance of the algorithm. For a comparison, k-nearest neighbor (KNN), linear discriminant analysis (LDA) and quadratic discriminant analysis (QDA) methods are also employed. D-CAIMAN with wavelet-based feature selection (WD-CAIMAN) method shows the best performance, achieving the total classification rate of 100% in both cross-validation set and prediction set. It is worth noting that the WD-CAIMAN classifier also shows improved sensitivity, selectivity and model interpretability in the classifications.
What problem does this paper attempt to address?