A Discrete Wavelet Transform-Genetic Algorithm-Cross Validation Approach For High Ratio Compression And Variable Selection Of Near-Infrared Spectral Data

Gq Wang,Xg Shao
DOI: https://doi.org/10.3321/j.issn:0253-3820.2005.02.011
IF: 1.193
2005-01-01
Chinese Journal of Analytical Chemistry
Abstract:An approach for high ratio compression and variable selection of near-infrared (NIR) spectra is proposed. The informative variables, wavelength points or approximation coefficients of discrete wavelet transform (DWT) of NIR spectra, could be selected by combination of genetic algorithm (GA) and cross-validation ( CV) procedure. These selected Variables were used in the determination of total volatile alkaloids (TVA) and total nitrogen (TN) in tobaccos by partial least squares (PLS) method. It is proved that there is almost no loss,of information when the spectral data are compressed to 3.3% of its original size. The method can significantly reduce the number of variables used in the prediction model, decrease the complexity of the model, and improve the predictive accuracy.
What problem does this paper attempt to address?