Variable Selection Based on Gray Wolf Optimization Algorithm for the Prediction of Saponin Contents in Xuesaitong Dropping Pills Using NIR Spectroscopy

Hou Yizhe,Gao Xin,Li Shunan,Cai Xiang,Li Pian,Li Wenlong,Li Zheng
DOI: https://doi.org/10.1007/s12247-022-09620-6
2022-01-01
Journal of Pharmaceutical Innovation
Abstract:Purpose A rapid quantification method for the five saponins was developed based on near infrared (NIR) spectroscopy as a quality control strategy for the Xuesaitong dropping pills (XDP). Methods In this study, NIR spectroscopy coupled with partial least squares regression (PLSR) was used to predict the contents of the five saponins in XDPs, named Notoginsenoside R1 (R1), Ginsenoside Rg1 (Rg1), Ginsenoside (Re), Ginsenoside Rb1 (Rb1) and Ginsenoside Rd (Rd). In order to obtain more accurate and robust prediction models, the gray wolf optimizer (GWO) algorithm, a new swarm intelligence algorithm, combined with PLSR algorithm were used to select the NIR spectral feature of the XDPs samples. In addition, six variable selection methods, i.e. stability competitive adaptive reweighted sampling (sCARS), genetic algorithm (GA), Monte Carlo-uninformative variable elimination (MCUVE), successive projections algorithm (SPA), bootstrapping soft shrinkage (BOSS), and variable combination population analysis (VCPA), were compared with GWO algorithm. Results The results showed that the GWO algorithm significantly improved the model prediction performance when compared with other variable selection methods. Finally, the stability of the calibration models were used as an index to evaluate the prediction performance of the models, and the results showed that the performance of the GWO was stable under 20 runs. Conclusion In summary, for the five saponins in XDPs, GWO, as an effective feature extraction algorithm for the NIR spectra, can significantly improve the prediction performance of the quantitative calibration model, so as to realize the rapid and accurate quantitative analysis of the saponins in XDPs.
What problem does this paper attempt to address?