Ensemble Regression Coefficient Analysis for Application to Near-Infrared Spectroscopy

Kaiyi Zheng,Huilian Hu,Peijin Tong,Yiping Du
DOI: https://doi.org/10.1080/00032719.2014.900776
2014-01-01
Analytical Letters
Abstract:A new variable selection method called ensemble regression coefficient analysis is reported on the basis of model population analysis. In order to construct ensemble regression coefficients, many subsets of variables are randomly selected to calibrate corresponding partial least square models. Based on ensemble theory, the mean of regression coefficients of the models is set as the ensemble regression coefficient. Subsequently, the absolute value of the ensemble regression coefficient can be applied as an informative vector for variable selection. The performance of ensemble regression coefficient analysis was assessed by four near infrared datasets: two simulated datasets, one wheat dataset, and one tobacco dataset. The results showed that this approach can select important variables to obtain fewer errors compared with regression coefficient analysis and Monte Carlo uninformative variable elimination.
What problem does this paper attempt to address?