Machine learning prediction of lignin content in poplar with Raman spectroscopy

Wenli Gao,Liang Zhou,Shengquan Liu,Ying Guan,Hui Gao,Bin Hui
DOI: https://doi.org/10.1016/j.biortech.2022.126812
IF: 11.4
2022-03-01
Bioresource Technology
Abstract:Based on features extracted from Raman spectra, regularization algorithms, SVR, DT, RF, LightGBM, CatBoost, and XGBoost were used to develop prediction models for lignin content in poplar. Firstly, Raman features extracted from FT-Raman spectra after data processing were used as input of models and determined lignin contents were output. Secondly, grid-search combined with cross-validation was used to adjust the hyper-parameters of models. Finally, the predictive models were built by aforementioned algorithms. The results indicated regularization algorithms, SVR, DT held test R<sup>2</sup> were &gt;0.80 which means the predictive values from model still deviate from measured ones. Meanwhile, RF, LightGBM, CatBoost, and XGBoost were better than above algorithms, and their test R<sup>2</sup> were &gt;0.91 which suggesting the predictive values was nearly close to measured ones. Therefore, fast and accurate methods for predicting lignin content were obtained and will be useful for screening suitable lignocellulosic resource with expected lignin content.
energy & fuels,biotechnology & applied microbiology,agricultural engineering
What problem does this paper attempt to address?