Machine Learning algorithms for in-line monitoring during Yeast Fermentations based on Raman Spectroscopy
Debiao Wu,Yaying Xu,Feng Xu,Minghao Shao,Mingzhi Huang
DOI: https://doi.org/10.1016/j.vibspec.2024.103672
IF: 2.5
2024-03-13
Vibrational Spectroscopy
Abstract:Given the intricacies and nonlinearity inherent to industrial fermentation systems, the application of process analytical technology presents considerable benefits for the direct, real-time monitoring, control, and assessment of synthetic processes. In this study, we introduce an in-line monitoring approach utilizing Raman spectroscopy for ethanol production by Saccharomyces cerevisiae. Initially, we employed feature selection techniques from the realm of machine learning to reduce the dimensionality of the Raman spectral data. Our findings reveal that feature selection results in a noteworthy reduction of over 90% in model training time, concurrently enhancing the predictive performance of glycerol and cell concentration by 14.20% and 17.10% at the root mean square error (RMSE) level. Subsequently, we conducted model retraining using 15 machine learning algorithms, with hyperparameters optimized through grid search. Our results illustrate that the post-hyperparameter adjustment model exhibits improvements in RMSE for ethanol, glycerol, glucose, and biomass by 9.73%, 4.33%, 22.22%, and 13.79%, respectively. Finally, specific machine learning algorithms, namely BaggingRegressor, Support Vector Regression, BayesianRidge, and VotingRegressor, were identified as suitable models for predicting glucose, ethanol, glycerol, and cell concentrations, respectively. Notably, the coefficient of determination (R 2 ) ranged from 0.89 to 0.97, and RMSE values ranged from 0.06 to 2.59 g/L on the testing datasets. The study highlights machine learning's effectiveness in Raman spectroscopy data analysis for improved industrial fermentation monitoring, enhancing efficiency, and offering novel modeling insights.
chemistry, physical,spectroscopy, analytical