Data mining-based screening of key points for corn starch and sugar production process

ZHANG Zhongyi,ZHANG Lei,WANG Yu,DONG Yachao,TAO Jin,LI Yi,TONG Yi,ZHUANG Yu,LIU Linlin,DU Jian
DOI: https://doi.org/10.11949/0438-1157.20230858
2023-01-01
Abstract:The production process of corn deep processing to produce fructose has problems of outdated control and lack of refinement in production and processing.However,due to the complexity of the process,it is difficult to establish and optimize mechanism based models.Big data technology offers an effective solution by utilizing a substantial volume of production data to uncover process insights and identify key points.Initially,crucial target variables of this process were selected.Using big data technology,the original production data underwent preprocessing steps such as handling missing values,addressing outliers,noise reduction,and dimensionality reduction.Subsequently,three machine learning models—random forest(RF),extreme gradient boosting(XGBoost),and artificial neural network(ANN)were constructed,all achieving R2 values exceeding 0.90.Lastly,the SHAP method is used to explain different machine learning models,validate the credibility of the models,obtain the contribution levels of different features to the prediction results,and integrate the results of explanations from different models.This process generates a ranking of the importance of different points in the production process.Combining this with production experience,a mechanistic analysis is conducted to obtain the final key point table.
What problem does this paper attempt to address?