Using Machine Learning Methods to Predict VOC Emissions in Chemical Production with Hourly Process Parameters

Hanyun Ye,Zhen Du,Hao Lu,Jinping Tian,Lyujun Chen,Wenhao Lin
DOI: https://doi.org/10.1016/j.jclepro.2022.133406
IF: 11.1
2022-01-01
Journal of Cleaner Production
Abstract:The control of volatile organic compounds (VOCs) is one of the key challenges in the chemical industry. This study aims to establish a high temporal-resolution prediction model of VOCs concentrations in pharmaceutical synthesis process using machine learning methods. We explored three machine learning methods, support vector machine, random forest, and extreme gradient boosting (XGBoost), with hourly process monitoring data in real production process as inputs. Key findings are as follows: (1) the average R2 of different VOCs concentration prediction models ranges from 0.40 to 0.93, (2) in most cases, the performance of RF and SVM is better than XGBoost, while the performance of RF and SVM is much close, (3) models lacking historical VOCs concentration features as inputs usually perform worse with poor R2 (ranges from 0.33 to 0.80), meanwhile the performance of models is similar when using different feature combinations with historical VOCs concentration, and (4) feature importance shows that the VOCs concentration at 1 h earlier has a great influence on the predicted VOCs con-centration at time t. To further optimize the performance of the model driven by time series data, additional process monitoring data in the distributed control system should be added. This study is meaningful for early warning and accurate control of VOCs emission in chemical processes.
What problem does this paper attempt to address?