Effect of dataset size on modeling and monitoring of chemical processes

Zheng Li,Ying Yu,Xinghua Pan,M. Nazmul Karim
DOI: https://doi.org/10.1016/j.ces.2020.115928
IF: 4.7
2020-12-01
Chemical Engineering Science
Abstract:<p>Multivariate data analysis is a powerful tool for process monitoring and data analysis. The theoretical methodology of real-time multivariate data analysis has been studied in the last decade. However, the effect of dataset size on modeling structure and fault detection ability has not been reported yet. In this paper, requirements for a minimum dataset for multivariate data analysis modeling are studied, and a practical approach is provided to evaluate the modeling structure. A method based on statistical index g<sup>2</sup> and cross-validation is proposed to determine a minimum dataset size of a valid model for statistical process monitoring. The proposed method was built on the linear PLS model and elaborated by case studies using both batch and continuous processes. This paper provides theoretical development of multivariate data analysis and demonstrates its application in chemical processes.</p>
engineering, chemical
What problem does this paper attempt to address?