Novel space projection interpolation based virtual sample generation for solving the small data problem in developing soft sensor

Qun-Xiong Zhu,De-Ping Liu,Yuan Xu,Yan-Lin He
DOI: https://doi.org/10.1016/j.chemolab.2021.104425
IF: 4.175
2021-10-01
Chemometrics and Intelligent Laboratory Systems
Abstract:In recent years, data-driven modeling has become a research hotspot in the industry. However, as many process industries capture data with minimal fluctuations in steady-state, which leads to a lack of sample information and causes the small data problem. Therefore, it is difficult to build data-driven models using small sample data in the process industries. To enhance the generalization ability of building predictive models with small size data, novel space projection interpolation based virtual sample generation is proposed (SPIVSG). In the proposed SPIVSG, we first detect the sample space sparsity using the maximum spacing of projection points and then generate valid virtual samples by using midpoint interpolation and radial basis function (RBF) interpolation to obtain the input values and output values of virtual samples at the space sparsity. To validate the effectiveness of SPIVSG, the standard dataset and purified terephthalic acid (PTA) solvent system dataset are used for Gradient Boosting Decision Tree (GBDT) modeling validation. The experimental results show that SPIVSG can effectively improve the precision of building prediction models for small size data. Besides, SPIVSG also shows better performance compared with other VSG methods.
automation & control systems,computer science, artificial intelligence,instruments & instrumentation,statistics & probability,mathematics, interdisciplinary applications,chemistry, analytical
What problem does this paper attempt to address?