Novel virtual sample generation method based on data augmentation and weighted interpolation for soft sensing with small data
Xiao-Lu Song,Yan-Lin He,Xing-Yuan Li,Qun-Xiong Zhu,Yuan Xu
DOI: https://doi.org/10.1016/j.eswa.2023.120085
IF: 8.5
2023-04-20
Expert Systems with Applications
Abstract:Data-driven soft sensing modeling plays an increasingly important role in the prediction of key variables in the process industry. Since data is an essential part of modeling, how to obtain sufficient samples to build more accurate soft sensors becomes a formidable challenge. In this paper, a virtual sample generation method based on data augmentation and weighted interpolation (DAWI-VSG) is proposed to expand the soft sensing dataset with high-quality samples. First, the original dataset is decomposed by singular value decomposition (SVD), and the features are extracted and then synthesized into a matrix to obtain new samples that can approximate the original sample set. Second, the two sample sets are merged, and outliers are detected with the improved Fast Angle-based Outlier Detection (FastABOD), which makes the data more uniformly distributed by weighted interpolation between the outliers. In addition, XGboost is utilized for predicting the outputs of the virtual samples. To verify that the effect of the proposed DAWI-VSG, simulations of the numerical function and the actual chemical process Pure terephthalic acid (PTA) were performed, and correlation analysis was introduced as a measure of whether the generated samples are consistent with the real ones. The results showed that the proposed DAWI-VSG can boost the predictive power of soft sensing by generating higher quality and more reasonable samples compared to other advanced methods.
computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science