Meta pseudo label tabular-related regression model for surrogate modeling
Sungjun Kim,Jungho Kim
DOI: https://doi.org/10.1016/j.eswa.2024.125520
IF: 8.5
2024-10-19
Expert Systems with Applications
Abstract:Deep neural networks (DNNs) present significant potential as surrogate models to substitute for costly simulations. Attaining high accuracy in DNNs is crucial for effective surrogate modeling. However, this goal necessitates the availability of extensive labeled datasets, which requires a time-consuming endeavor. Consequently, it is essential to develop a semi-supervised regression methodology that enhances DNN accuracy by leveraging unlabeled data. Pseudo-labeling techniques, frequently employed in semi-supervised learning, have been shown to effectively enhance DNN accuracy. However, these methods are susceptible to confirmation bias, which can adversely affect DNN accuracy by generating erroneous pseudo-labels. To mitigate this issue, we propose a meta pseudo label regression (MPLR) framework, drawing inspiration from the concept of meta pseudo labels. To validate the effectiveness of our proposed MPLR approach, we compare its performance with a supervised learning model, a co-training regression model, and a self- and mutual-teaching model. We use a quantum cascade laser dataset, which exemplifies a scenario that necessitates semi-supervised regression-based surrogate modeling due to the extensive computational time required for certain outputs. The MPLR framework can be integrate with model noise techniques such as dropout, resulting in substantial enhancements in model performance. Our results demonstrate that our proposed MPLR method surpasses other semi-supervised regression models in terms of both performance improvements and computational speed. Specifically, it achieves average enhancements in model performance metrics of 5.430 % and 4.911 % compared to the supervised learning model without dropout in small and relatively large data regimes, respectively. These improvements are notably greater than those observed with the self- and mutual-teaching model, which yields only 1.794 % and 1.500 % enhancements in the corresponding data regimes. In addition, the performance of the co-training regression model becomes worse than that of the supervised learning model.
computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science