Parallel Computing and SGD-Based DPMM for Soft Sensor Development with Large-Scale Semisupervised Data
Weiming Shao,Le Yao,Zhiqiang Ge,Zhihuan Song
DOI: https://doi.org/10.1109/tie.2018.2874589
IF: 7.7
2019-01-01
IEEE Transactions on Industrial Electronics
Abstract:Soft sensors based on Gaussian mixture models (GMM) have been widely used in industrial process systems for modeling the nonlinearity, non-Gaussianity, and uncertainties. However. there are still some challenging issues in developing high-accuracy GMM-based soft sensors. First, labeled samples are usually scarce due to technical or economical limitations, causing traditional supervised GMM-based soft sensing methods fail to provide satisfactory performance. Second, tremendous amounts of unlabeled samples are gathered, nevertheless, how to fully exploit those unlabeled samples in terms of improving both the predictive accuracy and computational efficiency remains unresolved. In this paper, in order to deal with these issues, two computationally efficient soft sensing methods, namely the parallel computing-based semisupervised Dirichlet process mixture models (P-S-2 DPMM) and stochastic gradient descent-based S-2 DPMM (SGD-S-2 DPMM), are proposed. The (SDPMM)-D-2 is first developed to mine information contained in both labeled and unlabeled samples for predictive accuracy enhancement, and subsequently is extended to the P-S-2 DPMM and SGD-S-2 DPMM to handle large-scale process data with sufficient and limited computing resources, respectively. Two case studies are carried out on real-world industrial processes, and the results obtained demonstrate the effectiveness of the proposed methods.