Sparse Modeling and Monitoring for Industrial Processes Using Sparse, Distributed Principal Component Analysis
Jian Huang,Xu Yang,Yuri A. W. Shardt,Xuefeng Yan
DOI: https://doi.org/10.1016/j.jtice.2021.04.029
IF: 5.477
2021-01-01
Journal of the Taiwan Institute of Chemical Engineers
Abstract:Driven by the strong demand for sparsity in dimensional reduction techniques, a sparse modeling and monitoring approach based on sparse, distributed principal component analysis (SDPCA) is proposed to achieve sparsity. To this end, the data set is first divided into highly correlated blocks (HCBs) and one remainder block (RB) on the basis of the mutual-information-based correlation matrix. From this, the sparse loading vectors for the HCBs are obtained using the PCA models, while for the RB, it is obtained using the sparse PCA model. It is worth noting that the sparsity in SDPCA enables the sparse loading vectors to produce interpretable principal components, which keeps the correlations between the highly correlated variables and achieves the sparsity for the weakly correlated ones. Moreover, to fully appreciate the interpretation of the sparse principal components, a fault diagnosis strategy named blockwise contribution plots is proposed by first determining the faulty block, and then, identifying the faulty variables. Compared with PCA and SPCA, the proposed SDPCA detects more faulty samples and gives more accurate diagnosis results. (c) 2021 Taiwan Institute of Chemical Engineers. Published by Elsevier B.V. All rights reserved.