Sub‐PCA Modeling and On‐line Monitoring Strategy for Batch Processes
NY Lu,FR Gao,FL Wang
DOI: https://doi.org/10.1002/aic.10024
IF: 4.167
2004-01-01
AIChE Journal
Abstract:Multivariate statistical methods such as principal component analysis (PCA) and partial least square (PLS) have been successfully used in modeling multivariable continuous processes (Kaspar and Ray, 1992; Kourti and MacGregor, 1995; Chen and McAvoy, 1998). Several extensions of the conventional PCA/PLS to batch processes have also been reported, among which multiway PCA (MPCA) model is the most widely used (Wold et al., 1987; Nomikos and MacGregor, 1994, 1995; Wise et al., 1999; Smilde, 2001). The MPCA model is ill-suited for multistage batch processes, as it takes the entire batch data as a single object, and it is difficult to reveal the changes of process correlation from stage to stage. Considering that the multiplicity of the operation stage is an inherent nature of many batch processes, each stage has its own underlying characteristics and the process can exhibit significantly different behaviors over different operation stages; it is desirable to develop a stage-based model that can reflect the inherent process stage nature to improve the process understanding and monitoring efficiency. Kosanovich et al. (1994) and Dong and McAvoy (1995) developed two MPCA/nonlinear MPCA models, utilizing the two-stage nature of a jacketed exothermic batch chemical reactor. Their results show that the two-stage models are more powerful than a single model. Their stage models, however, inherit the common weakness of the MPCA model that the unavailable future data in an evolving batch should be estimated for on-line monitoring. A new stage-based sub-PCA modeling method is proposed in this article for multistage batch processes, based on the recognition of the following: (1) a batch process may be divided into several “operation” stages reflecting its inherent process correlation nature; (2) despite that the process may be time varying, the correlation of its variables will be largely similar within the same “operation” stage. Changes in the correlation may be used to indicate changes in the process “operation” stages. We have placed a quotation mark around “operation” to indicate that the operation referred to in this article may not, and does not have to, have the exact correspondence to the physical operations of the process. Based on the above recognition, a representative model can be built for each stage, using the conventional two-way PCA model. This allows two-way PCA to be “directly” applied to a batch process after a proper stage division; a stage division algorithm is also developed in the article. A three-tank process, as an experimental verification system, is finally introduced to illustrate the effectiveness of the proposed. Consider a batch process with J process variables measured over sampling points k (k = 1, 2, …, K); a data matrix of dimensions J × K is generated from each batch run. A set of I number of normal batch runs, hence, result in a three-way process data matrix, X(I × J × K), which is the most popular data form for batch processes. The horizontal slice X(J × K) is the data matrix from each batch run. The vertical slice (I × J), a time-slice matrix that is the basic unit in the proposed modeling method, is used to obtain the process correlation at sampling time k. In PCA analysis, the loading matrix represents the information of the process correlation. The stage-based sub-PCA modeling begins with analyzing the loading matrix at each sampling interval. The time-slice matrices k will have a similar loading matrix within each “operation” stage. Different “operation” stage results in different loading matrices, reflecting that process correlation changes over different stages. Likewise, changes in the loading matrices, reflecting changes in the underlying process behavior, can be used to determine the “operation” stages. The k-means clustering will be adopted and modified to partition the K number of loading matrices. The clustering results, associated with process time span or indicator variables, can be used to define process “operation” stages. The Euclidean distance, the most popular metric, can be used to calculate the dissimilarity between two patterns. A variant k-means algorithm (Jain et al., 1999) is adopted for partitioning the K number of patterns k (k = 1, 2, …, K) to determine the optimal number of final clusters by minimizing the local squared error (for patterns within each cluster) and the global squared error (for all the patterns) by specifying a threshold θ of the minimal distance between two clusters' centers, or the maximal radius of a cluster. This algorithm transforms modeling accuracy and complexity into the specification of the threshold. A large threshold results in few clusters, but less accurate modeling. A step is added in the clustering algorithm to eliminate singular clusters that catch few patterns in the iterative clustering procedure to enhance the robustness and reliability of the partition algorithm. The above improved k-means clustering algorithm can group optimally the K patterns into C number of clusters, representing C kinds of pattern features. Since these patterns are extracted along the sampling time of a batch process, the clustering result can be directly associated with the operation time, which makes the partition of the patterns well interpretable. Normally, each cluster should contain a series of successive samples. The exception may be with a process that has the same underlying characteristics for several disjoint periods of operation time; this would result in a cluster with samples disjoint in time. In most cases, process stages can be determined based on the clustering result associated with operation time. The number of clusters may be different from the actual operation stages. For example, a process with two or more actual operation stages of similar correlation may be clustered together, resulting in a single representative loading matrix. On the other hand, a long stage of operation having significant changes in the correlation may be divided into several “operation” stages. The proposed method emphasizes the changes of process correlation rather than the physical operation. In this article, C denotes the number of “operation” stages obtained by the clustering algorithm based on the process correlation characteristic. Misclassification may occur at the beginning and end of each stage, because the k-means clustering algorithm is a hard-partition method in dealing with patterns between two neighboring clusters. Such possible misclassification has little influence in the sub PCA model development; however, it may lead to false alarm (type I error) and missing alarm (type II errors) in on-line monitoring due to batch variation. Alternative methods may be used to resolve this problem. One is to relax the monitoring conditions at the beginning and the end of each stage; the other is to associate one or more characteristic process variables with the stage division, rather than using the process time. The control limits for the Hotteling-T2 and squared prediction error (SPE) charts, which can be estimated from the reference data, should be computed in the modeling procedure. Using the above procedure, a sub-PCA model can be developed for each stage. For on-line monitoring, the T2 statistic is calculated using scores obtained by projecting the original process data onto the subspace spanned by the representative loading matrix P̄ for stage c. Then, the T2 statistic is calculated by: T = tT(S̄)−1t ∼ [Rc(I − 1)/I(I − Rc − 1)]F, and SPE is calculated by: SPEnew = eTe. For on-line monitoring, one should first determine which stage new data of the evolving batch belong to before calling the corresponding sub PCA model to obtain the two statistics. As process stages are represented by the process operation time span, one can know which stage the data belong to by checking which time span the current sampling falls in. Process monitoring is conducted by comparing the two statistics with the control limits of the corresponding stage. A three-tank system, as shown in Figure 1, is used for the verification of the proposed approach. Closed-loop control is implemented for the levels of tanks 1 and 2. The two levels are brought from their initial conditions to the set points of h1 = 300 mm and h2 = 200 mm. The level of h3 is left to float to reflect the interaction between tanks 1 and 2. The process finishes after the three levels stabilize over a period of time. The raising water levels in the tanks result in a time-varying process dynamics. Five process variables including three levels h1, h2, h3 and two flow rates Q1 and Q2 were measured every second. 120 points of historical data were collected in each batch under the normal operation. A typical process curve is shown in Figure 2 for one run. Three-tank process. Process variable measurements for three-tank process. According to the status of the two manipulated variables Q1 and Q2, the process can be classified into the following major stages: stage I, in which both manipulated variables are at saturation; stage II, in which one manipulated variable Q1 is at saturation; and stages III through V, in which no manipulated variables are at saturation. The last three stages are defined as the decreasing stage (stage III), the tuning stage (stage IV), and the steady stage (stage V) as illustrated in Figure 2. The data of the 21 normal experiments are used to yield the reference data matrix X of dimension of 21 × 5 × 120. The loading matrices calculated from the time-slice matrices are fed to the clustering algorithm, resulting in five groups as shown in Figure 3, which agrees well with the earlier theoretical analysis of Figure 2. This proves that process correlation does remain similar within each stage, changes from stage to stage. Only two or three principal components are needed for each stage to explain over 90% variations, while, for the MPCA analysis of the process, the first three principal components can explain about 50% variations. Results of loading matrices clustering algorithm. The proposed approach is put into on-line monitoring tests. Figure 4 shows the monitoring of a normal batch, where the values of the two statistics, Hotelling T2 and SPE statistics, are well below the control limits, indicating that the whole batch is free of any process abnormality. For the second case, a fault was introduced to simulate a leakage of tank 1 by opening valve S1 at the 42nd sampling. According to the above stage division, this fault occurs in the second stage. From the monitoring charts shown in Figure 5, the abnormality can be clearly detected in the second stage at 47th sampling, only five samplings after the occurrence. Monitoring charts for a normal batch. Solid line, 99% control limit; dashed line, 95% control limit; dash dot line, on-line T2 or SPE. (a) Monitoring chart of T2 statistic; (b) monitoring chart of SPE statistic. Monitoring charts for an abnormal batch. Solid line, 99% control limit; dashed line, 95% control limit; dash dot line, on-line T2 or SPE. (a) Monitoring chart of T2 statistic; (b) monitoring chart of SPE statistic. A new modeling and on-line monitoring scheme for batch process has been developed based on the fact that changes in the process correlation may relate to its “operation” stages. Dividing the process into “operation” stages by analyzing and clustering the loading matrices and constructing sub-PCA model for each stage can apply the conventional two-way PCA “directly” for batch process monitoring, without the need of predicting future data of the evolving batch.