Hybrid Unsupervised Learning Strategy for Monitoring Industrial Batch Processes

Christian W. Frey
2024-04-04
Abstract:Industrial production processes, especially in the pharmaceutical industry, are complex systems that require continuous monitoring to ensure efficiency, product quality, and safety. This paper presents a hybrid unsupervised learning strategy (HULS) for monitoring complex industrial processes. Addressing the limitations of traditional Self-Organizing Maps (SOMs), especially in scenarios with unbalanced data sets and highly correlated process variables, HULS combines existing unsupervised learning techniques to address these challenges. To evaluate the performance of the HULS concept, comparative experiments are performed based on a laboratory batch
Machine Learning,Signal Processing,Systems and Control
What problem does this paper attempt to address?
The paper primarily addresses the challenges in monitoring industrial production processes, particularly the complex system monitoring issues in the pharmaceutical industry, by proposing a Hybrid Unsupervised Learning Strategy (HULS). Specifically, the paper aims to solve the following key problems: 1. **Handling Imbalanced Datasets**: In actual industrial processes, the amount of data from different stages may be imbalanced. For example, data from certain stable stages may be much more abundant than data from transitional stages, which can make traditional methods like Self-Organizing Maps (SOMs) ineffective in identifying all stages. 2. **Handling Highly Correlated Process Variables**: In industrial production processes, there may be high correlations between multiple process variables, making it difficult for methods like SOMs to accurately model and identify anomalies in the process. 3. **Improving the Accuracy of Process Stage Identification**: For processes that include multiple distinct stages, it is crucial to reliably identify these stages and their sequence to ensure product quality. To overcome the above challenges, the paper proposes the HULS method, which combines the advantages of Instantaneous Topological Map (ITM) and SOMs. The specific approach includes: - First, using ITM to process the training dataset and generate a set of neuron weights, which can be considered as a resampling of the original dataset. - Then, using these weights as input to train a SOM, followed by utilizing the SOM for Unified-Distance Matrix (UM) transformation and Watershed Transformation (WT) to achieve effective clustering and process stage identification. Through experimental validation in laboratory batch processes, the HULS method significantly outperforms traditional SOM methods in handling imbalanced datasets, highly correlated variables, and identifying unknown process stages, especially in terms of ensuring the accuracy and robustness of process monitoring. Additionally, in terms of anomaly detection performance, the HULS method also demonstrates better recovery capabilities.