Multi-modal Hybrid Modeling Strategy Based on Gaussian Mixture Variational Autoencoder and Spatial–temporal Attention: Application to Industrial Process Prediction

Haifei Peng,Jian Long,Cheng Huang,Shibo Wei,Zhencheng Ye
DOI: https://doi.org/10.1016/j.chemolab.2023.105029
IF: 4.175
2024-01-01
Chemometrics and Intelligent Laboratory Systems
Abstract:The industrial process is characterized by its multi-modal nature and complex spatial and temporal correlations. Despite the fact that several multi-modal methods have been proposed, few of them can effectively extract deep multi-modal representations and the highly intricate spatial and temporal relationships. In this paper, a novel multi-modal hybrid modeling strategy (GMVAE-STA) is proposed for industrial process prediction. This strategy combines the Gaussian Mixture Variational Autoencoder (GMVAE) and the spatial–temporal attention based Gated Recurrent Unit (STA-GRU). First, the GMVAE maps the raw data to the latent space, which follows a Gaussian Mixture distribution, and the data with the highest probability in each Gaussian are identified as a mode. Then, the STA-GRU captures the complex spatial and temporal relationships within each mode and makes predictions. Experimental results on the Tennessee Eastman process and a real-world fluid catalytic cracking process demonstrate the effectiveness of mode classification and prediction of the proposed method.
What problem does this paper attempt to address?