A network structure for industrial process fault diagnosis based on hyper feature extraction and stacked LSTM

Yanwei Ren,Ridong Zhang,Furong Gao
DOI: https://doi.org/10.1016/j.ces.2024.119745
IF: 4.7
2024-01-13
Chemical Engineering Science
Abstract:Traditional feature extraction methods are highly dependent on manual work, and manual extraction will inevitably ignore some potentially useful features. For complex systems, there is no simple one-to-one correspondence between faults and symptoms, and fault characteristics often have complex characteristics such as hierarchy, propagation correlation, and delay. In the existing research results, the convolutional neural network (CNN) has been proven to be an effective network structure. But the existing network structure is more like a classifier, and only some features of industrial data are considered in the feature extraction stage, which will cause some feature information to be lost during the training process. To effectively solve the above problems, this paper proposes an industrial process-oriented hyper feature extraction network structure. This method first extracts multi-level CNN fault features, aggregates the hierarchical feature maps and compresses them in a unified feature space to form hyper features. Then the hyper features formed by feature fusion are input to the stacked Long Short Term Memory (LSTM) network for further feature extraction. In this way, the information in the time series can be effectively transmitted, and at the same time, the problem of gradient disappearance caused by long-term training can be solved. We conducted experiments on the Tennessee-Eastman (TE) benchmark process and industrial coking oven datasets. The experimental results show that the proposed method can effectively improve the fault diagnosis accuracy of the industrial production process system.
engineering, chemical
What problem does this paper attempt to address?