Causal-Trivial Attention Graph Neural Network for Fault Diagnosis of Complex Industrial Processes
Hao Wang,Ruonan Liu,Steven X. Ding,Qinghua Hu,Zengxiang Li,Hongkuan Zhou
DOI: https://doi.org/10.1109/tii.2023.3282979
IF: 12.3
2023-01-01
IEEE Transactions on Industrial Informatics
Abstract:In modern industrial systems, components have complex interactions with each other, which makes it become a challenging task to identify the operational conditions of industrial systems. Considering that an industrial system, the embedded components and their interactions can be expressed as nodes and edges in a graph, respectively. Therefore, graph representation algorithms are powerful tools for fault diagnosis of industrial systems. As one of the most commonly used graph representation algorithms, Graph Neural Networks (GNN) mainly follow the law of “learning to attend”. GNN extract training data features, learn the statistical correlations between features and labels, resulting in the attended graph favoring for accessing non-causal features as a shortcut for prediction. This shortcut feature is unstable and depends on the data distribution characteristics in the training dataset, which reduces the generalization ability of the classifier. By performing the causal analysis of GNN modeling for graph representation, the results show that shortcut features act as confounding factors between causal features and predictions, causing classifiers to learn wrong correlations. Therefore, to discover patterns of causality and weaken the confounding effects of shortcut features, a Causal-Trivial Attention Graph Neural Network (CTA-GNN) strategy is proposed. Firstly, node and edge representations are given by estimating soft masks. Secondly, through disentanglement, both causal features and shortcut features are obtained from the graph. Thirdly, the backdoor adjustment of the causal theory is parameterized to combine each causal feature with a variety of shortcut features. Finally, comparative experiments on the Three-Phase Flow Facility (TFF) dataset illustrate the effectiveness of the proposed method.
automation & control systems,computer science, interdisciplinary applications,engineering, industrial