Abstract:Recently developed fault classification methods for industrial processes are mainly data-driven. Notably, models based on deep neural networks have significantly improved fault classification accuracy owing to the inclusion of a large number of data patterns. However, these data-driven models are vulnerable to adversarial attacks; thus, small perturbations on the samples can cause the models to provide incorrect fault predictions. Several recent studies have demonstrated the vulnerability of machine learning methods and the existence of adversarial samples. This paper proposes a black-box attack method with an extreme constraint for a safe-critical industrial fault classification system: Only one variable can be perturbed to craft adversarial samples. Moreover, to hide the adversarial samples in the visualization space, a Jacobian matrix is used to guide the perturbed variable selection, making the adversarial samples in the dimensional reduction space invisible to the human eye. Using the one-variable attack (OVA) method, we explore the vulnerability of industrial variables and fault types, which can help understand the geometric characteristics of fault classification systems. Based on the attack method, a corresponding adversarial training defense method is also proposed, which efficiently defends against an OVA and improves the prediction accuracy of the classifiers. In experiments, the proposed method was tested on two datasets from the Tennessee–Eastman process (TEP) and steel plates (SP). We explore the vulnerability and correlation within variables and faults and verify the effectiveness of OVAs and defenses for various classifiers and datasets. For industrial fault classification systems, the attack success rate of our method is close to (on TEP) or even higher than (on SP) the current most effective first-order white-box attack method, which requires perturbation of all variables.

Robust Adversarial Attacks on Imperfect Deep Neural Networks in Fault Classification

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

There is Limited Correlation Between Coverage and Robustness for Deep Neural Networks

One-Variable Attack on the Industrial Fault Classification System and Its Defense

Adversarial Learning from Imbalanced Data: A Robust Industrial Fault Classification Method

Improving Model Robustness Against Adversarial Examples with Redundant Fully Connected Layer.

Adversarial Attacks on Neural-Network-Based Soft Sensors: Directly Attack Output

Adversarial robustness improvement for deep neural networks

Not So Robust After All: Evaluating the Robustness of Deep Neural Networks to Unseen Adversarial Attacks

Fault Sneaking Attack: a Stealthy Framework for Misleading Deep Neural Networks

Towards Robustifying Image Classifiers against the Perils of Adversarial Attacks on Artificial Intelligence Systems

Adversarial Attacks and Defenses in Fault Detection and Diagnosis: A Comprehensive Benchmark on the Tennessee Eastman Process

Towards Deep Learning Models Resistant to Adversarial Attacks

SoK: Certified Robustness for Deep Neural Networks

Evaluating and Improving Adversarial Robustness of Machine Learning-Based Network Intrusion Detectors

DeepSafe: A Data-driven Approach for Checking Adversarial Robustness in Neural Networks

DeepDefense: Training Deep Neural Networks with Improved Robustness.

Mitigating Adversarial Attacks for Deep Neural Networks by Input Deformation and Augmentation

A Review of Adversarial Attacks in Computer Vision

An ADMM-Based Universal Framework for Adversarial Attacks on Deep Neural Networks