Abstract:Deep neural network (DNN) accelerators overcome the power and memory walls for executing neural-net models locally on edge-computing devices to support sophisticated AI applications. The advocacy of “model once, run optimized anywhere” paradigm introduces potential new security threat to edge intelligence that is methodologically different from the well-known adversarial examples. Existing adversarial examples modify the input samples presented to an AI application either digitally or physically to cause a misclassification. Nevertheless, these input-based perturbations are not robust or surreptitious on multi-view target. To generate a good adversarial example for misclassifying a real-world target of variational viewing angle, lighting and distance, a decent number of target’s samples are required to extract the rare anomalies that can cross the decision boundary. The feasible perturbations are substantial and visually perceptible. In this paper, we propose a new glitch injection attack on DNN accelerator that is capable of misclassifying a target under variational viewpoints. The glitches injected into the computation clock signal induce transitory but disruptive errors in the intermediate results of the multiply-and-accumulate (MAC) operations. The attack pattern for each target of interest consists of sparse instantaneous glitches, which can be derived from just one sample of the target. Two modes of attack patterns are derived, and their effectiveness are demonstrated on four representative ImageNet models implemented on the Deep-learning Processing Unit (DPU) of FPGA edge and its DNN development toolchain. The attack success rates are evaluated on 118 objects in 61 diverse sensing conditions, including 25 viewing angles (−60° to 60°), 24 illumination directions and 12 color temperatures. In the covert mode, the success rates of our attack exceed existing stealthy adversarial examples by more than 16.3%, with only two glitches injected into ten thousands to a million cycles for one complete inference. In the robust mode, the attack success rates on all four DNNs are more than 96.2% with an average glitch intensity of 1.4% and a maximum glitch intensity of 10.2%.

An Interpretive Adversarial Attack Method: Attacking Softmax Gradient Layer-Wise Relevance Propagation Based on Cosine Similarity Constraint and TS-Invariant

Fooling Neural Network Interpretations - Adversarial Noise to Attack Images.

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

CSFAdv: Critical Semantic Fusion Guided Least-Effort Adversarial Example Attacks

Improving Model Robustness Against Adversarial Examples with Redundant Fully Connected Layer.

Stealthy and Robust Glitch Injection Attack on Deep Learning Accelerator for Target with Variational Viewpoint.

An efficient adversarial example generation algorithm based on an accelerated gradient iterative fast gradient

Enhancing Adversarial Attacks: The Similar Target Method

Invisible Adversarial Attack Against Deep Neural Networks: an Adaptive Penalization Approach

Demiguise Attack: Crafting Invisible Semantic Adversarial Perturbations with Perceptual Similarity

Analyzing the Noise Robustness of Deep Neural Networks

Adversarial Attacks Hidden in Plain Sight

A Direct Approach to Robust Deep Learning Using Adversarial Networks

Improved Forward-Backward Propagation To Generate Adversarial Examples

Adversarial sample attack method based on loss smoothing

Evading Defenses to Transferable Adversarial Examples by Translation-Invariant Attacks

Nesterov Accelerated Gradient and Scale Invariance for Adversarial Attacks

Exploring Adversarial Attacks on Neural Networks: An Explainable Approach

Enhanced Attacks on Defensively Distilled Deep Neural Networks.

Are You Confident That You Have Successfully Generated Adversarial Examples?

Mitigating Adversarial Attacks for Deep Neural Networks by Input Deformation and Augmentation