Abstract:Bit-flip attacks (BFAs) can manipulate deep neural networks (DNNs). For high-level DNN models running on deep learning (DL) frameworks like PyTorch, extensive BFAs have been used to flip bits in model weights and shown effective. Defenses have also been proposed to guard model weights. However, DNNs are increasingly compiled into DNN executables by DL compilers to leverage hardware primitives. These executables manifest distinct computation paradigms; existing research fails to accurately capture and expose the BFA surfaces on DNN executables. To this end, we launch the first systematic study of BFAs on DNN executables. Prior BFAs are limited to attacking model weights and assume a strong whitebox attacker with full knowledge of victim model weights, which is unrealistic as weights are often confidential. In contrast, we find that BFAs on DNN executables can achieve high effectiveness by exploiting the model structure (usually stored in the executable code), which only requires knowing the (often public) model structure. Importantly, such structure-based BFAs are pervasive, transferable, and more severe in DNN executables. They also slip past existing defenses. To demonstrate the new attack surfaces, we assume a weak and more realistic attacker with no knowledge of victim model weights. We design an automated tool to identify vulnerable bits in victim executables with high confidence (70% vs. baseline 2%). We show on DDR4 DRAM that only 1.4 flips on average are needed to fully downgrade the accuracy of victim models, including quantized ones which could require 23x more flips previously, to random guesses. We comprehensively evaluate 16 DNN executables, covering large-scale models trained on commonly-used datasets compiled by the two most popular DL compilers. Our finding calls for incorporating security mechanisms in future DNN compilation toolchains.

Defending Bit-Flip Attack Through DNN Weight Reconstruction

One-bit Flip is All You Need: when Bit-flip Attack Meets Model Training

Improving Fault Tolerance for Reliable DNN Using Boundary-Aware Activation

Compiled Models, Built-In Exploits: Uncovering Pervasive Bit-Flip Attack Surfaces in DNN Executables

Stealthy Attack on Algorithmic-Protected DNNs via Smart Bit Flipping

Targeted Attack Against Deep Neural Networks Via Flipping Limited Weight Bits

Aegis: Mitigating Targeted Bit-flip Attacks against Deep Neural Networks

DeepNcode: Encoding-Based Protection against Bit-Flip Attacks on Neural Networks

Versatile Weight Attack Via Flipping Limited Bits.

Impactful Bit-Flip Search on Full-precision Models

DNN-Defender: An in-DRAM Deep Neural Network Defense Mechanism for Adversarial Weight Attack

Adversarial Weight Prediction Networks for Defense of Industrial FDC Systems

DNN-Defender: A Victim-Focused In-DRAM Defense Mechanism for Taming Adversarial Weight Attack on DNNs

Hardly Perceptible Trojan Attack Against Neural Networks with Bit Flips

Enhancing Neural Network Robustness Against Fault Injection Through Non-linear Weight Transformations

DeepHammer: Depleting the Intelligence of Deep Neural Networks through Targeted Chain of Bit Flips

Harden Deep Neural Networks Against Fault Injections Through Weight Scaling

Attacking Graph Neural Networks with Bit Flips: Weisfeiler and Lehman Go Indifferent

Bit Error Robustness for Energy-Efficient DNN Accelerators

Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM

Mitigating Adversarial Attacks for Deep Neural Networks by Input Deformation and Augmentation