Abstract:Deep learning with edge computing arises as a popular paradigm for powering edge devices with intelligence. As the size of deep neural networks (DNN) continually increases, model quantization, which converts the full-precision model into lower-bit representation while mostly preserving the accuracy, becomes a prerequisite for deploying a well-trained DNN on resource-limited edge devices. However, to properly quantize a DNN requires an essential amount of expert knowledge, or otherwise the model accuracy would be devastatingly affected. Alternatively, recent years witness the birth of third-party model supply chains which provide pretrained quantized neural networks (QNN) for free downloading. In this paper, we systematically analyze the potential threats of trojaned models in third-party QNN supply chains. For the first time, we describe and implement a QUAntization-SpecIfic backdoor attack (QUASI), which manipulates the quantization mechanism to inject a backdoor specific to the quantized model. In other words, the attacker-specified inputs, or triggers, would not cause misbehaviors of the trojaned model in full precision until the backdoor function is automatically completed by a normal quantization operation, producing a trojaned QNN which can be triggered with a near 100% success rate. Our proposed QUASI attack reveals several key vulnerabilities in the existing QNN supply chains: (i) QUASI demonstrates a third-party QNN released online can also be injected with backdoors, while, unlike full-precision models, there is almost no working algorithm for checking the fidelity of a QNN. (ii) More threateningly, the backdoor injected by QUASI remains inactivated in the full-precision model, which inhibits model consumers from attributing undergoing trojan attacks to the malicious model provider. As a practical implication, we alarm it can be highly risky to accept and deploy third-party QNN on edge devices at the current stage, if without future mitigation studies.

Deep Neural Network Quantization Framework for Effective Defense against Membership Inference Attacks

Hessian-based Mixed-Precision Quantization with Transition Aware Training for Neural Networks

Defensive Quantization: When Efficiency Meets Robustness

Improving the Robustness of Quantized Deep Neural Networks to White-Box Attacks using Stochastic Quantization and Information-Theoretic Ensemble Training

Double-Win Quant: Aggressively Winning Robustness of Quantized Deep Neural Networks Via Random Precision Training and Inference

Improving Robustness Against Adversarial Attacks with Deeply Quantized Neural Networks

A White Paper on Neural Network Quantization

Error-Silenced Quantization: Bridging Robustness and Compactness

Saliency Assisted Quantization for Neural Networks

Nearest is Not Dearest: Towards Practical Defense against Quantization-conditioned Backdoor Attacks

Understanding and defending against White-box membership inference attack in deep learning

Membership reconstruction attack in deep neural networks

Scalable Membership Inference Attacks via Quantile Regression

Understanding the Threats of Trojaned Quantized Neural Network in Model Supply Chains.

Quantization-aware Neural Architectural Search for Intrusion Detection

Investigating the Impact of Quantization on Adversarial Robustness

Diffence: Fencing Membership Privacy With Diffusion Models

Benchmarking the Robustness of Quantized Models

Bit Efficient Quantization for Deep Neural Networks

Quantization Aware Attack: Enhancing Transferable Adversarial Attacks by Model Quantization

Order of Magnitude Speedups for LLM Membership Inference