Abstract:The escalating menace of backdoor attacks constitutes a formidable obstacle to the ongoing advancement of deep neural networks (DNNs), particularly in the security-sensitive applications such as face recognition and self-driving. Backdoored models render deliberately incorrect predictions on the inputs with the crafted triggers while behaving normally with the benign ones. Despite demonstrating the varying degrees of threat, existing backdoor attack strategies often prioritize stealthiness and defense evasions but neglect the practical feasibility in the real-world deployment scenarios. In this paper, we develop a backdoor attack leveraging bokeh effects (BABE), which introduces the bokeh effects as the trigger. Once the backdoored model is deployed in the vision application, the model's malicious behaviors can be activated only by utilizing the captured bokeh images without any other modifications. Specially, we employ the saliency and depth estimation maps to derive the bokeh images, thereby serving as the poisoned samples. Furthermore, to avoid the latent separation of the generated poisoned images, we propose distinct attack strategies on the basis of the adversary's prior abilities. For the adversary only with the data manipulation, we retain the original semantic labels fora subset of poisoned data during the training process. For the adversary with the manipulation of both the data and models, we construct a reference model trained on the clean samples to impose constraints on the latent representations of the poisoned images. Extensive experiments demonstrate the attack effects of the proposed BABE, even on the bokeh photos captured from Digital Still Cameras (DSC) and smartphones.

Backdoor Attacks on the DNN Interpretation System

Fooling Neural Network Interpretations - Adversarial Noise to Attack Images.

Invisible Backdoor Attacks on Deep Neural Networks via Steganography and Regularization

An Invisible Backdoor Attack Based On Semantic Feature

Hidden Backdoor Attack against Semantic Segmentation Models

SATBA: An Invisible Backdoor Attack Based On Spatial Attention

Regula Sub-rosa: Latent Backdoor Attacks on Deep Neural Networks

Stand-in Backdoor: A Stealthy and Powerful Backdoor Attack

An Effective and Resilient Backdoor Attack Framework against Deep Neural Networks and Vision Transformers

SGBA: A Stealthy Scapegoat Backdoor Attack Against Deep Neural Networks

Backdoor Attacks on Image Classification Models in Deep Neural Networks

Backdoor Attacks to Deep Learning Models and Countermeasures: A Survey

Saliency Map-Based Local White-Box Adversarial Attack Against Deep Neural Networks

Invisible Backdoor Attacks on Key Regions Based on Target Neurons in Self-Supervised Learning

Reflection Backdoor: A Natural Backdoor Attack on Deep Neural Networks

Imperceptible and Multi-channel Backdoor Attack against Deep Neural Networks

BABE: Backdoor Attack with Bokeh Effects Via Latent Separation Suppression

Untargeted Backdoor Attack Against Object Detection

Imperceptible Backdoor Attack: from Input Space to Feature Representation

Investigating the Backdoor on DNNs Based on Recolorization and Reconstruction: From a Multi-Channel Perspective