Abstract:Deep learning models have been deployed in numerous real-world applications such as autonomous driving and surveillance. However, these models are vulnerable in adversarial environments. Backdoor attack is emerging as a severe security threat which injects a backdoor trigger into a small portion of training data such that the trained model behaves normally on benign inputs but gives incorrect predictions when the specific trigger appears. While most research in backdoor attacks focuses on image classification, backdoor attacks on object detection have not been explored but are of equal importance. Object detection has been adopted as an important module in various security-sensitive applications such as autonomous driving. Therefore, backdoor attacks on object detection could pose severe threats to human lives and properties. We propose four kinds of backdoor attacks for object detection task: 1) Object Generation Attack: a trigger can falsely generate an object of the target class; 2) Regional Misclassification Attack: a trigger can change the prediction of a surrounding object to the target class; 3) Global Misclassification Attack: a single trigger can change the predictions of all objects in an image to the target class; and 4) Object Disappearance Attack: a trigger can make the detector fail to detect the object of the target class. We develop appropriate metrics to evaluate the four backdoor attacks on object detection. We perform experiments using two typical object detection models -- Faster-RCNN and YOLOv3 on different datasets. More crucially, we demonstrate that even fine-tuning on another benign dataset cannot remove the backdoor hidden in the object detection model. To defend against these backdoor attacks, we propose Detector Cleanse, an entropy-based run-time detection framework to identify poisoned testing samples for any deployed object detector.

Segmentation Based Backdoor Attack Detection

KerbNet: A QoE-aware Kernel-Based Backdoor Attack Framework

B3: Backdoor Attacks Against Black-box Machine Learning Models

Untargeted Backdoor Attack against Object Detection

Hidden Backdoor Attack against Semantic Segmentation Models

BadDet: Backdoor Attacks on Object Detection

An Invisible Backdoor Attack Based On Semantic Feature

Backdoor Attack in the Physical World

Universal Backdoor Attacks Detection via Adaptive Adversarial Probe

Escaping Backdoor Attack Detection of Deep Learning

Countering Backdoor Attacks in Image Recognition: A Survey and Evaluation of Mitigation Strategies

Parity measurements of nuclear levels using a free-electron-laser generated gamma-ray beam.

Adaptive Backdoor Attack Against Deep Neural Networks

SATBA: An Invisible Backdoor Attack Based On Spatial Attention

Imperceptible and Multi-channel Backdoor Attack against Deep Neural Networks

Clean-Label Backdoor Attacks on Video Recognition Models

Attacking by Aligning: Clean-Label Backdoor Attacks on Object Detection

A multitarget backdooring attack on deep neural networks with random location trigger

Scalable Backdoor Detection in Neural Networks

Universal Soldier: Using Universal Adversarial Perturbations for Detecting Backdoor Attacks