Abstract:Object detection is the foundation of various critical computer-vision tasks such as segmentation, object tracking, and event detection. To train an object detector with satisfactory accuracy, a large amount of data is required. However, due to the intensive workforce involved with annotating large datasets, such a data curation task is often outsourced to a third party or relied on volunteers. This work reveals severe vulnerabilities of such data curation pipeline. We propose MACAB that crafts clean-annotated images to stealthily implant the backdoor into the object detectors trained on them even when the data curator can manually audit the images. We observe that the backdoor effect of both misclassification and the cloaking are robustly achieved in the wild when the backdoor is activated with inconspicuously natural physical triggers. Backdooring non-classification object detection with clean-annotation is challenging compared to backdooring existing image classification tasks with clean-label, owing to the complexity of having multiple objects within each frame, including victim and non-victim objects. The efficacy of the MACAB is ensured by constructively i abusing the image-scaling function used by the deep learning framework, ii incorporating the proposed adversarial clean image replica technique, and iii combining poison data selection criteria given constrained attacking budget. Extensive experiments demonstrate that MACAB exhibits more than 90% attack success rate under various real-world scenes. This includes both cloaking and misclassification backdoor effect even restricted with a small attack budget. The poisoned samples cannot be effectively identified by state-of-the-art detection techniques.The comprehensive video demo is at <a class="link-external link-https" href="https://youtu.be/MA7L_LpXkp4" rel="external noopener nofollow">this https URL</a>, which is based on a poison rate of 0.14% for YOLOv4 cloaking backdoor and Faster R-CNN misclassification backdoor.

Palette: Physically-Realizable Backdoor Attacks Against Video Recognition Models

KerbNet: A QoE-aware Kernel-Based Backdoor Attack Framework

Clean-Label Backdoor Attacks on Video Recognition Models

Clean-Annotation Backdoor Attack against Lane Detection Systems in the Wild

Temporal-Distributed Backdoor Attack Against Video Based Action Recognition

Physical Invisible Backdoor Based on Camera Imaging

Backdoor Attacks Against Deep Learning Systems in the Physical World

Moiré Backdoor Attack (MBA): A Novel Trigger for Pedestrian Detectors in the Physical World

PatchBackdoor: Backdoor Attack against Deep Neural Networks without Model Modification

On the Credibility of Backdoor Attacks Against Object Detectors in the Physical World

A Motional but Temporally Consistent Physical Video Examples

Backdoor Attack in the Physical World

Towards Clean-Label Backdoor Attacks in the Physical World

A stealthy and robust backdoor attack via frequency domain transform

Few-Shot Backdoor Attacks on Visual Object Tracking

Heuristic Black-box Adversarial Attacks on Video Recognition Models

TransCAB: Transferable Clean-Annotation Backdoor to Object Detection with Natural Trigger in Real-World

Towards Decision-based Sparse Attacks on Video Recognition

Attack as Defense: Run-time Backdoor Implantation for Image Content Protection

Investigating the Backdoor on DNNs Based on Recolorization and Reconstruction: From a Multi-Channel Perspective

An Effective and Resilient Backdoor Attack Framework against Deep Neural Networks and Vision Transformers