Abstract:Deep learning has shown superiority in dealing with complicated and professional tasks (e.g., computer vision, audio, and language processing). However, research works have confirmed that Deep Neural Networks (DNNs) are vulnerable to carefully crafted adversarial perturbations, which cause DNNs confusion on specific tasks. In object detection domain, the background has little contributions to object classification, and the crafted adversarial perturbations added to the background do not improve the adversary effect in fooling deep neural detection models yet induce substantial distortions in generated examples. Based on such situation, we introduce an adversarial attack algorithm named Adaptive Object-oriented Adversarial Method (AO 2 AM). It aims to fool deep neural object detection networks with the adversarial examples by applying the adaptive cumulation of object-based gradients and adding the adaptive object-based adversarial perturbations merely onto objects rather than the whole frame of input images. AO 2 AM can effectively make the representations of generated adversarial samples close to the decision boundary in the latent space, and force deep neural detection networks to yield inaccurate locations and false classification in the process of object detection. Compared with existing adversarial attack methods which generate adversarial perturbations acting on the global scale of the original inputs, the adversarial examples produced by AO 2 AM can effectively fool deep neural object detection networks and maintain a high structural similarity with corresponding clean inputs. Performing adversarial attacks on Faster R-CNN, AO 2 AM gains attack success rate (ASR) over 98.00% on pre-processed Pascal VOC 2007&2012 (Val), and reaches SSIM over 0.870. In Fooling SSD, AO 2 AM receives SSIM exceeding 0.980 on L 2 norm constraint. On SSIM and Mean Attack Ratio, AO 2 AM outperforms adversarial attack methods based on global scale perturbations.

AdvFoolGen: Creating Persistent Troubles for Deep Classifiers

Fooling Neural Network Interpretations - Adversarial Noise to Attack Images.

GAN Generate Adversarial Examples to Fool Deep Networks.

ABCAttack: A Gradient-Free Optimization Black-Box Attack for Fooling Deep Image Classifiers

Fooling Examples: Another Intriguing Property of Neural Networks

Tailoring Adversarial Attacks on Deep Neural Networks for Targeted Class Manipulation Using DeepFool Algorithm

A Multi-objective Examples Generation Approach to Fool the Deep Neural Networks in the Black-Box Scenario

DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks

An Effective Way to Boost Black-Box Adversarial Attack.

AdverseGen: A Practical Tool for Generating Adversarial Examples to Deep Neural Networks Using Black-Box Approaches

DeepDefense: Training Deep Neural Networks with Improved Robustness.

An Evolutionary-Based Black-Box Attack to Deep Neural Network Classifiers.

Are You Confident That You Have Successfully Generated Adversarial Examples?

Adversarial Attack? Don't Panic

Patch-Wise Attack for Fooling Deep Neural Network

EnsembleFool: A Method to Generate Adversarial Examples Based on Model Fusion Strategy

Fooling deep neural detection networks with adaptive object-oriented adversarial perturbation

GenAttack: Practical Black-box Attacks with Gradient-Free Optimization

Deep Defense: Training DNNs with Improved Adversarial Robustness

Online Alternate Generator against Adversarial Attacks

Graphfool: Targeted Label Adversarial Attack on Graph Embedding