Abstract:Recent studies have highlighted that deep neural networks (DNNs) are vulnerable to adversarial attacks, even in a black-box scenario. However, most of the existing black-box attack algorithms need to make a huge amount of queries to perform attacks, which is not practical in the real world. We note one of the main reasons for the massive queries is that the adversarial example is required to be visually similar to the original image, but in many cases, how adversarial examples look like does not matter much. It inspires us to introduce a new attack called input-free attack, under which an adversary can choose an arbitrary image to start with and is allowed to add perceptible perturbations on it. Following this approach, we propose two techniques to significantly reduce the query complexity. First, we initialize an adversarial example with a gray color image on which every pixel has roughly the same importance for the target model. Then we shrink the dimension of the attack space by perturbing a small region and tiling it to cover the input image. To make our algorithm more effective, we stabilize a projected gradient ascent algorithm with momentum, and also propose a heuristic approach for region size selection. Recent studies have highlighted that deep neural networks (DNNs) are vulnerable to adversarial attacks, even in a black-box scenario. However, most of the existing black-box attack algorithms need to make a huge amount of queries to perform attacks, which is not practical in the real world. We note one of the main reasons for the massive queries is that the adversarial example is required to be visually similar to the original image, but in many cases, how adversarial examples look like does not matter much. It inspires us to introduce a new attack called input-free attack, under which an adversary can choose an arbitrary image to start with and is allowed to add perceptible perturbations on it. Following this approach, we propose two techniques to significantly reduce the query complexity. First, we initialize an adversarial example with a gray color image on which every pixel has roughly the same importance for the target model. Then we shrink the dimension of the attack space by perturbing a small region and tiling it to cover the input image. To make our algorithm more effective, we stabilize a projected gradient ascent algorithm with momentum, and also propose a heuristic approach for region size selection. Through extensive experiments, we show that with only 1,701 queries on average, we can perturb a gray image to any target class of ImageNet with a 100% success rate on InceptionV3. Besides, our algorithm has successfully defeated two real-world systems, the Clarifai food detection API and the Baidu Animal Identification API.

A Geometry-Inspired Decision-Based Attack

Fooling Neural Network Interpretations - Adversarial Noise to Attack Images.

DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks

ABCAttack: A Gradient-Free Optimization Black-Box Attack for Fooling Deep Image Classifiers

Towards Query Efficient Black-box Attacks

AdvFoolGen: Creating Persistent Troubles for Deep Classifiers

Revisiting DeepFool: generalization and improvement

Tailoring Adversarial Attacks on Deep Neural Networks for Targeted Class Manipulation Using DeepFool Algorithm

Perception-Driven Imperceptible Adversarial Attack Against Decision-Based Black-Box Models

FCGSM: Fast Conjugate Gradient Sign Method for Adversarial Attack on Image Classification

Decision-Based Adversarial Attack With Frequency Mixup

FineFool: A novel DNN object contour attack on image recognition based on the attention perturbation adversarial technique

Graphfool: Targeted Label Adversarial Attack on Graph Embedding

FoolChecker: A platform to evaluate the robustness of images against adversarial attacks

Query-efficient Black-box Adversarial Attack with Customized Iteration and Sampling

Adversarial Attacks Hidden in Plain Sight

Fast Geometrically-Perturbed Adversarial Faces

Exploring Decision-based Black-box Attacks on Face Forgery Detection

Fooling deep neural detection networks with adaptive object-oriented adversarial perturbation

AutoDA: Automated Decision-based Iterative Adversarial Attacks

Query-based Adversarial Attacks on Graph with Fake Nodes