Abstract:Recent studies have highlighted that deep neural networks (DNNs) are vulnerable to adversarial attacks, even in a black-box scenario. However, most of the existing black-box attack algorithms need to make a huge amount of queries to perform attacks, which is not practical in the real world. We note one of the main reasons for the massive queries is that the adversarial example is required to be visually similar to the original image, but in many cases, how adversarial examples look like does not matter much. It inspires us to introduce a new attack called input-free attack, under which an adversary can choose an arbitrary image to start with and is allowed to add perceptible perturbations on it. Following this approach, we propose two techniques to significantly reduce the query complexity. First, we initialize an adversarial example with a gray color image on which every pixel has roughly the same importance for the target model. Then we shrink the dimension of the attack space by perturbing a small region and tiling it to cover the input image. To make our algorithm more effective, we stabilize a projected gradient ascent algorithm with momentum, and also propose a heuristic approach for region size selection. Recent studies have highlighted that deep neural networks (DNNs) are vulnerable to adversarial attacks, even in a black-box scenario. However, most of the existing black-box attack algorithms need to make a huge amount of queries to perform attacks, which is not practical in the real world. We note one of the main reasons for the massive queries is that the adversarial example is required to be visually similar to the original image, but in many cases, how adversarial examples look like does not matter much. It inspires us to introduce a new attack called input-free attack, under which an adversary can choose an arbitrary image to start with and is allowed to add perceptible perturbations on it. Following this approach, we propose two techniques to significantly reduce the query complexity. First, we initialize an adversarial example with a gray color image on which every pixel has roughly the same importance for the target model. Then we shrink the dimension of the attack space by perturbing a small region and tiling it to cover the input image. To make our algorithm more effective, we stabilize a projected gradient ascent algorithm with momentum, and also propose a heuristic approach for region size selection. Through extensive experiments, we show that with only 1,701 queries on average, we can perturb a gray image to any target class of ImageNet with a 100% success rate on InceptionV3. Besides, our algorithm has successfully defeated two real-world systems, the Clarifai food detection API and the Baidu Animal Identification API.

Black-box adversarial attacks by manipulating image attributes

You See What I Want You to See: Exploring Targeted Black-Box Transferability Attack for Hash-based Image Retrieval Systems

An Adversarial Attack Via Feature Contributive Regions

Improving Query Efficiency of Black-box Adversarial Attack

Towards Query Efficient Black-box Attacks

SemanticAdv: Generating Adversarial Examples via Attribute-conditional Image Editing

Detecting Adversarial Examples Through Image Transformation

Perception Improvement for Free: Exploring Imperceptible Black-box Adversarial Attacks on Image Classification

RetouchUAA: Unconstrained Adversarial Attack via Image Retouching

Adversarial Attacks Hidden in Plain Sight

Targeted Black-Box Adversarial Attack Method for Image Classification Models.

Invisible Adversarial Attack Against Deep Neural Networks: an Adaptive Penalization Approach

Simple Black-box Adversarial Attacks

Taking Care of the Discretization Problem:A Black-Box Adversarial Image Attack in Discrete Integer Domain

Sparse and Imperceivable Adversarial Attacks

Adv-Attribute: Inconspicuous and Transferable Adversarial Attack on Face Recognition

Attacks Meet Interpretability: Attribute-steered Detection of Adversarial Samples

A Black-box Adversarial Attack Strategy with Adjustable Sparsity and Generalizability for Deep Image Classifiers

AICAttack: Adversarial Image Captioning Attack with Attention-Based Optimization

Feature Denoising for Improving Adversarial Robustness

Taking Care of the Discretization Problem: A Comprehensive Study of the Discretization Problem and A Black-Box Adversarial Attack in Discrete Integer Domain