Abstract:Recent studies have highlighted that deep neural networks (DNNs) are vulnerable to adversarial attacks, even in a black-box scenario. However, most of the existing black-box attack algorithms need to make a huge amount of queries to perform attacks, which is not practical in the real world. We note one of the main reasons for the massive queries is that the adversarial example is required to be visually similar to the original image, but in many cases, how adversarial examples look like does not matter much. It inspires us to introduce a new attack called input-free attack, under which an adversary can choose an arbitrary image to start with and is allowed to add perceptible perturbations on it. Following this approach, we propose two techniques to significantly reduce the query complexity. First, we initialize an adversarial example with a gray color image on which every pixel has roughly the same importance for the target model. Then we shrink the dimension of the attack space by perturbing a small region and tiling it to cover the input image. To make our algorithm more effective, we stabilize a projected gradient ascent algorithm with momentum, and also propose a heuristic approach for region size selection. Recent studies have highlighted that deep neural networks (DNNs) are vulnerable to adversarial attacks, even in a black-box scenario. However, most of the existing black-box attack algorithms need to make a huge amount of queries to perform attacks, which is not practical in the real world. We note one of the main reasons for the massive queries is that the adversarial example is required to be visually similar to the original image, but in many cases, how adversarial examples look like does not matter much. It inspires us to introduce a new attack called input-free attack, under which an adversary can choose an arbitrary image to start with and is allowed to add perceptible perturbations on it. Following this approach, we propose two techniques to significantly reduce the query complexity. First, we initialize an adversarial example with a gray color image on which every pixel has roughly the same importance for the target model. Then we shrink the dimension of the attack space by perturbing a small region and tiling it to cover the input image. To make our algorithm more effective, we stabilize a projected gradient ascent algorithm with momentum, and also propose a heuristic approach for region size selection. Through extensive experiments, we show that with only 1,701 queries on average, we can perturb a gray image to any target class of ImageNet with a 100% success rate on InceptionV3. Besides, our algorithm has successfully defeated two real-world systems, the Clarifai food detection API and the Baidu Animal Identification API.

Targeted Black-Box Adversarial Attack Method for Image Classification Models.

[Vitamin E as a natural antioxidant. Preventive importance and requirement].

Improving Query Efficiency of Black-box Adversarial Attack

Towards Query Efficient Black-box Attacks

Robust Adversarial Attacks on Imperfect Deep Neural Networks in Fault Classification

Perception Improvement for Free: Exploring Imperceptible Black-box Adversarial Attacks on Image Classification

A Black-Box Attack on Optical Character Recognition Systems

FCGSM: Fast Conjugate Gradient Sign Method for Adversarial Attack on Image Classification

Taking Care of the Discretization Problem: A Comprehensive Study of the Discretization Problem and A Black-Box Adversarial Attack in Discrete Integer Domain

Neural Fingerprints for Adversarial Attack Detection

Efficient Decision-Based Black-Box Adversarial Attacks on Face Recognition

Object-Aware Transfer-Based Black-Box Adversarial Attack on Object Detector.

A Black-box NLP Classifier Attacker

Black-box Adversarial Attacks on Video Recognition Models

One-Index Vector Quantization Based Adversarial Attack on Image Classification

Square-Based Black-Box Adversarial Attack on Time Series Classification Using Simulated Annealing and Post-Processing-Based Defense

Detection Tolerant Black-Box Adversarial Attack Against Automatic Modulation Classification With Deep Learning

Black-box adversarial attacks by manipulating image attributes

Toward Robust Image Classification

ZOO: Zeroth Order Optimization based Black-box Attacks to Deep Neural Networks without Training Substitute Models

Undermining Image and Text Classification Algorithms Using Adversarial Attacks