Abstract:Recent studies have highlighted that deep neural networks (DNNs) are vulnerable to adversarial attacks, even in a black-box scenario. However, most of the existing black-box attack algorithms need to make a huge amount of queries to perform attacks, which is not practical in the real world. We note one of the main reasons for the massive queries is that the adversarial example is required to be visually similar to the original image, but in many cases, how adversarial examples look like does not matter much. It inspires us to introduce a new attack called input-free attack, under which an adversary can choose an arbitrary image to start with and is allowed to add perceptible perturbations on it. Following this approach, we propose two techniques to significantly reduce the query complexity. First, we initialize an adversarial example with a gray color image on which every pixel has roughly the same importance for the target model. Then we shrink the dimension of the attack space by perturbing a small region and tiling it to cover the input image. To make our algorithm more effective, we stabilize a projected gradient ascent algorithm with momentum, and also propose a heuristic approach for region size selection. Recent studies have highlighted that deep neural networks (DNNs) are vulnerable to adversarial attacks, even in a black-box scenario. However, most of the existing black-box attack algorithms need to make a huge amount of queries to perform attacks, which is not practical in the real world. We note one of the main reasons for the massive queries is that the adversarial example is required to be visually similar to the original image, but in many cases, how adversarial examples look like does not matter much. It inspires us to introduce a new attack called input-free attack, under which an adversary can choose an arbitrary image to start with and is allowed to add perceptible perturbations on it. Following this approach, we propose two techniques to significantly reduce the query complexity. First, we initialize an adversarial example with a gray color image on which every pixel has roughly the same importance for the target model. Then we shrink the dimension of the attack space by perturbing a small region and tiling it to cover the input image. To make our algorithm more effective, we stabilize a projected gradient ascent algorithm with momentum, and also propose a heuristic approach for region size selection. Through extensive experiments, we show that with only 1,701 queries on average, we can perturb a gray image to any target class of ImageNet with a 100% success rate on InceptionV3. Besides, our algorithm has successfully defeated two real-world systems, the Clarifai food detection API and the Baidu Animal Identification API.

Towards Query-Efficient Black-Box Attacks: A Universal Dual Transferability-Based Framework

You See What I Want You to See: Exploring Targeted Black-Box Transferability Attack for Hash-based Image Retrieval Systems

Towards Efficient Data Free Blackbox Adversarial Attack

Improving Query Efficiency of Black-box Adversarial Attack

Towards Query Efficient Black-box Attacks

Boosting Black-Box Attack with Partially Transferred Conditional Adversarial Distribution

Optimizing Latent Variables in Integrating Transfer and Query Based Attack Framework

Query-Efficient Black-box Adversarial Examples (superceded)

Query-Efficient Black-Box Adversarial Attacks Guided by a Transfer-Based Prior

Efficient Query-based Black-box Attack against Cross-modal Hashing Retrieval

Object-Aware Transfer-Based Black-Box Adversarial Attack on Object Detector.

Towards Lightweight Black-Box Attacks against Deep Neural Networks

Generalizable Black-Box Adversarial Attack With Meta Learning

Advancing Few-Shot Black-Box Attack With Alternating Training

Query Efficient Black-Box Adversarial Attack on Deep Neural Networks

STBA: Towards Evaluating the Robustness of DNNs for Query-Limited Black-box Scenario

Subspace Attack: Exploiting Promising Subspaces for Query-Efficient Black-box Attacks.

Boosting Black-Box Attack to Deep Neural Networks With Conditional Diffusion Models

Data-Free Adversarial Perturbations for Practical Black-Box Attack

Promoting Adversarial Transferability via Dual-Sampling Variance Aggregation and Feature Heterogeneity Attacks