Abstract:Deep neural networks are extremely vulnerable to attacks and threats from adversarial examples. These adversarial examples deliberately crafted by attackers can easily fool classification models by adding imperceptibly tiny perturbations on clean images. This brings a great challenge to image security for deep learning. Therefore, studying and designing attack algorithms for generating adversarial examples is essential for building robust models. Moreover, adversarial examples are transferable in that they can mislead multiple different classifiers across models. This makes black-box attacks feasible for practical applications. However, most attack methods have low success rates and weak transferability against black-box models. This is because they often overfit the model during the production of adversarial examples. To address this issue, we propose a Nadam iterative fast gradient method (NAI-FGM), which combines an improved Nadam optimizer with gradient-based iterative attacks. Specifically, we introduce the look-ahead momentum vector and the adaptive learning rate component based on the Momentum Iterative Fast Gradient Sign Method (MI-FGSM). The look-ahead momentum vector is dedicated to making the loss function converge faster and get rid of the poor local maximum. Additionally, the adaptive learning rate component is used to help the adversarial example to converge to a better extreme point by obtaining adaptive update directions according to the current parameters. Furthermore, we also carry out different input transformations to further enhance the attack performance before using NAI-FGM for attack. Finally, we consider attacking the ensemble model. Extensive experiments show that the NAI-FGM has stronger transferability and black-box attack capability than advanced momentum-based iterative attacks. In particular, when using the adversarial examples produced by way of ensemble attack to test the adversarially trained models, the NAI-FGM improves the success rate by 8% to 11% over the other attack methods. Last but not least, the NAI-DI-TI-SI-FGM combined with the input transformation achieves a success rate of 91.3% on average.

An Effective Way to Boost Black-Box Adversarial Attack.

Boosting Black-box Adversarial Attack with a Better Convergence

Improving Query Efficiency of Black-box Adversarial Attack

Boosting Adversarial Attacks with Momentum

Boosting Adversarial Attacks with Nadam Optimizer

NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks

An efficient adversarial example generation algorithm based on an accelerated gradient iterative fast gradient

Adversarial sample attack method based on loss smoothing

ABCAttack: A Gradient-Free Optimization Black-Box Attack for Fooling Deep Image Classifiers

Boosting Black-Box Adversarial Attacks with Meta Learning

CAAD 2018: Iterative Ensemble Adversarial Attack

Patch-Wise Attack for Fooling Deep Neural Network

DCVAE-adv: A Universal Adversarial Example Generation Method for White and Black Box Attacks

Are You Confident That You Have Successfully Generated Adversarial Examples?

Universally Strict Black-Box Attacks Against Deep Neural Networks

A Multi-objective Examples Generation Approach to Fool the Deep Neural Networks in the Black-Box Scenario

Targeted Black-Box Adversarial Attack Method for Image Classification Models.

Adaptive Multi-scale Degradation-Based Attack for Boosting the Adversarial Transferability

Adversarial example generation with adabelief optimizer and crop invariance

Improving the Transferability of Adversarial Examples with a Noise Data Enhancement Framework and Random Erasing