Abstract:Deep neural networks are extremely vulnerable to attacks and threats from adversarial examples. These adversarial examples deliberately crafted by attackers can easily fool classification models by adding imperceptibly tiny perturbations on clean images. This brings a great challenge to image security for deep learning. Therefore, studying and designing attack algorithms for generating adversarial examples is essential for building robust models. Moreover, adversarial examples are transferable in that they can mislead multiple different classifiers across models. This makes black-box attacks feasible for practical applications. However, most attack methods have low success rates and weak transferability against black-box models. This is because they often overfit the model during the production of adversarial examples. To address this issue, we propose a Nadam iterative fast gradient method (NAI-FGM), which combines an improved Nadam optimizer with gradient-based iterative attacks. Specifically, we introduce the look-ahead momentum vector and the adaptive learning rate component based on the Momentum Iterative Fast Gradient Sign Method (MI-FGSM). The look-ahead momentum vector is dedicated to making the loss function converge faster and get rid of the poor local maximum. Additionally, the adaptive learning rate component is used to help the adversarial example to converge to a better extreme point by obtaining adaptive update directions according to the current parameters. Furthermore, we also carry out different input transformations to further enhance the attack performance before using NAI-FGM for attack. Finally, we consider attacking the ensemble model. Extensive experiments show that the NAI-FGM has stronger transferability and black-box attack capability than advanced momentum-based iterative attacks. In particular, when using the adversarial examples produced by way of ensemble attack to test the adversarially trained models, the NAI-FGM improves the success rate by 8% to 11% over the other attack methods. Last but not least, the NAI-DI-TI-SI-FGM combined with the input transformation achieves a success rate of 91.3% on average.

Stronger Adversarial Attack: Using Mini-batch Gradient

An Adversarial Attack Via Feature Contributive Regions

An efficient adversarial example generation algorithm based on an accelerated gradient iterative fast gradient

Adversarial example generation with adabelief optimizer and crop invariance

Efficient Adversarial Attack Based on Moment Estimation and Lookahead Gradient

Boosting Black-box Adversarial Attack with a Better Convergence

FCGSM: Fast Conjugate Gradient Sign Method for Adversarial Attack on Image Classification

Improving Transferability of Adversarial Examples With Input Diversity

Adversarial sample attack method based on loss smoothing

NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks

Adversarial attack defense algorithm based on convolutional neural network

Nesterov Accelerated Gradient and Scale Invariance for Adversarial Attacks

Enhancing the Transferability of Adversarial Attacks through Variance Tuning

Boosting Adversarial Attacks with Nadam Optimizer

DCVAE-adv: A Universal Adversarial Example Generation Method for White and Black Box Attacks

GenAttack: Practical Black-box Attacks with Gradient-Free Optimization

Improving Adversarial Transferability with Neighbourhood Gradient Information

Stochastic Variance Reduced Ensemble Adversarial Attack for Boosting the Adversarial Transferability

Evolution Attack On Neural Networks

Curls & Whey: Boosting Black-Box Adversarial Attacks