Abstract:Deep neural networks are extremely vulnerable to attacks and threats from adversarial examples. These adversarial examples deliberately crafted by attackers can easily fool classification models by adding imperceptibly tiny perturbations on clean images. This brings a great challenge to image security for deep learning. Therefore, studying and designing attack algorithms for generating adversarial examples is essential for building robust models. Moreover, adversarial examples are transferable in that they can mislead multiple different classifiers across models. This makes black-box attacks feasible for practical applications. However, most attack methods have low success rates and weak transferability against black-box models. This is because they often overfit the model during the production of adversarial examples. To address this issue, we propose a Nadam iterative fast gradient method (NAI-FGM), which combines an improved Nadam optimizer with gradient-based iterative attacks. Specifically, we introduce the look-ahead momentum vector and the adaptive learning rate component based on the Momentum Iterative Fast Gradient Sign Method (MI-FGSM). The look-ahead momentum vector is dedicated to making the loss function converge faster and get rid of the poor local maximum. Additionally, the adaptive learning rate component is used to help the adversarial example to converge to a better extreme point by obtaining adaptive update directions according to the current parameters. Furthermore, we also carry out different input transformations to further enhance the attack performance before using NAI-FGM for attack. Finally, we consider attacking the ensemble model. Extensive experiments show that the NAI-FGM has stronger transferability and black-box attack capability than advanced momentum-based iterative attacks. In particular, when using the adversarial examples produced by way of ensemble attack to test the adversarially trained models, the NAI-FGM improves the success rate by 8% to 11% over the other attack methods. Last but not least, the NAI-DI-TI-SI-FGM combined with the input transformation achieves a success rate of 91.3% on average.

Explanation-Guided Adversarial Example Attacks

Towards Efficient Data Free Blackbox Adversarial Attack

Improving Query Efficiency of Black-box Adversarial Attack

Explainable and Transferable Adversarial Attack for ML-Based Network Intrusion Detectors

Evolution Attack On Neural Networks

Efficient Adversarial Attack Based on Moment Estimation and Lookahead Gradient

GenAttack: Practical Black-box Attacks with Gradient-Free Optimization

Towards Imperceptible and Robust Adversarial Example Attacks Against Neural Networks

Adversarial Example Games

MC-Net: Realistic Sample Generation for Black-Box Attacks

Generating Universal Language Adversarial Examples by Understanding and Enhancing the Transferability Across Neural Models

Query-Efficient Generation of Adversarial Examples for Defensive DNNs via Multi-Objective Optimization

GCSA: A New Adversarial Example-Generating Scheme Towards Black-Box Adversarial Attacks

Generating Adversarial Examples with Adversarial Networks

DCVAE-adv: A Universal Adversarial Example Generation Method for White and Black Box Attacks

Explaining and Harnessing Adversarial Examples

Adversarial example generation with adabelief optimizer and crop invariance

Boosting Adversarial Attacks with Nadam Optimizer

Adversarial examples for models of code

Enhancing the Transferability of Adversarial Examples with Noise Reduced Gradient