Based on Max-Min Framework Transferable Adversarial Attacks

Zifei Zhang,Kai Qiao,Jian Chen,Ningning Liang
DOI: https://doi.org/10.1109/icsip52628.2021.9688630
2021-01-01
Abstract:Though deep neural networks perform challenging tasks excellently, they are susceptible to adversarial examples, which mislead classifiers by applying human-imperceptible perturbations on clean inputs. Under the query-free black-box scenario, adversarial examples are hard to transfer to unknown models, and several methods have been proposed with the low transferability. To settle such issue, we design a max-min framework inspired by input transformations, which are beneficial to both the adversarial attacks and defenses. Explicitly, we decrease loss values with inputs’ affine transformations as a defense in the minimum procedure, and then increase loss values with the momentum iterative algorithm as an attack in the maximum procedure. To further improve the transferability, we determine transformed values with the max-min framework. Extensive experiments on the ImageNet dataset demonstrate that our defense-guided transferable attacks achieve impressive promotion on transferability. Experimentally, we show that the attack success rate of our method reaches to 58.38% on average, which outperforms the state-of-the-art method by 12.1% on the normally trained models and 11.13% on the adversarially trained models. Additionally, we provide a novel insight on the improvement of transferability, and our method is expected to be a benchmark for assessing the robustness of deep models.
What problem does this paper attempt to address?