Generating Universal Adversarial Perturbation with ResNet

Jian Xu,Heng Liu,Dexin Wu,Fucai Zhou,Chong-zhi Gao,Linzhi Jiang
DOI: https://doi.org/10.1016/j.ins.2020.05.099
IF: 8.1
2020-01-01
Information Sciences
Abstract:Adversarial machine learning, as a research area, has received a great deal of attention in recent years. Much of this attention has been devoted to a phenomenon called adversarial perturbation, which is human-imperceptible and can be used to craft adversarial examples. The deep neural networks are vulnerable to adversarial examples, which raises security concerns on learning algorithms due to the potentially severe consequences. It was shown there exist universal perturbations that are image-agnostic can fool the network when added to the majority of images. Since different attack strategies proposed for generating universal perturbation are still suffering from attack success rate, attack efficiency, and transferability. In this paper, we design an attack framework that uses a residual network (ResNet) to create universal perturbation. We introduce a trainable residual network generator that converts random noise into universal adversarial perturbation, which can be used to efficiently generate perturbations for any instance after being trained. Unlike traditional methods, moreover, we use a loss network to guarantee the similarity of images in content. The new generator structure and objective function make our method achieve better attack results than the existing methods. A variety of experiments conducted on the CIFAR-10 dataset reveal that our proposed attack framework constitutes an advance in the creation of universal adversarial perturbation, as it can achieve a success rate of 89%, which outperforms the similar methods, along with low perturbation norms.
What problem does this paper attempt to address?