Stronger Adversarial Attack: Using Mini-batch Gradient

Lin Yu,Ting Deng,Wenxiang Zhang,Zhigang Zeng
DOI: https://doi.org/10.1109/icaci49185.2020.9177719
2020-01-01
Abstract:Recent years, convolutional neural network (CNN) has achieved excellent performance in computer vision, but recent researches show that they’re very vulnerable to adversarial examples, which results security risks to practical applications of CNN. Research on the generation algorithm of adversarial examples is conducive to evaluate the robustness of CNNs and promotes the study of defense algorithm. Although existing attack algorithms have achieved good performance in white-box manner, they behave unsatisfactorily in black-box manner. In this paper, we propose the mini-batch gradient based iterative method which could dramatically improve the attack ability in black-box manner. The previous iterative methods such as Iterative Fast Gradient Sign Method (I-FGSM) always make the generated adversarial examples fall into over-fitting, however we use minibatch gradient information to help them jump out of the local minimum. Moreover, the proposed method can also be considered as self-ensemble or implicit ensemble attack thus have a stronger attack power. The parallel randomization layer plays core role for providing the mini-batch gradient and the implement of it is very simple since there are no trainable variables. In order to further improve the attack success rate, we combined the momentum and the proposed method. We conducted several experiments on seven networks and the highest success rate of black-box attack has reached 94.1%, which is close to the whitebox manner. Since the proposed method greatly improve the transferability of adversarial examples, we think it could help evaluate the robustness of various deep networks.
What problem does this paper attempt to address?