Improving Resistance to Adversarial Deformations by Regularizing Gradients

Pengfei Xia,Bin Li
DOI: https://doi.org/10.1016/j.neucom.2021.05.055
IF: 6
2021-01-01
Neurocomputing
Abstract:Improving the resistance of deep neural networks against adversarial attacks is important for deploying models in realistic applications. Nowadays, most defense methods are designed to resist intensity perturbations, and location perturbations have not yet attracted enough attention. However, these two types should be equally important for deep model security. In this paper, we focus on adversarial deformations, a typical class of location perturbations, and propose a defense method named flow gradient regularization to improve the resistance of models against such attacks. By theoretical analysis, we prove that regularizing flow gradients is able to get a tighter bound than regularizing input gradients. Through verifying over multiple datasets, network architectures, and adversarial deformations, our empirical results indicate that training with flow gradients performs better than training with input gradients by a large margin, and also better than adversarial training. Moreover, the proposed method can be used to combine with adversarial deformation training to improve the resistance further. Our method is now available at https://github.com/xpf/Flow-Gradient-Regularization.
What problem does this paper attempt to address?