FePN: A Robust Feature Purification Network to Defend Against Adversarial Examples.

Dongliang Cao,Kaimin Wei,Yongdong Wu,Jilian Zhang,Bingwen Feng,Jinpeng Chen
DOI: https://doi.org/10.1016/j.cose.2023.103427
IF: 5.105
2023-01-01
Computers & Security
Abstract:Deep neural networks (DNNs) have been demonstrated to be vulnerable to adversarial attacks. Existing defenses can defend against a variety of adversarial examples. However, as the perturbation budget used to create adversarial examples increases, their adversarial robustness decreases dramatically. To solve this problem, we develop a Feature Purification Network (FePN), which is based on the fact that adversarial examples are closely associated with non-robust features of data. Specifically, an adversarial learning mechanism is proposed to learn robust features by removing non-robust features from inputs. Meanwhile, two linear branches and a discriminator are designed to reconstruct high-quality natural images by exploiting robust features. FePN is a preprocessing approach that can be utilized to safeguard other models without altering them. We conduct a series of experiments on MNIST, CIFAR-10 and ImageNet. Experimental results have proven that FePN can provide effective protection compared to previous state-of-the-art approaches, as well as superior robustness against attacks with considerable perturbations.
What problem does this paper attempt to address?