Abstract:Some recent studies have demonstrated that the deep neural network (DNN) is vulnerable to adversarial examples, which contain some subtle and human-imperceptible perturbations. Although numerous countermeasures have been proposed and play a significant role, most of them all have some flaws and are only effective for certain types of adversarial examples. In the paper, we present a novel and universal countermeasure to recover multiple types of adversarial examples to benign examples before they are fed into the deep neural network. The idea is to model the mapping between adversarial examples and benign examples using a generative adversarial network (GAN). Its GAN architecture consists of a generator based on UNET, a discriminator based on ACGAN, and a newly added third-party classifier. The UNET can enhance the capacity of the generator to recover adversarial examples to benign examples. The loss function makes full use of the advantages of ACGAN and WGAN-GP to ensure the stability of the training process and accelerate its convergence. Besides, a classification loss and a perceptual loss, all from the third-party classifier, are employed to improve further the generator's capacity to eliminate adversarial perturbations. Experiments are conducted on the MNIST, CIFAR10, and IMAGENET datasets. First, we perform ablation experiments to prove the proposed countermeasure's validity. Then, we defend against seven types of state-of-the-art adversarial examples on four deep neural networks and compare them with six existing countermeasures. Finally, the experimental results demonstrate that the proposed countermeasure is universal and has a more excellent performance than other countermeasures. The experimental code is available at https://github.com/Afreadyang/IAED-GAN.

Generating adversarial examples with elastic-net regularized boundary equilibrium generative adversarial network

Generating Adversarial Examples with Adversarial Networks

Creative and Diverse Artwork Generation Using Adversarial Networks

AT-GAN: An Adversarial Generator Model for Non-constrained Adversarial Examples

APE-GAN: Adversarial Perturbation Elimination with GAN.

EAD: Elastic-Net Attacks to Deep Neural Networks via Adversarial Examples

Robust and Generalized Physical Adversarial Attacks Via Meta-GAN

A novel and universal GAN-based countermeasure to recover adversarial examples to benign examples

RA-RevGAN: Region-Aware Reversible Adversarial Example Generation Network for Privacy-Preserving Applications

Evaluation of GAN-Based Model for Adversarial Training

Hlr: Generating Adversarial Examples By High-Level Representations

The core structure of the lipopolysaccharide from the causative agent of plague, Yersinia pestis.

Attention-Guided Evolutionary Attack with Elastic-Net Regularization on Face Recognition

A General Framework for Adversarial Examples with Objectives

Robust Adversarial Examples Against Scale Transformation Via Generative Network

Perceptual-Sensitive GAN for Generating Adversarial Patches.

Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection

NODE-AdvGAN: Improving the transferability and perceptual similarity of adversarial examples by dynamic-system-driven adversarial generative model

GRIP-GAN: an Attack-Free Defense Through General Robust Inverse Perturbation

Query-Efficient Generation of Adversarial Examples for Defensive DNNs via Multi-Objective Optimization

Evolutionary Generative Adversarial Networks