Abstract:Deep neural networks (DNNs) have achieved great success in various applications due to their strong expressive power. However, recent studies have shown that DNNs are vulnerable to adversarial examples, and these manipulated instances can mislead DNN into making false predictions. The existing methods of generating adversarial examples include pixel-level perturbation or spatial transformation of images, which cannot consider concurrently with the semantic quality of adversarial examples or success rate of attack. These methods are computationally bulky and slow to generate the adversarial examples. To solve this kind of issue, a two-stage generative adversarial networks (TSGAN) with semantic content constraints is proposed in this paper. The first-stage uses the original example dataset to train generator G, which can help the generator learn the distribution of real examples. Then, the example semantic quality constraint loss function, the adversarial loss function and the distance loss function are adopted in the second-stage, so that the generator G can continue to learn to search the distribution of the adversarial examples, and train the new generator G(adv). The adversarial examples generated by generator G(adv) are better fit the distribution of real examples, and have targeted black-box attack capability. The experiments show that the adversarial examples generated by TSGAN can achieve the success rate of attack at 98.40% in target model, 29.40% success rate in defense-oriented model. And 77.58% success rate is obtained in the transfer test attack. The results show that the adversarial examples generated by the proposed model, which has a highly attack success rate and more difficult to defense. Meanwhile, the improved adversarial examples have stronger transfer ability than the existing models. The proposed model can effectively reduce the expression of target category features of the adversarial examples, and the generated adversarial examples have better semantic quality than others.

Attribute-guided Face Adversarial Example Generation

SemanticAdv: Generating Adversarial Examples via Attribute-conditional Image Editing

Generating Adversarial Examples with Adversarial Networks

Multi-attribute Semantic Adversarial Attack Based on Cross-layer Interpolation for Face Recognition

Generating Adversarial Patterns in Facial Recognition with Visual Camouflage

Semantic Adversarial Attacks on Face Recognition through Significant Attributes

Unpaired Image-to-Image Translation Network for Semantic-based Face Adversarial Examples Generation

Generating Adversarial Examples for White-Box Attacks Based on GAN

An efficient adversarial example generation algorithm based on an accelerated gradient iterative fast gradient

Improved Forward-Backward Propagation To Generate Adversarial Examples

Exploring Adversarial Fake Images on Face Manifold

Evading Forensic Classifiers with Attribute-Conditioned Adversarial Faces

GAN Generate Adversarial Examples to Fool Deep Networks.

Restricted Black-Box Adversarial Attack Against DeepFake Face Swapping

Adversarial Attack on Fake-Faces Detectors under White and Black Box Scenarios

DCVAE-adv: A Universal Adversarial Example Generation Method for White and Black Box Attacks

Generating Adversarial Examples in Limited Queries with Image Encoding and Noise Decoding.

A Two-Stage Generative Adversarial Networks with Semantic Content Constraints for Adversarial Example Generation.

Defending against GAN-based Deepfake Attacks via Transformation-aware Adversarial Faces

Adversarial Transformation Network with Adaptive Perturbations for Generating Adversarial Examples.

GCSA: A New Adversarial Example-Generating Scheme Towards Black-Box Adversarial Attacks