Abstract:The transferability and robustness of adversarial examples are two practical and important properties for black- box adversarial attacks. In this paper, we explore effective mechanisms to boost both of them across network hierarchy. In general, a typical network can be hierarchically divided into output stage, intermediate stage and input stage. Due to the over-specialization of the substitute model, we can hardly improve the transferability and robustness of the adversarial perturbations in the output stage. Therefore, we focus on manipulating the intermediate and input stages in this paper, and propose a Transferable and Robust Adversarial Perturbation generation (TRAP) method. Specifically, we propose the dynamically guided mechanism to continuously calculate accurate directional guidances for perturbation generation in the intermediate stage. In the input stage, instead of employing the single-form transformation augmentations adopted in the existing methods, we leverage multi-form affine transformation augmentations to enrich the input diversity and simultaneously boost the robustness and transferability of the adversarial perturbations. Extensive evaluations on ImageNet validation set demonstrate that our TRAP achieves superior transferability when attacking convolution neural networks (CNNs) and vision transformers (ViTs) compared to closely related state-of-the-art methods. For instance, based on the ResNet-101 model, we achieve an average attack success rate of 97.5% on black-box CNN models and 70.1% on ViT models, respectively. Moreover, TRAP exhibits robust performance against various physical-world interferences, such as Gaussian blurring, Gaussian noise, JPEG compression, color distortions, image erosion and image dilation. Additionally, we also show the potential application of our TRAP method for proactive defense against deepfake.

Enhancing Adversarial Transferability with Partial Blocks on Vision Transformer

Protego: Detecting Adversarial Examples for Vision Transformers Via Intrinsic Capabilities

Towards Transferable Adversarial Attacks on Image and Video Transformers

Dual stage black-box adversarial attack against vision transformer

On Improving Adversarial Transferability of Vision Transformers

Improving transferable adversarial attack for vision transformers via global attention and local drop

Improving the Transferability of Adversarial Examples with Restructure Embedded Patches

Towards transferable adversarial attacks on vision transformers for image classification

Transferable Adversarial Attacks on Vision Transformers with Token Gradient Regularization

Towards Efficient Adversarial Training on Vision Transformers

Transferable Adversarial Attack for Both Vision Transformers and Convolutional Networks Via Momentum Integrated Gradients

Improving Transferability of Adversarial Examples With Input Diversity

Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers

Boosting Adversarial Transferability by Block Shuffle and Rotation

Attacking Transformers with Feature Diversity Adversarial Perturbation

On the Adversarial Robustness of Vision Transformers

Exploring Transferable and Robust Adversarial Perturbation Generation Across Network Hierarchy

Vision Transformer-based Adversarial Domain Adaptation

Bag of Tricks to Boost Adversarial Transferability

Query-Efficient Hard-Label Black-Box Attack against Vision Transformers

When Adversarial Training Meets Vision Transformers: Recipes from Training to Architecture