Abstract:Deep Neural Networks (DNNs) have been shown to be vulnerable to adversarial examples. The existence of adversarial examples significantly hinders the development of deep learning technologies in domains with high-security requirements. However, current defense methods often lack universality, being effective only against specific adversarial attacks. This study focuses on analyzing adversarial examples through changes in model attention, classifying attack algorithms into attention-shifting and attention-attenuation categories. To counter attention-shifting attacks, a defense module named Feature Pyramid-based Attention Space-guided (FPAS) is proposed, which spatially retracts the shifting attention in adversarial examples, thereby enhancing the model's overall defense capability. Attention-based Non-Local (ANL) is a proposed defense module to counter attention-attenuation attacks. This module enhances the model's focus on critical features, efficiently constructing a robust defense model with low implementation cost and minimal intrusion into the original model. By integrating FPAS and ANL into the Wide-ResNet model within a boosting framework, the study demonstrates their synergistic defense capability. Even with eight adversarial samples embedded with adversarial patches, our FPAS_at and ANL_at models demonstrated significant improvements over the baseline, enhancing the average defense rate by 5.47% and 7.74%, respectively. Extensive experiments confirm that this universal defense strategy offers comprehensive protection against adversarial attacks at a lower implementation cost compared to current mainstream defense methods, while also being adaptable for integration with existing defense strategies to further enhance adversarial robustness.

Geometric Universality of Adversarial Examples in Deep Learning

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

Intriguing Properties of Adversarial Examples

How Can We Deal with Adversarial Examples?

A Simple Explanation for the Existence of Adversarial Examples with Small Hamming Distance

A Universal Defense Strategy Against Adversarial Attacks Based on Attention-Guided

Adversarial Examples in Deep Learning: Characterization and Divergence

Adversarial Margin Maximization Networks

Adversarial examples: attacks and defenses in the physical world

With Friends Like These, Who Needs Adversaries?

Adversarial Examples Exist in Two-Layer ReLU Networks for Low Dimensional Linear Subspaces

Generalizing universal adversarial perturbations for deep neural networks

Towards Transferable Unrestricted Adversarial Examples with Minimum Changes

Why do universal adversarial attacks work on large language models?: Geometry might be the answer

An Interpretive Adversarial Attack Method: Attacking Softmax Gradient Layer-Wise Relevance Propagation Based on Cosine Similarity Constraint and TS-Invariant

Adversarial Examples for Semantic Segmentation and Object Detection

Latent Adversarial Defence with Boundary-guided Generation

Adversarial Attack and Interpretability of the Deep Neural Net-Work from the Geometric Perspective

Universalization of any adversarial attack using very few test examples

Explaining and Harnessing Adversarial Examples

Likelihood Landscapes: A Unifying Principle Behind Many Adversarial Defenses