Abstract:Deep Neural Networks (DNNs) have been shown to be vulnerable to adversarial examples. The existence of adversarial examples significantly hinders the development of deep learning technologies in domains with high-security requirements. However, current defense methods often lack universality, being effective only against specific adversarial attacks. This study focuses on analyzing adversarial examples through changes in model attention, classifying attack algorithms into attention-shifting and attention-attenuation categories. To counter attention-shifting attacks, a defense module named Feature Pyramid-based Attention Space-guided (FPAS) is proposed, which spatially retracts the shifting attention in adversarial examples, thereby enhancing the model's overall defense capability. Attention-based Non-Local (ANL) is a proposed defense module to counter attention-attenuation attacks. This module enhances the model's focus on critical features, efficiently constructing a robust defense model with low implementation cost and minimal intrusion into the original model. By integrating FPAS and ANL into the Wide-ResNet model within a boosting framework, the study demonstrates their synergistic defense capability. Even with eight adversarial samples embedded with adversarial patches, our FPAS_at and ANL_at models demonstrated significant improvements over the baseline, enhancing the average defense rate by 5.47% and 7.74%, respectively. Extensive experiments confirm that this universal defense strategy offers comprehensive protection against adversarial attacks at a lower implementation cost compared to current mainstream defense methods, while also being adaptable for integration with existing defense strategies to further enhance adversarial robustness.

EEJE: Two-Step Input Transformation for Robust DNN Against Adversarial Examples

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

Towards Robust DNNs: an Taylor Expansion-Based Method for Generating Powerful Adversarial Examples.

A Universal Defense Strategy Against Adversarial Attacks Based on Attention-Guided

Efficient Two-Step Adversarial Defense for Deep Neural Networks

EagleEye: Attack-Agnostic Defense against Adversarial Inputs (Technical Report)

Learning Defense Transformers for Counterattacking Adversarial Examples

Mitigating Adversarial Attacks for Deep Neural Networks by Input Deformation and Augmentation

Image Transformation can make Neural Networks more robust against Adversarial Examples

Admix: Enhancing the Transferability of Adversarial Attacks

Detecting Adversarial Examples Through Image Transformation

DeepDefense: Training Deep Neural Networks with Improved Robustness.

Improving robustness of deep neural networks via large-difference transformation

EAD: Elastic-Net Attacks to Deep Neural Networks via Adversarial Examples

Adversarial Examples: Opportunities and Challenges

Towards Understanding and Harnessing the Effect of Image Transformation in Adversarial Detection

Are You Confident That You Have Successfully Generated Adversarial Examples?

Deep Defense: Training DNNs with Improved Adversarial Robustness

Adversarial robustness improvement for deep neural networks

Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks

Adversarial Examples: Attacks and Defenses for Deep Learning