Abstract:Deep Neural Networks (DNNs) have been shown to be vulnerable to adversarial examples. The existence of adversarial examples significantly hinders the development of deep learning technologies in domains with high-security requirements. However, current defense methods often lack universality, being effective only against specific adversarial attacks. This study focuses on analyzing adversarial examples through changes in model attention, classifying attack algorithms into attention-shifting and attention-attenuation categories. To counter attention-shifting attacks, a defense module named Feature Pyramid-based Attention Space-guided (FPAS) is proposed, which spatially retracts the shifting attention in adversarial examples, thereby enhancing the model's overall defense capability. Attention-based Non-Local (ANL) is a proposed defense module to counter attention-attenuation attacks. This module enhances the model's focus on critical features, efficiently constructing a robust defense model with low implementation cost and minimal intrusion into the original model. By integrating FPAS and ANL into the Wide-ResNet model within a boosting framework, the study demonstrates their synergistic defense capability. Even with eight adversarial samples embedded with adversarial patches, our FPAS_at and ANL_at models demonstrated significant improvements over the baseline, enhancing the average defense rate by 5.47% and 7.74%, respectively. Extensive experiments confirm that this universal defense strategy offers comprehensive protection against adversarial attacks at a lower implementation cost compared to current mainstream defense methods, while also being adaptable for integration with existing defense strategies to further enhance adversarial robustness.

Minimizing Adversarial Training Samples for Robust Image Classifiers: Analysis and Adversarial Example Generator Design

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

Towards Robust DNNs: an Taylor Expansion-Based Method for Generating Powerful Adversarial Examples.

A Universal Defense Strategy Against Adversarial Attacks Based on Attention-Guided

Feature Augmentation for Adversarial Robustness

Improving Adversarial Robustness Requires Revisiting Misclassified Examples.

Robust Adversarial Examples Against Scale Transformation Via Generative Network

Are You Confident That You Have Successfully Generated Adversarial Examples?

DeepDefense: Training Deep Neural Networks with Improved Robustness.

An efficient adversarial example generation algorithm based on an accelerated gradient iterative fast gradient

Feature Denoising for Improving Adversarial Robustness

Generating Adversarial Examples with Adversarial Networks

Towards Deep Learning Models Resistant to Adversarial Attacks

A Direct Approach to Robust Deep Learning Using Adversarial Networks

Image classification adversarial attack with improved resizing transformation and ensemble models

Towards Robust Detection of Adversarial Examples

Improving Adversarial Robustness via Attention and Adversarial Logit Pairing

Robust Superpixel-Guided Attentional Adversarial Attack

Deep Defense: Training DNNs with Improved Adversarial Robustness

Towards Robust Training of Neural Networks by Regularizing Adversarial Gradients

Image Adversarial Example Generation Method Based on Adaptive Parameter Adjustable Differential Evolution