Abstract:Deep Neural Networks (DNNs) have been shown to be vulnerable to adversarial examples. The existence of adversarial examples significantly hinders the development of deep learning technologies in domains with high-security requirements. However, current defense methods often lack universality, being effective only against specific adversarial attacks. This study focuses on analyzing adversarial examples through changes in model attention, classifying attack algorithms into attention-shifting and attention-attenuation categories. To counter attention-shifting attacks, a defense module named Feature Pyramid-based Attention Space-guided (FPAS) is proposed, which spatially retracts the shifting attention in adversarial examples, thereby enhancing the model's overall defense capability. Attention-based Non-Local (ANL) is a proposed defense module to counter attention-attenuation attacks. This module enhances the model's focus on critical features, efficiently constructing a robust defense model with low implementation cost and minimal intrusion into the original model. By integrating FPAS and ANL into the Wide-ResNet model within a boosting framework, the study demonstrates their synergistic defense capability. Even with eight adversarial samples embedded with adversarial patches, our FPAS_at and ANL_at models demonstrated significant improvements over the baseline, enhancing the average defense rate by 5.47% and 7.74%, respectively. Extensive experiments confirm that this universal defense strategy offers comprehensive protection against adversarial attacks at a lower implementation cost compared to current mainstream defense methods, while also being adaptable for integration with existing defense strategies to further enhance adversarial robustness.

Attention, Please! Adversarial Defense via Activation Rectification and Preservation

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

Adversarial for Good – Defending Training Data Privacy with Adversarial Attack Wisdom

Associative Adversarial Learning Based on Selective Attack

A Universal Defense Strategy Against Adversarial Attacks Based on Attention-Guided

Robust Superpixel-Guided Attentional Adversarial Attack

Defense against adversarial attacks based on color space transformation

Dual Attention Suppression Attack: Generate Adversarial Camouflage in Physical World

Reversible Attack based on Local Visual Adversarial Perturbation

Attention Masks Help Adversarial Attacks to Bypass Safety Detectors

A defense method based on attention mechanism against traffic sign adversarial samples

Give Me Your Attention: Dot-Product Attention Considered Harmful for Adversarial Patch Robustness

A Simplified Heuristic Version of Raviv's Algorithm for Using Context in Text Recognition

SAD: Saliency-based Defenses Against Adversarial Examples

Detection defense against adversarial attacks with saliency map

Attacking Adversarial Attacks as A Defense

Object-Attentional Untargeted Adversarial Attack

AICAttack: Adversarial Image Captioning Attack with Attention-Based Optimization

Perturbing Attention Gives You More Bang for the Buck: Subtle Imaging Perturbations That Efficiently Fool Customized Diffusion Models

Adversarial Attacks against Deep Saliency Models

Perception Improvement for Free: Exploring Imperceptible Black-box Adversarial Attacks on Image Classification