Abstract:Deep Neural Networks (DNNs) have been shown to be vulnerable to adversarial examples. The existence of adversarial examples significantly hinders the development of deep learning technologies in domains with high-security requirements. However, current defense methods often lack universality, being effective only against specific adversarial attacks. This study focuses on analyzing adversarial examples through changes in model attention, classifying attack algorithms into attention-shifting and attention-attenuation categories. To counter attention-shifting attacks, a defense module named Feature Pyramid-based Attention Space-guided (FPAS) is proposed, which spatially retracts the shifting attention in adversarial examples, thereby enhancing the model's overall defense capability. Attention-based Non-Local (ANL) is a proposed defense module to counter attention-attenuation attacks. This module enhances the model's focus on critical features, efficiently constructing a robust defense model with low implementation cost and minimal intrusion into the original model. By integrating FPAS and ANL into the Wide-ResNet model within a boosting framework, the study demonstrates their synergistic defense capability. Even with eight adversarial samples embedded with adversarial patches, our FPAS_at and ANL_at models demonstrated significant improvements over the baseline, enhancing the average defense rate by 5.47% and 7.74%, respectively. Extensive experiments confirm that this universal defense strategy offers comprehensive protection against adversarial attacks at a lower implementation cost compared to current mainstream defense methods, while also being adaptable for integration with existing defense strategies to further enhance adversarial robustness.

Self-adaptive logit balancing for deep neural network robustness: Defence and detection of adversarial attacks

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

Self-Adaptive Logit Balancing for Deep Learning Robustness in Computer Vision

Adversarial robustness improvement for deep neural networks

Improving Adversarial Robustness via Attention and Adversarial Logit Pairing

Robust Adversarial Attacks on Imperfect Deep Neural Networks in Fault Classification

A Universal Defense Strategy Against Adversarial Attacks Based on Attention-Guided

Analyzing the Noise Robustness of Deep Neural Networks

An ADMM-Based Universal Framework for Adversarial Attacks on Deep Neural Networks

Defensive Dropout for Hardening Deep Neural Networks under Adversarial Attacks

Not So Robust After All: Evaluating the Robustness of Deep Neural Networks to Unseen Adversarial Attacks

Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks

SoK: Certified Robustness for Deep Neural Networks

NATTACK: Learning the Distributions of Adversarial Examples for an Improved Black-Box Attack on Deep Neural Networks

DeepDefense: Training Deep Neural Networks with Improved Robustness.

DeepSafe: A Data-driven Approach for Checking Adversarial Robustness in Neural Networks

Improving Adversarial Robustness Requires Revisiting Misclassified Examples.

Deep Defense: Training DNNs with Improved Adversarial Robustness

Towards Adversarial Robustness via Debiased High-Confidence Logit Alignment

Interpreting and Improving Adversarial Robustness of Deep Neural Networks With Neuron Sensitivity

Defending Adversarial Attacks by Correcting Logits.