Abstract:Deep neural networks are proven to be vulnerable to fine-designed adversarial examples, and adversarial defense algorithms draw more and more attention nowadays. Pre-processing based defense is a major strategy, as well as learning robust feature representation has been proven an effective way to boost generalization. However, existing defense works lack considering different depth-level visual features in the training process. In this paper, we first highlight two novel properties of robust features from the feature distribution perspective: 1) \textbf{Diversity}. The robust feature of intra-class samples can maintain appropriate diversity; 2) \textbf{Discriminability}. The robust feature of inter-class samples should ensure adequate separation. We find that state-of-the-art defense methods aim to address both of these mentioned issues well. It motivates us to increase intra-class variance and decrease inter-class discrepancy simultaneously in adversarial training. Specifically, we propose a simple but effective defense based on decoupled visual representation masking. The designed Decoupled Visual Feature Masking (DFM) block can adaptively disentangle visual discriminative features and non-visual features with diverse mask strategies, while the suitable discarding information can disrupt adversarial noise to improve robustness. Our work provides a generic and easy-to-plugin block unit for any former adversarial training algorithm to achieve better protection integrally. Extensive experimental results prove the proposed method can achieve superior performance compared with state-of-the-art defense approaches. The code is publicly available at \href{<a class="link-external link-https" href="https://github.com/chenboluo/Adversarial-defense" rel="external noopener nofollow">this https URL</a>}{<a class="link-external link-https" href="https://github.com/chenboluo/Adversarial-defense" rel="external noopener nofollow">this https URL</a>}.

Defense Against Adversarial Attacks Using Feature Scattering-based Adversarial Training

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

An Adversarial Attack Via Feature Contributive Regions

Adversarial for Good – Defending Training Data Privacy with Adversarial Attack Wisdom

Scattering Model Guided Adversarial Examples for SAR Target Recognition: Attack and Defense

Feature Augmentation for Adversarial Robustness

Enhancing Adversarial Training with Feature Separability

Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data

Enhancing Adversarial Robustness via Uncertainty-Aware Distributional Adversarial Training

Improving Adversarial Robustness via Decoupled Visual Representation Masking

Adversarial Training and Robustness for Multiple Perturbations

Can Adversarial Training Be Manipulated By Non-Robust Features?

Ensemble Adversarial Training: Attacks and Defenses

Exploring Robust Features for Improving Adversarial Robustness

Toward Adversarial Robustness via Semi-supervised Robust Training

On the Vulnerability of Adversarially Trained Models Against Two-faced Attacks

Understanding the Robustness of Randomized Feature Defense Against Query-Based Adversarial Attacks

Adversarial Training for Improving Model Robustness? Look at Both Prediction and Interpretation

Adv-4-Adv: Thwarting Changing Adversarial Perturbations via Adversarial Domain Adaptation

Adversarial Distributional Training for Robust Deep Learning

Successful Daptomycin Treatment for Staphylococcus lugdunensis Endocarditis