Abstract:Deep neural networks are proven to be vulnerable to fine-designed adversarial examples, and adversarial defense algorithms draw more and more attention nowadays. Pre-processing based defense is a major strategy, as well as learning robust feature representation has been proven an effective way to boost generalization. However, existing defense works lack considering different depth-level visual features in the training process. In this paper, we first highlight two novel properties of robust features from the feature distribution perspective: 1) \textbf{Diversity}. The robust feature of intra-class samples can maintain appropriate diversity; 2) \textbf{Discriminability}. The robust feature of inter-class samples should ensure adequate separation. We find that state-of-the-art defense methods aim to address both of these mentioned issues well. It motivates us to increase intra-class variance and decrease inter-class discrepancy simultaneously in adversarial training. Specifically, we propose a simple but effective defense based on decoupled visual representation masking. The designed Decoupled Visual Feature Masking (DFM) block can adaptively disentangle visual discriminative features and non-visual features with diverse mask strategies, while the suitable discarding information can disrupt adversarial noise to improve robustness. Our work provides a generic and easy-to-plugin block unit for any former adversarial training algorithm to achieve better protection integrally. Extensive experimental results prove the proposed method can achieve superior performance compared with state-of-the-art defense approaches. The code is publicly available at \href{<a class="link-external link-https" href="https://github.com/chenboluo/Adversarial-defense" rel="external noopener nofollow">this https URL</a>}{<a class="link-external link-https" href="https://github.com/chenboluo/Adversarial-defense" rel="external noopener nofollow">this https URL</a>}.

Improving Model Robustness Against Adversarial Examples with Redundant Fully Connected Layer.

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

Improving Adversarial Robustness of 3D Point Cloud Classification Models

Improving the Robustness of Deep Convolutional Neural Networks Through Feature Learning

A Universal Defense Strategy Against Adversarial Attacks Based on Attention-Guided

Robust Adversarial Attacks on Imperfect Deep Neural Networks in Fault Classification

Improving Deep Neural Network Robustness with Siamese Empowered Adversarial Training

Resilience from Diversity: Population-based approach to harden models against adversarial attacks

DeepDefense: Training Deep Neural Networks with Improved Robustness.

Adversarial robustness improvement for deep neural networks

When NAS Meets Robustness: In Search of Robust Architectures against Adversarial Attacks

Are You Confident That You Have Successfully Generated Adversarial Examples?

Exploring Robust Features for Improving Adversarial Robustness

Improving Adversarial Robustness via Feature Pattern Consistency Constraint

Robust Mode Connectivity-Oriented Adversarial Defense: Enhancing Neural Network Robustness Against Diversified $\ell_p$ Attacks

Fixed Inter-Neuron Covariability Induces Adversarial Robustness

Towards Deep Learning Models Resistant to Adversarial Attacks

Towards Robustness against Unsuspicious Adversarial Examples

Improving Adversarial Robustness Requires Revisiting Misclassified Examples.

Improving the Robustness of Deep Neural Networks via Adversarial Training with Triplet Loss

Improving Adversarial Robustness via Decoupled Visual Representation Masking