Abstract:Deep neural networks are proven to be vulnerable to fine-designed adversarial examples, and adversarial defense algorithms draw more and more attention nowadays. Pre-processing based defense is a major strategy, as well as learning robust feature representation has been proven an effective way to boost generalization. However, existing defense works lack considering different depth-level visual features in the training process. In this paper, we first highlight two novel properties of robust features from the feature distribution perspective: 1) \textbf{Diversity}. The robust feature of intra-class samples can maintain appropriate diversity; 2) \textbf{Discriminability}. The robust feature of inter-class samples should ensure adequate separation. We find that state-of-the-art defense methods aim to address both of these mentioned issues well. It motivates us to increase intra-class variance and decrease inter-class discrepancy simultaneously in adversarial training. Specifically, we propose a simple but effective defense based on decoupled visual representation masking. The designed Decoupled Visual Feature Masking (DFM) block can adaptively disentangle visual discriminative features and non-visual features with diverse mask strategies, while the suitable discarding information can disrupt adversarial noise to improve robustness. Our work provides a generic and easy-to-plugin block unit for any former adversarial training algorithm to achieve better protection integrally. Extensive experimental results prove the proposed method can achieve superior performance compared with state-of-the-art defense approaches. The code is publicly available at \href{<a class="link-external link-https" href="https://github.com/chenboluo/Adversarial-defense" rel="external noopener nofollow">this https URL</a>}{<a class="link-external link-https" href="https://github.com/chenboluo/Adversarial-defense" rel="external noopener nofollow">this https URL</a>}.

Diversity supporting robustness: Enhancing adversarial robustness via differentiated ensemble predictions

Improving Adversarial Robustness Via Promoting Ensemble Diversity

Improving Adversarial Robustness via Promoting Ensemble Diversity.

DVERGE: Diversifying Vulnerabilities for Enhanced Robust Generation of Ensembles

LAFED: Towards robust ensemble models via latent feature diversification

Dynamic ensemble selection based on Deep Neural Network Uncertainty Estimation for Adversarial Robustness

Ensemble Adversarial Defense via Integration of Multiple Dispersed Low Curvature Models

Towards Robust Neural Networks via Orthogonal Diversity

On the Certified Robustness for Ensemble Models and Beyond

Adversarial Robust Decision-Making under Uncertainty Learning and Dynamic Ensemble Selection

Perturbation diversity certificates robust generalization

Deep Neural Network Ensembles against Deception: Ensemble Diversity, Accuracy and Robustness

Enhancing Adversarial Robustness via Uncertainty-Aware Distributional Adversarial Training

Synergy-of-Experts: Collaborate to Improve Adversarial Robustness

Voting based ensemble improves robustness of defensive models

Improving Adversarial Robustness of Ensemble Classifiers by Diversified Feature Selection and Stochastic Aggregation

Self-ensemble Adversarial Training for Improved Robustness

Robust Mode Connectivity-Oriented Adversarial Defense: Enhancing Neural Network Robustness Against Diversified $\ell_p$ Attacks

Exploring Model Learning Heterogeneity for Boosting Ensemble Robustness

Improving Adversarial Robustness via Decoupled Visual Representation Masking