Abstract:Deep neural networks are proven to be vulnerable to fine-designed adversarial examples, and adversarial defense algorithms draw more and more attention nowadays. Pre-processing based defense is a major strategy, as well as learning robust feature representation has been proven an effective way to boost generalization. However, existing defense works lack considering different depth-level visual features in the training process. In this paper, we first highlight two novel properties of robust features from the feature distribution perspective: 1) \textbf{Diversity}. The robust feature of intra-class samples can maintain appropriate diversity; 2) \textbf{Discriminability}. The robust feature of inter-class samples should ensure adequate separation. We find that state-of-the-art defense methods aim to address both of these mentioned issues well. It motivates us to increase intra-class variance and decrease inter-class discrepancy simultaneously in adversarial training. Specifically, we propose a simple but effective defense based on decoupled visual representation masking. The designed Decoupled Visual Feature Masking (DFM) block can adaptively disentangle visual discriminative features and non-visual features with diverse mask strategies, while the suitable discarding information can disrupt adversarial noise to improve robustness. Our work provides a generic and easy-to-plugin block unit for any former adversarial training algorithm to achieve better protection integrally. Extensive experimental results prove the proposed method can achieve superior performance compared with state-of-the-art defense approaches. The code is publicly available at \href{<a class="link-external link-https" href="https://github.com/chenboluo/Adversarial-defense" rel="external noopener nofollow">this https URL</a>}{<a class="link-external link-https" href="https://github.com/chenboluo/Adversarial-defense" rel="external noopener nofollow">this https URL</a>}.

Adversarial Defense Via the Data-Dependent Activation, Total Variation Minimization, and Adversarial Training

Adversarial Defense via Data Dependent Activation Function and Total Variation Minimization

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

Deep Defense: Training DNNs with Improved Adversarial Robustness

Improving the Robustness of Deep Convolutional Neural Networks Through Feature Learning

DeepDefense: Training Deep Neural Networks with Improved Robustness.

A Universal Defense Strategy Against Adversarial Attacks Based on Attention-Guided

Minimizing Adversarial Training Samples for Robust Image Classifiers: Analysis and Adversarial Example Generator Design

Improved Adversarial Training Through Adaptive Instance-wise Loss Smoothing

Mitigating Adversarial Attacks for Deep Neural Networks by Input Deformation and Augmentation

Designing defensive techniques to handle adversarial attack on deep learning based model

Attacking Adversarial Attacks as A Defense

Efficient Two-Step Adversarial Defense for Deep Neural Networks

Improving Adversarial Robustness via Decoupled Visual Representation Masking

Improving the Robustness of Adversarial Attacks Using an Affine-Invariant Gradient Estimator

Improving Adversarial Robustness via Attention and Adversarial Logit Pairing

A hybrid adversarial training for deep learning model and denoising network resistant to adversarial examples

Enhancing adversarial robustness with randomized interlayer processing

Adversarial Training of Deep Neural Networks Guided by Texture and Structural Information

Boosting Adversarial Training with Hardness-Guided Attack Strategy

Distributed Adversarial Training to Robustify Deep Neural Networks at Scale