Abstract:Deep neural networks can be easily fooled into making incorrect predictions through corruption of the input by adversarial perturbations: human-imperceptible artificial noise. So far adversarial training has been the most successful defense against such adversarial attacks. This work focuses on improving adversarial training to boost adversarial robustness. We first analyze, from an instance-wise perspective, how adversarial vulnerability evolves during adversarial training. We find that during training an overall reduction of adversarial loss is achieved by sacrificing a considerable proportion of training samples to be more vulnerable to adversarial attack, which results in an uneven distribution of adversarial vulnerability among data. Such "uneven vulnerability", is prevalent across several popular robust training methods and, more importantly, relates to overfitting in adversarial training. Motivated by this observation, we propose a new adversarial training method: Instance-adaptive Smoothness Enhanced Adversarial Training (ISEAT). It jointly smooths both input and weight loss landscapes in an adaptive, instance-specific, way to enhance robustness more for those samples with higher adversarial vulnerability. Extensive experiments demonstrate the superiority of our method over existing defense methods. Noticeably, our method, when combined with the latest data augmentation and semi-supervised learning techniques, achieves state-of-the-art robustness against $\ell_{\infty}$-norm constrained attacks on CIFAR10 of 59.32% for Wide ResNet34-10 without extra data, and 61.55% for Wide ResNet28-10 with extra data. Code is available at <a class="link-external link-https" href="https://github.com/TreeLLi/Instance-adaptive-Smoothness-Enhanced-AT" rel="external noopener nofollow">this https URL</a>.

Push Stricter to Decide Better: A Class-Conditional Feature Adaptive Framework for Improving Adversarial Robustness

Attack As Defense: Characterizing Adversarial Examples Using Robustness.

Feature Augmentation for Adversarial Robustness

Boosting Adversarial Training in Safety-Critical Systems Through Boundary Data Selection

GAAT: Group Adaptive Adversarial Training to Improve the Trade-Off Between Robustness and Accuracy

Towards Fairness-Aware Adversarial Learning

CFA: Class-wise Calibrated Fair Adversarial Training

Adaptive Feature Alignment for Adversarial Training

Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training

Enhancing Robust Representation in Adversarial Training: Alignment and Exclusion Criteria

Improving Robust Fairness via Balance Adversarial Training

Improving Fast Adversarial Training via Self-Knowledge Guidance

Strength-Adaptive Adversarial Training

Class aware robust training

Class-aware domain adaptation for improving adversarial robustness

Improving Adversarial Robustness via Feature Pattern Consistency Constraint

Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data

FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training

Improved Adversarial Training Through Adaptive Instance-wise Loss Smoothing

To be Robust or to be Fair: Towards Fairness in Adversarial Training

Robustness through Cognitive Dissociation Mitigation in Contrastive Adversarial Training