Randomized Adversarial Training via Taylor Expansion

Gaojie Jin,Xinping Yi,Dengyu Wu,Ronghui Mu,Xiaowei Huang

2023-03-19

Abstract:In recent years, there has been an explosion of research into developing more robust deep neural networks against adversarial examples. Adversarial training appears as one of the most successful methods. To deal with both the robustness against adversarial examples and the accuracy over clean examples, many works develop enhanced adversarial training methods to achieve various trade-offs between them. Leveraging over the studies that smoothed update on weights during training may help find flat minima and improve generalization, we suggest reconciling the robustness-accuracy trade-off from another perspective, i.e., by adding random noise into deterministic weights. The randomized weights enable our design of a novel adversarial training method via Taylor expansion of a small Gaussian noise, and we show that the new adversarial training method can flatten loss landscape and find flat minima. With PGD, CW, and Auto Attacks, an extensive set of experiments demonstrate that our method enhances the state-of-the-art adversarial training methods, boosting both robustness and clean accuracy. The code is available at <a class="link-external link-https" href="https://github.com/Alexkael/Randomized-Adversarial-Training" rel="external noopener nofollow">this https URL</a>.

Machine Learning,Artificial Intelligence

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper primarily aims to address the trade-off between the robustness of deep neural networks against adversarial examples and the accuracy on clean samples. Specifically, the paper proposes a novel stochastic adversarial training method that introduces random noise through Taylor expansion to smooth weight updates and find flatter minima in the loss landscape. This method is designed to simultaneously improve the model's robustness to adversarial examples and accuracy on clean samples. ### Main Contributions 1. **Theoretical Analysis**: By introducing random weights, the paper theoretically explores the smoothness of weight updates and the flatness of the loss landscape, demonstrating that this method can find flatter minima. 2. **New Method Proposal**: Based on Taylor expansion, a new adversarial training method is proposed, which optimizes the first and second-order terms of the loss function by adding small Gaussian noise to the weights, thereby enhancing the model's robustness. 3. **Experimental Validation**: Extensive experiments validate the effectiveness of this method. On multiple datasets (CIFAR-10, CIFAR-100, SVHN) and different network architectures (ResNet, WideResNet, VGG, MobileNetV2), this method significantly improves adversarial robustness and accuracy on clean samples. Notably, under Auto Attack, the performance surpasses some existing methods.

Randomized Adversarial Training via Taylor Expansion

Towards Robust DNNs: an Taylor Expansion-Based Method for Generating Powerful Adversarial Examples.

GAAT: Group Adaptive Adversarial Training to Improve the Trade-Off Between Robustness and Accuracy

Output Randomization: A Novel Defense for both White-box and Black-box Adversarial Models

Toward Intrinsic Adversarial Robustness Through Probabilistic Training.

An efficient adversarial example generation algorithm based on an accelerated gradient iterative fast gradient

Towards Robust Training of Neural Networks by Regularizing Adversarial Gradients

You Only Propagate Once: Accelerating Adversarial Training Via Maximal Principle

Towards Noise-Robust Neural Networks via Progressive Adversarial Training

Boosting Adversarial Transferability by Achieving Flat Local Maxima

Towards Noise-Robust Neural Networks Via Progressive Adversarial Training

Fast Adversarial Training with Noise Augmentation: A Unified Perspective on RandStart and GradAlign

You Only Propagate Once: Painless Adversarial Training Using Maximal Principle

L G ] 1 9 Ju n 20 19 Convergence of Adversarial Training in Overparametrized Networks

Efficient Two-Step Adversarial Defense for Deep Neural Networks

Stability Analysis and Generalization Bounds of Adversarial Training

Improve Adversarial Robustness Via Probabilistic Distributions Decoupled Network While Guaranteeing Clean Performance

Variational Adversarial Defense: A Bayes Perspective for Adversarial Training.

Improving the Transferability of Adversarial Examples with a Noise Data Enhancement Framework and Random Erasing

Adversarial Training: embedding adversarial perturbations into the parameter space of a neural network to build a robust system

Strong Transferable Adversarial Attacks via Ensembled Asymptotically Normal Distribution Learning