Sliced Wasserstein adversarial training for improving adversarial robustness

Woojin Lee,Sungyoon Lee,Hoki Kim,Jaewook Lee
DOI: https://doi.org/10.1007/s12652-024-04791-1
IF: 3.662
2024-05-09
Journal of Ambient Intelligence and Humanized Computing
Abstract:Recently, deep-learning-based models have achieved impressive performance on tasks that were previously considered to be extremely challenging. However, recent works have shown that various deep learning models are susceptible to adversarial data samples. In this paper, we propose the sliced Wasserstein adversarial training method to encourage the logit distributions of clean and adversarial data to be similar to each other. We capture the dissimilarity between two distributions using the Wasserstein metric and then align distributions using an end-to-end training process. We present the theoretical background of the motivation for our study by providing generalization error bounds for adversarial data samples. We performed experiments on three standard datasets and the results demonstrate that our method is more robust against white box attacks compared to previous methods.
computer science, information systems,telecommunications, artificial intelligence
What problem does this paper attempt to address?