Introducing Adaptive Continuous Adversarial Training (ACAT) to Enhance ML Robustness

Mohamed elShehaby,Aditya Kotha,Ashraf Matrawy

2024-05-29

Abstract:Adversarial training enhances the robustness of Machine Learning (ML) models against adversarial attacks. However, obtaining labeled training and adversarial training data in network/cybersecurity domains is challenging and costly. Therefore, this letter introduces Adaptive Continuous Adversarial Training (ACAT), a method that integrates adversarial training samples into the model during continuous learning sessions using real-world detected adversarial data. Experimental results with a SPAM detection dataset demonstrate that ACAT reduces the time required for adversarial sample detection compared to traditional processes. Moreover, the accuracy of the under-attack ML-based SPAM filter increased from 69% to over 88% after just three retraining sessions.

Machine Learning,Cryptography and Security,Networking and Internet Architecture

What problem does this paper attempt to address?

The main goal of this paper is to enhance the robustness of machine learning (ML) models against adversarial attacks by introducing a new method—Adaptive Continuous Adversarial Training (ACAT). Specifically, ACAT aims to address the following issues: 1. **Lack of labeled data**: In the fields of networking and cybersecurity, obtaining labeled data for training is both difficult and expensive. ACAT addresses this issue by utilizing actual detected adversarial data for adversarial training. 2. **Efficiency of adversarial training**: Traditional adversarial training processes are time-consuming, especially in the field of cybersecurity where attack frequency is high. ACAT reduces the time required for adversarial sample detection by integrating adversarial samples online. 3. **Catastrophic forgetting**: During periodic continuous training, the model may forget previously learned knowledge. ACAT employs the Elastic Weight Consolidation (EWC) method to mitigate this phenomenon, ensuring that the model retains prior knowledge while continuously adapting to new data. Through experimental validation on a spam filtering task, ACAT not only improved the model's defense capability against adversarial attacks but also increased the accuracy of the spam filter under attack from 69% to 88%, and the decision speed was up to four times faster than traditional methods. These results indicate that ACAT can effectively accumulate knowledge about adversarial samples while avoiding the problem of catastrophic forgetting.

Introducing Adaptive Continuous Adversarial Training (ACAT) to Enhance ML Robustness

Boosting Adversarial Training in Safety-Critical Systems Through Boundary Data Selection

CAT:Collaborative Adversarial Training

SCAT: Robust Self-supervised Contrastive Learning via Adversarial Training for Text Classification

CAT: Customized Adversarial Training for Improved Robustness

Improved Adversarial Training Through Adaptive Instance-wise Loss Smoothing

CATIL: Customized Adversarial Training based on Instance Loss

Advanced Techniques for Improving Model Robustness in Adversarial Machine Learning

Sample Efficient Detection and Classification of Adversarial Attacks via Self-Supervised Embeddings

Towards Robustifying Image Classifiers against the Perils of Adversarial Attacks on Artificial Intelligence Systems

Dynamic Label Adversarial Training for Deep Learning Robustness Against Adversarial Attacks

Mutual Adversarial Training: Learning together is better than going alone

MetaAdvDet: Towards Robust Detection of Evolving Adversarial Attacks

Adaptive Adversarial Training to Improve Adversarial Robustness of DNNs for Medical Image Segmentation and Detection

ATWM: Defense against adversarial malware based on adversarial training

Improving Machine Learning Robustness via Adversarial Training

On the (Statistical) Detection of Adversarial Examples

Defending Against Unforeseen Failure Modes with Latent Adversarial Training

Towards Bridging the gap between Empirical and Certified Robustness against Adversarial Examples

AdvCat: Domain-Agnostic Robustness Assessment for Cybersecurity-Critical Applications with Categorical Inputs

A Simplified Heuristic Version of Raviv's Algorithm for Using Context in Text Recognition