Introducing Adaptive Continuous Adversarial Training (ACAT) to Enhance ML Robustness

Mohamed elShehaby,Aditya Kotha,Ashraf Matrawy
2024-05-29
Abstract:Adversarial training enhances the robustness of Machine Learning (ML) models against adversarial attacks. However, obtaining labeled training and adversarial training data in network/cybersecurity domains is challenging and costly. Therefore, this letter introduces Adaptive Continuous Adversarial Training (ACAT), a method that integrates adversarial training samples into the model during continuous learning sessions using real-world detected adversarial data. Experimental results with a SPAM detection dataset demonstrate that ACAT reduces the time required for adversarial sample detection compared to traditional processes. Moreover, the accuracy of the under-attack ML-based SPAM filter increased from 69% to over 88% after just three retraining sessions.
Machine Learning,Cryptography and Security,Networking and Internet Architecture
What problem does this paper attempt to address?
The main goal of this paper is to enhance the robustness of machine learning (ML) models against adversarial attacks by introducing a new method—Adaptive Continuous Adversarial Training (ACAT). Specifically, ACAT aims to address the following issues: 1. **Lack of labeled data**: In the fields of networking and cybersecurity, obtaining labeled data for training is both difficult and expensive. ACAT addresses this issue by utilizing actual detected adversarial data for adversarial training. 2. **Efficiency of adversarial training**: Traditional adversarial training processes are time-consuming, especially in the field of cybersecurity where attack frequency is high. ACAT reduces the time required for adversarial sample detection by integrating adversarial samples online. 3. **Catastrophic forgetting**: During periodic continuous training, the model may forget previously learned knowledge. ACAT employs the Elastic Weight Consolidation (EWC) method to mitigate this phenomenon, ensuring that the model retains prior knowledge while continuously adapting to new data. Through experimental validation on a spam filtering task, ACAT not only improved the model's defense capability against adversarial attacks but also increased the accuracy of the spam filter under attack from 69% to 88%, and the decision speed was up to four times faster than traditional methods. These results indicate that ACAT can effectively accumulate knowledge about adversarial samples while avoiding the problem of catastrophic forgetting.