Thwarting Cybersecurity Attacks with Explainable Concept Drift

Ibrahim Shaer,Abdallah Shami

2024-03-19

Abstract:Cyber-security attacks pose a significant threat to the operation of autonomous systems. Particularly impacted are the Heating, Ventilation, and Air Conditioning (HVAC) systems in smart buildings, which depend on data gathered by sensors and Machine Learning (ML) models using the captured data. As such, attacks that alter the readings of these sensors can severely affect the HVAC system operations impacting residents' comfort and energy reduction goals. Such attacks may induce changes in the online data distribution being fed to the ML models, violating the fundamental assumption of similarity in training and testing data distribution. This leads to a degradation in model prediction accuracy due to a phenomenon known as Concept Drift (CD) - the alteration in the relationship between input features and the target variable. Addressing CD requires identifying the source of drift to apply targeted mitigation strategies, a process termed drift explanation. This paper proposes a Feature Drift Explanation (FDE) module to identify the drifting features. FDE utilizes an Auto-encoder (AE) that reconstructs the activation of the first layer of the regression Deep Learning (DL) model and finds their latent representations. When a drift is detected, each feature of the drifting data is replaced by its representative counterpart from the training data. The Minkowski distance is then used to measure the divergence between the altered drifting data and the original training data. The results show that FDE successfully identifies 85.77 % of drifting features and showcases its utility in the DL adaptation method under the CD phenomenon. As a result, the FDE method is an effective strategy for identifying drifting features towards thwarting cyber-security attacks.

Cryptography and Security,Machine Learning

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper aims to address the impact of cybersecurity attacks in intelligent building environments on the performance of machine learning (ML) models, especially threats to heating, ventilation, and air - conditioning (HVAC) systems. Specifically, the paper focuses on the **Concept Drift (CD)** phenomenon, that is, the change in data distribution due to cybersecurity attacks, which affects the prediction accuracy of ML models. #### Main problems: 1. **The impact of cybersecurity attacks on HVAC systems**: - HVAC systems in intelligent buildings rely on sensor data and ML models for automated operations. - Cybersecurity attacks can tamper with sensor readings, leading to changes in data distribution and thus triggering CD. - CD will lead to a decline in the prediction performance of ML models, affecting the comfort of residents and energy - saving goals. 2. **Interpret and locate concept drift**: - When CD occurs, traditional ML models cannot respond effectively because they assume that the distributions of training and test data are similar. - A method is needed to identify the specific features that cause the drift (i.e., which features are affected by the attack) in order to adopt targeted mitigation strategies. - Explaining the reasons for CD is crucial for restoring model performance, especially in deep learning (DL) models. #### Proposed solutions: - The paper proposes a method named **Feature Drift Explanation (FDE)**. - FDE uses an autoencoder (AE) to reconstruct the activations of the first layer of the regression DL model and find its latent representation. - When drift is detected, FDE will replace each drifted feature with a representative feature in the training data and use the Minkowski distance to measure the difference between the modified drifted data and the original training data. - Experimental results show that FDE can successfully identify 85.77% of the drifted features and demonstrate its practicality in DL adaptation methods. #### Summary: By introducing the FDE module, this paper provides an effective method to identify and explain CD caused by cybersecurity attacks, thereby helping to restore and improve the performance of ML models in intelligent building environments.

Thwarting Cybersecurity Attacks with Explainable Concept Drift

Learn to Adapt: Robust Drift Detection in Security Domain

Evolving cybersecurity frontiers: A comprehensive survey on concept drift and feature dynamics aware machine and deep learning in intrusion detection systems

Going Proactive and Explanatory Against Malware Concept Drift

DREAM: Combating Concept Drift with Explanatory Detection and Adaptation in Malware Classification

Identifying Malicious Traffic under Concept Drift Based on Intraclass Consistency Enhanced Variational Autoencoder

Model Based Explanations of Concept Drift

Explaining Network Intrusion Detection System Using Explainable AI Framework

Optimized Deep Learning Models for Malware Detection under Concept Drift

SFC-NIDS: a sustainable and explainable flow filtering based concept drift-driven security approach for network introspection

DeepSecDrive: an Explainable Deep Learning Framework for Real-Time Detection of Cyberattack in In-Vehicle Networks

ReCDA: Concept Drift Adaptation with Representation Enhancement for Network Intrusion Detection

Concept Drift Detection in Federated Networked Systems

Explaining Drift using Shapley Values

Privacy Drift: Evolving Privacy Concerns in Incremental Learning

LSTMDD: an optimized LSTM-based drift detector for concept drift in dynamic cloud computing

Spurious Correlations in Concept Drift: Can Explanatory Interaction Help?

A Critical Assessment of Interpretable and Explainable Machine Learning for Intrusion Detection

A novel framework for concept drift detection using autoencoders for classification problems in data streams

Counteracting Concept Drift by Learning with Future Malware Predictions

Detecting and rationalizing concept drift: A feature-level approach for understanding cause–effect relationships in dynamic environments