Thwarting Cybersecurity Attacks with Explainable Concept Drift

Ibrahim Shaer,Abdallah Shami
2024-03-19
Abstract:Cyber-security attacks pose a significant threat to the operation of autonomous systems. Particularly impacted are the Heating, Ventilation, and Air Conditioning (HVAC) systems in smart buildings, which depend on data gathered by sensors and Machine Learning (ML) models using the captured data. As such, attacks that alter the readings of these sensors can severely affect the HVAC system operations impacting residents' comfort and energy reduction goals. Such attacks may induce changes in the online data distribution being fed to the ML models, violating the fundamental assumption of similarity in training and testing data distribution. This leads to a degradation in model prediction accuracy due to a phenomenon known as Concept Drift (CD) - the alteration in the relationship between input features and the target variable. Addressing CD requires identifying the source of drift to apply targeted mitigation strategies, a process termed drift explanation. This paper proposes a Feature Drift Explanation (FDE) module to identify the drifting features. FDE utilizes an Auto-encoder (AE) that reconstructs the activation of the first layer of the regression Deep Learning (DL) model and finds their latent representations. When a drift is detected, each feature of the drifting data is replaced by its representative counterpart from the training data. The Minkowski distance is then used to measure the divergence between the altered drifting data and the original training data. The results show that FDE successfully identifies 85.77 % of drifting features and showcases its utility in the DL adaptation method under the CD phenomenon. As a result, the FDE method is an effective strategy for identifying drifting features towards thwarting cyber-security attacks.
Cryptography and Security,Machine Learning
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to address the impact of cybersecurity attacks in intelligent building environments on the performance of machine learning (ML) models, especially threats to heating, ventilation, and air - conditioning (HVAC) systems. Specifically, the paper focuses on the **Concept Drift (CD)** phenomenon, that is, the change in data distribution due to cybersecurity attacks, which affects the prediction accuracy of ML models. #### Main problems: 1. **The impact of cybersecurity attacks on HVAC systems**: - HVAC systems in intelligent buildings rely on sensor data and ML models for automated operations. - Cybersecurity attacks can tamper with sensor readings, leading to changes in data distribution and thus triggering CD. - CD will lead to a decline in the prediction performance of ML models, affecting the comfort of residents and energy - saving goals. 2. **Interpret and locate concept drift**: - When CD occurs, traditional ML models cannot respond effectively because they assume that the distributions of training and test data are similar. - A method is needed to identify the specific features that cause the drift (i.e., which features are affected by the attack) in order to adopt targeted mitigation strategies. - Explaining the reasons for CD is crucial for restoring model performance, especially in deep learning (DL) models. #### Proposed solutions: - The paper proposes a method named **Feature Drift Explanation (FDE)**. - FDE uses an autoencoder (AE) to reconstruct the activations of the first layer of the regression DL model and find its latent representation. - When drift is detected, FDE will replace each drifted feature with a representative feature in the training data and use the Minkowski distance to measure the difference between the modified drifted data and the original training data. - Experimental results show that FDE can successfully identify 85.77% of the drifted features and demonstrate its practicality in DL adaptation methods. #### Summary: By introducing the FDE module, this paper provides an effective method to identify and explain CD caused by cybersecurity attacks, thereby helping to restore and improve the performance of ML models in intelligent building environments.