Reinforcement Learning for Supply Chain Attacks Against Frequency and Voltage Control

Amr S. Mohamed,Sumin Lee,Deepa Kundur
2023-09-12
Abstract:The ongoing modernization of the power system, involving new equipment installations and upgrades, exposes the power system to the introduction of malware into its operation through supply chain attacks. Supply chain attacks present a significant threat to power systems, allowing cybercriminals to bypass network defenses and execute deliberate attacks at the physical layer. Given the exponential advancements in machine intelligence, cybercriminals will leverage this technology to create sophisticated and adaptable attacks that can be incorporated into supply chain attacks. We demonstrate the use of reinforcement learning for developing intelligent attacks incorporated into supply chain attacks against generation control devices. We simulate potential disturbances impacting frequency and voltage regulation. The presented method can provide valuable guidance for defending against supply chain attacks.
Signal Processing,Systems and Control
What problem does this paper attempt to address?
The paper primarily explores how to use Reinforcement Learning (RL) to simulate supply chain attacks and conduct intelligent attacks on frequency and voltage control devices in power systems, such as Automatic Voltage Regulators (AVR), Power System Stabilizers (PSS), and Governors. Specifically: 1. **Research Background**: With the modernization of power systems, the installation and upgrade of new equipment have exposed power systems to the risk of supply chain attacks. These attacks inject malware into the supply chain, causing network defenses to fail and executing targeted attacks on the physical layer. 2. **Research Objective**: The authors developed an intelligent attack method using reinforcement learning algorithms to demonstrate how such attacks can affect the frequency and voltage stability of power systems. Additionally, this method can provide valuable guidance for preventing supply chain attacks. 3. **Technical Means**: The paper employs the Proximal Policy Optimization (PPO) algorithm to train the malware. By simulating the effects of attacks in different scenarios, the effectiveness of the proposed method is validated. Specific experiments include: - Attacking the Governor IED, causing frequency fluctuations. - Simultaneously attacking multiple devices (such as the Governors of G1 and G3), resulting in more severe frequency fluctuations. - Attacking PSS and AVR IEDs to amplify frequency fluctuations. - Combining attacks on multiple AVR IEDs to further exacerbate frequency fluctuations. 4. **Conclusion and Outlook**: The research shows that supply chain attacks designed through reinforcement learning can significantly reduce the stability and quality of power supply in power systems. Therefore, it is necessary to develop corresponding defense mechanisms to counter such intelligent attacks in the future. The paper calls for the adoption of reinforcement learning techniques to predict and defend against intelligent supply chain attacks, thereby enhancing the security of power systems.