Optimal Attack Strategy of Power Grid based on Double Q-learning Algorithm

Xianxu Li,Wei Hu,Penglin Hou,Tao Shang,Xueqin Gao,Da Li
DOI: https://doi.org/10.1109/ei252483.2021.9713550
2021-01-01
Abstract:The development of modern communication and information technology promotes the development of traditional power system to smart grid, but it also brings great challenges to security. In this context, some random disturbance leads to line disconnected, and power flow transfer and hidden fault will lead to cascading failure. We analyzed the best attack strategy of smart grid in such an environment with randomness. First, the dynamic model of DC power flow calculation is used to simulate cascading failures. In order to adapt to random environment, we adopt the improved OPA calculation to realize load cutting and generator output adjustment process. Then the reinforcement learning algorithm is used to solve the optimal Markov strategy for multi-step attack. Based on IEEE 39 BUS system, we demonstrate the effectiveness of the algorithm. The experimental results show that the Double Q-learning algorithm has better effect in the random environment, and can finally achieve the purpose of paralyzing the power grid in a three-step topological sequence attack.
What problem does this paper attempt to address?