Optimal control with deep reinforcement learning for shunt compensations to enhance voltage stability

Shang Cao,Shiwu Liao,Shaorong Wang,Xiaotong Luo
DOI: https://doi.org/10.1109/ACPEE48638.2020.9136271
2020-01-01
Abstract:Due to the severe complexity caused by the expanding interconnection and the increasing penetration of power electronic devices, optimally control the shunt compensations is facing great challenges. Deep reinforcement learning (DRL) has tremendous potential in handling the complexity and nonlinearity in the power systems. But traditional DRL methods are difficult or even impossible to apply to tasks with large discrete action space, while the shunt compensation control exactly has extremely large discrete action space. To adopt DRL to the optimal control of shunt compensations, this paper proposed a new voltage control method based on the Wolpertinger Architecture of DRL. The proposed method embeds discrete action in a continuous space, use DDPG to produce a continuous action, and then use proximity algorithm to search the final action, which reduces the time complexity of action lookup. The proposed method was tested on IEEE 118 bus system with 620 discrete actions, and shows good performance enhancing the voltage stability.
What problem does this paper attempt to address?