Multi-objective Dynamic Optimal Dispatch Method for CPS Order of Interconnected Power Grids Using Improved Hierarchical Reinforcement Learning

YU Tao,WANG Yuming,YE Wenjia,LIU Qianjin
DOI: https://doi.org/10.13334/j.0258-8013.pcsee.2011.19.013
2011-01-01
Abstract:This paper presented an improved hierarchical reinforcement learning (HRL) algorithm to solve the curse of dimensionality problem in the multi-objective dynamic optimization of automatic generation control (AGC) order dispatch based on control performance standard (CPS). The CPS order dispatch task was decomposed into several subtasks by classifying the AGC committed units according to their response time delay of power regulating. A time-varying coordination factor was introduced between layers of HRL to speed up the algorithm. Numbers of linear combination of weights in reward function were designed to optimize hydro capacity margin and AGC production cost. The application of improved hierarchical Q-learning in the China southern power grid model shows that the proposed method can speed up the algorithm by 47%, enhance the performance of AGC systems in CPS assessment, and save AGC production cost over 5%, compared with the hierarchical Q-learning and genetic algorithm. © 2011 Chin. Soc. for Elec. Eng.
What problem does this paper attempt to address?