Q-learning-based Dynamic Optimal Allocation Algorithm for CPS Order of Interconnected Power Grids

YU Tao,WANG Yu-ming,LIU Qian-jin
DOI: https://doi.org/10.13334/j.0258-8013.pcsee.2010.07.010
2010-01-01
Abstract:The dynamic optimization of automatic generation control (AGC) order allocation based on the NERC’s control performance standard (CPS) is a problem on stochastic optimization in the AGC system for the interconnected power system. The CPS order allocation was discretized and viewed as a discrete time Markov decision process (DTMDP). The dynamic control method based on Q-learning was proposed. Reward functions in Q-learning were designed based on different optimization objectives. Thermal and hydro units were integrated, with the regulating margin for hydro units being considered, toimprove the regulating performance of the AGC system. The application of the Q-learning algorithm in the two-area load frequency control (LFC) model and China southern power grid model was presented, compared with the genetic algorithm and an engineering method. The case study shows that the Q-learning algorithm can enhance the robustness and adaptability of AGC systems in CPS assessment.
What problem does this paper attempt to address?