Research on Unit Commitment Optimization Method Based on Deep Reinforcement Learning

CHEN Zhun,PAN Yi,FAN Shixiong,XU Dan,DING Qiang,CAI Zhi
DOI: https://doi.org/10.16543/j.2095-641x.electric.power.ict.2023.03.05
2023-01-01
Abstract:Aiming at the optimization problem of large-scale unit commitment in power grid, a deep reinforcement learning method combining pointer network and reinforcement learning is proposed. Firstly, the constraints of power system and thermal power units are fully considered, and the reinforcement learning environment of unit commitment with the minimum generation cost as the objective function is established; Secondly, in terms of optimization calculation,a deep reinforcement learning method combining pointer network and actor critical model is proposed, which forms a fast mapping from prediction data to unit startup and shutdown mode, so as to achieve the purpose of quickly solving unit commitment problems. The results for systems up to 10/200 units and 24 hours show that compared with the calculation results of traditional mathematical programming method, the method proposed in this paper can get the unit commitment results more quickly. By using pointer network as the policy network of reinforcement learning model, the ability of network feature extraction can be strengthened and the accuracy of calculation results can be improved.
What problem does this paper attempt to address?