Deep Reinforcement Learning-Based Scheduling Optimization of a Wind-solar Coupled CHP Unit System Considering Source-load Uncertainty

Baohua Wang,Xinglong Gao,Xianlian Wang,Zhixiao Wang,Li Sun,Shan Hua
DOI: https://doi.org/10.1109/fasta61401.2024.10595106
2024-01-01
Abstract:Promoting the electric-thermal decoupling in combined heat and power (CHP) units is an effective way to enhance their peak-regulation capability and facilitate the integration of renewable energy. In order to address the problems of flexibility and source-load uncertainty in the operation of the CHP unit, this paper studies the deep reinforcement learning (DRL)-based efficient scheduling problem of a wind-solar coupled CHP unit system. The problem is optimized under the objective of comprehensive income for energy supply and the constraints of safe operation and supply-demand balance. Based on system operation characteristics in uncertain environments, a Markov decision process (MDP) model is designed so as to transform the scheduling optimization into a sequential decision problem. To achieve efficient and intelligent scheduling of the CHP unit, the co-optimization of electricity and heat decisions is achieved by a continuous DRL algorithm, twin delayed deep deterministic policy gradient (TD3). Simulation results under a typical day scenario in the heating season show that TD3 algorithm can promote the consumption of renewable energy and ensure the sustainable scheduling capability of the thermal storage device while satisfying power balances. Furthermore, based on the well-trained neural network, the system exhibits near-theoretically optimal results with rapid single-step scheduling duration, effectively enhancing the system’s adaptability to uncertainty.
What problem does this paper attempt to address?