Abstract:This paper proposed a Deep Reinforcement learning (DRL) approach for Combined Heat and Power (CHP) system economic dispatch which obtain adaptability for different operating scenarios and significantly decrease the computational complexity without affecting accuracy. In the respect of problem description, a vast of Combined Heat and Power (CHP) economic dispatch problems are modeled as a high-dimensional and non-smooth objective function with a large number of non-linear constraints for which powerful optimization algorithms and considerable time are required to solve it. In order to reduce the solution time, most engineering applications choose to linearize the optimization target and devices model. To avoid complicated linearization process, this paper models CHP economic dispatch problems as Markov Decision Process (MDP) that making the model highly encapsulated to preserve the input and output characteristics of various devices. Furthermore, we improve an advanced deep reinforcement learning algorithm: distributed proximal policy optimization (DPPO), to make it applicable to CHP economic dispatch problem. Based on this algorithm, the agent will be trained to explore optimal dispatch strategies for different operation scenarios and respond to system emergencies efficiently. In the utility phase, the trained agent will generate optimal control strategy in real time based on current system state. Compared with existing optimization methods, advantages of DRL methods are mainly reflected in the following three aspects: 1) Adaptability: under the premise of the same network topology, the trained agent can handle the economic scheduling problem in various operating scenarios without recalculation. 2) High encapsulation: The user only needs to input the operating state to get the control strategy, while the optimization algorithm needs to re-write the constraints and other formulas for different situations. 3) Time scale flexibility: It can be applied to both the day-ahead optimized scheduling and the real-time control. The proposed method is applied to two test system with different characteristics. The results demonstrate that the DRL method could handle with varieties of operating situations while get better optimization performance than most of other algorithms.

Multi-objective Dynamic Optimal Dispatch Method for CPS Order of Interconnected Power Grids Using Improved Hierarchical Reinforcement Learning

Optimal CPS Command Dispatch Based on Hierarchically Correlated Equilibrium Reinforcement Learning

Stochastic Optimal Generation Command Dispatch Based on Improved Hierarchical Reinforcement Learning Approach

Q-learning-based Dynamic Optimal Allocation Algorithm for CPS Order of Interconnected Power Grids

Target-Value-Competition-Based Multi-Agent Deep Reinforcement Learning Algorithm for Distributed Nonconvex Economic Dispatch

Multi-agent Deep Reinforcement Learning Algorithm for Distributed Economic Dispatch in Smart Grid.

Hierarchical Correlated Q-Learning For Multi-Layer Optimal Generation Command Dispatch

Q-Learning Approach for Hierarchical Agc Scheme of Interconnected Power Grids

Hierarchical Multi-Agent Deep Reinforcement Learning for Multi-Objective Dispatching in Smart Grid

Q-learning Based Dynamic Optimal CPS Control Methodology for Interconnected Power Systems

Multi-step backtrack Q-learning based dynamic optimal algorithm for auto generation control order dispatch

Combined heat and power system intelligent economic dispatch: A deep reinforcement learning approach

Stochastic Optimal CPS Relaxed Control Methodology for Interconnected Power Systems Using Q-Learning Method

Multi-Agent Deep Reinforcement Learning for Sectional AGC Dispatch

Data-driven Optimal Dynamic Dispatch for Hydro-PV-PHS Integrated Power Systems Using Deep Reinforcement Learning Approach

Multiagent-Based Reinforcement Learning for Optimal Reactive Power Dispatch.

A Cooperative Dispatch Algorithm for Hydrogen-Based Grid-Connection Microgrids: A Multi-Agent Reinforcement Learning Method

Deep Interactive Teaching-Learning Optimization Algorithm for Generation Command Dispatch of AGC with High-Penetration Electric Vehicles

A Deep Reinforcement Learning Algorithm for the Power Order Optimization Allocation of AGC in Interconnected Power Grids

Offline economic dispatch for multi-area power system via hierarchical reinforcement learning