Abstract:This paper proposed a Deep Reinforcement learning (DRL) approach for Combined Heat and Power (CHP) system economic dispatch which obtain adaptability for different operating scenarios and significantly decrease the computational complexity without affecting accuracy. In the respect of problem description, a vast of Combined Heat and Power (CHP) economic dispatch problems are modeled as a high-dimensional and non-smooth objective function with a large number of non-linear constraints for which powerful optimization algorithms and considerable time are required to solve it. In order to reduce the solution time, most engineering applications choose to linearize the optimization target and devices model. To avoid complicated linearization process, this paper models CHP economic dispatch problems as Markov Decision Process (MDP) that making the model highly encapsulated to preserve the input and output characteristics of various devices. Furthermore, we improve an advanced deep reinforcement learning algorithm: distributed proximal policy optimization (DPPO), to make it applicable to CHP economic dispatch problem. Based on this algorithm, the agent will be trained to explore optimal dispatch strategies for different operation scenarios and respond to system emergencies efficiently. In the utility phase, the trained agent will generate optimal control strategy in real time based on current system state. Compared with existing optimization methods, advantages of DRL methods are mainly reflected in the following three aspects: 1) Adaptability: under the premise of the same network topology, the trained agent can handle the economic scheduling problem in various operating scenarios without recalculation. 2) High encapsulation: The user only needs to input the operating state to get the control strategy, while the optimization algorithm needs to re-write the constraints and other formulas for different situations. 3) Time scale flexibility: It can be applied to both the day-ahead optimized scheduling and the real-time control. The proposed method is applied to two test system with different characteristics. The results demonstrate that the DRL method could handle with varieties of operating situations while get better optimization performance than most of other algorithms.

Power Management for Chiplet-Based Multicore Systems Using Deep Reinforcement Learning

Deep Reinforcement Learning-Based Power Management for Chiplet-Based Multicore Systems

Multi-core Chip Dynamic Power Management Framework Based on Reinforcement Learning br

Improve the Stability and Robustness of Power Management through Model-free Deep Reinforcement Learning

An Efficient and Flexible Learning Framework for Dynamic Power and Thermal Co-Management

Multi-Core Power Management through Deep Reinforcement Learning

Online Power Management for Multi-Cores: A Reinforcement Learning Based Approach

A game-based deep reinforcement learning approach for energy-efficient computation in MEC systems

Optimizing Data Center Energy Efficiency Via Event-Driven Deep Reinforcement Learning

Autonomous Power Management With Double- Q Reinforcement Learning Method

Demand Charge Control for Energy-intensive Enterprises Based on Deep Reinforcement Learning

Modular reinforcement learning for self-adaptive energy efficiency optimization in multicore system

Q-DPM: an Efficient Model-Free Dynamic Power Management Technique

New Two-Stage Deep Reinforcement Learning for Task Admission and Channel Allocation of Wireless-Powered Mobile Edge Computing

Optimizing Energy Efficiency for Data Center via Parameterized Deep Reinforcement Learning

DeepEE: Joint Optimization of Job Scheduling and Cooling Control for Data Center Energy Efficiency Using Deep Reinforcement Learning

Energy-efficient edge intelligence for task-dependency MEC power grid networks

A Double Deep Q-Learning Model for Energy-Efficient Edge Scheduling

Combined heat and power system intelligent economic dispatch: A deep reinforcement learning approach

Cooperatively Improving Data Center Energy Efficiency Based on Multi-Agent Deep Reinforcement Learning

Deep Reinforcement Learning Based Energy Efficient Resource Allocation for Wireless Powered Edge Computing Network