Abstract:The objective of the authors' study is to investigate a dual‐layer optimization model for the economic viability and operational stability of distribution system operators and virtual power plants. To address this issue, the authors have introduced a parameter‐sharing deep reinforcement learning algorithm. Through case studies and comparisons with traditional model‐based algorithms and heuristic algorithms, the authors have demonstrated the excellent performance of their proposed method in terms of both efficiency and solution accuracy. The authors' aim is to enhance the benefits and operational efficiency for market participants while providing valuable insights for decision‐makers and researchers in this field. With the increasing integration of distributed energy resources (DERs) into distribution systems, the optimization of system operation has become complex, facing challenges such as inadequate consideration of market participants' benefits, poor computational efficiency, and data privacy concerns. This paper introduces the concept of a virtual power plant (VPP) as a solution for energy integration and management. To strike a balance between operational safety and the interests of market participants, a dual‐layer model is proposed. This model considers the benefits of both Distribution System Operators (DSO) and VPP, while also enhancing the consideration of distribution network constraints. The DSO considers AC optimal power flow and utilizes penalty functions to ensure network security in case of violations. To enhance computational efficiency and privacy, the paper presents the parameter‐sharing twin delayed deep deterministic policy gradient approach. This approach allows multiple intelligent agents to share a neural network model, effectively reducing the computational load. During the training process, only essential data is exchanged among the agents, ensuring the privacy of sensitive information. The effectiveness of the proposed model and the algorithm is validated through a case study on an IEEE 33‐node system.

Target-Value-Competition-Based Multi-Agent Deep Reinforcement Learning Algorithm for Distributed Nonconvex Economic Dispatch

Multi-agent Deep Reinforcement Learning Algorithm for Distributed Economic Dispatch in Smart Grid.

Multi-Agent Reinforcement Learning-based Distributed Economic Dispatch Considering Network attacks and Uncertain Costs

Distributed optimization for economic power dispatch with event-triggered communication

Observer-Based Multiagent Deep Reinforcement Learning: A Fully Distributed Training Scheme

A Distributed Consensus Based Algorithm for Economic Dispatch over Time‐varying Digraphs

Multiagent-Based Reinforcement Learning for Optimal Reactive Power Dispatch.

Distributed Multiagent Reinforcement Learning With Action Networks for Dynamic Economic Dispatch

Consensus Based Distributed Reinforcement Learning For Nonconvex Economic Power Dispatch In Microgrids

Distributed optimization of electricity-Gas-Heat integrated energy system with multi-agent deep reinforcement learning

Collaborative operation optimization of distribution system and virtual power plants using multi‐agent deep reinforcement learning with parameter‐sharing mechanism

A Reinforcement Learning Algorithm Based on Neural Network for Economic Dispatch

Optimal Bi-Level Bidding and Dispatching Strategy between Active Distribution Network and Virtual Alliances Using Distributed Robust Multi-Agent Deep Reinforcement Learning

Distributed $Q$ -Learning-based Online Optimization Algorithm for Unit Commitment and Dispatch in Smart Grid

Submucosal Dermoid Cyst of The Rectum in an Adult Female (a case report)

E$^2$ DNet: An Ensembling Deep Neural Network for Solving Nonconvex Economic Dispatch in Smart Grid

An Improved Deep Reinforcement Learning Method for Dispatch Optimization Strategy of Modern Power Systems

A Deep Reinforcement Learning Approach to Efficient Distributed Optimization

An adaptive active power rolling dispatch strategy for high proportion of renewable energy based on distributed deep reinforcement learning

Deep reinforcement learning based multi-level dynamic reconfiguration for urban distribution network: A cloud-edge collaboration architecture

Multi-Objective Interval Optimization Dispatch of Microgrid Via Deep Reinforcement Learning