Peer-to-Peer Energy Transactions for Prosumers Based on Improved Deep Deterministic Policy Gradient Algorithm

Hao Xiao,Xiaowei Pu,Wei Pei,Li Ma
DOI: https://doi.org/10.1109/tsg.2024.3419122
IF: 10.275
2024-01-01
IEEE Transactions on Smart Grid
Abstract:With the evolution of the power market, the active involvement of prosumers in both consuming renewable energy and maximizing financial gains has emerged as a pivotal and compelling trend in the prospective landscape of energy market development. However, the optimal energy trading strategy for prosumers under a complicated peer-to-peer (P2P) market environment is still a great challenge, due to the incomplete information and large decision space. To solve this problem, this paper proposes an improved deep deterministic policy gradient (IDDPG) algorithm to optimize the P2P energy trading and operation strategy for prosumers. The proposed IDDPG algorithm uses a wave-attention network to model the external interactive environment of each prosumer to simplify the complexity of multi-agent trading environment. Moreover, a priority sampling mechanism is proposed to learn valuable experiences purposefully and adopts a cyclic learning rate to avoid local optimality. Finally, the feasibility of the method is verified by the case study of P2P trading with different scales of prosumers. The numerical results show that the proposed IDDPG algorithm outperforms the traditional reinforcement learning algorithms in both calculating efficiency and convergence, and provides a reference for the P2P transaction strategy of prosumers.
What problem does this paper attempt to address?