A Deep Reinforcement Learning-Based Bidding Strategy for Participants in a Peer-to-peer Energy Trading Scenario

Yiqun Wang,Qingyu Yang,Donghe Li
DOI: https://doi.org/10.3389/fenrg.2022.1017438
IF: 2.847
2023-01-01
Journal of Renewable and Sustainable Energy
Abstract:With the massive access to distributed energy resources, an increasing number of users have transformed into prosumers with the functions of producing, storing, and consuming electric energy. Peer-to-peer (P2P) energy trading, as a new way to allow direct energy transactions between prosumers, is becoming increasingly widespread. How to determine the trading strategy of prosumers participating in P2P energy trading while the strategy can satisfy multiple optimization objectives simultaneously is a crucial problem to be solved. To this end, this paper introduces the demand response mechanism and applies the dissatisfaction function to represent the electricity consumption of prosumers. The mid-market rate price is adopted to attract more prosumers to participate in P2P energy trading. The P2P energy trading process among multiple prosumers in the community is constructed as a Markov decision process. We design the method of deep reinforcement learning (DRL) to solve the optimal trading policy of prosumers. DRL, by engaging in continual interactions with the environment, autonomously learns the optimal strategies. Additionally, the deep deterministic policy gradient algorithm is well-suited for handling the continuous and intricate decision problems that arise in the P2P energy trading market. Through the judicious construction of a reinforcement learning environment, this paper achieves multi-objective collaborative optimization. Simulation results show that our proposed algorithm and model reduce costs by 16.5%, compared to the transaction between prosumers and grid, and can effectively decrease the dependence of prosumers on the main grid.
What problem does this paper attempt to address?