Deep Reinforcement Learning for Community Battery Scheduling under Uncertainties of Load, PV Generation, and Energy Prices

Jiarong Fan,Hao Wang
2023-12-04
Abstract:In response to the growing uptake of distributed energy resources (DERs), community batteries have emerged as a promising solution to support renewable energy integration, reduce peak load, and enhance grid reliability. This paper presents a deep reinforcement learning (RL) strategy, centered around the soft actor-critic (SAC) algorithm, to schedule a community battery system in the presence of uncertainties, such as solar photovoltaic (PV) generation, local demand, and real-time energy prices. We position the community battery to play a versatile role, in integrating local PV energy, reducing peak load, and exploiting energy price fluctuations for arbitrage, thereby minimizing the system cost. To improve exploration and convergence during RL training, we utilize the noisy network technique. This paper conducts a comparative study of different RL algorithms, including proximal policy optimization (PPO) and deep deterministic policy gradient (DDPG) algorithms, to evaluate their effectiveness in the community battery scheduling problem. The results demonstrate the potential of RL in addressing community battery scheduling challenges and show that the SAC algorithm achieves the best performance compared to RL and optimization benchmarks.
Machine Learning,Artificial Intelligence,Optimization and Control
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the scheduling challenges of community battery systems in the face of various uncertainties (such as changes in solar photovoltaic power generation, local demand, and real - time energy prices). Specifically, the author proposes a method based on deep reinforcement learning (DRL), especially the Soft Actor - Critic (SAC) algorithm, to optimize the scheduling strategy of community battery systems. Through this method, community batteries can more effectively integrate local solar photovoltaic energy, reduce peak loads, and take advantage of energy price fluctuations for arbitrage, thereby minimizing system costs. In addition, in order to improve the exploration ability and accelerate convergence, the noise network technology is also adopted in the study. The paper verifies the effectiveness of the proposed SAC algorithm by comparing it with different reinforcement learning algorithms (such as Proximal Policy Optimization (PPO) and Deep Deterministic Policy Gradient (DDPG)) and model - based optimization methods. The experimental results show that the SAC algorithm exhibits the best performance in dealing with community battery scheduling problems.