Optimal Coordination of Distributed Energy Resources Using Deep Deterministic Policy Gradient

Avijit Das,Di Wu
DOI: https://doi.org/10.1109/EESAT55007.2022.9998046
2022-11-08
Abstract:Recent studies have shown that reinforcement learning (RL) is a promising approach for coordination and control of distributed energy resources (DERS) under uncertainties. Many existing RL approaches, including Q-learning and approximate dynamic programming, are based on lookup table methods, which become inefficient when the problem size is large and infeasible when continuous states and actions are involved. In addition, when modeling battery energy storage systems (BESSS), the loss of life is not reasonably considered in the decision-making process. This paper proposes an innovative deep RL method for DER coordination considering BESS degradation. The proposed deep RL is designed based on an adaptive actor-critic architecture and employs an off-policy deterministic policy gradient method for determining the dispatch operation that minimizes the operation cost and BESS loss of life. Case studies were performed to validate the proposed method and demonstrate the effects of incorporating degradation models into control design.
Environmental Science,Engineering,Computer Science
What problem does this paper attempt to address?