A Low-Carbon Economic Dispatch Method for Power Systems with Carbon Capture Plants Based on Safe Reinforcement Learning

Qian Wang,Xueguang Zhang,Ying Xu,Zhongkai Yi,Dianguo Xu
DOI: https://doi.org/10.1109/tii.2024.3396355
IF: 12.3
2024-01-01
IEEE Transactions on Industrial Informatics
Abstract:To address the high-dimensional and complex scheduling issues in the low-carbon economic dispatch (LCED) with carbon capture plants, in this article, we propose a novel safe reinforcement learning (SRL) based on heterogeneous action space representation, which can make fast decisions for both optimal power flow and carbon capture operation. First, SRL is designed based on the feasible set to ensure that the dispatch results continuously remain within the preset range. Then, to tackle the problem of having a large number of discrete and continuous variables in the LCED, this article employs a parameterized Markov process to represent these discrete-continuous actions and uses a conditional variational autoencoder to depict heterogeneous space. To learn the correlation between discrete and continuous action spaces, a mechanism for approximating action space based on small-sample behavior cloning is proposed, and a method based on dynamic time warping for calculating environment similarity is designed for determining the value of the regularization term. Finally, numerical simulations validate the superiority and scalability of the proposed method in enhancing decision-making efficiency and promoting the low-carbon economic operation of the power system.
What problem does this paper attempt to address?