Deep reinforcement learning for PMSG wind turbine control via twin delayed deep deterministic policy gradient (TD3)

Darkhan Zholtayev,Matteo Rubagotti,Ton Duc Do
DOI: https://doi.org/10.1002/oca.3129
2024-04-08
Optimal Control Applications and Methods
Abstract:This article describes the implementation and training of the twin delayed deep deterministic policy gradient (TD3) for maximum power point tracking in wind energy conversion systems that use permanent magnet synchronous generators (PMSGs). Simulation results are provided, including a comparison with a model‐based control method based on feedback linearization and linear‐quadratic regulation. The proposed TD3‐based controller achieves satisfactory control performance and is more robust to PMSG parameter variations as compared to the presented model‐based method. This article proposes the use of a deep reinforcement learning method—and precisely a variant of the deep deterministic policy gradient (DDPG) method known as twin delayed DDPG, or TD3—for maximum power point tracking in wind energy conversion systems that use permanent magnet synchronous generators (PMSGs). An overview of the TD3 algorithm is provided, together with a detailed description of its implementation and training for the considered application. Simulation results are provided, also including a comparison with a model‐based control method based on feedback linearization and linear‐quadratic regulation. The proposed TD3‐based controller achieves a satisfactory control performance and is more robust to PMSG parameter variations as compared to the presented model‐based method. To the best of the authors' knowledge, this article presents for the first time an approach for generating both speed and current control loops using DRL for wind energy conversion systems.
automation & control systems,operations research & management science,mathematics, applied
What problem does this paper attempt to address?