Safe reward‐based deep reinforcement learning control for an electro‐hydraulic servo system

Minling Wu,Lijun Liu,Zhen Yu,Weizhou Li
DOI: https://doi.org/10.1002/rnc.6235
IF: 3.8973
2022-06-12
International Journal of Robust and Nonlinear Control
Abstract:In this article, a safe deep reinforcement learning (DRL) control method based on a safe reward shaping method is proposed and applied to the constrained control for an electro‐hydraulic servo system (EHSS). The proposed control method improves the safety of the constrained control for a nonlinear system with the minimal intervention to the optimization of the performance objective, while the convergence speed of the DRL process has accelerated. By introducing control barrier functions (CBFs) to the reward shaping, a CBF‐based potential difference term is designed to shape the safe reward, which not only provides the safe guidance for the DRL process by encoding the safety constraints of the nonlinear system, but also considers effects of the complex safety transformation on the convergence process in the DRL. Then the safe reward‐based DRL control method is presented to learn the optimal safety policy of position tracking for the EHSS with position error constraints by planning and optimizing the safety together with the performance objective. Theoretical analysis is given to demonstrate that the proposed control method with the safe reward can achieve the optimal safety performance for the constrained control system. Experimental results of the constrained control for the EHSS with system uncertainties and perturbations are also exhibited, to show that the proposed control method converges fast and performs safer and better than the conventional control methods.
automation & control systems,engineering, electrical & electronic,mathematics, applied
What problem does this paper attempt to address?