Synchronous Optimal Control Method for Nonlinear Systems with Saturating Actuators and Unknown Dynamics Using Off-Policy Integral Reinforcement Learning

Zenglian Zhang,Ruizhuo Song,Min Cao
DOI: https://doi.org/10.1016/j.neucom.2019.04.036
IF: 6
2019-01-01
Neurocomputing
Abstract:The present study establishes an approximate optimal critic learning algorithm, based on the single-network integral reinforcement learning (IRL) algorithm and intends to solve the optimal control problem for an unknown nonlinear system with saturating actuators. The value function is formulated through building generalized nonquadratic functions. In order to solve the Hamilton–Jacobi–Bellman (HJB) equation, a novel optimal scheme for the control approximation, based on the off-policy iteration is presented. Moreover, the single-neural network implementation procedure is introduced to complete the iteration algorithm. The synchronous IRL policy iteration is proposed to update the weight of the critic neural network. Finally, reasonable simulation results are provided for confirming the effectiveness of the proposed optimal approximation control technique in solving equations for a linear and oscillating systems.
What problem does this paper attempt to address?