Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems with Disturbances

Ruizhuo Song,Qinglai Wei,Qing Li
DOI: https://doi.org/10.1109/TCYB.2015.2421338
IF: 11.8
2018-01-01
IEEE Transactions on Cybernetics
Abstract:An optimal control method is developed for unknown continuous-time systems with unknown disturbances in this paper. The integral reinforcement learning (IRL) algorithm is presented to obtain the iterative control. Off-policy learning is used to allow the dynamics to be completely unknown. Neural networks are used to construct critic and action networks. It is shown that if there are unknown disturbances, off-policy IRL may not converge or may be biased. For reducing the influence of unknown disturbances, a disturbances compensation controller is added. It is proven that the weight errors are uniformly ultimately bounded based on Lyapunov techniques. Convergence of the Hamiltonian function is also proven. The simulation study demonstrates the effectiveness of the proposed optimal control method for unknown systems with disturbances.
What problem does this paper attempt to address?