A novel Z-function-based completely model-free reinforcement learning method to finite-horizon zero-sum game of nonlinear system

Zhe Chen,Wenqian Xue,Ning Li,Bosen Lian,Frank L. Lewis
DOI: https://doi.org/10.1007/s11071-021-07049-z
IF: 5.741
2022-01-09
Nonlinear Dynamics
Abstract:This paper addresses the finite-horizon two-player zero-sum game for the continuous-time nonlinear system by defining a novel Z-function and proposing a completely model-free reinforcement learning (RL)-based method with reduced dimension of the basis functions. First, a model-based RL policy iteration framework is raised for reducing the order of the Hamiltonian–Jacobi–Isaacs (HJI) equation and strengthening the anti-interference capability and efficiency. This provides the basic framework for model-free algorithms. A partially model-free algorithm is then developed by applying integral RL and iterative learning control techniques to further simplify the solution seeking and remove the need for system dynamics on value function update. An integral Bellman equation is considered. The value function for the HJI equation is evaluated by a critic neural network with time-variant weights and state-dependent basis functions. In order to realize completely model-free learning, a novel Z-function is finally defined and a completely model-free algorithm is thus proposed to further remove the need for system dynamics on input update. Sufficient convergence and stability analysis is provided. The corresponding simulation results are shown to verify the validity of this algorithm.
engineering, mechanical,mechanics
What problem does this paper attempt to address?