Integral Policy Iteration for Zero-Sum Games with Completely Unknown Nonlinear Dynamics

Hongliang Li,Derong Liu,Ding Wang
DOI: https://doi.org/10.1007/978-3-642-42054-2_29
2013-01-01
Abstract:In this paper, we develop a model-free integral policy iteration algorithm to learn online the Nash equilibrium solution of two-player zero-sum differential games with completely unknown nonlinear continuous-time dynamics. The developed algorithm updates value function, control and disturbance policies simultaneously. To implement this algorithm, three neural networks are used to approximate the game value function, the control policy and the disturbance policy. The least squares method is used to estimate the unknown parameters of the neural networks. The effectiveness of the developed scheme is demonstrated by a simulation example.
What problem does this paper attempt to address?