Adaptive Dynamic Programming for Solving Non-Zero-Sum Differential Games.

Hongliang Li,Derong Liu,Ding Wang
DOI: https://doi.org/10.3182/20130902-3-CN-3020.00124
2013-01-01
Abstract:In this paper, a novel adaptive dynamic programming algorithm based on policy iteration is developed to solve online multi-player non-zero-sum differential game for continuoustime nonlinear systems. This algorithm is mathematically equivalent to the quasi-Newton's iteration in a Banach space. The implementation using neural networks is given, where a critic neural network is used to learn its value function, and an action neural network sharing the same parameters with the corresponding critic neural network is used to learn its optimal control policy for each player. All the critic and action neural networks are updated online in real-time and continuously. A simulation example is presented to demonstrate the effectiveness of the developed scheme. Copyright © 2013 IFAC.
What problem does this paper attempt to address?