Data-Driven Integral Reinforcement Learning for Continuous-Time Non-Zero-Sum Games

Yongliang Yang,Liming Wang,Hamidreza Modares,Dawei Ding,Yixin Yin,Donald Wunsch
DOI: https://doi.org/10.1109/access.2019.2923845
IF: 3.9
2019-01-01
IEEE Access
Abstract:This paper develops an integral value iteration (VI) method to efficiently find online the Nash equilibrium solution of two-player non-zero-sum (NZS) differential games for linear systems with partially unknown dynamics. To guarantee the closed-loop stability about the Nash equilibrium, the explicit upper bound for the discounted factor is given. To show the efficacy of the presented online model-free solution, the integral VI method is compared with the model-based off-line policy iteration method. Moreover, the theoretical analysis of the integral VI algorithm in terms of three aspects, i.e., positive definiteness properties of the updated cost functions, the stability of the closed-loop systems, and the conditions that guarantee the monotone convergence, is provided in detail. Finally, the simulation results demonstrate the efficacy of the presented algorithms.
What problem does this paper attempt to address?