Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games.

Ruizhuo Song,Frank L. Lewis,Qinglai Wei
DOI: https://doi.org/10.1109/TNNLS.2016.2582849
IF: 14.255
2017-01-01
IEEE Transactions on Neural Networks and Learning Systems
Abstract:This paper establishes an off-policy integral reinforcement learning (IRL) method to solve nonlinear continuous-time (CT) nonzero-sum (NZS) games with unknown system dynamics. The IRL algorithm is presented to obtain the iterative control and off-policy learning is used to allow the dynamics to be completely unknown. Off-policy IRL is designed to do policy evaluation and policy improvement in the ...
What problem does this paper attempt to address?