Game of Marine Robots: USV Pursuit Evasion Game Using Online Reinforcement Learning

Yongkang Wang,Yong Wang,Rongxin Cui,Xinxin Guo,Weisheng Yan
DOI: https://doi.org/10.1109/ICDL55364.2023.10364460
2023-01-01
Abstract:In this article, an online reinforcement learning (RL) algorithm is studied for the pursuit evasion game of Unmanned Surface Vehicles (USVs), both of which have learning abilities compared to the traditional apparent strategy. The pursuit evasion game between the USVs is described as differential game based on the relative motion equation to overcome the weakness of data-driven learning. The solution to this differential game is obtained by using online RL. The value function, the USV1 (pursuer) strategy, and the USV2 (evader) strategy are approximated by critic, actor 1, and actor 2 neural networks (NNs), respectively. The uniformly ultimately bound (UUB) of the system states and weight errors of NNs are researched based on Lyapunov theory. The performance of the proposed strategy is verified by the simulation results.
What problem does this paper attempt to address?