Abstract:A pursuit–evasion game is a classical maneuver confrontation problem in the multi-agent systems (MASs) domain. An online decision technique based on deep reinforcement learning (DRL) was developed in this paper to address the problem of environment sensing and decision-making in pursuit–evasion games. A control-oriented framework developed from the DRL-based multi-agent deep deterministic policy gradient (MADDPG) algorithm was built to implement multi-agent cooperative decision-making to overcome the limitation of the tedious state variables required for the traditionally complicated modeling process. To address the effects of errors between a model and a real scenario, this paper introduces adversarial disturbances. It also proposes a novel adversarial attack trick and adversarial learning MADDPG (A2-MADDPG) algorithm. By introducing an adversarial attack trick for the agents themselves, uncertainties of the real world are modeled, thereby optimizing robust training. During the training process, adversarial learning was incorporated into our algorithm to preprocess the actions of multiple agents, which enabled them to properly respond to uncertain dynamic changes in MASs. Experimental results verified that the proposed approach provides superior performance and effectiveness for pursuers and evaders, and both can learn the corresponding confrontational strategy during training.

Adaptive Double Fuzzy Systems Based Q-Learning for Pursuit-Evasion Game

Large Scale Pursuit-Evasion under Collision Avoidance Using Deep Reinforcement Learning.

A Novel Method for a Pursuit–Evasion Game Based on Fuzzy Q-Learning and Model-Predictive Control

Pursuit-Evasion Games for Multi-agent Based on Reinforcement Learning with Obstacles

Adaptive Optimal Control via Q-Learning for Multi-Agent Pursuit-Evasion Games

An Open Loop Stackelberg Solution to Optimal Strategy for UAV Pursuit-Evasion Game

An Improved Approach Towards Multi-Agent Pursuit–Evasion Game Decision-Making Using Deep Reinforcement Learning

Integral Reinforcement Learning Based Dynamic Stackelberg Pursuit-Evasion Game for Unmanned Surface Vehicles

Pursuit-evasion Game Strategy of USV Based on Deep Reinforcement Learning in Complex Multi-Obstacle Environment

Cooperative Pursuit with Multiple Pursuers based on Deep Minimax Q-learning

Deep Reinforcement Learning Based Strategy for Quadrotor UAV Pursuer and Evader Problem

High-Speed Three-Dimensional Aerial Vehicle Evasion Based on a Multi-Stage Dueling Deep Q-Network

Hierarchical Maneuver Decision Method Based on PG-Option for UAV Pursuit-Evasion Game

Safety-Critical Pursuit-Evasion Differential Game Guidance for Multiple Underactuated Autonomous Surface Vehicles

Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with Reinforcement Learning

A Pursuit-Evasion Game on a Real-City Virtual Simulation Platform Based on Multi-Agent Reinforcement Learning

Min-Max Q-Learning for Multi-Player Pursuit-Evasion Games

Receding Horizon Control Based Real-time Strategy for Missile Pursuit-evasion Game

A Fuzzy Deterministic Policy Gradient Algorithm for Pursuit-Evasion Differential Games.

Dynamic Fuzzy Q-Learning and Its Real-Time Application in Embedded System

Pursuit and evasion game between UVAs based on multi-agent reinforcement learning