Abstract:This paper addresses multi-UAV pursuit-evasion, where a group of drones cooperates to capture a fast evader in a confined environment with obstacles. Existing heuristic algorithms, which simplify the pursuit-evasion problem, often lack expressive coordination strategies and struggle to capture the evader in extreme scenarios, such as when the evader moves at high speeds. In contrast, reinforcement learning (RL) has been applied to this problem and has the potential to obtain highly cooperative capture strategies. However, RL-based methods face challenges in training for complex 3-dimensional scenarios with diverse task settings due to the vast exploration space. The dynamics constraints of drones further restrict the ability of reinforcement learning to acquire high-performance capture strategies. In this work, we introduce a dual curriculum learning framework, named DualCL, which addresses multi-UAV pursuit-evasion in diverse environments and demonstrates zero-shot transfer ability to unseen scenarios. DualCL comprises two main components: the Intrinsic Parameter Curriculum Proposer, which progressively suggests intrinsic parameters from easy to hard to improve the capture capability of drones, and the External Environment Generator, tasked with exploring unresolved scenarios and generating appropriate training distributions of external environment parameters. The simulation experimental results show that DualCL significantly outperforms baseline methods, achieving over 90% capture rate and reducing the capture timestep by at least 27.5% in the training scenarios. Additionally, it exhibits the best zero-shot generalization ability in unseen environments. Moreover, we demonstrate the transferability of our pursuit strategy from simulation to real-world environments. Further details can be found on the project website at <a class="link-external link-https" href="https://sites.google.com/view/dualcl" rel="external noopener nofollow">this https URL</a>.

Crafting a robotic swarm pursuit–evasion capture strategy using deep reinforcement learning

Large Scale Pursuit-Evasion under Collision Avoidance Using Deep Reinforcement Learning.

Multi-Agent Cooperative Pursuit-Evasion Control Using Gene Expression Programming

Multi-Target Pursuit by a Decentralized Heterogeneous UAV Swarm using Deep Multi-Agent Reinforcement Learning

Distributed Pursuit-Evasion Game of Limited Perception USV Swarm Based on Multiagent Proximal Policy Optimization

Cooperative Pursuit with Multiple Pursuers based on Deep Minimax Q-learning

Multi-UAV Pursuit-Evasion with Online Planning in Unknown Environments by Deep Reinforcement Learning

Mapless Collaborative Navigation for a Multi-Robot System Based on the Deep Reinforcement Learning

Coordination and Control in Multiagent Systems for Enhanced Pursuit-Evasion Game Performance

Cooperative Encirclement Strategy for Multiple Drones Based on ATT-MADDPG

Learning to Play Pursuit-Evasion with Dynamic and Sensor Constraints

Game of Drones: Multi-UAV Pursuit-Evasion Game With Online Motion Planning by Deep Reinforcement Learning

Multi-robot Target Encirclement Control with Collision Avoidance via Deep Reinforcement Learning

Learning Vision-based Pursuit-Evasion Robot Policies

Coordinated control of multiple mobile robots in pursuit-evasion games

Learning Multi-Pursuit Evasion for Safe Targeted Navigation of Drones

Cooperative multi-agent target searching: a deep reinforcement learning approach based on parallel hindsight experience replay

A Dual Curriculum Learning Framework for Multi-UAV Pursuit-Evasion in Diverse Environments

Learning Evasion Strategy in Pursuit-Evasion by Deep Q-network

Nature-inspired dynamic control for pursuit-evasion of robots

Multi-robot Cooperative Pursuit via Potential Field-Enhanced Reinforcement Learning