Abstract:Unmanned Aerial Vehicles (UAVs) play a vital role in military warfare. In a variety of battlefield mission scenarios, UAVs are required to safely fly to designated locations without human intervention. Therefore, finding a suitable method to solve the UAV Autonomous Motion Planning (AMP) problem can improve the success rate of UAV missions to a certain extent. In recent years, many studies have used Deep Reinforcement Learning (DRL) methods to address the AMP problem and have achieved good results. From the perspective of sampling, this paper designs a sampling method with double-screening, combines it with the Deep Deterministic Policy Gradient (DDPG) algorithm, and proposes the Relevant Experience Learning-DDPG (REL-DDPG) algorithm. The REL-DDPG algorithm uses a Prioritized Experience Replay (PER) mechanism to break the correlation of continuous experiences in the experience pool, finds the experiences most similar to the current state to learn according to the theory in human education, and expands the influence of the learning process on action selection at the current state. All experiments are applied in a complex unknown simulation environment constructed based on the parameters of a real UAV. The training experiments show that REL-DDPG improves the convergence speed and the convergence result compared to the state-of-the-art DDPG algorithm, while the testing experiments show the applicability of the algorithm and investigate the performance under different parameter conditions. (c) 2021 Chinese Society of Aeronautics and Astronautics. Production and hosting by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Prioritized Experience Replay–Based Path Planning Algorithm for Multiple UAVs

Path Planning Algorithm for Multiple UAVs Based on Artificial Potential Field

A Many-to-Many UAV Pursuit and Interception Strategy Based on PERMADDPG

Deep Reinforcement Learning Approach with Multiple Experience Pools for UAV's Autonomous Motion Planning in Complex Unknown Environments

Multi-UAV Path Planning Based on Potential Field Dense Reward in Unknown Environments with Static and Dynamic Obstacles

Relevant experience learning: A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments

An Improved Method towards Multi-UAV Autonomous Navigation Using Deep Reinforcement Learning

Decomposed and Prioritized Experience Replay-based MADDPG Algorithm for Multi-UAV Confrontation

UAV Path Planning Based on the Average TD3 Algorithm With Prioritized Experience Replay

Research on multi-UAV task decision-making based on improved MADDPG algorithm and transfer learning

A Motion Camouflage-Inspired Path Planning Method for UAVs Based on Reinforcement Learning

Path Planning of UAVs Based on Improved Ant Colony System

Multi-mission Path Re-planning for Multiple Unmanned Aerial Vehicles Based on Unexpected Events

Task offloading and trajectory scheduling for UAV-enabled MEC networks: An MADRL algorithm with prioritized experience replay

Pursuit Path Planning for Multiple Unmanned Ground Vehicles Based on Deep Reinforcement Learning

Multi-UAV Autonomous Path Planning in Reconnaissance Missions Considering Incomplete Information: A Reinforcement Learning Method

Delayed Soft Actor-Critic Based Path Planning Method for UAV in Dense Obstacles Environment

Deep Reinforcement Learning-based Collaborative Multi-UAV Coverage Path Planning

Path Planning for Multi-UAV Based on Improved Proximal Policy Optimization Algorithm

UAV Path Planning Based on Multi-agent Deep Reinforcement Learning

Unmanned Aerial Vehicle Path Planning Based on Improved DDQN Algorithm