Abstract:Unmanned Aerial Vehicles (UAVs) play a vital role in military warfare. In a variety of battlefield mission scenarios, UAVs are required to safely fly to designated locations without human intervention. Therefore, finding a suitable method to solve the UAV Autonomous Motion Planning (AMP) problem can improve the success rate of UAV missions to a certain extent. In recent years, many studies have used Deep Reinforcement Learning (DRL) methods to address the AMP problem and have achieved good results. From the perspective of sampling, this paper designs a sampling method with double-screening, combines it with the Deep Deterministic Policy Gradient (DDPG) algorithm, and proposes the Relevant Experience Learning-DDPG (REL-DDPG) algorithm. The REL-DDPG algorithm uses a Prioritized Experience Replay (PER) mechanism to break the correlation of continuous experiences in the experience pool, finds the experiences most similar to the current state to learn according to the theory in human education, and expands the influence of the learning process on action selection at the current state. All experiments are applied in a complex unknown simulation environment constructed based on the parameters of a real UAV. The training experiments show that REL-DDPG improves the convergence speed and the convergence result compared to the state-of-the-art DDPG algorithm, while the testing experiments show the applicability of the algorithm and investigate the performance under different parameter conditions. (c) 2021 Chinese Society of Aeronautics and Astronautics. Production and hosting by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

AoI optimal UAV trajectory planning: A Deep Recurrent Reinforcement Learning Approach

Deep RL-based Trajectory Planning for AoI Minimization in UAV-assisted IoT

AoI Optimal Trajectory Planning for Cooperative UAVs: A Multi-Agent Deep Reinforcement Learning Approach

Joint AoI-Aware UAVs Trajectory Planning and Data Collection in UAV-Based IoT Systems: A Deep Reinforcement Learning Approach

Joint Power Control and UAV Trajectory Design for Information Freshness Via Deep Reinforcement Learning

A RDA-Based Deep Reinforcement Learning Approach for Autonomous Motion Planning of UAV in Dynamic Unknown Environments

Deep Reinforcement Learning-Driven UAV Data Collection Path Planning: A Study on Minimizing AoI

A Learning-Based Trajectory Planning of Multiple UAVs for AoI Minimization in IoT Networks

Relevant experience learning: A deep reinforcement learning method for UAV autonomous motion planning in complex unknown environments

AoI-Energy-Aware UAV-Assisted Data Collection for IoT Networks: A Deep Reinforcement Learning Method

A Novel AI-Based Framework for AoI-Optimal Trajectory Planning in UAV-Assisted Wireless Sensor Networks

Trajectory Planning for UAV-Assisted Data Collection in IoT Network: A Double Deep Q Network Approach

Deep Reinforcement Learning for Fresh Data Collection in UAV-assisted IoT Networks

Traffic Learning and Proactive UAV Trajectory Planning for Data Uplink in Markovian IoT Models

Trajectory Design for UAV-Based Internet of Things Data Collection: A Deep Reinforcement Learning Approach

Learning-Based Data Gathering for Information Freshness in UAV-Assisted IoT Networks

Priority-Oriented Trajectory Planning for UAV-Aided Time-Sensitive IoT Networks

Work Together to Keep Fresh: Hierarchical Learning for UAVs-assisted Data Time-Sensitive IoT

The UAV Trajectory Optimization for Data Collection from Time-Constrained IoT Devices: A Hierarchical Deep Q-Network Approach

Trajectory Planning of UAV in Wireless Powered IoT System Based on Deep Reinforcement Learning

Research on the UAV-aided Data Collection and Trajectory Design Based on the Deep Reinforcement Learning