Abstract:Despite being a widely adopted development framework for unmanned aerial vehicle (UAV), deep reinforcement learning (DRL) is often considered sample inefficient. Particularly, UAV struggles to fully explore the state and action space in environments with sparse rewards. While some exploration algorithms have been proposed to overcome the challenge of sparse rewards, they are not specifically tailored for UAV platform. Consequently, applying those algorithms to UAV path planning may lead to problems such as unstable training processes and neglect of action space comprehension, possibly causing negative impacts on the path planning results. To address the problem of sparse rewards in UAV path planning, we propose an information-theoretic exploration algorithm, Entropy Explorer (EE), specifically for UAV platform. The proposed EE generates intrinsic rewards based on state entropy and action entropy to compensate for the scarcity of extrinsic rewards. To further improve sampling efficiency, a framework integrating EE and Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithms is proposed. Finally, the TD3-EE algorithm is tested in Airsim and compared against benchmarking algorithms. The simulation outcomes manifest that TD3-EE effectively stimulates the UAV to comprehensively explore both state and action spaces, thereby attaining superior performance compared to the benchmark algorithms in the realm of path planning.

Path Planning for UAV Ground Target Tracking via Deep Reinforcement Learning

Path Planning of Unmanned Aerial Vehicle in Complex Environments Based on State-Detection Twin Delayed Deep Deterministic Policy Gradient

Autonomous UAV Navigation: A DDPG-based Deep Reinforcement Learning Approach

Towards Real-Time Path Planning through Deep Reinforcement Learning for a UAV in Dynamic Environments

Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments

A Deep Reinforcement Learning Approach for UAV Path Planning Incorporating Vehicle Dynamics with Acceleration Control

Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Deep Reinforcement Learning

Dynamic Scene Path Planning of UAVs Based on Deep Reinforcement Learning

Path Following for Autonomous Ground Vehicle Using DDPG Algorithm: A Reinforcement Learning Approach

Maneuvering target tracking of UAV based on MN-DDPG and transfer learning

Deep Reinforcement Learning-Driven UAV Data Collection Path Planning: A Study on Minimizing AoI

UAV Coverage Path Planning Based on Deep Reinforcement Learning

A Motion Camouflage-Inspired Path Planning Method for UAVs Based on Reinforcement Learning

UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient

Path Planning of UAV Base Station Based on Deep Reinforcement Learning

Target tracking strategy using deep deterministic policy gradient

Advancements in UAV Path Planning: A Deep Reinforcement Learning Approach with Soft Actor-Critic for Enhanced Navigation

UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning

Improve Exploration in Deep Reinforcement Learning for UAV Path Planning using State and Action Entropy

Deep Reinforcement Learning for UAV Intelligent Mission Planning

Pursuit Path Planning for Multiple Unmanned Ground Vehicles Based on Deep Reinforcement Learning