Abstract:The reconnaissance of high-value targets is prerequisite for effective operations. The recent appreciation of deep reinforcement learning (DRL) arises from its success in navigation problems, but due to the competitiveness and complexity of the military field, the applications of DRL in the military field are still unsatisfactory. In this paper, an end-to-end DRL-based intelligent reconnaissance mission planning is proposed for dual unmanned aerial vehicle (dual UAV) cooperative reconnaissance missions under high-threat and dense situations. Comprehensive consideration is given to specific mission properties and parameter requirements through the whole modelling. Firstly, the reconnaissance mission is described as a Markov decision process (MDP), and the mission planning model based on DRL is established. Secondly, the environment and UAV motion parameters are standardized to input the neural network, aiming to deduce the difficulty of algorithm convergence. According to the concrete requirements of non-reconnaissance by radars, dual-UAV cooperation and wandering reconnaissance in the mission, four reward functions with weights are designed to enhance agent understanding to the mission. To avoid sparse reward, the clip function is used to control the reward value range. Finally, considering the continuous action space of reconnaissance mission planning, the widely applicable proximal policy optimization (PPO) algorithm is used in this paper. The simulation is carried out by combining offline training and online planning. By changing the location and number of ground detection areas, from 1 to 4, the model with PPO can maintain 20% of reconnaissance proportion and a 90% mission complete rate and help the reconnaissance UAV to complete efficient path planning. It can adapt to unknown continuous high-dimensional environmental changes, is generalizable, and reflects strong intelligent planning performance.

Standoff Target Tracking for Networked UAVs with Specified Performance Via Deep Reinforcement Learning

Deep Reinforcement Learning-Based End-to-End Control for UAV Dynamic Target Tracking

Deep Reinforcement Learning Multi-UAV Trajectory Control for Target Tracking

Multi-Agent Reinforcement Learning Aided Intelligent UAV Swarm for Target Tracking

UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning

Path Planning for UAV Ground Target Tracking via Deep Reinforcement Learning

Target tracking strategy using deep deterministic policy gradient

Multi-UAV Cooperative Target Tracking Based on Swarm Intelligence

3D-Trajectory and Phase-Shift Design for RIS-Assisted UAV Systems Using Deep Reinforcement Learning

Trajectory Planning for Airborne Radar in Extended Target Tracking Based on Deep Reinforcement Learning

Collaborative Reinforcement Learning Based Unmanned Aerial Vehicle (UAV) Trajectory Design for 3D UAV Tracking

Goal-Oriented UAV Communication Design and Optimization for Target Tracking: A MachineLearning Approach

Multi-target tracking for unmanned aerial vehicle swarms using deep reinforcement learning

Maneuvering target tracking of UAV based on MN-DDPG and transfer learning

SREC: Proactive Self-Remedy of Energy-Constrained UAV-Based Networks via Deep Reinforcement Learning

Joint 3D trajectory and phase shift optimization via deep reinforcement learning for RIS-assisted UAV communication systems

Multi-UAV simultaneous target assignment and path planning based on deep reinforcement learning in dynamic multiple obstacles environments

Target-Following Double Deep Q-Networks for UAVs

Trace Pheromone-Based Energy-Efficient UAV Dynamic Coverage Using Deep Reinforcement Learning

Deep Reinforcement Learning for Intelligent Dual-UAV Reconnaissance Mission Planning