Abstract:Autonomous air combat technology of unmanned combat air vehicles (UCAVs) is a hot issue that is currently being studied by various countries, and maneuvering trajectory prediction is an important part of autonomous air combat research. To address the difficulty of maintaining high prediction accuracy and short prediction time simultaneously in maneuvering trajectory prediction, this paper proposes a maneuvering trajectory prediction method that is based on a layered strategy, which combines long-term maneuvering unit prediction and short-term maneuvering trajectory prediction. In long-term maneuvering unit prediction, the complex trajectory is divided into 21 types of maneuvering units using the four characteristics of maneuvering trajectories, and a maneuvering unit library is established. On the basis of the deep echo state network(DeepESN), to capture multiscale prediction input parameters, autoencoder (AE) technology is incorporated. In addition, to increase the prediction accuracy, adaptive boosting (Ada) learning technology is utilized to build a strong predictor, and seven prediction networks are compared. The results demonstrate that the proposed method realizes the highest prediction accuracy. The single-step prediction time is about 0.002 s, which meets the time requirement. In short-term maneuvering trajectory prediction, the long and short-term memory (LSTM) network is analyzed, and the gaussian random walk strategy particle swarm optimization (GWSPSO) algorithm is used to update the internal weights and biases of the network to overcome the problems of “gradient disappearance” and “gradient explosion”, and a data sharing method is proposed for overcoming the no directionality of optimization algorithms. Compared with four traditional networks, the results demonstrate the method that is proposed in this paper performs better. Compared with the sampling time of 0.3 s, the short-term prediction time of 0.05 s can also meet the requirements. Finally, a long- and short-term layered prediction method is used on a group of complex maneuvering trajectories. The results demonstrate that the prediction accuracy is significantly increased and the real-time requirements are satisfied.

Strategy Generation Based on DDPG with Prioritized Experience Replay for UCAV.

Long and Short Term Maneuver Trajectory Prediction of UCAV Based on Deep Learning

Deep reinforcement learning and its application in autonomous fitting optimization for attack areas of UCAVs

Research on UCAV Maneuvering Decision Method Based on Heuristic Reinforcement Learning

Autonomous maneuver decision-making for a UCAV in short-range aerial combat based on an MS-DDQN algorithm

Asynchronous Curriculum Experience Replay: A Deep Reinforcement Learning Approach for UAV Autonomous Motion Control in Unknown Dynamic Environments

Target tracking strategy using deep deterministic policy gradient

Research on Autonomous Maneuvering Decision of UCAV based on Approximate Dynamic Programming

Cooperative multi-agent target searching: a deep reinforcement learning approach based on parallel hindsight experience replay

UAV Autonomous Aerial Combat Maneuver Strategy Generation with Observation Error Based on State-Adversarial Deep Deterministic Policy Gradient and Inverse Reinforcement Learning

UCAV Air Combat Maneuver Decisions Based on a Proximal Policy Optimization Algorithm with Situation Reward Shaping

Autonomous Maneuver Decision of UCAV Air Combat Based on Double Deep Q Network Algorithm and Stochastic Game Theory

Generalization Strategy Design of UAVs Pursuit Evasion Game Based on DDPG

A Deep Reinforcement Learning Based Intelligent Decision Method for UCAV Air Combat

UAV Cooperative Air Combat Maneuvering Confrontation Based on Multi-agent Reinforcement Learning

Deep Reinforcement Learning With Application to Air Confrontation Intelligent Decision-Making of Manned/Unmanned Aerial Vehicle Cooperative System

A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat

Memory-Enhanced Twin Delayed Deep Deterministic Policy Gradient (ME-TD3)-Based Unmanned Combat Aerial Vehicle Trajectory Planning for Avoiding Radar Detection Threats in Dynamic and Unknown Environments

A UAV Maneuver Decision-Making Algorithm for Autonomous Airdrop Based on Deep Reinforcement Learning

Multiple unmanned aerial vehicle coordinated strikes against ground targets based on an improved multi-agent deep deterministic policy gradient algorithm

Autonomous Decision Making for UAV Cooperative Pursuit-Evasion Game with Reinforcement Learning