Abstract:Autonomous air combat technology of unmanned combat air vehicles (UCAVs) is a hot issue that is currently being studied by various countries, and maneuvering trajectory prediction is an important part of autonomous air combat research. To address the difficulty of maintaining high prediction accuracy and short prediction time simultaneously in maneuvering trajectory prediction, this paper proposes a maneuvering trajectory prediction method that is based on a layered strategy, which combines long-term maneuvering unit prediction and short-term maneuvering trajectory prediction. In long-term maneuvering unit prediction, the complex trajectory is divided into 21 types of maneuvering units using the four characteristics of maneuvering trajectories, and a maneuvering unit library is established. On the basis of the deep echo state network(DeepESN), to capture multiscale prediction input parameters, autoencoder (AE) technology is incorporated. In addition, to increase the prediction accuracy, adaptive boosting (Ada) learning technology is utilized to build a strong predictor, and seven prediction networks are compared. The results demonstrate that the proposed method realizes the highest prediction accuracy. The single-step prediction time is about 0.002 s, which meets the time requirement. In short-term maneuvering trajectory prediction, the long and short-term memory (LSTM) network is analyzed, and the gaussian random walk strategy particle swarm optimization (GWSPSO) algorithm is used to update the internal weights and biases of the network to overcome the problems of “gradient disappearance” and “gradient explosion”, and a data sharing method is proposed for overcoming the no directionality of optimization algorithms. Compared with four traditional networks, the results demonstrate the method that is proposed in this paper performs better. Compared with the sampling time of 0.3 s, the short-term prediction time of 0.05 s can also meet the requirements. Finally, a long- and short-term layered prediction method is used on a group of complex maneuvering trajectories. The results demonstrate that the prediction accuracy is significantly increased and the real-time requirements are satisfied.

UCAV Autonomous Maneuvering Decision Based on Curriculum Learning Mechanism Training

UCAV Air Combat Maneuver Decisions Based on a Proximal Policy Optimization Algorithm with Situation Reward Shaping

Long and Short Term Maneuver Trajectory Prediction of UCAV Based on Deep Learning

Model-free Maneuvering Control of Fixed-Wing UAVs Based on Deep Reinforcement Learning

Research on UCAV Maneuvering Decision Method Based on Heuristic Reinforcement Learning

Mean policy-based proximal policy optimization for maneuvering decision in multi-UAV air combat

Maneuver Decision of UAV in Short-Range Air Combat Based on Deep Reinforcement Learning

UAV maneuver decision-making via deep reinforcement learning for short-range air combat

Research on Autonomous Maneuvering Decision of UCAV based on Approximate Dynamic Programming

Autonomous maneuver decision-making for a UCAV in short-range aerial combat based on an MS-DDQN algorithm

Autonomous Maneuver Decision of UCAV Air Combat Based on Double Deep Q Network Algorithm and Stochastic Game Theory

Air Combat Maneuver Decision Method Based on A3C Deep Reinforcement Learning

UAV Cooperative Air Combat Maneuvering Confrontation Based on Multi-agent Reinforcement Learning

Maneuver Decision-Making Through Automatic Curriculum Reinforcement Learning Without Handcrafted Reward functions

Maneuver Decision-Making For Autonomous Air Combat Through Curriculum Learning And Reinforcement Learning With Sparse Rewards

UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning

Multi-intent autonomous decision-making for air combat with deep reinforcement learning

Cross coordination of behavior clone and reinforcement learning for autonomous within-visual-range air combat

Strategy Generation Based on DDPG with Prioritized Experience Replay for UCAV.

Autonomous confrontation strategy learning evolution mechanism of unmanned system group under actual combat in the loop

Mastering air combat game with deep reinforcement learning