Trajectory Planning for Hypersonic Vehicles with Reinforcement Learning

Mingxin Zhou,Haihong Chi
DOI: https://doi.org/10.23919/CCC52363.2021.9549361
2021-07-26
Abstract:This paper discusses the problem of avoiding threats during the cruising flight of hypersonic vehicles (HV). Considering the constraints on kinematics of HV and changing environments, this paper proposes two methods of trajectory planning that taking the overload or the rotational angular velocity of the ballistic deflection angle as actions of agent. Meanwhile, the agent’s policy is optimized with policy gradient method, and Proximal Policy Optimization (PPO) and Soft Actor-Critic (SAC) are used for comparison. Experimental results show that PPO and SAC have similar performance in penetration missions. Moreover, in the complicated flight environment, the method of taking overload and exploration distance as actions has a higher penetration success ratio.
Engineering,Computer Science
What problem does this paper attempt to address?