Two-Stage Strategy to Achieve a Reinforcement Learning-Based Upset Recovery Policy for Aircraft

Huanhui Cao,Hao Xiong,Hantao Jiang,Hao Hu,Weifeng Zeng,Chaoran Li,Wenjie Lu
DOI: https://doi.org/10.1109/CAC53003.2021.9727381
2021-10-22
Abstract:Aircraft upset situations are the highest risk to civil aviation. Thus, a reliable upset recovery policy is necessary for aircraft. In this paper, a two-stage strategy to achieve a reinforcement learning (RL)-based upset recovery policy that takes time of recovery and loss of altitude into account is proposed for aircraft to recover from an arbitration upset situation to level flight. Based on the proposed two-stage strategy and Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm, an algorithm to achieve a TD3-based upset recovery policy for aircraft is developed. Experiments are conducted based on X-Plane 11 to evaluate the effectiveness of the proposed two-stage strategy and the performance of the achieved upset recovery policy in stall recovery and spin recovery.
Engineering,Computer Science
What problem does this paper attempt to address?