An Integrated Imitation and Reinforcement Learning Methodology for Robust Agile Aircraft Control with Limited Pilot Demonstration Data

Gulay Goktas Sever,Umut Demir,Abdullah Sadik Satir,Mustafa Cagatay Sahin,Nazim Kemal Ure
2023-12-27
Abstract:In this paper, we present a methodology for constructing data-driven maneuver generation models for agile aircraft that can generalize across a wide range of trim conditions and aircraft model parameters. Maneuver generation models play a crucial role in the testing and evaluation of aircraft prototypes, providing insights into the maneuverability and agility of the aircraft. However, constructing the models typically requires extensive amounts of real pilot data, which can be time-consuming and costly to obtain. Moreover, models built with limited data often struggle to generalize beyond the specific flight conditions covered in the original dataset. To address these challenges, we propose a hybrid architecture that leverages a simulation model, referred to as the source model. This open-source agile aircraft simulator shares similar dynamics with the target aircraft and allows us to generate unlimited data for building a proxy maneuver generation model. We then fine-tune this model to the target aircraft using a limited amount of real pilot data. Our approach combines techniques from imitation learning, transfer learning, and reinforcement learning to achieve this objective. To validate our methodology, we utilize real agile pilot data provided by Turkish Aerospace Industries (TAI). By employing the F-16 as the source model, we demonstrate that it is possible to construct a maneuver generation model that generalizes across various trim conditions and aircraft parameters without requiring any additional real pilot data. Our results showcase the effectiveness of our approach in developing robust and adaptable models for agile aircraft.
Artificial Intelligence,Machine Learning,Robotics,Systems and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to construct an agile aircraft maneuver generation model that can be widely adapted to different flight conditions and aircraft parameters with limited real - pilot demonstration data. Specifically, the paper focuses on the following aspects: 1. **Construction of Maneuver Generation Model**: How to construct a maneuver generation model capable of generating trajectories similar to those of expert pilots' operations based on limited real - pilot demonstration data. 2. **Robustness of the Model**: The model needs to be able to maintain good performance in unseen flight states, that is, it has a strong recovery ability for states that do not appear in the training data. 3. **Transferability of the Model**: The model needs to be transferable to different aircraft models without a large amount of retraining or additional data collection. 4. **Adaptability of the Model to Parameter Changes**: When aircraft parameters change (such as weight distribution, actuator models, etc.), the model needs to be able to quickly adapt to these changes without new demonstration data. To achieve the above goals, the paper proposes a hybrid architecture that combines imitation learning, transfer learning, and reinforcement learning. The specific steps are as follows: 1. **Generate Data Using the Source Model**: Use an open - source aircraft simulator (source model) with similar dynamics to the target aircraft to generate a large amount of data for constructing the surrogate maneuver generation model. 2. **Fine - tune the Model to the Target Aircraft**: Use limited real - pilot data to fine - tune the surrogate model to adapt to the specific characteristics of the target aircraft. 3. **Enhance the Robustness and Adaptability of the Model**: Improve the performance of the model in unseen states by introducing the C - DAgger algorithm (an improved DAgger algorithm), and further enhance the adaptability and generalization ability of the model through reinforcement learning. Through this method, the paper shows how to construct a high - performance maneuver generation model that can be widely adapted to different flight conditions and aircraft parameters with limited real - pilot data.