Inverse Optimal Planning for Air Traffic Control

Ekaterina Tolstaya,Alejandro Ribeiro,Vijay Kumar,Ashish Kapoor
DOI: https://doi.org/10.1109/IROS40897.2019.8968460
2019-03-26
Abstract:We envision a system that concisely describes the rules of air traffic control, assists human operators and supports dense autonomous air traffic around commercial airports. We develop a method to learn the rules of air traffic control from real data as a cost function via maximum entropy inverse reinforcement learning. This cost function is used as a penalty for a search-based motion planning method that discretizes both the control and the state space. We illustrate the methodology by showing that our approach can learn to imitate the airport arrival routes and separation rules of dense commercial air traffic. The resulting trajectories are shown to be safe, feasible, and efficient.
Robotics
What problem does this paper attempt to address?
The paper attempts to address the problem of implementing Autonomous Air Traffic Control (Autonomous ATC) around busy commercial airports. Specifically, the researchers face the following challenges: 1. **Complex Rules and Strategies**: Air Traffic Controllers (ATCs) need to follow a series of complex regulations, including spacing requirements between aircraft, weather restrictions, and airport-specific takeoff and landing protocols. Additionally, experienced ATCs develop various strategies based on the complex interactions between these factors. 2. **Diverse Dynamic Variables**: The air traffic control task involves multiple deterministic and stochastic variables, such as traffic density, weather, regulatory requirements, and local geographical conditions. There may be multiple compliant path choices among these variables, and these paths can be qualitatively very different. 3. **Adaptability to New Types of Aircraft**: Existing air traffic control systems are primarily designed for standard fixed-wing aircraft and helicopters. Extending these systems to new types of aircraft (such as micro-drones and Vertical Take-Off and Landing vehicles, VTOL), which may have completely different dynamic behaviors, is a challenge. 4. **Optimization of Multiple Dynamic Agents**: As the number of aircraft increases, planning tasks need to be optimized among multiple dynamic agents, making the problem difficult to handle. To address these issues, the researchers propose a method that combines search-based motion planning and Inverse Reinforcement Learning (IRL). Specifically, they use maximum entropy inverse reinforcement learning to learn air traffic control rules from real data and use these rules as a penalty function in the search-based motion planning method. This approach can generate trajectories similar to real air traffic, and these trajectories are proven to be safe, feasible, and efficient.