Inverse Optimal Planning for Air Traffic Control

Ekaterina Tolstaya,Alejandro Ribeiro,Vijay Kumar,Ashish Kapoor

DOI: https://doi.org/10.1109/IROS40897.2019.8968460

2019-03-26

Abstract:We envision a system that concisely describes the rules of air traffic control, assists human operators and supports dense autonomous air traffic around commercial airports. We develop a method to learn the rules of air traffic control from real data as a cost function via maximum entropy inverse reinforcement learning. This cost function is used as a penalty for a search-based motion planning method that discretizes both the control and the state space. We illustrate the methodology by showing that our approach can learn to imitate the airport arrival routes and separation rules of dense commercial air traffic. The resulting trajectories are shown to be safe, feasible, and efficient.

Robotics

What problem does this paper attempt to address?

The paper attempts to address the problem of implementing Autonomous Air Traffic Control (Autonomous ATC) around busy commercial airports. Specifically, the researchers face the following challenges: 1. **Complex Rules and Strategies**: Air Traffic Controllers (ATCs) need to follow a series of complex regulations, including spacing requirements between aircraft, weather restrictions, and airport-specific takeoff and landing protocols. Additionally, experienced ATCs develop various strategies based on the complex interactions between these factors. 2. **Diverse Dynamic Variables**: The air traffic control task involves multiple deterministic and stochastic variables, such as traffic density, weather, regulatory requirements, and local geographical conditions. There may be multiple compliant path choices among these variables, and these paths can be qualitatively very different. 3. **Adaptability to New Types of Aircraft**: Existing air traffic control systems are primarily designed for standard fixed-wing aircraft and helicopters. Extending these systems to new types of aircraft (such as micro-drones and Vertical Take-Off and Landing vehicles, VTOL), which may have completely different dynamic behaviors, is a challenge. 4. **Optimization of Multiple Dynamic Agents**: As the number of aircraft increases, planning tasks need to be optimized among multiple dynamic agents, making the problem difficult to handle. To address these issues, the researchers propose a method that combines search-based motion planning and Inverse Reinforcement Learning (IRL). Specifically, they use maximum entropy inverse reinforcement learning to learn air traffic control rules from real data and use these rules as a penalty function in the search-based motion planning method. This approach can generate trajectories similar to real air traffic, and these trajectories are proven to be safe, feasible, and efficient.

Inverse Optimal Planning for Air Traffic Control

Efficient Optimization-Based Trajectory Planning for Unmanned Systems in Confined Environments

Game-theoretic Objective Space Planning

Efficient optimization-based trajectory planning

Time-Optimal Maneuver Planning in Automatic Parallel Parking Using a Simultaneous Dynamic Optimization Approach

Strategic Planning in Air Traffic Control as a Multi-objective Stochastic Optimization Problem

Learning-Initialized Trajectory Planning in Unknown Environments

Autonomous Air Traffic Controller: A Deep Multi-Agent Reinforcement Learning Approach

An Autonomous Free Airspace En-route Controller using Deep Reinforcement Learning Techniques

Real-Time Spatial Trajectory Planning for Urban Environments Using Dynamic Optimization

Real-Time On-the-Fly Motion Planning for Urban Air Mobility via Updating Tree Data of Sampling-Based Algorithms Using Neural Network Inference

A Stochastic Integer Programming Approach to Air Traffic Scheduling and Operations

Multiobjective Tactical Planning under Uncertainty for Air Traffic Flow and Capacity Management

Hierarchical Game-Theoretic Planning for Autonomous Vehicles

Learning to Plan Maneuverable and Agile Flight Trajectory with Optimization Embedded Networks

PILOT: Efficient Planning by Imitation Learning and Optimisation for Safe Autonomous Driving

Game-Theoretic Planning for Autonomous Driving among Risk-Aware Human Drivers

Integrated Behavior Planning and Motion Control for Autonomous Vehicles with Traffic Rules Compliance

Enhancing Planning for Autonomous Driving via an Iterative Optimization Framework Incorporating Safety-Critical Trajectory Generation

LUCIDGames: Online Unscented Inverse Dynamic Games for Adaptive Trajectory Prediction and Planning

Spatio-Temporal Trajectory Planning Using Search And Optimizing Method for Autonomous Driving