Scheduling for Urban Air Mobility using Safe Learning

Surya Murthy,Natasha A. Neogi,Suda Bharadwaj
DOI: https://doi.org/10.4204/EPTCS.371.7
2022-09-28
Abstract:This work considers the scheduling problem for Urban Air Mobility (UAM) vehicles travelling between origin-destination pairs with both hard and soft trip deadlines. Each route is described by a discrete probability distribution over trip completion times (or delay) and over inter-arrival times of requests (or demand) for the route along with a fixed hard or soft deadline. Soft deadlines carry a cost that is incurred when the deadline is missed. An online, safe scheduler is developed that ensures that hard deadlines are never missed, and that average cost of missing soft deadlines is minimized. The system is modelled as a Markov Decision Process (MDP) and safe model-based learning is used to find the probabilistic distributions over route delays and demand. Monte Carlo Tree Search (MCTS) Earliest Deadline First (EDF) is used to safely explore the learned models in an online fashion and develop a near-optimal non-preemptive scheduling policy. These results are compared with Value Iteration (VI) and MCTS (Random) scheduling solutions.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to solve the scheduling problem of Urban Air Mobility (UAM) vehicles. Specifically, the paper focuses on how to schedule UAM vehicle trip requests in the presence of hard and soft deadlines to ensure: 1. **Tasks with hard deadlines are not missed**: For tasks with hard deadlines, it must be ensured that these tasks are completed within their specified deadlines. 2. **Minimize the cost of missing soft deadlines**: For tasks with soft deadlines, if the deadline is missed, a certain cost will be incurred. Therefore, these costs need to be minimized as much as possible. Each route is described by a discrete probability distribution, covering trip completion time and request arrival time (i.e., demand). In addition, each route has a fixed hard or soft deadline. Tasks with soft deadlines will incur a certain cost when the deadline is missed. ### Main challenges - **Uncertainty**: Since trip completion time and request arrival time are random, this uncertainty needs to be dealt with. - **Online scheduling**: The scheduler needs to make decisions without knowing the characteristics of future requests (i.e., non - prophet scheduling), which means it cannot predict trip execution time or request arrival intervals. - **Safety and efficiency**: Ensure that all tasks with hard deadlines are completed on time and minimize the cost of tasks with soft deadlines as much as possible. ### Solutions To solve the above problems, the paper proposes a safe learning - based scheduling method based on Markov Decision Process (MDP). The main steps include: 1. **Modeling**: Formalize the UAM scheduling problem as an infinite - duration, aperiodic, non - preemptive optimization problem and model it with MDP. 2. **Learning**: Safely learn the demand and delay distributions of UAM trip requests through sampling techniques. 3. **Policy generation**: Use Monte Carlo Tree Search (MCTS) to learn safe and scalable non - preemptive scheduling policies. 4. **Comparison and verification**: Compare the results with value iteration (VI) and random MCTS scheduling solutions to verify the effectiveness of the proposed method. Through these steps, the paper develops a system that can safely and efficiently schedule UAM vehicles in an uncertain environment.