Learning-based Online Optimization for Autonomous Mobility-on-Demand Fleet Control

Kai Jungel,Axel Parmentier,Maximilian Schiffer,Thibaut Vidal
2024-02-21
Abstract:Autonomous mobility-on-demand systems are a viable alternative to mitigate many transportation-related externalities in cities, such as rising vehicle volumes in urban areas and transportation-related pollution. However, the success of these systems heavily depends on efficient and effective fleet control strategies. In this context, we study online control algorithms for autonomous mobility-on-demand systems and develop a novel hybrid combinatorial optimization enriched machine learning pipeline which learns online dispatching and rebalancing policies from optimal full-information solutions. We test our hybrid pipeline on large-scale real-world scenarios with different vehicle fleet sizes and various request densities. We show that our approach outperforms state-of-the-art greedy, and model-predictive control approaches with respect to various KPIs, e.g., by up to 17.1% and on average by 6.3% in terms of realized profit.
Optimization and Control,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to implement efficient and effective fleet control strategies in the Autonomous Mobility - on - Demand (AMoD) system. Specifically, the authors studied the online control algorithms of the AMoD system and developed a new hybrid combinatorial optimization (CO) enriched machine learning (ML) pipeline, which can learn online scheduling and rebalancing strategies from the full - information optimal solutions. The solution to this problem is of great significance for improving the sustainability and economic efficiency of urban transportation systems, especially in the face of challenges such as increasing urban traffic volume and traffic - related pollution. ### Background and Objectives of the Paper With the acceleration of the urbanization process, the traffic volume in urban areas is constantly rising, which poses challenges to the sustainability and economic efficiency of the current urban transportation systems. The external effects of the transportation system, such as congestion and local emissions, not only have a serious impact on public health but also pose a threat to the environment. In addition, private cars occupy a large amount of parking and driving space in urban areas. Advances in autonomous driving technology and 5G technology make it possible to realize the Autonomous Mobility - on - Demand system, which can use a centrally - controlled fleet of self - driving vehicles to address the above - mentioned challenges. ### Research Challenges Designing efficient control algorithms for the AMoD system faces two main challenges: 1. **Decision - making under Uncertainty**: The central controller must make scheduling and rebalancing decisions without knowing future ride - request information. 2. **Computational Challenges for Large - scale Applications**: In densely populated areas, thousands of requests need to be processed per hour, which brings computational challenges to the scalability of the control mechanism. ### Solutions To address these challenges, the authors developed a new type of scalable online control strategy. This strategy receives the state input of the AMoD system through a hybrid combinatorial optimization - enriched machine learning pipeline and returns scheduling and rebalancing actions. The core idea of this pipeline is to transform the online state of the AMoD system into a combinatorial scheduling problem and learn the parameterization method of the scheduling problem from the full - information optimal solutions in historical data, so that the solution to the scheduling problem can imitate the full - information optimal solution and achieve predictive online scheduling and rebalancing. ### Experimental Verification The authors tested their hybrid pipeline in large - scale real - world scenarios, covering vehicle fleets of different scales and different request densities. The experimental results show that this method outperforms existing greedy algorithms and model - predictive control methods in multiple key performance indicators (KPIs). For example, it achieves a maximum 17.1% increase and an average 6.3% increase in profit. ### Main Contributions 1. **Proposed a new hybrid CO - ML pipeline**, which is suitable for online scheduling and rebalancing in the AMoD system. 2. **Showed how to solve the underlying CO problem**, that is, the scheduling and rebalancing problem, within polynomial time. 3. **Proposed a method based on Structured Learning (SL)** for learning the parameterization method of the underlying CO problem from the full - information optimal solutions in historical data. 4. **Developed two online control strategies**: the Sample - Based (SB) method and the Cell - Based (CB) method, both of which utilize the hybrid CO - ML pipeline. 5. **Verified the effectiveness of the proposed strategies through empirical research** and compared them with existing methods in multiple scenarios. In conclusion, by introducing a new hybrid CO - ML pipeline, this paper provides an innovative and effective solution to the online scheduling and rebalancing problems in the AMoD system.