Dual-objective Optimization of Taxi Dispatching in Simulated Road Network Based on Reinforcement Learning

Mingbo Yang,Kai Zhang,Yuhong Yuan,Yu Liang,Yuhan Dong
DOI: https://doi.org/10.1117/12.3039914
2024-01-01
Abstract:The surge in urban population has led to an imbalance between the demand of residents for travelling and available taxi resources in some specific spatial-temporal contexts. This paper delves into the utilization of reinforcement learning technology to enhance taxi dispatching, with a particular emphasis on optimizing passenger and driver satisfaction. The optimization objective is to maximize revenue while simultaneously minimizing waiting times. We introduce a novel dual-objective optimization system for taxi dispatching, employing reinforcement learning techniques. This system comprises three core modules of the traffic environment simulation module, the mathematical modeling module, and the RL-based dispatching optimization module. Employing a comprehensive approach, we specifically design reward models in reinforcement learning to ensure thorough optimization of taxi scheduling. Stability plays a pivotal role in addressing the intricacies of urban taxi scheduling, given the extensive variations in state and action spaces amidst dynamic environmental conditions. Our reinforcement learning model, based on A3C, streamlines strategy adaptation by learning a unified approach, thus bolstering algorithmic stability through gradient averaging across all agents.
What problem does this paper attempt to address?