The Rich and the Poor: A Markov Decision Process Approach to Optimizing Taxi Driver Revenue Efficiency

Huigui Rong,Xun Zhou,Chang Yang,Zubair Shafiq,Alex Liu
DOI: https://doi.org/10.1145/2983323.2983689
2016-01-01
Abstract:Taxi services play an important role in the public transportation system of large cities. Improving taxi business efficiency is an important societal problem since it could improve the income of the drivers and reduce gas emissions and fuel consumption. In recent years many big taxi transaction datasets have been collected and used in business analytics to recommend business strategies for taxi drivers. However, most of these analytical approaches only considered how to maximize the pickup chance, energy efficiency, or profit for the immediate next passenger/trip when recommending seeking routes or seeking locations. Their recommended seeking strategy may not be optimal for the overall revenue over an extended period of time as they ignored the important impact of passengers' destinations on future passenger seeking. To address these issues, this paper investigates how to increase the revenue efficiency (revenue per unit time) of taxi drivers. Given a set of historical taxi records and the current status of a vacant taxi, the goal of our work in this paper is to find the best move for this taxi to maximize the revenue in an extended business time window in the near future. This problem is challenging due to the large number of possible scenarios and uncertainty on both the taxi side and the passenger side. To solve this problem we model the passenger seeking process as a Markov Decision Process (MDP). For each one-hour time slot, we learn a different set of parameters for the MDP from data and find the best move for a vacant taxi to maximize the total revenue in that time slot. A case study and several experimental evaluations on a real dataset from a major city in China show that our proposed approach improves the revenue efficiency of inexperienced drivers by up to 15% and outperforms a baseline method in all the time slots.
What problem does this paper attempt to address?