Abstract:A fundamental question in any peer-to-peer ridesharing system is how to, both effectively and efficiently, dispatch user's ride requests to the right driver in real time. Traditional rule-based solutions usually work on a simplified problem setting, which requires a sophisticated hand-crafted weight design for either centralized authority control or decentralized multi-agent scheduling systems. Although recent approaches have used reinforcement learning to provide centralized combinatorial optimization algorithms with informative weight values, their single-agent setting can hardly model the complex interactions between drivers and orders. In this paper, we address the order dispatching problem using multi-agent reinforcement learning (MARL), which follows the distributed nature of the peer-to-peer ridesharing problem and possesses the ability to capture the stochastic demand-supply dynamics in large-scale ridesharing scenarios. Being more reliable than centralized approaches, our proposed MARL solutions could also support fully distributed execution through recent advances in the Internet of Vehicles (IoV) and the Vehicle-to-Network (V2N). Furthermore, we adopt the mean field approximation to simplify the local interactions by taking an average action among neighborhoods. The mean field approximation is capable of globally capturing dynamic demand-supply variations by propagating many local interactions between agents and the environment. Our extensive experiments have shown the significant improvements of MARL order dispatching algorithms over several strong baselines on the accumulated driver income (ADI), and order response rate measures. Besides, the simulated experiments with real data have also justified that our solution can alleviate the supply-demand gap during the rush hours, thus possessing the capability of reducing traffic congestion.

Multi-Objective Distributional Reinforcement Learning for Large-Scale Order Dispatching.

Spatial-temporal Pricing for Ride-Sourcing Platform with Reinforcement Learning

Multi-Agent Reinforcement Learning for Order-dispatching via Order-Vehicle Distribution Matching

Target-Value-Competition-Based Multi-Agent Deep Reinforcement Learning Algorithm for Distributed Nonconvex Economic Dispatch

Large-Scale Order Dispatch in On-Demand Ride-Hailing Platforms: A Learning and Planning Approach

An End-to-End Reinforcement Learning Based Approach for Micro-View Order-Dispatching in Ride-Hailing

Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning

Dynamic Order Dispatching With Multiobjective Reward Learning

Supply-Demand-aware Deep Reinforcement Learning for Dynamic Fleet Management

Deep Dispatching: A Deep Reinforcement Learning Approach for Vehicle Dispatching on Online Ride-Hailing Platform

Multi-Agent Mix Hierarchical Deep Reinforcement Learning for Large-Scale Fleet Management

NondBREM: Nondeterministic Offline Reinforcement Learning for Large-Scale Order Dispatching

A Distributed Model-Free Ride-Sharing Approach for Joint Matching, Pricing, and Dispatching using Deep Reinforcement Learning

A Clustering-Based Multi-Agent Reinforcement Learning Framework for Finer-Grained Taxi Dispatching

A Deep Value-network Based Approach for Multi-Driver Order Dispatching

A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems

Scalable Deep Reinforcement Learning for Ride-Hailing

Multi-Stage Vehicle Dispatch for Community Group-buying Logistics via Deep Reinforcement Learning

Vehicle Dispatching and Routing of On-Demand Intercity Ride-Pooling Services: A Multi-Agent Hierarchical Reinforcement Learning Approach

Promoting Collaborative Dispatching in the Ride-Sourcing Market With a Third-Party Integrator

Online food ordering delivery strategies based on deep reinforcement learning