Abstract:The Dynamic Pickup and Delivery Problem (DPDP) is aimed at dynamically scheduling vehicles among multiple sites in order to minimize the cost when delivery orders are not known a priori. Although DPDP plays an important role in modern logistics and supply chain management, state-of-the-art DPDP algorithms are still limited on their solution quality and efficiency. In practice, they fail to provide a scalable solution as the numbers of vehicles and sites become large. In this paper, we propose a data-driven approach, Spatial-Temporal Aided Double Deep Graph Network (ST-DDGN), to solve industry-scale DPDP. In our method, the delivery demands are first forecast using spatial-temporal prediction method, which guides the neural network to perceive spatial-temporal distribution of delivery demand when dispatching vehicles. Besides, the relationships of individuals such as vehicles are modelled by establishing a graph-based value function. ST-DDGN incorporates attention-based graph embedding with Double DQN (DDQN). As such, it can make the inference across vehicles more efficiently compared with traditional methods. Our method is entirely data driven and thus adaptive, i.e., the relational representation of adjacent vehicles can be learned and corrected by ST-DDGN from data periodically. We have conducted extensive experiments over real-world data to evaluate our solution. The results show that ST-DDGN reduces 11.27% number of the used vehicles and decreases 13.12% total transportation cost on average over the strong baselines, including the heuristic algorithm deployed in our UAT (User Acceptance Test) environment and a variety of vanilla DRL methods. We are due to fully deploy our solution into our online logistics system and it is estimated that millions of USD logistics cost can be saved per year.

Solving two-stage stochastic route-planning problem in milliseconds via end-to-end deep learning

Solving Online Food Delivery Problem via an Effective Hybrid Algorithm with Intelligent Batching Strategy.

Application of Deep Reinforcement Learning Algorithm in Uncertain Logistics Transportation Scheduling

Solving Stochastic Online Food Delivery Problem Via Iterated Greedy Algorithm with Decomposition-Based Strategy

Two-Stage Solution for Meal Delivery Routing Optimization on Time-Sensitive Customer Satisfaction

A Four-stage Heuristic Algorithm for Solving On-demand Meal Delivery Routing Problem

Solving Large-Scale Dynamic Vehicle Routing Problems with Stochastic Requests

A Two-stage Learning-based Method for Large-scale On-demand Pickup and Delivery Services with Soft Time Windows

Online food ordering delivery strategies based on deep reinforcement learning

A Deep Reinforcement Learning Based Real-Time Solution Policy for the Traveling Salesman Problem

Learning to Optimize Industry-Scale Dynamic Pickup and Delivery Problems

A Deep Reinforcement Learning Approach for Online Parcel Assignment

Online Vehicle Routing With Neural Combinatorial Optimization and Deep Reinforcement Learning

A Matching Algorithm with Reinforcement Learning and Decoupling Strategy for Order Dispatching in On-Demand Food Delivery

An XGBoost-enhanced Fast Constructive Algorithm for Food Delivery Route Planning Problem

An Effective Iterated Greedy Algorithm for Online Route Planning Problem

Data-driven optimization for last-mile delivery

Multi-Stage Vehicle Dispatch for Community Group-buying Logistics via Deep Reinforcement Learning

Online Parallel Optimization Approach to Courier Routing Problems.

Solving the vehicle-drone pickup and delivery problem in road congestion: A heuristic and its deep reinforcement learning-based improvement

Optimizing Online Matching for Ride-Sourcing Services with Multi-Agent Deep Reinforcement Learning