Deep Reinforcement Learning for Demand Driven Services in Logistics and Transportation Systems: A Survey

Zefang Zong,Jingwei Wang,Tao Feng,Tong Xia,Depeng Jin,Yong Li
2024-10-28
Abstract:Recent technology development brings the boom of numerous new Demand-Driven Services (DDS) into urban lives, including ridesharing, on-demand delivery, express systems and warehousing. In DDS, a service loop is an elemental structure, including its service worker, the service providers and corresponding service targets. The service workers should transport either people or parcels from the providers to the target locations. Various planning tasks within DDS can thus be classified into two individual stages: 1) Dispatching, which is to form service loops from demand/supply distributions, and 2) Routing, which is to decide specific serving orders within the constructed loops. Generating high-quality strategies in both stages is important to develop DDS but faces several challenges. Meanwhile, deep reinforcement learning (DRL) has been developed rapidly in recent years. It is a powerful tool to solve these problems since DRL can learn a parametric model without relying on too many problem-based assumptions and optimize long-term effects by learning sequential decisions. In this survey, we first define DDS, then highlight common applications and important decision/control problems within. For each problem, we comprehensively introduce the existing DRL solutions. We also introduce open simulation environments for development and evaluation of DDS applications. Finally, we analyze remaining challenges and discuss further research opportunities in DRL solutions for DDS.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the application of Deep Reinforcement Learning (DRL) techniques to optimize dispatching and routing tasks in demand-driven services (DDS) within logistics and transportation systems. Specifically, the paper focuses on the following two main issues: 1. **Dispatching Problem**: - How to form service loops based on the distribution of demand and supply. This involves matching service workers with demand, i.e., how to optimally allocate tasks given the current state of demand and workers. - Considering future demand distribution to achieve long-term optimization. - How to manage idle service workers by assigning them to other areas, forming a fleet management problem. 2. **Routing Problem**: - In the already formed service loops, how to determine the specific execution order. This involves optimizing the sequence of visiting demand points to minimize the total travel distance and reduce worker costs. - This problem can be traced back to the classic Traveling Salesman Problem (TSP) and Vehicle Routing Problem (VRP) and their variants. The paper points out that traditional exact solution methods have limitations in handling large-scale, complex scenarios, while heuristic and meta-heuristic methods, although capable of generating approximate solutions within a reasonable time, still have room for improvement. Therefore, the paper explores how to use Deep Reinforcement Learning (DRL) techniques to solve these problems, as DRL can learn strategies from experience, optimize long-term effects, and does not require excessive problem assumptions.