Abstract:With the rise of e-commerce and increasing customer requirements, logistics service providers face a new complexity in their daily planning, mainly due to efficiently handling same day deliveries. Existing multi-stage stochastic optimization approaches that allow to solve the underlying dynamic vehicle routing problem are either computationally too expensive for an application in online settings, or -- in the case of reinforcement learning -- struggle to perform well on high-dimensional combinatorial problems. To mitigate these drawbacks, we propose a novel machine learning pipeline that incorporates a combinatorial optimization layer. We apply this general pipeline to a dynamic vehicle routing problem with dispatching waves, which was recently promoted in the EURO Meets NeurIPS Vehicle Routing Competition at NeurIPS 2022. Our methodology ranked first in this competition, outperforming all other approaches in solving the proposed dynamic vehicle routing problem. With this work, we provide a comprehensive numerical study that further highlights the efficacy and benefits of the proposed pipeline beyond the results achieved in the competition, e.g., by showcasing the robustness of the encoded policy against unseen instances and scenarios.

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is the Dynamic Vehicle Routing Problem (DVRP), especially the Dynamic Vehicle Routing Problem with Time Windows (DVRPTW). With the development of e - commerce and the increase in customer demands, logistics service providers are facing new and complex challenges, especially in handling same - day delivery orders. Although traditional multi - stage stochastic optimization methods can solve this problem, they are either too computationally costly in online applications or perform poorly when dealing with high - dimensional combinatorial problems, such as methods based on reinforcement learning. To overcome these drawbacks, the authors propose a new machine - learning (ML) pipeline that combines a combinatorial optimization (CO) layer to efficiently solve the dynamic vehicle routing problem. Through this method, the authors aim to provide a solution that can make high - quality scheduling decisions quickly while considering the uncertainty of future orders. Specifically, the contributions of the paper include: 1. **Methodological contributions**: The authors integrate the combinatorial optimization layer into the machine - learning pipeline, expanding the existing combinatorial - optimization - enriched machine - learning (CO - enriched ML) pipeline so that it can handle nonlinear objective functions. In addition, they also propose for the first time a CO - enriched ML pipeline that uses meta - heuristic algorithms to solve the combinatorial optimization layer. 2. **Application contributions**: The authors apply the proposed pipeline to a new dynamic vehicle routing problem introduced in the EURO Meets NeurIPS Vehicle Routing Competition. The experimental results show that this method outperforms other existing methods in solving the proposed dynamic vehicle routing problem and exhibits good robustness in unseen instances and scenarios. Through these contributions, the authors not only provide an effective solution to the challenges in dynamic vehicle routing but also demonstrate the potential of combining combinatorial optimization and machine learning in solving high - dimensional combinatorial optimization problems.

Combinatorial Optimization enriched Machine Learning to solve the Dynamic Vehicle Routing Problem with Time Windows

Combinatorial Optimization-Enriched Machine Learning to Solve the Dynamic Vehicle Routing Problem with Time Windows

Combinatorial Optimization and Machine Learning for Dynamic Inventory Routing

Online Vehicle Routing With Neural Combinatorial Optimization and Deep Reinforcement Learning

Learning-based Online Optimization for Autonomous Mobility-on-Demand Fleet Control

Logistics Distribution Route Optimization With Time Windows Based on Multi-Agent Deep Reinforcement Learning

Multi-Vehicle Routing Problems with Soft Time Windows: A Multi-Agent Reinforcement Learning Approach

Reinforcement Learning-based Approach for Dynamic Vehicle Routing Problem with Stochastic Demand

Learn to Solve Vehicle Routing Problems ASAP: A Neural Optimization Approach for Time-Constrained Vehicle Routing Problems with Finite Vehicle Fleet

A Deep Reinforcement Learning-Based Adaptive Search for Solving Time-Dependent Green Vehicle Routing Problem

Dynamic collaborative truck-drone delivery with en-route synchronization and random requests

Dynamic Vehicle-Cargo Matching Based on Adaptive Time Windows.

Reinforcement Learning for Solving Multiple Vehicle Routing Problem with Time Window

Fast Approximate Solutions using Reinforcement Learning for Dynamic Capacitated Vehicle Routing with Time Windows

Routing Optimization with Vehicle-Customer Coordination

A Two-stage Learning-based Method for Large-scale On-demand Pickup and Delivery Services with Soft Time Windows

Solving Large-Scale Dynamic Vehicle Routing Problems with Stochastic Requests

Application of Deep Reinforcement Learning Algorithm in Uncertain Logistics Transportation Scheduling

A Chronological and Cooperative Route Optimization Method for Heterogeneous Vehicle Routing Problem

Multi-Robot Routing with Time Windows: A Column Generation Approach

Optimization-based Learning for Dynamic Load Planning in Trucking Service Networks