Combinatorial Optimization enriched Machine Learning to solve the Dynamic Vehicle Routing Problem with Time Windows

Léo Baty,Kai Jungel,Patrick S. Klein,Axel Parmentier,Maximilian Schiffer
2023-04-03
Abstract:With the rise of e-commerce and increasing customer requirements, logistics service providers face a new complexity in their daily planning, mainly due to efficiently handling same day deliveries. Existing multi-stage stochastic optimization approaches that allow to solve the underlying dynamic vehicle routing problem are either computationally too expensive for an application in online settings, or -- in the case of reinforcement learning -- struggle to perform well on high-dimensional combinatorial problems. To mitigate these drawbacks, we propose a novel machine learning pipeline that incorporates a combinatorial optimization layer. We apply this general pipeline to a dynamic vehicle routing problem with dispatching waves, which was recently promoted in the EURO Meets NeurIPS Vehicle Routing Competition at NeurIPS 2022. Our methodology ranked first in this competition, outperforming all other approaches in solving the proposed dynamic vehicle routing problem. With this work, we provide a comprehensive numerical study that further highlights the efficacy and benefits of the proposed pipeline beyond the results achieved in the competition, e.g., by showcasing the robustness of the encoded policy against unseen instances and scenarios.
Optimization and Control,Systems and Control,Machine Learning
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the Dynamic Vehicle Routing Problem (DVRP), especially the Dynamic Vehicle Routing Problem with Time Windows (DVRPTW). With the development of e - commerce and the increase in customer demands, logistics service providers are facing new and complex challenges, especially in handling same - day delivery orders. Although traditional multi - stage stochastic optimization methods can solve this problem, they are either too computationally costly in online applications or perform poorly when dealing with high - dimensional combinatorial problems, such as methods based on reinforcement learning. To overcome these drawbacks, the authors propose a new machine - learning (ML) pipeline that combines a combinatorial optimization (CO) layer to efficiently solve the dynamic vehicle routing problem. Through this method, the authors aim to provide a solution that can make high - quality scheduling decisions quickly while considering the uncertainty of future orders. Specifically, the contributions of the paper include: 1. **Methodological contributions**: The authors integrate the combinatorial optimization layer into the machine - learning pipeline, expanding the existing combinatorial - optimization - enriched machine - learning (CO - enriched ML) pipeline so that it can handle nonlinear objective functions. In addition, they also propose for the first time a CO - enriched ML pipeline that uses meta - heuristic algorithms to solve the combinatorial optimization layer. 2. **Application contributions**: The authors apply the proposed pipeline to a new dynamic vehicle routing problem introduced in the EURO Meets NeurIPS Vehicle Routing Competition. The experimental results show that this method outperforms other existing methods in solving the proposed dynamic vehicle routing problem and exhibits good robustness in unseen instances and scenarios. Through these contributions, the authors not only provide an effective solution to the challenges in dynamic vehicle routing but also demonstrate the potential of combining combinatorial optimization and machine learning in solving high - dimensional combinatorial optimization problems.