Abstract:In real-world express systems, couriers need to satisfy not only the delivery demands but also the pick-up demands of customers. Delivery and pickup tasks are usually mixed together within integrated routing plans. Such a mixed routing problem can be abstracted and formulated as Vehicle Routing Problem with Mixed Delivery and Pickup (VRPMDP), which is an NP-hard combinatorial optimization problem. To solve VRPMDP, there are three major challenges as below. (a) Even though successive pickup and delivery tasks are independent to accomplish, the inter-influence between choosing pickup task or delivery task to deal with still exists. (b) Due to the two-way flow of goods between the depot and customers, the loading rate of vehicles leaving the depot affects routing decisions. (c) The proportion of deliveries and pickups will change due to the complex demand situation in real-world scenarios, which requires robustness of the algorithm. To solve the challenges above, we design an encoder-decoder based framework to generate high-quality and robust VRPMDP solutions. First, we consider a VRPMDP instance as a graph and utilize a GNN encoder to extract the feature of the instance effectively. The detailed routing solutions are further decoded as a sequence by the decoder with attention mechanism. Second, we propose a Coordinated Decision of Loading and Routing (CDLR) mechanism to determine the loading rate dynamically after the vehicle returns to the depot, thus avoiding the influence of improper loading rate settings. Finally, the model equipped with a GNN encoder and CDLR simultaneously can adapt to the changes in the proportion of deliveries and pickups. We conduct the experiments to demonstrate the effectiveness of our model. The experiments show that our method achieves desirable results and generalization ability.

Deep Reinforcement Learning-Based Multi-Agent Algorithm for Vehicle Routing Problem in Complex Logistics Scenarios

Learning to Cooperate: Application of Deep Reinforcement Learning for Online AGV Path Finding.

Logistics Distribution Route Optimization With Time Windows Based on Multi-Agent Deep Reinforcement Learning

A multi-agent deep reinforcement learning approach for solving the multi-depot vehicle routing problem

Multi-Vehicle Routing Problems with Soft Time Windows: A Multi-Agent Reinforcement Learning Approach

Applying Artificial Bee Colony Algorithm to the Multidepot Vehicle Routing Problem.

Reinforcement Learning for Solving Multiple Vehicle Routing Problem with Time Window

Multi-Task Multi-Objective Evolutionary Search Based on Deep Reinforcement Learning for Multi-Objective Vehicle Routing Problems with Time Windows

A Hybrid of Deep Reinforcement Learning and Local Search for the Vehicle Routing Problems

A deeper look back at Y

Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem

Deep Reinforcement Learning-based Multi-AMR Path Planning Algorithm

Deep Reinforcement Learning for Multi-Truck Vehicle Routing Problems with Multi-Leg Demand Routes

Deep Reinforcement Learning Algorithm for Fast Solutions to Vehicle Routing Problem with Time-Windows

Multiobjective Vehicle Routing Optimization with Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II

Graph attention reinforcement learning with flexible matching policies for multi-depot vehicle routing problems

Reinforcement Learning for Practical Express Systems with Mixed Deliveries and Pickups

Real-Time Multi-Vehicle Scheduling in Tasks With Dependency Relationships Using Multi-Agent Reinforcement Learning

MAPDP: Cooperative Multi-Agent Reinforcement Learning to Solve Pickup and Delivery Problems

Solving the Vehicle Routing Problem with Stochastic Travel Cost Using Deep Reinforcement Learning

Multi-agent reinforcement learning-based dynamic task assignment for vehicles in urban transportation system