Abstract:Nowadays, finding the optimal route for vehicles through online vehicle path planning is one of the main problems that the logistics industry needs to solve. Due to the uncertainty of the transportation system, especially the last-mile delivery problem of small packages in uncertain logistics transportation, the calculation of logistics vehicle routing planning becomes more complex than before. Most of the existing solutions are less applied to new technologies such as machine learning, and most of them use a heuristic algorithm. This kind of solution not only needs to set a lot of constraints but also requires much calculation time in the logistics network with high demand density. To design the uncertain logistics transportation path with minimum time, this paper proposes a new optimization strategy based on deep reinforcement learning that converts the uncertain online logistics routing problems into vehicle path planning problems and designs an embedded pointer network for obtaining the optimal solution. Considering the long time to solve the neural network, it is unrealistic to train parameters through supervised data. This article uses an unsupervised method to train the parameters. Because the process of parameter training is offline, this strategy can avoid the high delay. Through the simulation part, it is not difficult to see that the strategy proposed in this paper will effectively solve the uncertain logistics scheduling problem under the limited computing time, and it is significantly better than other strategies. Compared with traditional mathematical procedures, the algorithm proposed in this paper can reduce the driving distance by 60.71%. In addition, this paper also studies the impact of some key parameters on the effect of the program.

Reinforcement Learning for Shortest Path Problem on Stochastic Time-Dependent Road Network

CTD: Cascaded Temporal Difference Learning for the Mean-Standard Deviation Shortest Path Problem

Finding Paths with Least Expected Time in Stochastic Time-Varying Networks Considering Uncertainty of Prediction Information

Routing optimization with Monte Carlo Tree Search-based multi-agent reinforcement learning

A deep reinforcement learning with dynamic spatio-temporal graph model for solving urban logistics delivery planning problems

Real-time deep reinforcement learning based vehicle navigation

Multi-Vehicle Routing Problems with Soft Time Windows: A Multi-Agent Reinforcement Learning Approach

Deep Reinforcement Learning Based Dynamic Route Planning for Minimizing Travel Time

Robust Route Planning with Distributional Reinforcement Learning in a Stochastic Road Network Environment

Route Guidance Systems Based on Real-Time Information

Distributed Adaptive Reinforcement Learning: A Method for Optimal Routing

A knowledge-assisted reinforcement learning optimization for road network design problems under uncertainty

Tensor-Based Reinforcement Learning for Network Routing

Optimising Stochastic Routing for Taxi Fleets with Model Enhanced Reinforcement Learning

Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning

Data-Driven Optimization for Dynamic Shortest Path Problem Considering Traffic Safety

Reinforcement Learning for Solving Stochastic Vehicle Routing Problem with Time Windows

Reinforcement Learning-based Approach for Dynamic Vehicle Routing Problem with Stochastic Demand

Application of Deep Reinforcement Learning Algorithm in Uncertain Logistics Transportation Scheduling

Significant Sampling for Shortest Path Routing: A Deep Reinforcement Learning Solution

Population Game-Assisted Multi-Agent Reinforcement Learning Method for Dynamic Multi-Vehicle Route Selection