Multimodal transportation routing optimization based on multi-objective Q-learning under time uncertainty

Tie Zhang,Jia Cheng,Yanbiao Zou
DOI: https://doi.org/10.1007/s40747-023-01308-9
IF: 6.7
2024-01-16
Complex & Intelligent Systems
Abstract:Abstract Multimodal transportation is a modern way of cargo transportation. With the increasing demand for cargo transportation, higher requirements are being placed on multimodal transportation multi-objective routing optimization. In multimodal transportation multi-objective routing optimization, in response to the limitations of classical algorithms in solving large-scale problems with multiple nodes and modes of transport, the limitations of directed transportation networks in the application, and the uncertainty of transport time, this paper proposes an optimization framework based on multi-objective weighted sum Q -learning, combined with the proposed undirected multiple-node network, and characterizes the uncertainty of time with a positively skewed distribution. The undirected multiple-node transportation network can better simulate cargo transportation and characterize transfer information, facilitate the modification of origin and destination, and avoid suboptimal solutions due to the manual setting of wrong route directions. The network is combined with weighted sum Q -learning to solve multimodal transportation multi-objective routing optimization problems faster and better. When modeling the uncertainty of transport time, a positively skewed distribution is used. The three objectives of transport cost, carbon emission cost, and transport time were studied and compared with PSO, GA, AFO, NSGA-II, and MOPSO. The experimental results show that compared with PSO, GA, and AFO using a directed transportation network, the proposed method has a significant improvement in optimization results and running time, and the running time is shortened by 26 times. The proposed method can better solve the boundary of the Pareto front and dominate the partial solutions of NSGA-II and MOPSO. The effect of time uncertainty on the performance of the algorithm is more significant in transport orders with high time weight. With the increase in uncertainty, the reliability of the route decreases. The effectiveness of the proposed method is verified.
computer science, artificial intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the multi - objective path optimization problem in multimodal transportation, especially in the case of uncertain transportation time. Specifically, the paper focuses on how to find the optimal transportation path while considering the three objectives of transportation cost, carbon emission cost and transportation time. The paper points out that traditional algorithms have limitations in solving problems with large - scale multi - nodes and multiple transportation modes. Especially when the transportation network is modeled as a directed graph, sub - optimal solutions are likely to occur and the route direction needs to be set manually. In addition, there are uncertainties in the actual transportation process, such as changes in transportation time, which make the problem more complicated. To meet these challenges, the paper proposes an optimization framework based on multi - objective weighted sum and Q - learning. This framework combines an undirected multi - node transportation network model to more realistically simulate the cargo transportation process, and uses a positively skewed distribution to describe the uncertainty of transportation time. In this way, the paper aims to provide a faster and more effective method for solving the multi - objective path optimization problem in multimodal transportation, especially in the case of uncertain transportation time. The main contributions of the paper include: 1. Proposing an undirected multi - node multimodal transportation network model, which solves the sub - optimal solution problem caused by incorrect manual setting of route directions, and the problem of needing to modify or re - establish the network when the starting point or destination of the goods changes. 2. Constructing a multimodal transportation model with time uncertainty based on a positively skewed distribution to better describe the uncertainty of transportation time. 3. Combining the established network model, proposing an optimization framework based on multi - objective weighted sum and Q - learning for solving the multimodal transportation path optimization problem under uncertain conditions, and analyzing its convergence. Through these methods, the paper not only improves the performance of the algorithm in optimization results and running time, but also verifies the effectiveness of the proposed method, especially when dealing with transportation orders with high time weights, the performance of the algorithm is more significantly affected by time uncertainty. As the uncertainty increases, the reliability of the route will decrease, but the proposed framework can better solve the boundary problem of the Pareto front and is superior to some NSGA - II and MOPSO solutions.