Reinforcement Learning for Intermodal Transportation Planning with Time Windows and Limited Cargo Capacity

Hadi Aghazadeh,Xin Wang
DOI: https://doi.org/10.1145/3615895.3628166
2023-01-01
Abstract:This paper addresses the enhancement of practical intermodal transportation efficiency by synergizing train and truck utilization for the seamless movement of freights across diverse geographical locations in a country. The core challenge revolves around determining the optimal allocation of freights, either individually by trucks or collectively with train mode in each location, while accounting for time windows constraints associated with each order and the inherent capacity limitations of the transportation modes. To tackle this complex optimization problem, we employ the renowned Q-Learning reinforcement learning algorithm. This enables the derivation of an optimal dispatch policy predicated on the selection of appropriate transportation modes. In order to establish a robust benchmark for comparison, we introduce three baseline metaheuristic models: Tabu Search, Simulated Annealing, and a hybrid approach merging Tabu Search with Simulated Annealing. Our methodologies undergo rigorous testing using realistic datasets of varying sizes. The outcomes distinctly demonstrate the superior performance of Q-learning over the baseline models. Furthermore, the Q-Learning approach showcases its efficacy in real-time scenario handling, even when confronted with substantial intermodal transportation challenges on a large scale.
What problem does this paper attempt to address?