Joint optimization of steel plate shuffling and truck loading sequencing based on deep reinforcement learning

Zhezhuang Xu,Jinlong Wang,Meng Yuan,Yazhou Yuan,Boyu Chen,Qingdong Zhang,Cailian Chen,Xinping Guan
DOI: https://doi.org/10.1016/j.aei.2024.102392
IF: 8.8
2024-02-03
Advanced Engineering Informatics
Abstract:Steel plate is one of the most valuable steel products which is highly customized in specification according to the demands of users. In this case, the outbound scheduling of steel plates is a challenging issue since its efficiency and complexity are impacted by both steel plate shuffling and truck loading sequencing. To overcome this challenge, we propose to jointly optimize steel plate shuffling and truck loading sequencing (SPS-TLS) by utilizing the data of steel plates and trucks collected by Industrial Internet of Things (IIoT). The SPS-TLS problem is firstly transformed as an orders scheduling problem which is formulated as a mixed-integer linear programming (MILP) model. Then an alternating iteration algorithm based on deep reinforcement learning (AltDRL) is proposed to solve the SPS-TLS problem. In AltDRL, the deep Q network (DQN) with prioritized experience replay (PER) and the heuristic algorithm are combined to iteratively obtain the near-optimal shuffling position of blocking plates and truck sequence. Experiments are executed based on data collected from a real steel logistics park. The results confirm that AltDRL can significantly reduce the number of plate shuffles and improve the outbound scheduling efficiency of steel plates.
engineering, multidisciplinary,computer science, artificial intelligence
What problem does this paper attempt to address?