Deep Reinforcement Learning Based Optimization Algorithm for Permutation Flow-Shop Scheduling

Zixiao Pan,Ling Wang,Jingjing Wang,Jiawen Lu
DOI: https://doi.org/10.1109/tetci.2021.3098354
2021-01-01
IEEE Transactions on Emerging Topics in Computational Intelligence
Abstract:As a new analogy paradigm of human learning process, reinforcement learning (RL) has become an emerging topic in computational intelligence (CI). The synergy between the RL and CI is an emerging way to develop efficient solution algorithms for solving complex combinatorial optimization (CO) problems like machine scheduling problem. In this paper, we proposed an efficient optimization algorithm based on Deep RL for solving permutation flow-shop scheduling problem (PFSP) to minimize the maximum completion time. Firstly, a new deep neural network (PFSPNet) is designed for the PFSP to achieve the end-to-end output without limitation of problem sizes. Secondly, an actor-critic method of RL is used to train the PFSPNet without depending on the collection of high-quality labelled data. Thirdly, an improvement strategy is designed to refine the solution provided by the PFSPNet. Simulation results and statistical comparison show that the proposed optimization algorithm based on deep RL can obtain better results than the existing heuristics in similar computational time for solving the PFSP.
English Else
What problem does this paper attempt to address?