A Decomposition-Based Q-Learning Evolutionary Algorithm for a Transportation-Assembly Collaborative Optimization Problem

Rong Hu,Teng-Fei Zhang,Zi-Qi Zhang,Bin QIAN,Ling Wang,Jian-Bo Yang
DOI: https://doi.org/10.2139/ssrn.4263890
2022-01-01
SSRN Electronic Journal
Abstract:With the upgrading of manufacturing industry to smart manufacturing, the coordination between transportation and assembly has become one of the core competitiveness of manufacturing enterprises. This paper studies a significant transportation-assembly collaborative optimization problem (TACOP) with the criteria of minimizing cycle time (CT) as the primary objective and minimizing transportation cost (TC) and inventory level (IL) as the secondary objectives. Since TACOP consists of two coupled sub-problems, namely, vehicle routing problem (VRP) and simple assembly line balancing problem (SALBP), a decomposition-based Q-learning evolutionary algorithm (DQEA) is designed to deal with the TACOP. Firstly, the assembly agent addresses SALBP and acquires assembly scheme with high-quality CT. Then, assembly scheme is extended into a diverse set of assembly solutions with the same CT but different arrangements. Secondly, these assembly solutions are sliced by a slicing method of load balancing (SM_LB) to produce some subproblems (VRPs), which also facilitates the reduction of IL. Thirdly, since these VRPs are independent, the transportation agent can solve them in parallel, thus yielding transportation solutions corresponding to assembly solutions quickly. For the sub-problems SALBP and VRPs, the agent adopts the off-policy to execute global and local search in an adaptive manner, learning valuable information from good actions and quickly guiding the search direction toward promising regions. Extensive experiments and computational comparisons are conducted to confirm the effectiveness and efficiency of DQEA of in solving TACOP.
What problem does this paper attempt to address?