A Bidirectional Parameter Transfer Reinforcement Learning Approach for Bi-Objectives Traveling Salesman Problem

Hao Gao,Xin Xu,Changxin Zhang,Xing Zhou
DOI: https://doi.org/10.1109/ACAIT56212.2022.10137925
2022-01-01
Abstract:In recent years, learning-based approaches for solving combinational optimization problems have received increasing research interest. However, it is still challenging to solve multi-objective optimization problems (MOPs). In this paper, we proposed a bidirectional parameter transfer attention-based reinforcement learning approach for solving bi-objective traveling salesman problem (BOTSP), which is based on dynamic context attention neural network trained by the rollout reinforce algorithm. Specifically, BOTSP is decomposed into a series of static sub-tasks at first, then, bidirectional parameter transfer methods are proposed for training each subproblem sequentially. Once the model has been learned, Pareto optimal solutions can be obtained on different scale problem instances. Extensive experiments on BOTSP were conducted to illustrate the effectiveness and advantages of the proposed approach. Compared with several algorithms, our proposed method achieves the state-of-the-art performance in hypervolume and inference efficiency. In particular, our method is suitable for different scale problem instances without extra learning, and experimental results demonstrate it realizes powerful generalization ability across tasks.
What problem does this paper attempt to address?