A Response Surface Model Approach to Parameter Estimation of Reinforcement Learning for the Travelling Salesman Problem

André L. C. Ottoni,Erivelton G. Nepomuceno,Marcos S. de Oliveira
DOI: https://doi.org/10.1007/s40313-018-0374-y
2018-03-02
Journal of Control, Automation and Electrical Systems
Abstract:This paper reports the use of response surface model (RSM) and reinforcement learning (RL) to solve the travelling salesman problem (TSP). In contrast to heuristically approaches to estimate the parameters of RL, the method proposed here allows a systematic estimation of the learning rate and the discount factor parameters.The Q-learning and SARSA algorithms were applied to standard problems from the TSPLIB library. Computational results demonstrate that the use of RSM is capable of producing better solutions to both symmetric and asymmetric tests of TSP.
What problem does this paper attempt to address?