A Multi-Agv Routing Planning Method Based on Deep Reinforcement Learning and Recurrent Neural Network

Yishuai Lin,Gang Hue,Liang Wang,Qingshan Li,Jiawei Zhu
DOI: https://doi.org/10.1109/jas.2023.123300
2023-01-01
IEEE/CAA Journal of Automatica Sinica
Abstract:Dear Editor, This letter presents a multi-automated guided vehicles (AGV) routing planning method based on deep reinforcement learning (DRL) and recurrent neural network (RNN), specifically utilizing proximal policy optimization (PPO) and long short-term memory (LSTM). Compared to traditional AGV pathing planning methods using genetic algorithm, ant colony optimization algorithm, etc., our proposed method has a higher degree of adaptability to deal with temporary changes of tasks or sudden failures of AGVs. Furthermore, our novel routing method, which uses LSTM to take into account temporal step information, provides a more optimized performance in terms of rewards and convergence speed as compared to existing PPO-based routing methods for AGVs.
What problem does this paper attempt to address?