A modified Q-learning algorithm for robot path planning in a digital twin assembly system

Xiaowei Guo,Gongzhuang Peng,Yingying Meng
DOI: https://doi.org/10.1007/s00170-021-08597-9
IF: 3.563
2022-01-10
The International Journal of Advanced Manufacturing Technology
Abstract:Product assembly is an important stage in complex product manufacturing. How to intelligently plan the assembly process based on dynamic product and environment information has become a pressing issue that needs to be addressed. For this reason, this research has constructed a digital twin assembly system, including virtual and real interactive feedback, data fusion analysis, and decision-making iterative optimization modules. In the virtual space, a modified Q-learning algorithm is proposed to solve the path planning problem in product assembly. The proposed algorithm speeds up the convergence speed by adding a dynamic reward function, optimizes the initial Q table by introducing knowledge and experience through the case-based reasoning algorithm, and prevents entry into the trapped area through the obstacle avoiding method. Finally, the six-joint robot UR10 is taken as an example to verify the performance of the algorithm in the three-dimensional pathfinding space. The experimental results show that the performance of the modified Q-learning algorithm is significantly better than the original Q-learning algorithm in both convergence efficiency and the optimization effect.
engineering, manufacturing,automation & control systems
What problem does this paper attempt to address?