Deep reinforcement learning in playing Tetris with robotic arm experiment

Yu Yan,Peng Liu,Jin Zhao,Chengxi Zhang,Guangwei Wang
DOI: https://doi.org/10.1177/01423312221114694
IF: 2.146
2022-08-19
Transactions of the Institute of Measurement and Control
Abstract:Transactions of the Institute of Measurement and Control, Ahead of Print. Tetris has been an important field for research in deep reinforcement learning (DRL). However, most studies about Tetris are focused on simulation validation, and a few attempts are conducted in the real-world environment. In this paper, the DRL algorithms are trained in the constructed Tetris simulation environment, after that they are deployed into the real-world Tetris experiments. The dynamic timesteps method is integrated into the proximal policy optimization (PPO) method to accelerate its training speed, which reaches the goal of the game within 1483 episodes. With the help of multiple recognition and segmented moving techniques, the robotic arm provides accurate and robust performance to play real-world Tetris. The effectiveness of the developed system is experimentally verified; the experimental results show that the proposed algorithm achieved superior performance compared with conventional method and Deep Q-Network (DQN) in real-world Tetris environments.
automation & control systems,instruments & instrumentation
What problem does this paper attempt to address?