Solving the Rubik’s cube with deep reinforcement learning and search

Forest Agostinelli,Stephen McAleer,Alexander Shmakov,Pierre Baldi
DOI: https://doi.org/10.1038/s42256-019-0070-z
IF: 23.8
2019-07-15
Nature Machine Intelligence
Abstract:The Rubik’s cube is a prototypical combinatorial puzzle that has a large state space with a single goal state. The goal state is unlikely to be accessed using sequences of randomly generated moves, posing unique challenges for machine learning. We solve the Rubik’s cube with DeepCubeA, a deep reinforcement learning approach that learns how to solve increasingly difficult states in reverse from the goal state without any specific domain knowledge. DeepCubeA solves 100% of all test configurations, finding a shortest path to the goal state 60.3% of the time. DeepCubeA generalizes to other combinatorial puzzles and is able to solve the 15 puzzle, 24 puzzle, 35 puzzle, 48 puzzle, Lights Out and Sokoban, finding a shortest path in the majority of verifiable cases.
computer science, artificial intelligence, interdisciplinary applications
What problem does this paper attempt to address?