Mastering Curling with RL-revised Decision Tree

Junjie Lin,Yuhao Gong,Jian Zhao,Wengang Zhou,Houqiang Li
DOI: https://doi.org/10.1109/cog57401.2023.10333135
2023-01-01
Abstract:Curling, also known as "chess on ice", is a popular worldwide sport, which not only tests the physical and mental strength of the participants but also showcases the beauty of movement and stillness and the wisdom of trade-offs. Previously, AI for curling was usually based on decision trees, which required strict artificial prior knowledge and often led to unexpected bugs on extreme occasions. In recent years, however, more and more reinforcement learning algorithms have been proposed in competitive games. Nevertheless, AI derived from RL is very unstable when playing against unseen opponents. In this work, we develop an AI for curling in a novel way, utilizing both decision trees and reinforcement learning. The policy of our AI is defined by a decision tree, and we detect the flaws of it through reinforcement learning. Training a policy model against the decision tree not only helps to mend the flaws of the tree but also provides a way to examine the strength and stability of the tree itself. This approach successfully combines the advantages of RL and decision trees, enhancing the strength and generalization capacity of the policy. Our AI ranked the first among 67 teams in the 2022 RLChina 1 spring competition.
What problem does this paper attempt to address?