Towards automating Codenames spymasters with deep reinforcement learning

Sherman Siu
DOI: https://doi.org/10.48550/arXiv.2212.14104
2022-12-29
Abstract:Although most reinforcement learning research has centered on competitive games, little work has been done on applying it to co-operative multiplayer games or text-based games. Codenames is a board game that involves both asymmetric co-operation and natural language processing, which makes it an excellent candidate for advancing RL research. To my knowledge, this work is the first to formulate Codenames as a Markov Decision Process and apply some well-known reinforcement learning algorithms such as SAC, PPO, and A2C to the environment. Although none of the above algorithms converge for the Codenames environment, neither do they converge for a simplified environment called ClickPixel, except when the board size is small.
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?