Unsupervised State Representation Learning in Atari

Ankesh Anand,Evan Racah,Sherjil Ozair,Yoshua Bengio,Marc-Alexandre Côté,R Devon Hjelm
DOI: https://doi.org/10.48550/arXiv.1906.08226
IF: 5.414
2019-06-19
Machine Learning
Abstract:State representation learning, or the ability to capture latent generative factors of an environment, is crucial for building intelligent agents that can perform a wide variety of tasks. Learning such representations without supervision from rewards is a challenging open problem. We introduce a method that learns state representations by maximizing mutual information across spatially and temporally distinct features of a neural encoder of the observations. We also introduce a new benchmark based on Atari 2600 games where we evaluate representations based on how well they capture the ground truth state variables. We believe this new framework for evaluating representation learning models will be crucial for future representation learning research. Finally, we compare our technique with other state-of-the-art generative and contrastive representation learning methods. The code associated with this work is available at https://github.com/mila-iqia/atari-representation-learning
What problem does this paper attempt to address?