Toybox: A Suite of Environments for Experimental Evaluation of Deep Reinforcement Learning

Emma Tosch,Kaleigh Clary,John Foley,David Jensen
DOI: https://doi.org/10.48550/arXiv.1905.02825
IF: 5.414
2019-05-07
Machine Learning
Abstract:Evaluation of deep reinforcement learning (RL) is inherently challenging. In particular, learned policies are largely opaque, and hypotheses about the behavior of deep RL agents are difficult to test in black-box environments. Considerable effort has gone into addressing opacity, but almost no effort has been devoted to producing high quality environments for experimental evaluation of agent behavior. We present TOYBOX, a new high-performance, open-source* subset of Atari environments re-designed for the experimental evaluation of deep RL. We show that TOYBOX enables a wide range of experiments and analyses that are impossible in other environments. *https://kdl-umass.github.io/Toybox/
What problem does this paper attempt to address?