UvA-DARE ( Digital Academic Repository ) Using Informative Behavior to Increase Engagement While Learning from Human Reward
Guangliang Li,Shimon Whiteson,W. B. Knox,H. Hung
2015-01-01
Abstract:In this work, we address a relatively unexplored aspect of designing agents that learn fromhuman reward.We investigate howan agent’s non-task behavior can affect a human trainer’s training and agent learning. We use the TAMER framework, which facilitates the training of agents by human-generated reward signals, i.e., judgements of the quality of the agent’s actions, as the foundation for our investigation. Then, starting from the premise that the interaction between the agent and the trainer should be bi-directional, we propose two new training interfaces to increase a human trainer’s active involvement in the training process and thereby improve the agent’s task performance. One provides information on the agent’s uncertainty which is a metric calculated as data coverage, the other on its performance. Our results from a 51-subject user study show that these interfaces can induce the trainers to train longer and give more feedback. The agent’s performance, however, increases only in response to the addition of performance-oriented information, not by sharing uncertainty levels. These results suggest that the organizational maxim about human behavior, “you get what you measure”—i.e., sharing metrics with people causes them to focus on optimizing those metrics while de-emphasizing other objectives—also applies to the training of agents. Using principle component analysis, we show how trainers in the two conditions train agents differently. In addition, by simulating the influence of the agent’s uncertainty–informative B Guangliang Li g.li@uva.nl Shimon Whiteson shimon.whiteson@cs.ox.ac.uk W. Bradley Knox bradknox@mit.edu Hayley Hung h.hung@tudelft.nl 1 University of Amsterdam, Amsterdam, The Netherlands 2 University of Oxford, Oxford, UK 3 Massachusetts Institute of Technology, Cambridge, USA 4 Technical University of Delft, Delft, The Netherlands