Is Reinforcement Learning the Choice of Human Learners?

Menghai Pan,Weixiao Huang,Yanhua Li,Xun Zhou,Zhenming Liu,Jie Bao,Yu Zheng,Jun Luo
DOI: https://doi.org/10.1145/3397536.3422246
2020-01-01
Abstract:Learning to make optimal decisions is a common yet complicated task. While computer agents can learn to make decisions by running reinforcement learning (RL), it remains unclear how human beings learn. In this paper, we perform the first data-driven case study on taxi drivers to validate whether humans mimic RL to learn. We categorize drivers into three groups based on their performance trends and analyze the correlations between human drivers and agents trained using RL. We discover that drivers that become more efficient at earning over time exhibit similar learning patterns to those of agents, whereas drivers that become less efficient tend to do the opposite. Our study (1) provides evidence that some human drivers do adapt RL when learning, (2) enhances the deep understanding of taxi drivers' learning strategies, (3) offers a guideline for taxi drivers to improve their earnings, and (4) develops a generic analytical framework to study and validate human learning strategies.
What problem does this paper attempt to address?