Research on the Combihation of Bayesian Learning and Reinforcement Learning

CHEN Fei,WANG Ben-Nian,GAO Yang,CHEN Zhao-Qian,CHEN Shi-Fu
DOI: https://doi.org/10.3969/j.issn.1002-137X.2006.02.049
2006-01-01
Computer Science
Abstract:A central problem in reinforcement learning is balancing exploration of untested actions against exploitation of actions that are known to be good.Bayesian learning is a probability method that makes optimal decision based on known probability distribution and recently observed data.So combination of Bayesian learning and reinforcement learning the agent can choose the strategy of exploration or exploitation based on its own experience and newly incoming knowledge.In this paper,we introduce single-agent Bayesian reinforcement learning and multi-agent Bayesian reinforce- ment learning.Single-agent Bayesian reinforcement learning includes Bayesian Q-learning,model-based Bayesian learn ing and Bayesian DP,and muhi-agent Bayesian reinforcement learning includes Bayesian imitation,Bayesian coordina- tion and Bayesian reinforcement learning for coalition formation under uncertainty.At last,some unsolved problems in Bayesian reinforcement learning are discussed.
What problem does this paper attempt to address?