Reinforcement Learning Theory,Algorithms and Its Application

ZHANG Rubo,GU Guochang,LIU Zhaode,WANG Xingce
DOI: https://doi.org/10.3969/j.issn.1000-8152.2000.05.002
2000-01-01
Abstract:The term,reinforcement learning,comes from behavior psychology that takes behavior leaming as trial and error,by which the states of environment are mapped into corresponding actions.First,the main algorithms,temporal difference, \%Q \%learning and adaptive heuristic critic,are roundly introduced.Then,the application of reinforcement leaming is presented.Finally,some present research projects of reinforcement learning are discussed.
What problem does this paper attempt to address?