Review of Deep Reinforcement Learning and Discussions on the Development of Computer Go

ZHAO Dong-bin,SHAO Kun,ZHU Yuan-heng,LI Dong,CHEN Ya-ran,WANG Hai-tao,LIU De-rong,ZHOU Tong,WANG Cheng-hong
DOI: https://doi.org/10.7641/cta.2016.60173
2016-01-01
Abstract:Deep reinforcement learning which incorporates both the advantages of the perception of deep learning and the decision making of reinforcement learning is able to output control signal directly based on input images. This mech-anism makes the artificial intelligence much close to human thinking modes. Deep reinforcement learning has achieved remarkable success in terms of theory and application since it is proposed. ‘Chuyihao–AlphaGo’, a computer Go deve-loped by Google DeepMind, based on deep reinforcement learning, beat the world’s top Go player Lee Sedol 4:1 in March 2016. This becomes a new milestone in artificial intelligence history. This paper surveys the development course of deep reinforcement learning, reviews the history of computer Go concurrently, analyzes the algorithms features, and discusses the research directions and application areas, in order to provide a valuable reference to the development of control theory and applications in a new direction.
What problem does this paper attempt to address?