Reinforcement Learning Method of Continuous State Adaptively Discretized Based on K-means Clustering

Feng WEN,Zong-hai CHEN,Rui ZHUO,Guang-ming ZHOU
DOI: https://doi.org/10.3321/j.issn:1001-0920.2006.02.005
2006-01-01
Abstract:A K-means clustering based reinforcement learning method is proposed, which uses clustering algorithm to adaptively discretize continuous state space. The learning of this method is divided into two processes, state space learning using K-means clustering algorithm for adaptive discretization of continuous states and policy learning using Sarsa algorithm for finding optimal policy. Simulation conducted on reinforcement learning benchmark problem with continuous state shows that the proposed method can adaptively discretize continuous state space and learn optimal policy in the end. Comparison with CMAC network based reinforcement learning method shows that the proposed method has advantages of saving memory and reducing computation time.
What problem does this paper attempt to address?