Option Automatic Generation in Hierarchical Reinforcement Learning

Shen Jing,Gu Guochang,Liu Haibo
DOI: https://doi.org/10.3321/j.issn:1002-8331.2005.34.002
2005-01-01
Abstract:There are currently three typical approaches,namely,Option,HAM,and MAXQ,for hierarchical reinforcement learning,whereas the open problem that generates hierarchies automatically is not solved well.Aiming at the first approach,this paper presents an algorithm for Option automatic generation.The algorithm takes the state space explored by Agent in the initial learning phase and clusters the states employing artificial immune net.Based on the clustered state sets,the intra-strategies are learned by an experience replay procedure.As a result,the Options are generated.The validity of the algorithm is demonstrated by simulation experiments.
What problem does this paper attempt to address?