New Method of Hierarchical Reinforcement Learning

SHEN Jing,GU Guo-chang,LIU Hai-bo
2006-01-01
Abstract:A novel method of hierarchical reinforcement learning which named OMQ was presented by integrating Options into MAXQ.In OMQ,MAXQ was used as the basic framework to design hierarchies experientially and learn online,and the Option was used to construct hierarchies automatically.The performance of OMQ was demonstrated in taxi domain and compared with Option and MAXQ.The simulation results show that the OMQ is more practical than Option and MAXQ in partially known environment.
What problem does this paper attempt to address?