PCFBPI: A Point Clustering Feature Based Policy Iteration Algorithm

Feng Liu,Chongjun Wang,Jidong Ge,Bin Luo
DOI: https://doi.org/10.1109/ictai.2015.30
2015-01-01
Abstract:The exponential growth of the size of the search space has always been an obstacle to POMDP planning. Heuristics are often used to reduce the search space size and improve computational efficiency. As the advantage of the feature of POMDP problems should be taken into deeper consideration, we analyze the clustering feature of reachable space of POMDP problems and apply policy iteration based on this clustering feature. With insights from theoretical analysis, we have developed a practical POMDP algorithm Point Clustering Feature Based Policy Iteration (PCFBPI). Empirically, PCFBPI is competitive with PBPI in terms of solution quality and convergence efficiency on some large-scale problems.
What problem does this paper attempt to address?