Point-Based Pomdp Algorithms Using Pre-Sampled Belief Set

Jun Lu,Aihua Bian,Chongjun Wang,Shifu Chen
2007-01-01
Abstract:PBA(Point-based POMDP Algorithm) is a kind of algorithms for solving problems of Partially observable Markov decision processes by using backup operations on the it representative belief points". The core process in PBA is the selection of belief points. In the optimal condition, the same result as the exact algorithm is achieved by including all the witness points in each step and applying backup operations on them. In this paper, the idea of sampling a large belief set B before PBA is presented. We introduce four sampling algorithms firstly and present the algorithm BPBVI: an improved algorithm based on PBVI[5] using pre-sampled belief set. Experimental results illustrated that BPBVI performs better than PBVI.
What problem does this paper attempt to address?