A Search Space Utility Optimization Based Online POMDP Planning Algorithm

Qingya Wang,Feng Liu
DOI: https://doi.org/10.1109/ictai56018.2022.00109
2022-01-01
Abstract:Online planning algorithms based on Monte Carlo tree search have been widely used to solve POMDP problems, but they still have difficulty handling large-scale scenarios. Although POMCPOW adapts to large-scale problems by using weighted particle filter and double progressive widening, it does not differentiate between the contributions of different levels, thus cannot make the best of the fixed number of simulations. A Partially Observable Monte Carlo Planning algorithm based on Search Space Utility Optimization (POMCP-SSUO) is proposed in this paper. POMCP-SSUO takes the level of simulated nodes as a factor of progressive widening, with which the upper bound of branches is negatively correlated. The proposed algorithm improves POMCPOW by exploring more valuable nodes and generating the search space with optimal utility. Experimental results demonstrate that POMCP-SSUO outperforms POMCPOW with varying numbers of simulations on three large benchmark problems.
What problem does this paper attempt to address?