Abstract:Proactively searching for objects like humans may be a basic requirement for intelligent service robots. The path planning during the searching process can be modeled as a Partially Observable Markov Decision Process (POMDP). In this work, we propose a Probabilistic Voronoi Diagram (PVD) for object search strategy planning on the basis of POMDP. Firstly, an environmental knowledge base is constructed to record the information of objects, and a Bidirectional Encoder Representations from Transformers (BERT) model [1] is trained to encode and decompose the semantic knowledge in the environment. In order to reveal the interrelationships between objects, a Gaussian Mixture Model (GMM) is adopted using the information within the environmental knowledge base. In order to accelerate the searching efficiency, the Generalized Voronoi Diagram (GVD) is introduced to discretize the map and generate the environmental topological map. In order to further establish the spatial correlation between objects, we propose combining the GVD topological map with the GMM to generate the PVD, which can respond to the probability distribution of objects. On the basis of PVD, we further model the object search problem as a POMDP problem by considering the region search cost and the distance traveled cost of the robot in performing solutions. When making observations and updates, the object-to-object relationships in the knowledge base are extracted by the robot to optimize decisions when observing objects related to the target object. Both real-world experimental studies and simulations reveal that our algorithm is very close to human search strategies and outperforms other state-of-the-art algorithms in terms of trajectory length and running time.

Voronoi Progressive Widening: Efficient Online Solvers for Continuous State, Action, and Observation POMDPs

Online algorithms for POMDPs with continuous state, action, and observation spaces

Sparse tree search optimality guarantees in POMDPs with continuous observation spaces

Observation-Based Optimization for POMDPs with Continuous State, Observation, and Action Spaces.

Adaptive Online Packing-guided Search for POMDPs

Bayesian Optimized Monte Carlo Planning

Policy Graph Pruning And Optimization In Monte Carlo Value Iteration For Continuous-State Pomdps

A Search Space Utility Optimization Based Online POMDP Planning Algorithm

Multilevel Monte-Carlo for Solving POMDPs Online

PODDP: Partially Observable Differential Dynamic Programming for Latent Belief Space Planning

Simplified POMDP Planning with an Alternative Observation Space and Formal Performance Guarantees

Scaling Long-Horizon Online POMDP Planning via Rapid State Space Sampling

Improving Online POMDP Planning Algorithms with Decaying Q Value

A Partially Observable Monte Carlo Planning Algorithm Based on Path Modification.

Online POMDP Planning with Anytime Deterministic Guarantees

Planning with Partially Observable Markov Decision Processes: Advances in Exact Solution Method

An Active Robot Object Search Strategy Based on Probabilistic Voronoi Diagram and POMDP

PLEASE: Palm Leaf Search for POMDPs with Large Observation Spaces.

A Probabilistic Forward Search Value Iteration Algorithm for POMDP

Monte Carlo Information-Oriented Planning

A Surprisingly Simple Continuous-Action POMDP Solver: Lazy Cross-Entropy Search Over Policy Trees