Hierarchical Path-planning from Speech Instructions with Spatial Concept-based Topometric Semantic Mapping

Akira Taniguchi,Shuya Ito,Tadahiro Taniguchi
DOI: https://doi.org/10.3389/frobt.2024.1291426
2024-06-21
Abstract:Assisting individuals in their daily activities through autonomous mobile robots, especially for users without specialized knowledge, is crucial. Specifically, the capability of robots to navigate to destinations based on human speech instructions is essential. While robots can take different paths to the same goal, the shortest path is not always the best. A preferred approach is to accommodate waypoint specifications flexibly, planning an improved alternative path, even with detours. Additionally, robots require real-time inference capabilities. This study aimed to realize a hierarchical spatial representation using a topometric semantic map and path planning with speech instructions, including waypoints. This paper presents Spatial Concept-based Topometric Semantic Mapping for Hierarchical Path Planning (SpCoTMHP), integrating place connectivity. This approach offers a novel integrated probabilistic generative model and fast approximate inference across hierarchy levels. A formulation based on control as probabilistic inference theoretically supports the proposed path planning algorithm. We conducted experiments in home environments using the Toyota Human Support Robot on the SIGVerse simulator and in a lab-office environment with the real robot, Albert. Users issued speech commands specifying the waypoint and goal, such as "Go to the bedroom via the corridor." Navigation experiments using speech instructions with a waypoint demonstrated a performance improvement of SpCoTMHP over the baseline hierarchical path planning method with heuristic path costs (HPP-I), in terms of the weighted success rate at which the robot reaches the closest target and passes the correct waypoints, by 0.590. The computation time was significantly accelerated by 7.14 seconds with SpCoTMHP compared to baseline HPP-I in advanced tasks.
Robotics,Artificial Intelligence
What problem does this paper attempt to address?
The problem this paper attempts to address is how to navigate an autonomous mobile robot to a destination based on human voice commands, especially for users without specialized knowledge. Specifically, the paper focuses on the following points: 1. **Flexibility in Path Planning**: Although robots can find different paths to the same goal, the shortest path is not always the best choice. A better approach is to be able to flexibly specify waypoints to plan improved alternative routes, even if they include detours. 2. **Real-time Reasoning Ability**: Robots need to have real-time reasoning capabilities to quickly respond to user commands. 3. **Multi-level Spatial Representation**: Spatial representation includes semantic, topological, and metric levels, each capturing different aspects of the environment. The paper aims to achieve a multi-level spatial representation based on a topometric semantic map, combined with voice commands for path planning. 4. **Environment-specific Knowledge**: Robots need to have environment-specific knowledge to handle situations where specific location names provided in everyday natural language commands are uncommon or where multiple locations in the environment share the same name. To achieve these goals, the paper proposes a method called "Spatial Concept-based Topometric Semantic Mapping for Hierarchical Path Planning (SpCoTMHP)." This method combines local connectivity, provides a new integrated probabilistic generative model, and enables fast approximate reasoning between hierarchical levels. Experimental results show that compared to baseline methods, SpCoTMHP significantly improves success rates and computation time.