Integrating Intent Understanding and Optimal Behavior Planning for Behavior Tree Generation from Human Instructions

Xinglin Chen,Yishuai Cai,Yunxin Mao,Minglong Li,Wenjing Yang,Weixia Xu,Ji Wang
2024-06-27
Abstract:Robots executing tasks following human instructions in domestic or industrial environments essentially require both adaptability and reliability. Behavior Tree (BT) emerges as an appropriate control architecture for these scenarios due to its modularity and reactivity. Existing BT generation methods, however, either do not involve interpreting natural language or cannot theoretically guarantee the BTs' success. This paper proposes a two-stage framework for BT generation, which first employs large language models (LLMs) to interpret goals from high-level instructions, then constructs an efficient goal-specific BT through the Optimal Behavior Tree Expansion Algorithm (OBTEA). We represent goals as well-formed formulas in first-order logic, effectively bridging intent understanding and optimal behavior planning. Experiments in the service robot validate the proficiency of LLMs in producing grammatically correct and accurately interpreted goals, demonstrate OBTEA's superiority over the baseline BT Expansion algorithm in various metrics, and finally confirm the practical deployability of our framework. The project website is <a class="link-external link-https" href="https://dids-ei.github.io/Project/LLM-OBTEA/" rel="external noopener nofollow">this https URL</a>.
Artificial Intelligence,Human-Computer Interaction,Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how robots can autonomously perform tasks according to human natural - language instructions in daily scenarios such as homes, restaurants or factories. Specifically, the paper focuses on two main challenges: 1. **Intent Understanding**: Due to the openness of natural languages, robots need to be able to understand users' intents, clarify the goals of tasks, and possess sufficient advanced skills to complete these tasks. 2. **Reliable Plan Generation**: Robots must be able to automatically generate reliable plans and execute these plans safely and robustly. To solve these problems, the paper proposes a two - stage framework for generating Behavior Trees (BT) from human instructions. The specific steps are as follows: 1. **First Stage: Intent Understanding** - Use large - language models (LLM) to interpret the goals in user instructions. LLM improves the syntactic and semantic accuracy of goal interpretation through prompt engineering and reflective feedback mechanisms. - Goals are represented as Well - Formed Formulas (WFF) in first - order logic, thus effectively connecting intent understanding and optimal behavior planning. 2. **Second Stage: Optimal Behavior Planning** - Design the Optimal Behavior Tree Expansion Algorithm (OBTEA), which can construct a behavior tree that can theoretically ensure the completion of goals at the minimum cost. - OBTEA generates the final optimal behavior tree by exploring, expanding and compressing sub - trees. Verified by experiments, this framework performs excellently in service robot scenarios. It not only improves the accuracy and syntactic correctness of intent understanding but also significantly reduces the total cost of behavior trees and the number of trigger times of conditional nodes, showing its potential in actual deployment.