Improved Inference of Human Intent by Combining Plan Recognition and Language Feedback

Ifrah Idrees,Tian Yun,Naveen Sharma,Yunxin Deng,Nakul Gopalan,George Konidaris,Stefanie Tellex

2023-10-04

Abstract:Conversational assistive robots can aid people, especially those with cognitive impairments, to accomplish various tasks such as cooking meals, performing exercises, or operating machines. However, to interact with people effectively, robots must recognize human plans and goals from noisy observations of human actions, even when the user acts sub-optimally. Previous works on Plan and Goal Recognition (PGR) as planning have used hierarchical task networks (HTN) to model the actor/human. However, these techniques are insufficient as they do not have user engagement via natural modes of interaction such as language. Moreover, they have no mechanisms to let users, especially those with cognitive impairments, know of a deviation from their original plan or about any sub-optimal actions taken towards their goal. We propose a novel framework for plan and goal recognition in partially observable domains -- Dialogue for Goal Recognition (D4GR) enabling a robot to rectify its belief in human progress by asking clarification questions about noisy sensor data and sub-optimal human actions. We evaluate the performance of D4GR over two simulated domains -- kitchen and blocks domain. With language feedback and the world state information in a hierarchical task model, we show that D4GR framework for the highest sensor noise performs 1% better than HTN in goal accuracy in both domains. For plan accuracy, D4GR outperforms by 4% in the kitchen domain and 2% in the blocks domain in comparison to HTN. The ALWAYS-ASK oracle outperforms our policy by 3% in goal recognition and 7%in plan recognition. D4GR does so by asking 68% fewer questions than an oracle baseline. We also demonstrate a real-world robot scenario in the kitchen domain, validating the improved plan and goal recognition of D4GR in a realistic setting.

Robotics,Artificial Intelligence,Human-Computer Interaction

What problem does this paper attempt to address?

The paper aims to address the problem of effectively recognizing and understanding human intentions during interactions between assistive robots and humans. Specifically, the paper focuses on how to accurately identify the user's plans and goals through noisy observational data when the user (especially those with cognitive impairments) is performing tasks. Current methods such as Hierarchical Task Networks (HTN) can model human behavior but lack the ability to interact with users and cannot provide feedback when users deviate from the original plan or take suboptimal actions. Therefore, the paper proposes a new framework—Dialog-based Goal Recognition (D4GR), which clarifies uncertain information in sensor data and human suboptimal behavior through natural language interaction, thereby improving the accuracy of the robot's understanding of human intentions. The main contribution of the paper is the combination of HTN's task representation capabilities and the sequential decision-making capabilities of Partially Observable Markov Decision Processes (POMDP), proposing the D4GR framework. This framework can enhance the accuracy of goal recognition and plan recognition through language feedback and environmental state information without frequent inquiries. Experimental results show that in two simulated environments, the kitchen and blocks, D4GR improves goal recognition and plan recognition by 1% and 4% respectively (kitchen environment) compared to traditional HTN methods. Additionally, D4GR has been deployed on a social robot named Kuri, demonstrating its ability to assist users in completing tasks in real-world scenarios.

Improved Inference of Human Intent by Combining Plan Recognition and Language Feedback

KGGPT: Empowering Robots with OpenAI's ChatGPT and Knowledge Graph.

Goal Inference Improves Objective and Perceived Performance in Human-Robot Collaboration

Accurately and Efficiently Interpreting Human-Robot Instructions of Varying Granularities

Evaluation of Goal Recognition Systems on Unreliable Data and Uninspectable Agents

AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation

Extracting Dynamic Navigation Goal from Natural Language Dialogue

Investigating the Combination of Planning-Based and Data-Driven Methods for Goal Recognition

Communicating Inferred Goals with Passive Augmented Reality and Active Haptic Feedback

NOPA: Neurally-guided Online Probabilistic Assistance for Building Socially Intelligent Home Assistants

Continual Skill and Task Learning via Dialogue

From Commands to Goal-based Dialogs: A Roadmap to Achieve Natural Language Interaction in RoboCup@Home

GR-MG: Leveraging Partially Annotated Data via Multi-Modal Goal Conditioned Policy

Enabling robots to follow abstract instructions and complete complex dynamic tasks

Activity, Plan, and Goal Recognition: A Review

Yell At Your Robot: Improving On-the-Fly from Language Corrections

A Simulated Experiment to Explore Robotic Dialogue Strategies for People with Dementia

ODGR: Online Dynamic Goal Recognition

Pragmatic Instruction Following and Goal Assistance via Cooperative Language-Guided Inverse Planning

Grounding Robot Policies with Visuomotor Language Guidance

GRIT: Fast, Interpretable, and Verifiable Goal Recognition with Learned Decision Trees for Autonomous Driving