Conveying Autonomous Robot Capabilities through Contrasting Behaviour Summaries

Peter Du,Surya Murthy,Katherine Driggs-Campbell

2023-04-02

Abstract:As advances in artificial intelligence enable increasingly capable learning-based autonomous agents, it becomes more challenging for human observers to efficiently construct a mental model of the agent's behaviour. In order to successfully deploy autonomous agents, humans should not only be able to understand the individual limitations of the agents but also have insight on how they compare against one another. To do so, we need effective methods for generating human interpretable agent behaviour summaries. Single agent behaviour summarization has been tackled in the past through methods that generate explanations for why an agent chose to pick a particular action at a single timestep. However, for complex tasks, a per-action explanation may not be able to convey an agents global strategy. As a result, researchers have looked towards multi-timestep summaries which can better help humans assess an agents overall capability. More recently, multi-step summaries have also been used for generating contrasting examples to evaluate multiple agents. However, past approaches have largely relied on unstructured search methods to generate summaries and require agents to have a discrete action space. In this paper we present an adaptive search method for efficiently generating contrasting behaviour summaries with support for continuous state and action spaces. We perform a user study to evaluate the effectiveness of the summaries for helping humans discern the superior autonomous agent for a given task. Our results indicate that adaptive search can efficiently identify informative contrasting scenarios that enable humans to accurately select the better performing agent with a limited observation time budget.

Robotics,Artificial Intelligence,Multiagent Systems

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve This paper aims to address how humans can efficiently understand the behavior capabilities of multiple autonomous agents and accurately determine which agent performs better in a specific task within a limited time. With the advancement of artificial intelligence, the capabilities of autonomous agents are becoming increasingly powerful, making it difficult for human observers to quickly construct mental models of these agents' behaviors. For the successful deployment of autonomous agents, humans need to understand not only the limitations of a single agent but also be able to compare the strengths and weaknesses of different agents. Specifically, the paper focuses on the following points: 1. **Limitations of Single-Agent Behavior Summarization**: Existing methods typically generate behavior summaries by explaining why an agent chose a specific action at a particular time step. However, for complex tasks, the explanation of a single action may not convey the agent's overall strategy. 2. **Need for Multi-Step Summarization**: Researchers have turned to multi-step summarization to better help humans assess the overall capabilities of an agent. However, past methods mostly rely on unstructured search methods and require agents to have discrete action spaces. 3. **Challenges of Comparative Behavior Summarization**: When comparing multiple agents, generating meaningful comparative summaries is particularly challenging. Existing methods lack structured approaches when exploring the scenario space and are not suitable for agents with continuous action spaces. To address these issues, the paper proposes an adaptive search method that can efficiently generate comparative behavior summaries, supporting agents with continuous state and action spaces. Through user studies, the effectiveness of this method is validated, demonstrating its ability to help humans accurately choose the better-performing agent within a limited observation time.

Conveying Autonomous Robot Capabilities through Contrasting Behaviour Summaries

SummAct: Uncovering User Intentions Through Interactive Behaviour Summarisation

Learning to Summarize and Answer Questions about a Virtual Robot's Past Actions

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Discovering User-Interpretable Capabilities of Black-Box Planning Agents

Towards reconciling usability and usefulness of policy explanations for sequential decision-making systems

Improving Human-AI Collaboration With Descriptions of AI Behavior

Scene Augmentation Methods for Interactive Embodied AI Tasks

My Actions Speak Louder Than Your Words: When User Behavior Predicts Their Beliefs about Agents' Attributes

Balancing Both Behavioral Quality and Diversity in Unsupervised Skill Discovery

Select to Perfect: Imitating desired behavior from large multi-agent data

Analysing Explanation-Related Interactions in Collaborative Perception-Cognition-Communication-Action

Demo2Code: From Summarizing Demonstrations to Synthesizing Code via Extended Chain-of-Thought

Explainable Artificial Intelligence: Evaluating the Objective and Subjective Impacts of xAI on Human-Agent Interaction

Mapping the Design Space of Human-AI Interaction in Text Summarization

Let people fail! Exploring the influence of explainable virtual and robotic agents in learning-by-doing tasks

AGENT: A Benchmark for Core Psychological Reasoning

Effects of Explanation Types on User Satisfaction and Performance in Human-agent Teams

Multi-Agent Strategy Explanations for Human-Robot Collaboration

Unveiling Disparities in Web Task Handling Between Human and Web Agent

Mixed-Initiative Human-Robot Teaming under Suboptimality with Online Bayesian Adaptation