Abstract:Remarkable advancements in modern generative foundation models have enabled the development of sophisticated and highly capable autonomous agents that can observe their environment, invoke tools, and communicate with other agents to solve problems. Although such agents can communicate with users through natural language, their complexity and wide-ranging failure modes present novel challenges for human-AI interaction. Building on prior research and informed by a communication grounding perspective, we contribute to the study of \emph{human-agent communication} by identifying and analyzing twelve key communication challenges that these systems pose. These include challenges in conveying information from the agent to the user, challenges in enabling the user to convey information to the agent, and overarching challenges that need to be considered across all human-agent communication. We illustrate each challenge through concrete examples and identify open directions of research. Our findings provide insights into critical gaps in human-agent communication research and serve as an urgent call for new design patterns, principles, and guidelines to support transparency and control in these systems.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the challenges faced by modern intelligent agents (AI agents) when communicating with human users. With the development of generative foundation models, these intelligent agents have become more complex and powerful, capable of performing a wide range of tasks, but at the same time, they have also brought new and more complex communication problems. Specifically, the paper focuses on the following aspects: 1. **Challenges in Information Conveyance**: - **Challenges from Agent to User**: How can the agent effectively convey its actions, intentions, and results to the user? For example, how should the agent help the user verify whether its actions meet expectations (X1), how to ensure the consistency of actions (X2), and how to select an appropriate level of information detail (X3). - **Challenges from User to Agent**: How can the user effectively convey their goals, preferences, and constraints to the agent? For example, how should the user express which preferences need to be respected (U1), and how should the agent adjust its behavior according to user feedback (U3). 2. **General Communication Difficulties**: - There are also some general challenges in the communication between the agent and the user, such as how to consider the past interaction history to optimize the current communication (X4). In addition, problems such as how to avoid inconsistency and reduce the user's cognitive burden also need to be solved. ### Specific Problem Summary - **X1: How can the agent help the user verify its actions?** Although modern agents are powerful, they are not perfect. When the agent performs a task, errors may occur, especially for complex or multi - step tasks. Therefore, a mechanism needs to be designed to enable the user to easily verify the agent's actions to ensure that both parties have a common understanding of the task. - **X2: How can the agent convey consistent behavior?** The agent's behavior may appear inconsistent due to the randomness of the underlying model and environmental changes. This inconsistency will prevent the user from building an accurate mental model, which in turn will affect trust. Therefore, it is necessary to ensure that the agent's behavior is as consistent as possible or at least appears consistent to the user. - **X3: How should the agent select an appropriate level of information detail?** Providing too much information may overburden the user, while providing too little information may lead to misunderstandings. Therefore, the agent needs to find a balance between the two, so that the user can understand its actions without causing information overload. - **X4: What past interactions should the agent consider during communication?** Context is crucial for communication. The agent should consider which past interactions are helpful for the current communication in order to better understand and meet the user's needs. By solving these problems, the paper aims to provide guidance for the future design of intelligent agents to support transparency and controllability, thereby achieving more effective collaboration.

Challenges in Human-Agent Communication

Teaching Machines to Converse

Artificial intelligence and communication: A Human–Machine Communication research agenda

Challenges of Human-Aware AI Systems

Revisiting Human-Agent Communication: The Importance of Joint Co-construction and Understanding Mental States

AI Challenges in Human-Robot Cognitive Teaming

The Critical Role of Effective Communication in Human-Robot Collaborative Assembly

Learning and Communicating the Latent States of Human-Machine Collaboration

COMMUNICATION IN AI-ASSISTED TEAMS DURING AN INTERDISCIPLINARY DRONE DESIGN PROBLEM

Deconstructing Human-AI Collaboration: Agency, Interaction, and Adaptation

Towards More Human-like AI Communication: A Review of Emergent Communication Research

What Makes a Good Conversation? Challenges in Designing Truly Conversational Agents

Prolongation of heterotopic human corneal graft survival in mice treated with an anti-CD4 monoclonal antibody.

Designing for Human-Agent Alignment: Understanding what humans want from their agents

A Comprehensive Study on Exploring the Potential Challenges of Chatbot Research as Future Directions for Advancing Knowledge in Artificial Intelligence Conversation System

Effects of Communication Directionality and AI Agent Differences in Human-AI Interaction

Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions

Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions

Human-AI Collaboration in a Cooperative Game Setting

“Artificial humans”: Psychology and neuroscience perspectives on embodiment and nonverbal communication

“My AI must have been broken”: How AI Stands to Reshape Human Communication