Abstract:Large Language Models have shown exceptional generative abilities in various natural language and generation tasks. However, possible anthropomorphization and leniency towards failure cases have propelled discussions on emergent abilities of Large Language Models especially on Theory of Mind (ToM) abilities in Large Language Models. While several false-belief tests exists to verify the ability to infer and maintain mental models of another entity, we study a special application of ToM abilities that has higher stakes and possibly irreversible consequences : Human Robot Interaction. In this work, we explore the task of Perceived Behavior Recognition, where a robot employs a Large Language Model (LLM) to assess the robot's generated behavior in a manner similar to human observer. We focus on four behavior types, namely - explicable, legible, predictable, and obfuscatory behavior which have been extensively used to synthesize interpretable robot behaviors. The LLMs goal is, therefore to be a human proxy to the agent, and to answer how a certain agent behavior would be perceived by the human in the loop, for example "Given a robot's behavior X, would the human observer find it explicable?". We conduct a human subject study to verify that the users are able to correctly answer such a question in the curated situations (robot setting and plan) across five domains. A first analysis of the belief test yields extremely positive results inflating ones expectations of LLMs possessing ToM abilities. We then propose and perform a suite of perturbation tests which breaks this illusion, i.e. Inconsistent Belief, Uninformative Context and Conviction Test. We conclude that, the high score of LLMs on vanilla prompts showcases its potential use in HRI settings, however to possess ToM demands invariance to trivial or irrelevant perturbations in the context which LLMs lack.

Language Models Represent Beliefs of Self and Others

TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models' Theory-of-Mind

Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker

Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models

Theory of Mind in Large Language Models: Examining Performance of 11 State-of-the-Art models vs. Children Aged 7-10 on Advanced Tests

Theory of Mind for Multi-Agent Collaboration via Large Language Models

ToM-LM: Delegating Theory of Mind Reasoning to External Symbolic Executors in Large Language Models

Think Twice: Perspective-Taking Improves Large Language Models' Theory-of-Mind Capabilities

Theory of Mind abilities of Large Language Models in Human-Robot Interaction : An Illusion?

Do Large Language Models Know What Humans Know?

Neural Theory-of-Mind? On the Limits of Social Intelligence in Large LMs

Theory of Mind May Have Spontaneously Emerged in Large Language Models

Zero, Finite, and Infinite Belief History of Theory of Mind Reasoning in Large Language Models

Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models

Computational Language Acquisition with Theory of Mind

How FaR Are Large Language Models From Agents with Theory-of-Mind?

Evaluating Large Language Models in Theory of Mind Tasks

Language models and psychological sciences

Multi-ToM: Evaluating Multilingual Theory of Mind Capabilities in Large Language Models

Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning

Testing theory of mind in large language models and humans