Abstract:Some of the required characteristics for a true machine theory of mind (MToM) include the ability to (1) reproduce the full diversity of human thought and behavior, (2) develop a personalized model of an individual with very limited data, and (3) provide an explanation for behavioral predictions grounded in the cognitive processes of the individual. We propose that a certain class of cognitive models provide an approach that is well suited to meeting those requirements. Being grounded in a mechanistic framework like a cognitive architecture such as ACT‐R naturally fulfills the third requirement by mapping behavior to cognitive mechanisms. Exploiting a modeling paradigm such as instance‐based learning accounts for the first requirement by reflecting variations in individual experience into a diversity of behavior. Mechanisms such as knowledge tracing and model tracing allow a specific run of the cognitive model to be aligned with a given individual behavior trace, fulfilling the second requirement. We illustrate these principles with a cognitive model of decision‐making in a search and rescue task in the Minecraft simulation environment. We demonstrate that cognitive models personalized to individual human players can provide the MToM capability to optimize artificial intelligence agents by diagnosing the underlying causes of observed human behavior, projecting the future effects of potential interventions, and managing the adaptive process of shaping human behavior. Examples of the inputs provided by such analytic cognitive agents include predictions of cognitive load, probability of error, estimates of player self‐efficacy, and trust calibration. Finally, we discuss implications for future research and applications to collective human–machine intelligence.

MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks

TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft

Towards Collaborative Plan Acquisition through Theory of Mind Modeling in Situated Dialogue

MindForge: Empowering Embodied Agents with Theory of Mind for Lifelong Collaborative Learning

Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task

Cognitive Models for Machine Theory of Mind

Modeling Theory of Mind in Multi-Agent Games Using Adaptive Feedback Control

Emergence of Theory of Mind Collaboration in Multiagent Systems

Solving Dialogue Grounding Embodied Task in a Simulated Environment using Further Masked Language Modeling

ToM2C: Target-oriented Multi-agent Communication and Cooperation with Theory of Mind

MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Situated Neural Dialogue Generation

A Brain-Inspired Model of Theory of Mind

MuMA-ToM: Multi-modal Multi-Agent Theory of Mind

Theory of Mind for Multi-Agent Collaboration via Large Language Models

Limits of Theory of Mind Modelling in Dialogue-Based Collaborative Plan Acquisition

Improving Agent Interactions in Virtual Environments with Language Models

Mathematical Models of Theory of Mind

MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge

MindAgent: Emergent Gaming Interaction

See and Think: Embodied Agent in Virtual Environment

A Bayesian theory of mind approach to modeling cooperation and communication