Abstract:Theory of Mind (ToM)-the cognitive ability to reason about mental states of ourselves and others, is the foundation of social interaction. Although ToM comes naturally to humans, it poses a significant challenge to even the most advanced Large Language Models (LLMs). Due to the complex logical chains in ToM reasoning, especially in higher-order ToM questions, simply utilizing reasoning methods like Chain of Thought (CoT) will not improve the ToM capabilities of LLMs. We present TimeToM, which constructs a temporal space and uses it as the foundation to improve the ToM capabilities of LLMs in multiple scenarios. Specifically, within the temporal space, we construct Temporal Belief State Chain (TBSC) for each character and inspired by the cognition perspective of the social world model, we divide TBSC into self-world beliefs and social world beliefs, aligning with first-order ToM (first-order beliefs) and higher-order ToM (higher-order beliefs) questions, respectively. Moreover, we design a novel tool-belief solver that, by considering belief communication between characters in temporal space, can transform a character's higher-order beliefs into another character's first-order beliefs under belief communication period. Experimental results indicate that TimeToM can dramatically improve the reasoning performance of LLMs on ToM questions while taking a big step towards coherent and robust ToM reasoning.

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper aims to address the challenges that large language models (LLMs) face in "Theory of Mind" (ToM) reasoning. Specifically: 1. **Complexity of ToM reasoning**: ToM refers to the human ability to reason about one's own and others' mental states, which is the basis of social interaction. Although ToM is a natural ability for humans, it remains a significant challenge for the most advanced LLMs, especially when dealing with high - order ToM problems. 2. **Limitations of existing methods**: Current methods such as Chain of Thought (CoT) have improved the reasoning ability of LLMs in some tasks, but they are not effective in ToM reasoning, especially in high - order ToM problems. 3. **Temporal order and logical errors**: Common mistakes made by LLMs in ToM reasoning include ignoring the temporal order of events, generating outputs that violate common sense, and confusing the reasoning logic of high - order ToM problems. To solve these problems, the paper proposes **TIMETOM** (Temporal Space is the Key to Unlocking the Door of Large Language Models’ Theory - of - Mind), which improves the ToM reasoning ability of LLMs by constructing a temporal space. Specific methods include: - **Constructing a temporal space**: Adding a timeline to a story or conversation so that LLMs can more clearly understand the temporal order of events. - **Constructing a Temporal Belief State Chain (TBSC)**: Building a temporal belief state chain for each character based on the events they perceive, divided into self - world beliefs and social - world beliefs. - **Designing a Belief Solver**: By considering the temporal belief communication between characters, transforming high - order ToM problems into low - order ToM problems, thereby simplifying the reasoning process. Experimental results show that TIMETOM significantly improves the reasoning performance of LLMs on ToM problems, especially in high - order ToM problems. ### Formula representation The paper does not involve complex mathematical formulas, but for the sake of accuracy and readability of the presentation, the following is the Markdown - format representation of some key steps: - Constructing a temporal space: \[ x_t = M(p_{cts}\|x) \] - Constructing a Temporal Belief State Chain: \[ c_{tbsc} = M(p_{tbsc}\|x_t) \] - Generating self - world beliefs: \[ c_{tbsc - self} = M(p_{self}\|c_{tbsc}) \] - Answering ToM questions: \[ y_{first\_order} = M(p_{qa}\|c_{tbsc - self}) \] \[ y_{higher\_order} = M(p_{qa}\|c_{tbsc}) \] - Temporal set parsing: \[ T_{John} = [t_1, t_2, t_3, t_4, t_5, t_6, t_7] \] \[ T_{Bob} = [t_1, t_2, t_3, t_4, t_5] \] \[ T_{Alice} = [t_1, t_2, t_3, t_6, t_7] \] - Belief communication: \[ BC_{John,Bob} = T_{John}\cap T_{Bob} \] \[ BC_{John,Bob,Alice} = T_{John}\cap T_{Bob}\cap T_{Alice} \] These formulas and methods work together to enable TIMETOM to significantly improve the ToM reasoning ability of LLMs in various scenarios.

TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models' Theory-of-Mind

Constrained Reasoning Chains for Enhancing Theory-of-Mind in Large Language Models

Think Twice: Perspective-Taking Improves Large Language Models' Theory-of-Mind Capabilities

ToM-LM: Delegating Theory of Mind Reasoning to External Symbolic Executors in Large Language Models

OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning Capabilities of Large Language Models

Language Models Represent Beliefs of Self and Others

HI-TOM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models

Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker

ToMBench: Benchmarking Theory of Mind in Large Language Models

How FaR Are Large Language Models From Agents with Theory-of-Mind?

Zero, Finite, and Infinite Belief History of Theory of Mind Reasoning in Large Language Models

Theory of Mind for Multi-Agent Collaboration via Large Language Models

Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models

SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs

Theory of Mind in Large Language Models: Examining Performance of 11 State-of-the-Art models vs. Children Aged 7-10 on Advanced Tests

Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models

Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models

Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Probing the Robustness of Theory of Mind in Large Language Models

Computational Language Acquisition with Theory of Mind

Unlocking Temporal Question Answering for Large Language Models Using Code Execution