Abstract:Theory of Mind (ToM)-the cognitive ability to reason about mental states of ourselves and others, is the foundation of social interaction. Although ToM comes naturally to humans, it poses a significant challenge to even the most advanced Large Language Models (LLMs). Due to the complex logical chains in ToM reasoning, especially in higher-order ToM questions, simply utilizing reasoning methods like Chain of Thought (CoT) will not improve the ToM capabilities of LLMs. We present TimeToM, which constructs a temporal space and uses it as the foundation to improve the ToM capabilities of LLMs in multiple scenarios. Specifically, within the temporal space, we construct Temporal Belief State Chain (TBSC) for each character and inspired by the cognition perspective of the social world model, we divide TBSC into self-world beliefs and social world beliefs, aligning with first-order ToM (first-order beliefs) and higher-order ToM (higher-order beliefs) questions, respectively. Moreover, we design a novel tool-belief solver that, by considering belief communication between characters in temporal space, can transform a character's higher-order beliefs into another character's first-order beliefs under belief communication period. Experimental results indicate that TimeToM can dramatically improve the reasoning performance of LLMs on ToM questions while taking a big step towards coherent and robust ToM reasoning.
What problem does this paper attempt to address?
### What problems does this paper attempt to solve?
This paper aims to address the challenges that large language models (LLMs) face in "Theory of Mind" (ToM) reasoning. Specifically:
1. **Complexity of ToM reasoning**: ToM refers to the human ability to reason about one's own and others' mental states, which is the basis of social interaction. Although ToM is a natural ability for humans, it remains a significant challenge for the most advanced LLMs, especially when dealing with high - order ToM problems.
2. **Limitations of existing methods**: Current methods such as Chain of Thought (CoT) have improved the reasoning ability of LLMs in some tasks, but they are not effective in ToM reasoning, especially in high - order ToM problems.
3. **Temporal order and logical errors**: Common mistakes made by LLMs in ToM reasoning include ignoring the temporal order of events, generating outputs that violate common sense, and confusing the reasoning logic of high - order ToM problems.
To solve these problems, the paper proposes **TIMETOM** (Temporal Space is the Key to Unlocking the Door of Large Language Models’ Theory - of - Mind), which improves the ToM reasoning ability of LLMs by constructing a temporal space. Specific methods include:
- **Constructing a temporal space**: Adding a timeline to a story or conversation so that LLMs can more clearly understand the temporal order of events.
- **Constructing a Temporal Belief State Chain (TBSC)**: Building a temporal belief state chain for each character based on the events they perceive, divided into self - world beliefs and social - world beliefs.
- **Designing a Belief Solver**: By considering the temporal belief communication between characters, transforming high - order ToM problems into low - order ToM problems, thereby simplifying the reasoning process.
Experimental results show that TIMETOM significantly improves the reasoning performance of LLMs on ToM problems, especially in high - order ToM problems.
### Formula representation
The paper does not involve complex mathematical formulas, but for the sake of accuracy and readability of the presentation, the following is the Markdown - format representation of some key steps:
- Constructing a temporal space:
\[
x_t = M(p_{cts}\|x)
\]
- Constructing a Temporal Belief State Chain:
\[
c_{tbsc} = M(p_{tbsc}\|x_t)
\]
- Generating self - world beliefs:
\[
c_{tbsc - self} = M(p_{self}\|c_{tbsc})
\]
- Answering ToM questions:
\[
y_{first\_order} = M(p_{qa}\|c_{tbsc - self})
\]
\[
y_{higher\_order} = M(p_{qa}\|c_{tbsc})
\]
- Temporal set parsing:
\[
T_{John} = [t_1, t_2, t_3, t_4, t_5, t_6, t_7]
\]
\[
T_{Bob} = [t_1, t_2, t_3, t_4, t_5]
\]
\[
T_{Alice} = [t_1, t_2, t_3, t_6, t_7]
\]
- Belief communication:
\[
BC_{John,Bob} = T_{John}\cap T_{Bob}
\]
\[
BC_{John,Bob,Alice} = T_{John}\cap T_{Bob}\cap T_{Alice}
\]
These formulas and methods work together to enable TIMETOM to significantly improve the ToM reasoning ability of LLMs in various scenarios.