MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic

Damien Sileo,Antoine Lernould
2023-11-07
Abstract:Theory of Mind (ToM) is a critical component of intelligence but its assessment remains the subject of heated debates. Prior research applied human ToM assessments to natural language processing models using either human-created standardized tests or rule-based templates. However, these methods primarily focus on simplistic reasoning and require further validation. Here, we leverage dynamic epistemic logic to isolate a particular component of ToM and to generate controlled problems. We also introduce new verbalization techniques to express these problems in English natural language. Our findings indicate that some language model scaling (from 70M to 6B and 350M to 174B) does not consistently yield results better than random chance. While GPT-4 demonstrates superior epistemic reasoning capabilities, there is still room for improvement. Our code and datasets are publicly available (<a class="link-external link-https" href="https://huggingface.co/datasets/sileod/mindgames" rel="external noopener nofollow">this https URL</a> , <a class="link-external link-https" href="https://github.com/sileod/llm-theory-of-mind" rel="external noopener nofollow">this https URL</a> )
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
This paper aims to address the issue of how to evaluate the capabilities of large language models (LLMs) in the Theory of Mind (ToM). Specifically, the researchers use Dynamic Epistemic Logic (DEL) to generate control problems and convert them into natural language reasoning tasks to assess the performance of language models of different scales in handling ToM-related issues. The study finds that although some large-scale language models (such as GPT-4) exhibit excellent epistemic reasoning abilities, simply increasing the model scale does not necessarily significantly enhance the model's performance on these tasks. Additionally, the paper proposes a new dataset, MindGames, for systematically evaluating the Theory of Mind understanding capabilities of language models.