Abstract:In the social world, humans possess the capability to infer and reason about others mental states (such as emotions, beliefs, and intentions), known as the Theory of Mind (ToM). Simultaneously, humans own mental states evolve in response to social situations, a capability we refer to as socialization. Together, these capabilities form the foundation of human social interaction. In the era of artificial intelligence (AI), especially with the development of large language models (LLMs), we raise an intriguing question: How do LLMs perform in terms of ToM and socialization capabilities? And more broadly, can these AI models truly enter and navigate the real social world? Existing research evaluating LLMs ToM and socialization capabilities by positioning LLMs as passive observers from a third person perspective, rather than as active participants. However, compared to the third-person perspective, observing and understanding the world from an egocentric first person perspective is a natural approach for both humans and AI agents. The ToM and socialization capabilities of LLMs from a first person perspective, a crucial attribute for advancing embodied AI agents, remain unexplored. To answer the aforementioned questions and bridge the research gap, we introduce EgoSocialArena, a novel framework designed to evaluate and investigate the ToM and socialization capabilities of LLMs from a first person perspective. It encompasses two evaluation environments: static environment and interactive environment, with seven scenarios: Daily Life, Counterfactual, New World, Blackjack, Number Guessing, and Limit Texas Hold em, totaling 2,195 data entries. With EgoSocialArena, we have conducted a comprehensive evaluation of nine advanced LLMs and observed some key insights regarding the future development of LLMs as well as the capabilities levels of the most advanced LLMs currently available.

LLM Theory of Mind and Alignment: Opportunities and Risks

Do LLMs Exhibit Human-Like Reasoning? Evaluating Theory of Mind in LLMs for Open-Ended Responses

Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models

The benefits, risks and bounds of personalizing the alignment of large language models to individuals

Empathy and the Right to Be an Exception: What LLMs Can and Cannot Do

LLMs achieve adult human performance on higher-order theory of mind tasks

LLMs and the Human Condition

LLMs grasp morality in concept

A Survey on Human-Centric LLMs

The Moral Turing Test: Evaluating Human-LLM Alignment in Moral Decision-Making

Rethinking Machine Ethics -- Can LLMs Perform Moral Reasoning through the Lens of Moral Theories?

Entering Real Social World! Benchmarking the Theory of Mind and Socialization Capabilities of LLMs from a First-person Perspective

Exploring the psychology of LLMs' Moral and Legal Reasoning

Moral Alignment for LLM Agents

Navigating LLM Ethics: Advancements, Challenges, and Future Directions

Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning

Language Model Alignment in Multilingual Trolley Problems

Alignment Between the Decision-Making Logic of LLMs and Human Cognition: A Case Study on Legal LLMs

Large Language Models: The Need for Nuance in Current Debates and a Pragmatic Perspective on Understanding

FairMindSim: Alignment of Behavior, Emotion, and Belief in Humans and LLM Agents Amid Ethical Dilemmas