Large Language Models Can Infer Personality from Free-Form User Interactions

Heinrich Peters,Moran Cerf,Sandra C. Matz
2024-05-20
Abstract:This study investigates the capacity of Large Language Models (LLMs) to infer the Big Five personality traits from free-form user interactions. The results demonstrate that a chatbot powered by GPT-4 can infer personality with moderate accuracy, outperforming previous approaches drawing inferences from static text content. The accuracy of inferences varied across different conversational settings. Performance was highest when the chatbot was prompted to elicit personality-relevant information from users (mean r=.443, range=[.245, .640]), followed by a condition placing greater emphasis on naturalistic interaction (mean r=.218, range=[.066, .373]). Notably, the direct focus on personality assessment did not result in a less positive user experience, with participants reporting the interactions to be equally natural, pleasant, engaging, and humanlike across both conditions. A chatbot mimicking ChatGPT's default behavior of acting as a helpful assistant led to markedly inferior personality inferences and lower user experience ratings but still captured psychologically meaningful information for some of the personality traits (mean r=.117, range=[-.004, .209]). Preliminary analyses suggest that the accuracy of personality inferences varies only marginally across different socio-demographic subgroups. Our results highlight the potential of LLMs for psychological profiling based on conversational interactions. We discuss practical implications and ethical challenges associated with these findings.
Human-Computer Interaction,Artificial Intelligence,Computation and Language,Computers and Society,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to explore whether large - language models (LLMs) can infer the Big Five personality traits (i.e., openness, conscientiousness, extraversion, agreeableness, and neuroticism) from users' free - form interactions. Specifically, researchers evaluate the ability of a GPT - 4 - based chatbot to infer personality characteristics in different interaction modes by designing different experimental conditions. These modes include: 1. **Evaluation condition**: The chatbot is instructed to try to understand the user's Big Five personality traits without making them aware of it. 2. **Acquaintance condition**: The chatbot is instructed to have a natural conversation with the user in an attempt to understand their personality. 3. **Assistant condition**: The chatbot interacts with the user as a "helpful assistant" and does not take the initiative to ask personal questions. In addition, the study also explores the impact of different interaction modes on user experience and whether the personality traits inferred by LLM differ among different sociodemographic groups. Through the exploration of these issues, the study aims to reveal the potential of LLM in psychological assessment and the ethical challenges of its application.