Large Language Models Can Infer Personality from Free-Form User Interactions

Heinrich Peters,Moran Cerf,Sandra C. Matz

2024-05-20

Abstract:This study investigates the capacity of Large Language Models (LLMs) to infer the Big Five personality traits from free-form user interactions. The results demonstrate that a chatbot powered by GPT-4 can infer personality with moderate accuracy, outperforming previous approaches drawing inferences from static text content. The accuracy of inferences varied across different conversational settings. Performance was highest when the chatbot was prompted to elicit personality-relevant information from users (mean r=.443, range=[.245, .640]), followed by a condition placing greater emphasis on naturalistic interaction (mean r=.218, range=[.066, .373]). Notably, the direct focus on personality assessment did not result in a less positive user experience, with participants reporting the interactions to be equally natural, pleasant, engaging, and humanlike across both conditions. A chatbot mimicking ChatGPT's default behavior of acting as a helpful assistant led to markedly inferior personality inferences and lower user experience ratings but still captured psychologically meaningful information for some of the personality traits (mean r=.117, range=[-.004, .209]). Preliminary analyses suggest that the accuracy of personality inferences varies only marginally across different socio-demographic subgroups. Our results highlight the potential of LLMs for psychological profiling based on conversational interactions. We discuss practical implications and ethical challenges associated with these findings.

Human-Computer Interaction,Artificial Intelligence,Computation and Language,Computers and Society,Machine Learning

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to explore whether large - language models (LLMs) can infer the Big Five personality traits (i.e., openness, conscientiousness, extraversion, agreeableness, and neuroticism) from users' free - form interactions. Specifically, researchers evaluate the ability of a GPT - 4 - based chatbot to infer personality characteristics in different interaction modes by designing different experimental conditions. These modes include: 1. **Evaluation condition**: The chatbot is instructed to try to understand the user's Big Five personality traits without making them aware of it. 2. **Acquaintance condition**: The chatbot is instructed to have a natural conversation with the user in an attempt to understand their personality. 3. **Assistant condition**: The chatbot interacts with the user as a "helpful assistant" and does not take the initiative to ask personal questions. In addition, the study also explores the impact of different interaction modes on user experience and whether the personality traits inferred by LLM differ among different sociodemographic groups. Through the exploration of these issues, the study aims to reveal the potential of LLM in psychological assessment and the ethical challenges of its application.

Large Language Models Can Infer Personality from Free-Form User Interactions

Large Language Models Can Infer Psychological Dispositions of Social Media Users

PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits

Artificial Intelligence and Personality: Large Language Models’ Ability to Predict Personality Type

PersonaLLM: Investigating the Ability of GPT-3.5 to Express Personality Traits and Gender Differences

Can ChatGPT Assess Human Personalities? A General Evaluation Framework

Humanity in AI: Detecting the Personality of Large Language Models

Personality Traits in Large Language Models

Is persona enough for personality? Using ChatGPT to reconstruct an agent's latent personality from simple descriptions

Can ChatGPT Read Who You Are?

Limited Ability of LLMs to Simulate Human Psychological Behaviours: a Psychometric Analysis

Challenging the Validity of Personality Tests for Large Language Models

Can large language models help predict results from a complex behavioural science study?

Can Large Language Models Assess Personality from Asynchronous Video Interviews? A Comprehensive Evaluation of Validity, Reliability, Fairness, and Rating Patterns

PersonalityChat: Conversation Distillation for Personalized Dialog Modeling with Facts and Traits

Eliciting Big Five Personality Traits in Large Language Models: A Textual Analysis with Classifier-Driven Approach

Dynamic Generation of Personalities with Large Language Models

LMLPA: Language Model Linguistic Personality Assessment

Large language models and humans converge in judging public figures' personalities

Eliciting Personality Traits in Large Language Models