PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits

Hang Jiang,Xiajie Zhang,Xubo Cao,Cynthia Breazeal,Deb Roy,Jad Kabbara

2024-04-02

Abstract:Despite the many use cases for large language models (LLMs) in creating personalized chatbots, there has been limited research on evaluating the extent to which the behaviors of personalized LLMs accurately and consistently reflect specific personality traits. We consider studying the behavior of LLM-based agents which we refer to as LLM personas and present a case study with GPT-3.5 and GPT-4 to investigate whether LLMs can generate content that aligns with their assigned personality profiles. To this end, we simulate distinct LLM personas based on the Big Five personality model, have them complete the 44-item Big Five Inventory (BFI) personality test and a story writing task, and then assess their essays with automatic and human evaluations. Results show that LLM personas' self-reported BFI scores are consistent with their designated personality types, with large effect sizes observed across five traits. Additionally, LLM personas' writings have emerging representative linguistic patterns for personality traits when compared with a human writing corpus. Furthermore, human evaluation shows that humans can perceive some personality traits with an accuracy of up to 80%. Interestingly, the accuracy drops significantly when the annotators were informed of AI authorship.

Computation and Language,Artificial Intelligence,Human-Computer Interaction

What problem does this paper attempt to address?

The paper aims to explore the ability of large language models (LLMs) to express personality traits. Specifically, researchers define LLM-based agents (referred to as LLM personas) that are prompted to generate content reflecting specific personality traits. The core question of the study is to verify whether these LLM personas can accurately reflect the personality traits they are assigned when completing the Big Five Inventory (BFI) test. Additionally, the study explores how these agents with specific personality traits are perceived by humans, i.e., whether humans can accurately perceive these personality traits through the stories generated by the agents. The main objectives of the study include: 1. **Verifying whether LLM personas can reflect the specified personality traits when completing the BFI assessment** (RQ1). 2. **Analyzing whether there are specific language patterns in the stories generated by LLM personas** (RQ2). 3. **Evaluating the perceptions of human and LLM evaluators regarding the stories generated by LLM personas** (RQ3). 4. **Examining whether humans and LLMs can accurately identify the Big Five personality traits from the stories** (RQ4).

PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits

PersonaLLM: Investigating the Ability of GPT-3.5 to Express Personality Traits and Gender Differences

Humanity in AI: Detecting the Personality of Large Language Models

Personality Traits in Large Language Models

Identifying Multiple Personalities in Large Language Models with External Evaluation

LMLPA: Language Model Linguistic Personality Assessment

PersLLM: A Personified Training Approach for Large Language Models

Large Language Models Can Infer Personality from Free-Form User Interactions

Illuminating the Black Box: A Psychometric Investigation into the Multifaceted Nature of Large Language Models

Limited Ability of LLMs to Simulate Human Psychological Behaviours: a Psychometric Analysis

Editing Personality for LLMs

Revisiting the Reliability of Psychological Scales on Large Language Models

Eliciting Big Five Personality Traits in Large Language Models: A Textual Analysis with Classifier-Driven Approach

Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models

Personality testing of Large Language Models: Limited temporal stability, but highlighted prosociality

Can ChatGPT Assess Human Personalities? A General Evaluation Framework

Self-assessment, Exhibition, and Recognition: a Review of Personality in Large Language Models

LLMs Simulate Big Five Personality Traits: Further Evidence

Large Language Models Can Infer Psychological Dispositions of Social Media Users

Is persona enough for personality? Using ChatGPT to reconstruct an agent's latent personality from simple descriptions

Challenging the Validity of Personality Tests for Large Language Models