Large language models and humans converge in judging public figures' personalities

Xubo Cao,Michal Kosinski
DOI: https://doi.org/10.1093/pnasnexus/pgae418
2024-09-19
PNAS Nexus
Abstract:ChatGPT-4 and 600 human raters evaluated 226 public figures' personalities using the Ten-Item Personality Inventory. The correlation between ChatGPT-4 and aggregate human ratings ranged from r = 0.76 to 0.87, outperforming the models specifically trained to make such predictions. Notably, the model was not provided with any training data or feedback on its performance. We discuss the potential explanations and practical implications of ChatGPT-4's ability to mimic human responses accurately.
What problem does this paper attempt to address?