Exploring Personality and Online Social Engagement: An Investigation of MBTI Users on Twitter

Partha Kadambi
DOI: https://doi.org/10.48550/arXiv.2109.06402
2021-09-14
Abstract:Text-based personality prediction by computational models is an emerging field with the potential to significantly improve on key weaknesses of survey-based personality assessment. We investigate 3848 profiles from Twitter with self-labeled Myers-Briggs personality traits (MBTI) - a framework closely related to the Five Factor Model of personality - to better understand how text-based digital traces from social engagement online can be used to predict user personality traits. We leverage BERT, a state-of-the-art NLP architecture based on deep learning, to analyze various sources of text that hold most predictive power for our task. We find that biographies, statuses, and liked tweets contain significant predictive power for all dimensions of the MBTI system. We discuss our findings and their implications for the validity of the MBTI and the lexical hypothesis, a foundational theory underlying the Five Factor Model that links language use and behavior. Our results hold optimistic implications for personality psychologists, computational linguists, and other social scientists aiming to predict personality from observational text data and explore the links between language and core behavioral traits.
Computation and Language,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the issue of exploring how personality traits manifest in online social interactions by analyzing the social interactions and information consumption patterns of Twitter users. Specifically, the authors studied the social communication of Twitter users who self-identified their Myers-Briggs Type Indicator (MBTI) personality types. The main objectives of the paper include: 1. **Understanding how personality traits manifest on social media**: By analyzing the social media behavior of users with different MBTI personality types, the authors aim to explore the semantic space behind these behaviors and the relationship between personality traits and online information consumption preferences. 2. **Validating the effectiveness and theoretical basis of the MBTI system**: By constructing deep learning models to predict users' MBTI traits, the authors evaluate the prediction accuracy of these models, thereby indirectly validating the effectiveness of the MBTI system and its theoretical basis. In particular, the authors hope to test whether the four dimensions of MBTI (Introversion/Extraversion, Sensing/Intuition, Thinking/Feeling, Judging/Perceiving) can be accurately predicted through text data analysis and what these prediction results mean for the MBTI system itself. 3. **Exploring the relationship between various forms of social participation and personality traits**: The study examines how different types of text data from users (such as personal bios, status updates, liked tweets, etc.) reflect their personality traits, in order to understand the association between different forms of digital traces and personality traits. Through the above research, the paper aims to provide new tools and methods for psychologists, computational linguists, and other social scientists to better understand and predict personality traits, especially in the context of using social media data for personality assessment.