Personality Analysis for Social Media Users using Arabic language and its Effect on Sentiment Analysis

Mokhaiber Dandash,Masoud Asadpour
2024-07-23
Abstract:Social media is heading towards more and more personalization, where individuals reveal their beliefs, interests, habits, and activities, simply offering glimpses into their personality traits. This study, explores the correlation between the use of Arabic language on twitter, personality traits and its impact on sentiment analysis. We indicated the personality traits of users based on the information extracted from their profile activities, and the content of their tweets. Our analysis incorporated linguistic features, profile statistics (including gender, age, bio, etc.), as well as additional features like emoticons. To obtain personality data, we crawled the timelines and profiles of users who took the 16personalities test in Arabic on <a class="link-external link-http" href="http://16personalities.com" rel="external noopener nofollow">this http URL</a>. Our dataset, "AraPers", comprised 3,250 users who shared their personality results on twitter. We implemented various machine learning techniques, to reveal personality traits and developed a dedicated model for this purpose, achieving a 74.86% accuracy rate with BERT, analysis of this dataset proved that linguistic features, profile features and derived model can be used to differentiate between different personality traits. Furthermore, our findings demonstrated that personality affect sentiment in social media. This research contributes to the ongoing efforts in developing robust understanding of the relation between human behaviour on social media and personality features for real-world applications, such as political discourse analysis, and public opinion tracking.
Computation and Language
What problem does this paper attempt to address?
The main objective of this paper is to explore the relationship between the personality traits of Arabic social media users and their sentiment analysis. The study specifically attempts to address the following key issues: 1. **The relationship between personality traits and social media behavior**: The paper explores how the behavior of Arabic social media users on Twitter reflects their personality traits. Researchers identify these traits based on the information users display in their profiles and the content of their tweets. 2. **Factors influencing sentiment analysis**: The paper also examines how personality traits affect sentiment analysis on social media. Researchers analyze the emotional expressions of different personality types and explore the connection between these emotions and personality traits. 3. **Challenges in processing Arabic**: Given the complexity and richness of the Arabic language, the study also discusses the unique challenges of processing this language, including its morphological features, lexical variations, and lack of standardization. 4. **Dataset creation**: To overcome these challenges, the authors constructed a dataset named "AraPers," which includes the profiles of 3,250 users who completed 16 types of personality tests. These users shared their test results on Twitter, allowing researchers to link social media behavior with personality test results. 5. **Application of machine learning**: The study employs various machine learning techniques, such as deep learning models like BERT, to reveal personality traits and assess the accuracy of these models in predicting personality traits. The results show that linguistic features, profile statistics, and other characteristics can be used to distinguish different personality traits. In summary, this paper aims to reveal the impact of personality traits on emotional expression by analyzing the behavior and language use of Arabic social media users, and to explore how these findings can be utilized for more accurate social media analysis. This research is significant for understanding the relationship between human online behavior and personality traits, especially in fields such as political discourse analysis and public opinion tracking.