Is Personality Prediction Possible Based on Reddit Comments?

Robert Deimann,Till Preidt,Shaptarshi Roy,Jan Stanicki
2024-08-29
Abstract:In this assignment, we examine whether there is a correlation between the personality type of a person and the texts they wrote. In order to do this, we aggregated datasets of Reddit comments labeled with the Myers-Briggs Type Indicator (MBTI) of the author and built different supervised classifiers based on BERT to try to predict the personality of an author given a text. Despite experiencing issues with the unfiltered character of the dataset, we can observe potential in the classification.
Computation and Language
What problem does this paper attempt to address?