Exploring Social Media Posts for Depression Identification: A Study on Reddit Dataset

Nandigramam Sai Harshit,Nilesh Kumar Sahu,Haroon R. Lone
DOI: https://doi.org/10.48550/arxiv.2405.06656
2024-04-16
Computation and Language
Abstract:Depression is one of the most common mental disorders affecting an individual's personal and professional life. In this work, we investigated the possibility of utilizing social media posts to identify depression in individuals. To achieve this goal, we conducted a preliminary study where we extracted and analyzed the top Reddit posts made in 2022 from depression-related forums. The collected data were labeled as depressive and non-depressive using UMLS Metathesaurus. Further, the pre-processed data were fed to classical machine learning models, where we achieved an accuracy of 92.28\% in predicting the depressive and non-depressive posts.
What problem does this paper attempt to address?