Multi Class Depression Detection Through Tweets using Artificial Intelligence

Muhammad Osama Nusrat,Waseem Shahzad,Saad Ahmed Jamal
2024-04-19
Abstract:Depression is a significant issue nowadays. As per the World Health Organization (WHO), in 2023, over 280 million individuals are grappling with depression. This is a huge number; if not taken seriously, these numbers will increase rapidly. About 4.89 billion individuals are social media users. People express their feelings and emotions on platforms like Twitter, Facebook, Reddit, Instagram, etc. These platforms contain valuable information which can be used for research purposes. Considerable research has been conducted across various social media platforms. However, certain limitations persist in these endeavors. Particularly, previous studies were only focused on detecting depression and the intensity of depression in tweets. Also, there existed inaccuracies in dataset labeling. In this research work, five types of depression (Bipolar, major, psychotic, atypical, and postpartum) were predicted using tweets from the Twitter database based on lexicon labeling. Explainable AI was used to provide reasoning by highlighting the parts of tweets that represent type of depression. Bidirectional Encoder Representations from Transformers (BERT) was used for feature extraction and training. Machine learning and deep learning methodologies were used to train the model. The BERT model presented the most promising results, achieving an overall accuracy of 0.96.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to detect depression on social media platforms, especially Twitter, by analyzing users' tweets, and further predict the specific types of depression. Specifically, the research aims to: 1. **Detect depression**: Use artificial intelligence technology to identify whether Twitter users have depression from their tweets. 2. **Predict depression types**: Not only detect depression, but also attempt to predict five main types of depression (bipolar disorder, major depression, psychotic depression, atypical depression, and postpartum depression), which has been less involved in previous studies. 3. **Improve accuracy**: Solve the problem of inaccurate dataset labels in previous studies, and improve the accuracy of the model by using a dictionary - based labeling method. 4. **Explain model decisions**: Use Explainable AI technology to provide explanations for the model's predictions by highlighting the parts of the tweets that represent specific types of depression. By constructing a dataset containing different types of depression, using the BERT model for feature extraction and training, and combining machine learning and deep - learning methods, the paper finally achieved an overall accuracy rate as high as 0.96. This indicates that the research has made significant progress in multi - class depression detection.