Harnessing Large Language Models Over Transformer Models for Detecting Bengali Depressive Social Media Text: A Comprehensive Study

Ahmadul Karim Chowdhury,Md. Saidur Rahman Sujon,Md. Shirajus Salekin Shafi,Tasin Ahmmad,Sifat Ahmed,Khan Md Hasib,Faisal Muhammad Shah

2024-01-14

Abstract:In an era where the silent struggle of underdiagnosed depression pervades globally, our research delves into the crucial link between mental health and social media. This work focuses on early detection of depression, particularly in extroverted social media users, using LLMs such as GPT 3.5, GPT 4 and our proposed GPT 3.5 fine-tuned model DepGPT, as well as advanced Deep learning models(LSTM, Bi-LSTM, GRU, BiGRU) and Transformer models(BERT, BanglaBERT, SahajBERT, BanglaBERT-Base). The study categorized Reddit and X datasets into "Depressive" and "Non-Depressive" segments, translated into Bengali by native speakers with expertise in mental health, resulting in the creation of the Bengali Social Media Depressive Dataset (BSMDD). Our work provides full architecture details for each model and a methodical way to assess their performance in Bengali depressive text categorization using zero-shot and few-shot learning techniques. Our work demonstrates the superiority of SahajBERT and Bi-LSTM with FastText embeddings in their respective domains also tackles explainability issues with transformer models and emphasizes the effectiveness of LLMs, especially DepGPT, demonstrating flexibility and competence in a range of learning contexts. According to the experiment results, the proposed model, DepGPT, outperformed not only Alpaca Lora 7B in zero-shot and few-shot scenarios but also every other model, achieving a near-perfect accuracy of 0.9796 and an F1-score of 0.9804, high recall, and exceptional precision. Although competitive, GPT-3.5 Turbo and Alpaca Lora 7B show relatively poorer effectiveness in zero-shot and few-shot situations. The work emphasizes the effectiveness and flexibility of LLMs in a variety of linguistic circumstances, providing insightful information about the complex field of depression detection models.

Computation and Language

What problem does this paper attempt to address?

The problem that this paper aims to solve is the detection of depression in Bengali social media texts. Specifically, the research focuses on the early identification of depression, especially among extroverted social media users, by using large - language models (such as GPT 3.5, GPT 4 and the DepGPT model proposed by the author), advanced deep - learning models (such as LSTM, Bi - LSTM, GRU, BiGRU) and transformer models (such as BERT, BanglaBERT, SahajBERT, BanglaBERT - Base). The goal of the research is to evaluate the performance of these models in classifying Bengali depressive texts, especially to evaluate their performance through zero - shot and few - shot learning techniques. In addition, this study also emphasizes the effectiveness and flexibility of large - language models, especially their adaptability and competitiveness in various learning environments. The research supports its goal by creating a dataset named Bengali Social Media Depressive Dataset (BSMDD), which contains depressive and non - depressive texts collected from Reddit and X (formerly Twitter) platforms and translated into Bengali by native speakers. The research not only compares the performance of different models, but also explores the interpretability issues of transformer models, and emphasizes the superior performance of the DepGPT model in zero - shot and few - shot scenarios, achieving near - perfect accuracy and F1 scores. In summary, the key problems that this paper attempts to solve are to improve the accuracy of detecting depressive content in Bengali social media, reduce the error rate and shorten the training time, while exploring which type of language model (deep - learning model, traditional transformer model or large - language model) is the most effective in this specific task.

Harnessing Large Language Models Over Transformer Models for Detecting Bengali Depressive Social Media Text: A Comprehensive Study

Detection of Depression Severity Using Bengali Social Media Posts on Mental Health: Study Using Natural Language Processing Techniques

Enhancing Depressive Post Detection in Bangla: A Comparative Study of TF-IDF, BERT and FastText Embeddings

Detecting Level of Depression from Social Media Posts for the Low-resource Bengali Language

Advancing Depression Detection on Social Media Platforms Through Fine-Tuned Large Language Models

A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model

Novel Transformer Based Contextualized Embedding and Probabilistic Features for Depression Detection From Social Media

Depression Classification From Tweets Using Small Deep Transfer Learning Language Models

Linguistic Analysis of Hindi-English Mixed Tweets for Depression Detection

Advanced Comparative Analysis of Machine Learning and Transformer Models for Depression and Suicide Detection in Social Media Texts

Predicting the language of depression from multivariate twitter data using a feature‐rich hybrid deep learning model

AI-Enhanced Mental Health Diagnosis: Leveraging Transformers for Early Detection of Depression Tendency in Textual Data

An hybrid deep learning approach for depression prediction from user tweets using feature-rich CNN and bi-directional LSTM

Depression detection in social media posts using transformer-based models and auxiliary features

Exhaustive Study into Machine Learning and Deep Learning Methods for Multilingual Cyberbullying Detection in Bangla and Chittagonian Texts

Enabling Early Health Care Intervention by Detecting Depression in Users of Web-Based Forums using Language Models: Longitudinal Analysis and Evaluation

DeepLearningBrasil@LT-EDI-2023: Exploring Deep Learning Techniques for Detecting Depression in Social Media Text

Early Depression Detection from Social Network Using Deep Learning Techniques

Deep Learning Based Cyberbullying Detection in Bangla Language

DEPTWEET: A Typology for Social Media Texts to Detect Depression Severities

Depression symptoms modelling from social media text: an LLM driven semi-supervised learning approach