Predictive Analytics in Mental Health Leveraging LLM Embeddings and Machine Learning Models for Social Media Analysis

Ahmad Radwan,Mohannad Amarneh,Hussam Alawneh,Huthaifa I. Ashqar,Anas AlSobeh,Aws Abed Al Raheem Magableh
DOI: https://doi.org/10.4018/ijwsr.338222
2024-02-14
International Journal of Web Services Research
Abstract:The prevalence of stress-related disorders has increased significantly in recent years, necessitating scalable methods to identify affected individuals. This paper proposes a novel approach utilizing large language models (LLMs), with a focus on OpenAI's generative pre-trained transformer (GPT-3) embeddings and machine learning (ML) algorithms to classify social media posts as indicative or not of stress disorders. The aim is to create a preliminary screening tool leveraging online textual data. GPT-3 embeddings transformed posts into vector representations capturing semantic meaning and linguistic nuances. Various models, including support vector machines, random forests, XGBoost, KNN, and neural networks, were trained on a dataset of >10,000 labeled social media posts. The top model, a support vector machine, achieved 83% accuracy in classifying posts displaying signs of stress.
computer science, information systems, software engineering
What problem does this paper attempt to address?