Adapting Mental Health Prediction Tasks for Cross-lingual Learning via Meta-Training and In-context Learning with Large Language Model

Zita Lifelo,Huansheng Ning,Sahraoui Dhelim
2024-04-14
Abstract:Timely identification is essential for the efficient handling of mental health illnesses such as depression. However, the current research fails to adequately address the prediction of mental health conditions from social media data in low-resource African languages like Swahili. This study introduces two distinct approaches utilising model-agnostic meta-learning and leveraging large language models (LLMs) to address this gap. Experiments are conducted on three datasets translated to low-resource language and applied to four mental health tasks, which include stress, depression, depression severity and suicidal ideation prediction. we first apply a meta-learning model with self-supervision, which results in improved model initialisation for rapid adaptation and cross-lingual transfer. The results show that our meta-trained model performs significantly better than standard fine-tuning methods, outperforming the baseline fine-tuning in macro F1 score with 18\% and 0.8\% over XLM-R and mBERT. In parallel, we use LLMs' in-context learning capabilities to assess their performance accuracy across the Swahili mental health prediction tasks by analysing different cross-lingual prompting approaches. Our analysis showed that Swahili prompts performed better than cross-lingual prompts but less than English prompts. Our findings show that in-context learning can be achieved through cross-lingual transfer through carefully crafted prompt templates with examples and instructions.
Computation and Language,Artificial Intelligence,Computers and Society,Machine Learning
What problem does this paper attempt to address?
The paper aims to address the following issues: 1. **Mental Health Prediction in Low-Resource Languages**: Current research has not adequately addressed the problem of predicting mental health status (such as depression) from social media data, especially in low-resource African languages like Swahili. 2. **Cross-Language Transfer Learning**: Proposes two methods to improve the task of mental health prediction in low-resource languages, including the use of Model-Agnostic Meta-Learning (MAML) and Large Language Models (LLMs) for cross-language transfer learning. Specifically, the paper explores the following aspects: - **Meta-Learning Framework**: Utilizes a meta-learning model for self-supervision to improve model initialization, enabling rapid adaptation and cross-language transfer. - **Application of LLMs in Contextual Learning**: Leverages the contextual learning capabilities of LLMs to evaluate their performance accuracy in the Swahili mental health prediction task and analyzes different cross-language prompting methods. Experimental results show that the meta-trained model significantly outperforms standard fine-tuning methods in macro F1 score, with improvements of 18% and 0.8% on XLM-R and mBERT, respectively. Additionally, context learning through carefully designed prompt templates can achieve cross-language transfer. Swahili prompts perform better than cross-language prompts but are slightly inferior to English prompts. These findings suggest that with carefully designed prompt templates, effective mental health prediction can be achieved in low-resource languages.