MentalQA: An Annotated Arabic Corpus for Questions and Answers of Mental Healthcare

Hassan Alhuzali,Ashwag Alasmari,Hamad Alsaleh
2024-05-21
Abstract:Mental health disorders significantly impact people globally, regardless of background, education, or socioeconomic status. However, access to adequate care remains a challenge, particularly for underserved communities with limited resources. Text mining tools offer immense potential to support mental healthcare by assisting professionals in diagnosing and treating patients. This study addresses the scarcity of Arabic mental health resources for developing such tools. We introduce MentalQA, a novel Arabic dataset featuring conversational-style question-and-answer (QA) interactions. To ensure data quality, we conducted a rigorous annotation process using a well-defined schema with quality control measures. Data was collected from a question-answering medical platform. The annotation schema for mental health questions and corresponding answers draws upon existing classification schemes with some modifications. Question types encompass six distinct categories: diagnosis, treatment, anatomy \& physiology, epidemiology, healthy lifestyle, and provider choice. Answer strategies include information provision, direct guidance, and emotional support. Three experienced annotators collaboratively annotated the data to ensure consistency. Our findings demonstrate high inter-annotator agreement, with Fleiss' Kappa of $0.61$ for question types and $0.98$ for answer strategies. In-depth analysis revealed insightful patterns, including variations in question preferences across age groups and a strong correlation between question types and answer strategies. MentalQA offers a valuable foundation for developing Arabic text mining tools capable of supporting mental health professionals and individuals seeking information.
Computation and Language
What problem does this paper attempt to address?
The problem this paper attempts to address is the lack of Arabic mental health resources, particularly in the development of text mining tools to support mental health care. Specifically, the goal of the paper is to create a novel Arabic mental health question-answering dataset, MentalQA, to support mental health professionals and individuals seeking information. This dataset includes questions posed by patients and answers provided by professional doctors, ensuring data quality through a rigorous annotation process. Additionally, the paper explores the relationship between different types of questions and answering strategies, and analyzes the preference differences in questioning among different genders and age groups. By conducting sentiment analysis, word frequency analysis, and response behavior analysis on this data, the researchers hope to provide valuable resources for building effective communication channels and improving the performance of mental health support systems.