DEPAC: a Corpus for Depression and Anxiety Detection from Speech

Mashrura Tasnim,Malikeh Ehghaghi,Brian Diep,Jekaterina Novikova
2023-06-20
Abstract:Mental distress like depression and anxiety contribute to the largest proportion of the global burden of diseases. Automated diagnosis systems of such disorders, empowered by recent innovations in Artificial Intelligence, can pave the way to reduce the sufferings of the affected individuals. Development of such systems requires information-rich and balanced corpora. In this work, we introduce a novel mental distress analysis audio dataset DEPAC, labeled based on established thresholds on depression and anxiety standard screening tools. This large dataset comprises multiple speech tasks per individual, as well as relevant demographic information. Alongside, we present a feature set consisting of hand-curated acoustic and linguistic features, which were found effective in identifying signs of mental illnesses in human speech. Finally, we justify the quality and effectiveness of our proposed audio corpus and feature set in predicting depression severity by comparing the performance of baseline machine learning models built on this dataset with baseline models trained on other well-known depression corpora.
Audio and Speech Processing,Computation and Language,Machine Learning,Sound
What problem does this paper attempt to address?
This paper aims to address the issue of insufficient datasets in automatic diagnosis systems for depression and anxiety. Specifically: - **Problem Background**: Mental illnesses such as depression and anxiety contribute significantly to the global burden of disease. Existing automated diagnostic systems suffer from small dataset sizes and lack of language diversity, leading to model overfitting and affecting diagnostic accuracy. - **Paper Objective**: To construct a high-quality, diverse audio dataset (DEPAC) for the detection of depression and anxiety, and to propose a set of carefully selected acoustic and linguistic features to improve the identification of speech-based digital biomarkers for mental illnesses. Through these efforts, the paper hopes to lay the foundation for the development of more accurate and effective automatic diagnosis systems for depression and anxiety.