Multi-Local Attention for Speech-Based Depression Detection

A. Esposito,Xuri Ge,A. Vinciarelli,Fuxiang Tao,Wei Ma
DOI: https://doi.org/10.1109/ICASSP49357.2023.10095757
2023-06-04
Abstract:This article shows that an attention mechanism, the Multi-Local Attention, can improve a depression detection approach based on Long Short-Term Memory Networks. Besides leading to higher performance metrics (e.g., Accuracy and F1 Score), Multi-Local Attention improves two other aspects of the approach, both important from an application point of view. The first is the effectiveness of a confidence score associated to the detection outcome at identifying speakers more likely to be classified correctly. The second is the amount of speaking time needed to classify a speaker as depressed or non-depressed. The experiments were performed over read speech and involved 109 participants (including 55 diagnosed with depression by professional psychiatrists). The results show accuracies up to 88.0% (F1 Score 88.0%).
Computer Science,Psychology
What problem does this paper attempt to address?