Speech-based recognition and estimating severity of PTSD using machine learning

Jiawei Hu,Chunxiao Zhao,Congrong Shi,Ziyi Zhao,Zhihong Ren
DOI: https://doi.org/10.1016/j.jad.2024.07.015
2024-10-01
Abstract:Background: Traditional methodologies for diagnosing post-traumatic stress disorder (PTSD) primarily rely on interviews, incurring considerable costs and lacking objective indices. Integrating biomarkers and machine learning techniques into this diagnostic process has the potential to facilitate accurate PTSD assessment by clinicians. Methods: We assembled a dataset encompassing recordings from 76 individuals diagnosed with PTSD and 60 healthy controls. Leveraging the openSmile framework, we extracted acoustic features from these recordings and employed a random forest algorithm for feature selection. Subsequently, these selected features were utilized as inputs for six distinct classification models and a regression model. Results: Classification models employing a feature set of 18 elements yielded robust binary prediction outcomes for PTSD. Notably, the RF model achieved peak accuracy at 0.975 with the highest AUC of 1.0. In terms of the regression model, it exhibited significant predictive capability for PCL-5 scores (MSE = 0.90, MAE = 0.76, R2 = 0.10, p < 0.001). Noteworthy was the correlation coefficient of 0.33 (p < 0.01) between predicted and actual values. Limitations: Firstly, the process of feature selection may compromise the stability of models, which leads to potentially overestimating results. Secondly, it is hard to elucidate the nature of biological mechanisms behind between PTSD patients and healthy individuals. Lastly, the regression model has a limited prediction for PTSD. Conclusions: Distinct speech patterns differentiate PTSD patients and controls. Classification models accurately discern both groups. Regression model gauges PTSD severity, but further validation on larger datasets is needed.
What problem does this paper attempt to address?