Automatic speech-based assessment to discriminate Parkinson's disease from essential tremor with a cross-language approach

Cristian David Rios-Urrego,Jan Rusz,Juan Rafael Orozco-Arroyave
DOI: https://doi.org/10.1038/s41746-024-01027-6
IF: 15.2
2024-02-18
npj Digital Medicine
Abstract:Parkinson's disease (PD) and essential tremor (ET) are prevalent movement disorders that mainly affect elderly people, presenting diagnostic challenges due to shared clinical features. While both disorders exhibit distinct speech patterns—hypokinetic dysarthria in PD and hyperkinetic dysarthria in ET—the efficacy of speech assessment for differentiation remains unexplored. Developing technology for automatic discrimination could enable early diagnosis and continuous monitoring. However, the lack of data for investigating speech behavior in these patients has inhibited the development of a framework for diagnostic support. In addition, phonetic variability across languages poses practical challenges in establishing a universal speech assessment system. Therefore, it is necessary to develop models robust to the phonetic variability present in different languages worldwide. We propose a method based on Gaussian mixture models to assess domain adaptation from models trained in German and Spanish to classify PD and ET patients in Czech. We modeled three different speech dimensions: articulation, phonation, and prosody and evaluated the models' performance in both bi-class and tri-class classification scenarios (with the addition of healthy controls). Our results show that a fusion of the three speech dimensions achieved optimal results in binary classification, with accuracies up to 81.4 and 86.2% for monologue and /pa-ta-ka/ tasks, respectively. In tri-class scenarios, incorporating healthy speech signals resulted in accuracies of 63.3 and 71.6% for monologue and /pa-ta-ka/ tasks, respectively. Our findings suggest that automated speech analysis, combined with machine learning is robust, accurate, and can be adapted to different languages to distinguish between PD and ET patients.
health care sciences & services,medical informatics
What problem does this paper attempt to address?
The paper aims to address the diagnostic differentiation between Parkinson's disease (PD) and essential tremor (ET), two movement disorders. These diseases have many similarities in clinical presentation, especially in the early stages, leading to misdiagnosis. Although they differ in speech patterns—PD patients exhibit hypokinetic dysarthria, while ET patients exhibit hyperkinetic dysarthria—the effectiveness of distinguishing them based on speech assessment has not been fully explored. The paper proposes a method that uses Gaussian mixture models (GMM) for domain adaptation of speech data from PD and ET patients with different language backgrounds, and classification through support vector machines (SVM). This method considers three different speech dimensions: articulation, phonation, and prosody, and is evaluated in both binary classification (PD vs. ET) and ternary classification (including healthy controls) scenarios. The study results show that in controlled tasks (such as repeating specific syllables "/pa-ta-ka/") and spontaneous speech tasks (such as monologues), the method combining the three speech dimensions achieved classification accuracies of 86.2% and 81.4%, respectively. Additionally, in the ternary classification scenario including healthy controls, the method combining the three speech dimensions achieved an accuracy of 63.3% in the monologue task, while the method using only the prosody dimension achieved an accuracy of 71.6% in the controlled task. These findings suggest that automatic assessment based on speech analysis combined with machine learning techniques can accurately differentiate between PD and ET. This technology can adapt to different language environments and has potential clinical application value, especially in early diagnosis.