Task-voting for schizophrenia spectrum disorders prediction using machine learning across linguistic feature domains

Rui He,Víctor Ortiz-García de la Foz,Luis Manuel Fernández Cacho,Philipp Homan,Iris Sommer,Rosa Ayesa-Arriola,Wolfram Hinzen
DOI: https://doi.org/10.1101/2024.08.31.24312886
2024-09-03
Abstract:Background Identifying schizophrenia spectrum disorders (SSD) from spontaneous speech features is a key focus in computational psychiatry today. Methods We present a task-voting procedure using various speech-elicitation tasks to predict SSD in Spanish, followed by ablation studies highlighting the roles of different tasks and feature domains. Speech from five tasks was recorded from 92 subjects (41 controls and 49 with SSD). A total of 319 features were automatically extracted and selected based intra-feature correlations and ANOVA F-values, covering acoustic-prosodic, morphosyntactic, and semantic similarity metrics from pretrained embeddings. Results Twenty-four features were preselected. ExtraTrees-based classification using these features yielded accuracy of 0.840 on hold-out data. Ablating picture descriptions impaired performance most, followed by story reading, retelling, and free speech. Removing morphosyntactic measures impaired performance most, followed by acoustic and semantic measures. Mixed-effect models suggested significant group differences on all 24 features. In SSD, speech patterns were slower and more variable temporally, while variations in pitch, amplitude, and sound intensity decreased. Semantic similarity between speech and prompts decreased, while minimal distances from embedding centroids to each word increased, and word-to-word similarity arrays became more predictable, all replicating patterns previously documented in other languages. Morphosyntactically, SSD patients used more first-person pronouns together with less third-person pronouns, and more punctuations and negations. Semantic metrics correlated with a range of positive symptoms, and multiple acoustic-prosodic features with negative symptoms. Conclusions This study highlights the importance of combining different speech tasks and features for SSD detection, apart from validating previously found patterns in psychosis for Spanish.
What problem does this paper attempt to address?