Automated remote speech-based testing of individuals with cognitive decline: Bayesian agreement of transcription accuracy

Alexandra König,Stefanie Köhler,Johannes Tröger,Emrah Düzel,Wenzel Glanz,Michaela Butryn,Elisa Mallick,Josef Priller,Slawek Altenstein,Annika Spottke,Okka Kimmich,Björn Falkenburger,Antje Osterrath,Jens Wiltfang,Claudia Bartels,Ingo Kilimann,Christoph Laske,Matthias H Munk,Sandra Roeske,Ingo Frommann,Daniel C Hoffmann,Frank Jessen,Michael Wagner,Nicklas Linz,Stefan Teipel
DOI: https://doi.org/10.1002/dad2.70011
2024-10-06
Abstract:Introduction: We investigated the agreement between automated and gold-standard manual transcriptions of telephone chatbot-based semantic verbal fluency testing. Methods: We examined 78 cases from the Screening over Speech in Unselected Populations for Clinical Trials in AD (PROSPECT-AD) study, including cognitively normal individuals and individuals with subjective cognitive decline, mild cognitive impairment, and dementia. We used Bayesian Bland-Altman analysis of word count and the qualitative features of semantic cluster size, cluster switches, and word frequencies. Results: We found high levels of agreement for word count, with a 93% probability of a newly observed difference being below the minimally important difference. The qualitative features had fair levels of agreement. Word count reached high levels of discrimination between cognitively impaired and unimpaired individuals, regardless of transcription mode. Discussion: Our results support the use of automated speech recognition particularly for the assessment of quantitative speech features, even when using data from telephone calls with cognitively impaired individuals in their homes. Highlights: High levels of agreement were found between automated and gold-standard manual transcriptions of telephone chatbot-based semantic verbal fluency testing, particularly for word count.The qualitative features had fair levels of agreement.Word count reached high levels of discrimination between cognitively impaired and unimpaired individuals, regardless of transcription mode.Automated speech recognition for the assessment of quantitative and qualitative speech features, even when using data from telephone calls with cognitively impaired individuals in their homes, seems feasible and reliable.
What problem does this paper attempt to address?