Abstract:As the impact of Alzheimer’s disease (AD) is projected to grow in the coming decades as the world’s population ages, the development of noninvasive and cost-effective methods of detecting AD is essential for the early prevention and mitigation of the progressive disease, alleviating its expected global impact. This study analyzes audio processing techniques and transcription methodologies to optimize the detection of AD through the natural language processing (NLP) of spontaneous speech. We enhanced audio fidelity using Boll Spectral Subtraction and evaluated the transcription accuracy of state-of-the-art AI services—locally-based Wav2Vec and Whisper, alongside cloud-based IBM Cloud and Rev AI—against traditional manual transcription methods. The choice between local and cloud-based solutions hinges on a trade-off between privacy, ongoing costs, and computational requirements. Leveraging OpenAI’s GPT for word embeddings, we enhanced the training of Support Vector Machine (SVM) classifiers, which were crucial in analyzing transcripts and refining detection accuracy. Our findings reveal that AI-driven transcriptions significantly outperform manual counterparts when classifying AD and Control samples, with Wav2Vec using enhanced audio exhibiting the highest accuracy and F-1 scores (0.99 for both metrics) for locally based systems and Rev AI using unenhanced audio leading cloud-based methods with comparable precision (0.96 for both metrics). The study also uncovers the detrimental effect of including interviewer speech in recordings on model performance, advocating for the exclusion of such interactions to improve data quality for AD classification algorithms. Our comprehensive evaluation demonstrates that AI transcription (both Cloud and Local) and NLP technologies in their current forms can classify AD, as well as probable AD and mild cognitive impairment (MCI), a prodromal stage of AD, accurately but suffer from a lack of available training data. The insights garnered from this research lay the groundwork for future advancements in the noninvasive monitoring and early detection of cognitive impairments through linguistic analysis.

Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data

Identification of Alzheimer's Disease Patients Based on Oral Speech Features

Multimodal fusion for alzheimer’s disease recognition

Detecting Alzheimer’s Disease from Speech Using Neural Networks with Bottleneck Features and Data Augmentation

Multimodal Deep Learning Models for Detecting Dementia From Speech and Transcripts

Temporal Integration of Text Transcripts and Acoustic Features for Alzheimer's Diagnosis Based on Spontaneous Speech

Leveraging Pretrained Representations with Task-related Keywords for Alzheimer's Disease Detection

Connected Multi-speech Task for Detecting Alzheimer’s Disease Using a Two-Layer Model

Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech

CDA: A Contrastive Data Augmentation Method for Alzheimer’s Disease Detection

A Transfer Learning Method for Detecting Alzheimer's Disease Based on Speech and Natural Language Processing

Leveraging Pretrained Representations with Task-Related Keywords for Alzheimer’s Disease Detection

Combining Speech and Drawing Data for Alzheimer's Disease Detecting

An approach for assisting diagnosis of Alzheimer's disease based on natural language processing

Multimodal Inductive Transfer Learning for Detection of Alzheimer's Dementia and its Severity

The Optimization of a Natural Language Processing Approach for the Automatic Detection of Alzheimer’s Disease Using GPT Embeddings

The Optimization of a Natural Language Processing Approach for the Automatic Detection of Alzheimer's Disease Using GPT Embeddings

Noninvasive automatic detection of Alzheimer's disease from spontaneous speech: a review

Detecting Alzheimer's Disease from Continuous Speech Using Language Models.

End-to-End ASR-Enhanced Neural Network for Alzheimer’s Disease Diagnosis

Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech