Abstract:As the impact of Alzheimer’s disease (AD) is projected to grow in the coming decades as the world’s population ages, the development of noninvasive and cost-effective methods of detecting AD is essential for the early prevention and mitigation of the progressive disease, alleviating its expected global impact. This study analyzes audio processing techniques and transcription methodologies to optimize the detection of AD through the natural language processing (NLP) of spontaneous speech. We enhanced audio fidelity using Boll Spectral Subtraction and evaluated the transcription accuracy of state-of-the-art AI services—locally-based Wav2Vec and Whisper, alongside cloud-based IBM Cloud and Rev AI—against traditional manual transcription methods. The choice between local and cloud-based solutions hinges on a trade-off between privacy, ongoing costs, and computational requirements. Leveraging OpenAI’s GPT for word embeddings, we enhanced the training of Support Vector Machine (SVM) classifiers, which were crucial in analyzing transcripts and refining detection accuracy. Our findings reveal that AI-driven transcriptions significantly outperform manual counterparts when classifying AD and Control samples, with Wav2Vec using enhanced audio exhibiting the highest accuracy and F-1 scores (0.99 for both metrics) for locally based systems and Rev AI using unenhanced audio leading cloud-based methods with comparable precision (0.96 for both metrics). The study also uncovers the detrimental effect of including interviewer speech in recordings on model performance, advocating for the exclusion of such interactions to improve data quality for AD classification algorithms. Our comprehensive evaluation demonstrates that AI transcription (both Cloud and Local) and NLP technologies in their current forms can classify AD, as well as probable AD and mild cognitive impairment (MCI), a prodromal stage of AD, accurately but suffer from a lack of available training data. The insights garnered from this research lay the groundwork for future advancements in the noninvasive monitoring and early detection of cognitive impairments through linguistic analysis.

Exploring the Topics of Audio Words for Detecting Alzheimer's Disease from Spontaneous Speech

Identification of Alzheimer's Disease Patients Based on Oral Speech Features

Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data

Leveraging Pretrained Representations with Task-Related Keywords for Alzheimer’s Disease Detection

Leveraging Pretrained Representations with Task-related Keywords for Alzheimer's Disease Detection

Detecting Alzheimer’s Disease from Speech Using Neural Networks with Bottleneck Features and Data Augmentation

Detecting Alzheimer's Disease from Continuous Speech Using Language Models.

Exploring linguistic feature and model combination for speech recognition based automatic AD detection

Noninvasive automatic detection of Alzheimer's disease from spontaneous speech: a review

Efficient Pause Extraction and Encode Strategy for Alzheimer's Disease Detection Using Only Acoustic Features from Spontaneous Speech

Detecting Alzheimer's Disease Based on Acoustic Features Extracted from Pre-trained Models

Artificial Intelligence-Enabled End-To-End Detection and Assessment of Alzheimer's Disease Using Voice

An approach for assisting diagnosis of Alzheimer's disease based on natural language processing

Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech

Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech

The Optimization of a Natural Language Processing Approach for the Automatic Detection of Alzheimer’s Disease Using GPT Embeddings

A Transfer Learning Method for Detecting Alzheimer's Disease Based on Speech and Natural Language Processing

The Optimization of a Natural Language Processing Approach for the Automatic Detection of Alzheimer's Disease Using GPT Embeddings

Temporal Integration of Text Transcripts and Acoustic Features for Alzheimer's Diagnosis Based on Spontaneous Speech

Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech

Multimodal Deep Learning Models for Detecting Dementia From Speech and Transcripts