Abstract:Large amounts of labeled data are a prerequisite to training accurate and reliable machine learning models. However, in the medical domain in particular, this is also a stumbling block as accurately labeled data are hard to obtain. DementiaBank, a publicly available corpus of spontaneous speech samples from a picture description task widely used to study Alzheimer's disease (AD) patients' language characteristics and for training classification models to distinguish patients with AD from healthy controls, is relatively small—a limitation that is further exacerbated when restricting to the balanced subset used in the Alzheimer's Dementia Recognition through Spontaneous Speech (ADReSS) challenge. We build on previous work showing that the performance of traditional machine learning models on DementiaBank can be improved by the addition of normative data from other sources, evaluating the utility of such extrinsic data to further improve the performance of state-of-the-art deep learning based methods on the ADReSS challenge dementia detection task. To this end, we developed a new corpus of professionally transcribed recordings from the Wisconsin Longitudinal Study (WLS), resulting in 1366 additional Cookie Theft Task transcripts, increasing the available training data by an order of magnitude. Using these data in conjunction with DementiaBank is challenging because the WLS metadata corresponding to these transcripts do not contain dementia diagnoses. However, cognitive status of WLS participants can be inferred from results of several cognitive tests including semantic verbal fluency available in WLS data. In this work, we evaluate the utility of using the WLS ‘controls’ (participants without indications of abnormal cognitive status), and these data in conjunction with inferred ‘cases’ (participants with such indications) for training deep learning models to discriminate between language produced by patients with dementia and healthy controls. We find that incorporating WLS data during training a BERT model on ADReSS data improves its performance on the ADReSS dementia detection task, supporting the hypothesis that incorporating WLS data adds value in this context. We also demonstrate that weighted cost functions and additional prediction targets may be effective ways to address issues arising from class imbalance and confounding effects due to data provenance.

Text Classification by Contrastive Learning and Cross-lingual Data Augmentation for Alzheimer’s Disease Detection

Identification of Alzheimer's Disease Patients Based on Oral Speech Features

Attention-based and Micro Designed EfficientNetB2 for Diagnosis of Alzheimer’s Disease

Exploring Multimodal Approaches for Alzheimer's Disease Detection Using Patient Speech Transcript and Audio Data

Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech

Cross-lingual Alzheimer's Disease detection based on paralinguistic and pre-trained features

A Transfer Learning Method for Detecting Alzheimer's Disease Based on Speech and Natural Language Processing

An approach for assisting diagnosis of Alzheimer's disease based on natural language processing

Exploring linguistic feature and model combination for speech recognition based automatic AD detection

Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech

Detecting Alzheimer’s Disease from Speech Using Neural Networks with Bottleneck Features and Data Augmentation

Leveraging Pretrained Representations with Task-related Keywords for Alzheimer's Disease Detection

Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection

Leveraging Pretrained Representations with Task-Related Keywords for Alzheimer’s Disease Detection

Detecting dementia in Mandarin Chinese using transfer learning from a parallel corpus

Multimodal fusion for alzheimer’s disease recognition

Comparison of AI with and without hand-crafted features to classify Alzheimer's disease in different languages

Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech

Multimodal Deep Learning Models for Detecting Dementia From Speech and Transcripts

Crossing the “Cookie Theft” Corpus Chasm: Applying What BERT Learns From Outside Data to the ADReSS Challenge Dementia Detection Task

Detecting Alzheimer's Disease from Continuous Speech Using Language Models.