Abstract:Background: The discovery of early, non-invasive biomarkers for the identification of "preclinical" or "pre-symptomatic" Alzheimer's disease and other dementias is a key issue in the field, especially for research purposes, the design of preventive clinical trials, and drafting population-based health care policies. Complex behaviors are natural candidates for this. In particular, recent studies have suggested that speech alterations might be one of the earliest signs of cognitive decline, frequently noticeable years before other cognitive deficits become apparent. Traditional neuropsychological language tests provide ambiguous results in this context. In contrast, the analysis of spoken language productions by Natural Language Processing (NLP) techniques can pinpoint language modifications in potential patients. This interdisciplinary study aimed at using NLP to identify early linguistic signs of cognitive decline in a population of elderly individuals. Methods: We enrolled 96 participants (age range 50-75): 48 healthy controls (CG) and 48 cognitively impaired participants: 16 participants with single domain amnestic Mild Cognitive Impairment (aMCI), 16 with multiple domain MCI (mdMCI) and 16 with early Dementia (eD). Each subject underwent a brief neuropsychological screening composed by MMSE, MoCA, GPCog, CDT, and verbal fluency (phonemic and semantic). The spontaneous speech during three tasks (describing a complex picture, a typical working day and recalling a last remembered dream) was then recorded, transcribed and annotated at various linguistic levels. A multidimensional parameter computation was performed by a quantitative analysis of spoken texts, computing rhythmic, acoustic, lexical, morpho-syntactic, and syntactic features. Results: Neuropsychological tests showed significant differences between controls and mdMCI, and between controls and eD participants; GPCog, MoCA, PF, and SF also discriminated between controls and aMCI. In the linguistic experiments, a number of features regarding lexical, acoustic and syntactic aspects were significant in differentiating between mdMCI, eD, and CG (non-parametric statistical analysis). Some features, mainly in the acoustic domain also discriminated between CG and aMCI. Conclusions: Linguistic features of spontaneous speech transcribed and analyzed by NLP techniques show significant differences between controls and pathological states (not only eD but also MCI) and seems to be a promising approach for the identification of preclinical stages of dementia. Long duration follow-up studies are needed to confirm this assumption.

Automatic speech analysis for detecting cognitive decline of older adults

Identification of Alzheimer's Disease Patients Based on Oral Speech Features

Selecting and Analyzing Speech Features for the Screening of Mild Cognitive Impairment

Automated Classification of Cognitive Decline and Probable Alzheimer's Dementia Across Multiple Speech and Language Domains

Dementia Detection by Analyzing Spontaneous Mandarin Speech.

Detection of Mild Cognitive Impairment From Non-Semantic, Acoustic Voice Features: The Framingham Heart Study

Automated assessment of speech production and prediction of MCI in older adults

A new strategy on Early diagnosis of cognitive impairment via novel cross-lingual language markers: a non-invasive description and AI analysis for the cookie theft picture

Cross-cultural difference and validation of the Chinese version of Montreal Cognitive Assessment in older adults residing in Eastern China: preliminary findings.

Design and development of the intelligent voice recognition‐based cognitive assessment WeChat mini‐program

An explainable machine learning model of cognitive decline derived from speech

Automatic detection of Mild Cognitive Impairment using high-dimensional acoustic features in spontaneous speech

A Comparison of Connected Speech Tasks for Detecting Early Alzheimer’s Disease and Mild Cognitive Impairment Using Natural Language Processing and Machine Learning

Screening for early Alzheimer's disease: enhancing diagnosis with linguistic features and biomarkers

A comparison of different connected-speech tasks for detecting mild cognitive impairment using multivariate pattern analysis

Develop a Diagnostic Tool for Dementia Using Machine Learning and Non-Imaging Features

Shanghai Cognitive Screening: A Mobile Cognitive Assessment Tool Using Voice Recognition to Detect Mild Cognitive Impairment and Dementia in the Community.

Screening for early Alzheimer’s disease: enhancing diagnosis with linguistic features and biomarkers

Analysis of Speech Features in Alzheimer's Disease with Machine Learning: A Case-Control Study

Developing a machine learning model for detecting depression, anxiety, and apathy in older adults with mild cognitive impairment using speech and facial expressions: A cross-sectional observational study

Speech Analysis by Natural Language Processing Techniques: A Possible Tool for Very Early Detection of Cognitive Decline?