Abstract:Background Early identification of Alzheimer’s disease or mild cognitive impairment can help guide direct prevention and supportive treatments, improve outcomes, and reduce medical costs. Existing advanced diagnostic tools are mostly based on neuroimaging and suffer from certain problems in cost, reliability, repeatability, accessibility, ease of use, and clinical integration. To address these problems, we developed, evaluated, and implemented an early diagnostic tool using machine learning and non-imaging factors. Methods and results A total of 654 participants aged 65 or older from the Nursing Home in Hangzhou, China were identified. Information collected from these patients includes dementia status and 70 demographic, cognitive, socioeconomic, and clinical features. Logistic regression, support vector machine (SVM), neural network, random forest, extreme gradient boosting (XGBoost), least absolute shrinkage and selection operator (LASSO), and best subset models were trained, tuned, and internally validated using a novel double cross validation algorithm and multiple evaluation metrics. The trained models were also compared and externally validated using a separate dataset with 1,100 participants from four communities in Zhejiang Province, China. The model with the best performance was then identified and implemented online with a friendly user interface. For the nursing dataset, the top three models are the neural network (AUROC = 0.9435), XGBoost (AUROC = 0.9398), and SVM with the polynomial kernel (AUROC = 0.9213). With the community dataset, the best three models are the random forest (AUROC = 0.9259), SVM with linear kernel (AUROC = 0.9282), and SVM with polynomial kernel (AUROC = 0.9213). The F1 scores and area under the precision-recall curve showed that the SVMs, neural network, and random forest were robust on the unbalanced community dataset. Overall the SVM with the polynomial kernel was found to be the best model. The LASSO and best subset models identified 17 features most relevant to dementia prediction, mostly from cognitive test results and socioeconomic characteristics. Conclusion Our non-imaging-based diagnostic tool can effectively predict dementia outcomes. The tool can be conveniently incorporated into clinical practice. Its online implementation allows zero barriers to its use, which enhances the disease’s diagnosis, improves the quality of care, and reduces costs.

Enhancing identification performance of cognitive impairment high-risk based on a semi-supervised learning method

Develop a Diagnostic Tool for Dementia Using Machine Learning and Non-Imaging Features

Identification of Dementia & Mild Cognitive Impairment in Chinese Elderly Using Machine Learning

Self-paced Semi-Supervised Feature Selection with Application to Multi-Modal Alzheimer’s Disease Classification

Predicting mild cognitive impairment among Chinese older adults: a longitudinal study based on long short-term memory networks and machine learning

A Hybrid Intelligent Diagnosis Approach for Quick Screening of Alzheimer's Disease Based on Multiple Neuropsychological Rating Scales.

Note on certain peculiar Cells of the Cornea described by Dr Thin.

Prediction of cognitive impairment using higher order item response theory and machine learning models

Prediction of Cognitive Impairment Risk among Older Adults: A Machine-Learning Based Comparative Study and Model Development.

Enhancing Early Detection of Cognitive Decline in the Elderly: A Comparative Study Utilizing Large Language Models in Clinical Notes

Recognition of Mild Cognitive Impairment in the Elderly Based on Machine Learning

Early Stage Identification of Alzheimer's Disease Using a Two-stage Ensemble Classifier

An explainable machine learning based prediction model for Alzheimer's disease in China longitudinal aging study

Predicting Dementia Risk for Elderly Community Dwellers in Primary Care Services Using Subgroup-specific Prediction Models

Improving an Electronic Health Record–Based Clinical Prediction Model Under Label Deficiency: Network-Based Generative Adversarial Semisupervised Approach

Semi-Supervised Approaches to Efficient Evaluation of Model Prediction Performance

Using Machine Learning to Predict Cognitive Impairment Among Middle-Aged and Older Chinese: A Longitudinal Study

Using Deep Learning to Identify Patients with Cognitive Impairment in Electronic Health Records

Prediction of future cognitive impairment among the community elderly: A machine-learning based approach

Boosting Alzheimer Diagnosis Accuracy with the Help of Incomplete Privileged Information