Intelligent Diagnosis of Alzheimer's Disease Based on Machine Learning

Mingyang Li, Hongyu Liu, Yixuan Li, Zejun Wang, Yuan Yuan, Honglin Dai
2024-02-14
Abstract:This study is based on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset and aims to explore early detection and disease progression in Alzheimer's disease (AD). We employ innovative data preprocessing strategies, including the use of the random forest algorithm to fill missing data and the handling of outliers and invalid data, thereby fully mining and utilizing these limited data resources. Through Spearman correlation coefficient analysis, we identify some features strongly correlated with AD diagnosis. We build and test three machine learning models using these features: random forest, XGBoost, and support vector machine (SVM). Among them, the XGBoost model performs the best in terms of diagnostic performance, achieving an accuracy of 91%. Overall, this study successfully overcomes the challenge of missing data and provides valuable insights into early detection of Alzheimer's disease, demonstrating its unique research value and practical significance.
Machine Learning,Applications
What problem does this paper attempt to address?
This paper aims to solve the problems of early detection and disease progression in Alzheimer's Disease (AD). Specifically, based on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset, the study explored early diagnosis methods for AD. Through innovative data pre - processing strategies, such as using the random forest algorithm to fill in missing data, handle outliers and invalid data, the paper made full use of limited data resources. Through Spearman correlation coefficient analysis, features strongly related to AD diagnosis were identified, and three machine - learning models were constructed: random forest, XGBoost and Support Vector Machine (SVM). Among them, the XGBoost model performed best in diagnostic performance, achieving 91% accuracy. Overall, this study successfully overcame the challenge of missing data, provided valuable insights for the early detection of AD, and has important academic significance and clinical application value.