Ranking and filtering of neuropathology features in the machine learning evaluation of dementia studies

Mohammed D. Rajab,Teruka Taketa,Stephen B. Wharton,Dennis Wang,and for the Alzheimer's Disease Neuroimaging Initiative Cognitive Function and Ageing Neuropathology Study,Cognitive Function and Ageing Neuropathology Study,and for the Alzheimer's Disease Neuroimaging Initiative
DOI: https://doi.org/10.1111/bpa.13247
IF: 7.611
2024-02-21
Brain Pathology
Abstract:Methodology for dementia classification using CFAS and ADNI datasets. The dementia classification methodology was developed and executed in three key stages: design, implementation, and evaluation. After acquiring neuropathology data, we carried out pre‐processing and determined the correlation between different features. Utilising seven filter methods, we ranked all neuropathological features. Subsequently, we explored the connection between feature–feature correlation and feature ranking across all applied filter methods. Thereafter, classifiers were evaluated using various feature subsets, depending on their interrelations. Early diagnosis of dementia diseases, such as Alzheimer's disease, is difficult because of the time and resources needed to perform neuropsychological and pathological assessments. Given the increasing use of machine learning methods to evaluate neuropathology features in the brains of dementia patients, it is important to investigate how the selection of features may be impacted and which features are most important for the classification of dementia. We objectively assessed neuropathology features using machine learning techniques for filtering features in two independent ageing cohorts, the Cognitive Function and Aging Studies (CFAS) and Alzheimer's Disease Neuroimaging Initiative (ADNI). The reliefF and least loss methods were most consistent with their rankings between ADNI and CFAS; however, reliefF was most biassed by feature–feature correlations. Braak stage was consistently the highest ranked feature and its ranking was not correlated with other features, highlighting its unique importance. Using a smaller set of highly ranked features, rather than all features, can achieve a similar or better dementia classification performance in CFAS (60%–70% accuracy with Naïve Bayes). This study showed that specific neuropathology features can be prioritised by feature filtering methods, but they are impacted by feature–feature correlations and their results can vary between cohort studies. By understanding these biases, we can reduce discrepancies in feature ranking and identify a minimal set of features needed for accurate classification of dementia.
pathology,neurosciences,clinical neurology
What problem does this paper attempt to address?