Alzheimer Disease Detection from Raman Spectroscopy of the Cerebrospinal Fluid via Topological Machine Learning

Francesco Conti,Martina Banchelli,Valentina Bessi,Cristina Cecchi,Fabrizio Chiti,Sara Colantonio,Cristiano D'Andrea,Marella de Angelis,Davide Moroni,Benedetta Nacmias,Maria Antonietta Pascali,Sandro Sorbi,Paolo Matteini
DOI: https://doi.org/10.48550/arXiv.2309.03664
2023-09-07
Abstract:The cerebrospinal fluid (CSF) of 19 subjects who received a clinical diagnosis of Alzheimer's disease (AD) as well as of 5 pathological controls have been collected and analysed by Raman spectroscopy (RS). We investigated whether the raw and preprocessed Raman spectra could be used to distinguish AD from controls. First, we applied standard Machine Learning (ML) methods obtaining unsatisfactory results. Then, we applied ML to a set of topological descriptors extracted from raw spectra, achieving a very good classification accuracy (>87%). Although our results are preliminary, they indicate that RS and topological analysis together may provide an effective combination to confirm or disprove a clinical diagnosis of AD. The next steps will include enlarging the dataset of CSF samples to validate the proposed method better and, possibly, to understand if topological data analysis could support the characterization of AD subtypes.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to detect Alzheimer's Disease (AD) from Cerebrospinal Fluid (CSF) by combining Raman Spectroscopy (RS) with Topological Machine Learning (TML) techniques. Specifically, the research aims to: 1. **Distinguish AD patients from the control group**: By analyzing the Raman spectra of CSF, determine whether AD patients can be effectively distinguished from the pathological control group. 2. **Improve diagnostic accuracy**: Traditional machine - learning methods perform poorly on this task. Therefore, the research attempts to use topological machine - learning methods to extract features and verify whether it can significantly improve classification accuracy. 3. **Explore the potential of RS and TML**: Evaluate the potential of combining Raman spectroscopy with topological data analysis to confirm or exclude clinical diagnoses and explore whether this method is helpful for further characterizing different subtypes of AD. ### Research Background Alzheimer's Disease is a common neurodegenerative disease that affects tens of millions of people worldwide. Currently, the diagnosis of AD requires a series of neurological examinations, and the final diagnosis can only be made through the analysis of brain tissue after the patient's death. Therefore, it is crucial to find innovative, cost - effective and accurate diagnostic methods. Raman spectroscopy, as a fast, efficient and non - invasive diagnostic tool, has been proven to have potential in detecting specific biomarkers in body fluids. ### Method Overview 1. **Data collection**: - CSF samples were collected from 19 AD patients and 5 pathological control groups (including patients with vascular dementia, hydrocephalus and multiple sclerosis). - Five Raman spectra of each sample were obtained using a micro - Raman spectrometer. 2. **Data pre - processing and feature extraction**: - Multiple transformations were performed on the original Raman spectra, including Fourier Transform, Welch Transform and autocorrelation analysis. - Persistence Diagrams (PDs) were extracted and converted into feature vectors using different vectorization methods such as persistent images, persistent landscapes, persistent contours and Betti curves. 3. **Machine - learning classification**: - Classifiers such as Support Vector Classifier (SVC), Random Forest Classifier and Ridge Classifier were used for classification. - Leave One Patient Out (LOPO) cross - validation was used to evaluate the model performance. ### Main Results - The Raman spectra after Fourier Transform showed the best performance in the classification task, achieving a classification accuracy of 87.5%. - In contrast, the H0 feature extraction using the original Raman spectra directly also achieved an accuracy of over 83%, which is consistent with previous research results. ### Conclusion This study shows that the combination of Raman spectroscopy and topological data analysis may provide an effective combined method for the diagnosis of AD. Future research will further expand the sample size, verify the effectiveness of this method, and explore its potential in characterizing AD subtypes.