An Explainable-AI approach for Diagnosis of COVID-19 using MALDI-ToF Mass Spectrometry

Venkata Devesh Reddy Seethi,Zane LaCasse,Prajkta Chivte,Joshua Bland,Shrihari S. Kadkol,Elizabeth R. Gaillard,Pratool Bharti,Hamed Alhoori
DOI: https://doi.org/10.48550/arXiv.2109.14099
2023-05-24
Abstract:The severe acute respiratory syndrome coronavirus type-2 (SARS-CoV-2) caused a global pandemic and immensely affected the global economy. Accurate, cost-effective, and quick tests have proven substantial in identifying infected people and mitigating the spread. Recently, multiple alternative platforms for testing coronavirus disease 2019 (COVID-19) have been published that show high agreement with current gold standard real-time polymerase chain reaction (RT-PCR) results. These new methods do away with nasopharyngeal (NP) swabs, eliminate the need for complicated reagents, and reduce the burden on RT-PCR test reagent supply. In the present work, we have designed an artificial intelligence-based (AI) testing method to provide confidence in the results. Current AI applications for COVID-19 studies often lack a biological foundation in the decision-making process, and our AI approach is one of the earliest to leverage explainable AI (X-AI) algorithms for COVID-19 diagnosis using mass spectrometry. Here, we have employed X-AI to explain the decision-making process on a local (per-sample) and global (all samples) basis underscored by biologically relevant features. We evaluated our technique with data extracted from human gargle samples and achieved a testing accuracy of 94.12%. Such techniques would strengthen the relationship between AI and clinical diagnostics by providing biomedical researchers and healthcare workers with trustworthy and, most importantly, explainable test results
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to develop a new COVID - 19 diagnostic method based on Explainable Artificial Intelligence (X - AI) and Matrix - Assisted Laser Desorption/Ionization Time - of - Flight Mass Spectrometry (MALDI - ToF MS). Specifically, the research aims to: 1. **Improve detection efficiency and accuracy**: By using non - invasive saliva samples and the fast, low - cost MALDI - ToF MS technology, replace the traditional nasopharyngeal swab and real - time polymerase chain reaction (RT - PCR) tests. This method not only reduces the need for complex reagents but also alleviates the pressure on laboratory reagent supplies for RT - PCR tests. 2. **Enhance the interpretability of diagnostic results**: Current artificial intelligence applications in COVID - 19 research often lack a biologically - based decision - making process. The method proposed in this paper is one of the earliest studies to use X - AI algorithms for COVID - 19 diagnosis, which can explain the decision - making process, provide local (per - sample) and global (all - samples) explanations, making the results more credible and transparent. 3. **Reduce the risk of model overfitting**: By selecting appropriate feature engineering techniques and machine learning algorithms (such as random forest), ensure that the model will not overfit due to high - dimensional data while maintaining good interpretability and performance. ### Specific problems and solutions - **Problem**: Existing COVID - 19 detection methods (such as RT - PCR), although accurate, are complex to operate and costly, and are difficult to promote on a large scale. **Solution**: Adopt MALDI - ToF MS combined with AI technology, use saliva samples for fast, low - cost detection, reduce the need for complex reagents, and increase the detection speed. - **Problem**: The decision - making process of traditional AI models in medical diagnosis is not transparent, and it is difficult to gain the trust of clinicians. **Solution**: Introduce X - AI algorithms to ensure that the decision - making process of AI models can be explained, thereby enhancing the trust of doctors and patients. - **Problem**: How to extract effective features from complex mass spectrometry data to improve the accuracy of the model. **Solution**: Process mass spectrometry data through various feature engineering techniques (such as AUC ratio, statistical features, etc.) and use machine learning algorithms such as random forest for classification, ultimately achieving a test accuracy of 94.12%. ### Summary This research has developed an efficient, accurate, and interpretable COVID - 19 diagnostic method by combining MALDI - ToF MS and X - AI technologies, providing new ideas and technical support for future medical diagnosis.