Explainable AI for gut microbiome-based diagnostics: colorectal cancer as a case study
Animesh Acharjee
DOI: https://doi.org/10.1515/dx-2023-0062
2023-06-17
Diagnosis
Abstract:To the Editor, An increasing number of studies have found a correlation between gut microbiome composition changes and colorectal cancer (CRC) [1], [2], [3]. Researchers have utilised machine learning (ML) algorithms to identify potential CRC biomarkers and to distinguish CRC patients from healthy individuals [4]. Identification of the diagnostics microbes will help tremendously to develop a non-invasive biomarker for the CRC patients. Recent microbiome studies frequently employ random forest [5, 6] due to its predictive ability and capacity to generate feature importance. Random forest is a decision tree-based algorithm and proven to be sophisticated enough to capture bacteria that are typically associated with CRC, however it is unable to identify species that are only significant for a subset of patients. Recently, Rynazal et al., published an exciting paper on [7] on explainable AI and their application on microbiome data sets derived from CRC patients. This study investigated the feasibility of employing a method called as "Shapley Additive Explanations (SHAP)" technique for analysing gastrointestinal microbiome data [7]. The difference between normal ML methods compared to the explainable AI method is interpretability. For example: Using Random Forest method, we can rank the important variable (here microbiome) called as "Gini importance" and we rely solely on global explanation techniques. Local explanations enable us to observe distinct bacterial contribution patterns among CRC patients. One of the examples provided Rynazal et al., would be, Clostridium symbiosum was found to be the most influential bacteria for the patient with subject ID SAMD00114911. In Figure 1, shows the application of the explainable AI and discovery of the interpretable microbes for therapeutic intervention. Shows the application of the explainable AI and discovery of the interpretable microbes for therapeutic intervention. In contrast, Eubacterium eligens displays the opposite pattern, indicating that a higher abundance of this species is associated with a lower CRC risk. This understanding of the direction of effects cannot be obtained using commonly employed global explanation methods, such as the random forest's built-in feature importance. This study further investigated the benefits of using SHAP for individual feature contribution analyses and CRC subtyping. In addition, a Python library was developed to assist microbiome researchers with comparable analyses. Based on the contribution of each bacterial species to the classifier, the study discovered that SHAP can generate feature contributions for every single ML prediction and classify the disease group into subgroups of CRC patients. This study investigated the feasibility of using explainable AI for CRC classification based on the gastrointestinal microbiome. SHAP was utilised to acquire more individualised feature importance that can be used to identify potential bacterial CRC biomarkers. This method will benefit the microbiome community and encourage researchers to utilise local explanations for a more personalised identification of feature importance and subtyping and a huge potential for microbiome based diagnostics [8]. CRC is just an example of such applications however, it can be used other cancer types or other diseases like inflammatory bowel diseases [9] or metabolic diseases to make an interpretable model and hence a new avenue towards precision medicine [10]. Research funding: None declared. Author contributions: Animesh Acharjee is responsible for the writing, conceptualization, acquisition of data, and interpretation of the data. Competing interests: The author declares no conflicts of interest regarding this article. Informed consent: Not applicable. Ethical approval: Not applicable. 1. Roelands, J, Kuppen, PJK, Ahmed, EI, Mall, R, Masoodi, T, Singh, P, et al.. An integrated tumor, immune and microbiome atlas of colon cancer. Nat Med 2023;29:1273–86. https://doi.org/10.1038/s41591-023-02324-5. Search in Google Scholar PubMed PubMed Central 2. Rebersek, M. Gut microbiome and its role in colorectal cancer. BMC Cancer 2021;21:1325. https://doi.org/10.1186/s12885-021-09054-2. Search in Google Scholar PubMed PubMed Central 3. Ternes, D, Karta, J, Tsenkova, M, Wilmes, P, Haan, S, Letellier, E. Microbiome in colorectal cancer: how to get from meta-omics to mechanism? Trends Microbiol 2020;28:401–23. https:// -Abstract Truncated-