Evaluation of gut microbiota predictive potential associated with phenotypic characteristics to identify multifactorial diseases

Danielle Cristina Fonseca,Ilanna Marques Gomes da Rocha,Bianca Depieri Balmant,Leticia Callado,Ana Paula Aguiar Prudêncio,Juliana Tepedino Martins Alves,Raquel Susana Torrinhas,Gabriel da Rocha Fernandes,Dan Linetzky Waitzberg
DOI: https://doi.org/10.1080/19490976.2023.2297815
IF: 12.2
2024-01-19
Gut Microbes
Abstract:Gut microbiota has been implicated in various clinical conditions, yet the substantial heterogeneity in gut microbiota research results necessitates a more sophisticated approach than merely identifying statistically different microbial taxa between healthy and unhealthy individuals. Our study seeks to not only select microbial taxa but also explore their synergy with phenotypic host variables to develop novel predictive models for specific clinical conditions. Design: We assessed 50 healthy and 152 unhealthy individuals for phenotypic variables (PV) and gut microbiota (GM) composition by 16S rRNA gene sequencing. The entire modeling process was conducted in the R environment using the Random Forest algorithm. Model performance was assessed through ROC curve construction. Results: We evaluated 52 bacterial taxa and pre-selected PV ( p < 0.05) for their contribution to the final models. Across all diseases, the models achieved their best performance when GM and PV data were integrated. Notably, the integrated predictive models demonstrated exceptional performance for rheumatoid arthritis (AUC = 88.03%), type 2 diabetes (AUC = 96.96%), systemic lupus erythematosus (AUC = 98.4%), and type 1 diabetes (AUC = 86.19%). Conclusion: Our findings underscore that the selection of bacterial taxa based solely on differences in relative abundance between groups is insufficient to serve as clinical markers. Machine learning techniques are essential for mitigating the considerable variability observed within gut microbiota. In our study, the use of microbial taxa alone exhibited limited predictive power for health outcomes, while the integration of phenotypic variables into predictive models substantially enhanced their predictive capabilities. What is Already Known on this Subject? What Are the New Findings? How This Study Might Advance Research
gastroenterology & hepatology,microbiology
What problem does this paper attempt to address?