Bayesian modeling of co-occurrence microbial interaction networks

Tejasv Bedi,Bencong Zhu,Michael L. Neugent,Kevin C. Lutz,Nicole J. De Nisco,Qiwei Li
DOI: https://doi.org/10.48550/arXiv.2404.09194
2024-04-14
Abstract:The human body consists of microbiomes associated with the development and prevention of several diseases. These microbial organisms form several complex interactions that are informative to the scientific community for explaining disease progression and prevention. Contrary to the traditional view of the microbiome as a singular, assortative network, we introduce a novel statistical approach using a weighted stochastic infinite block model to analyze the complex community structures within microbial co-occurrence microbial interaction networks. Our model defines connections between microbial taxa using a novel semi-parametric rank-based correlation method on their transformed relative abundances within a fully connected network framework. Employing a Bayesian nonparametric approach, the proposed model effectively clusters taxa into distinct communities while estimating the number of communities. The posterior summary of the taxa community membership is obtained based on the posterior probability matrix, which could naturally solve the label switching problem. Through simulation studies and real-world application to microbiome data from postmenopausal patients with recurrent urinary tract infections, we demonstrate that our method has superior clustering accuracy over alternative approaches. This advancement provides a more nuanced understanding of microbiome organization, with significant implications for disease research.
Methodology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to detect community structures in microbial co - occurrence networks, and estimate the parameters of each community as well as the number of communities. Specifically, the authors propose a new statistical method - the Weighted Stochastic Infinite Block Model (WSIBM) - to analyze the complex community structures within microbial communities. This method establishes connections between taxa by defining the transformed relative abundances of microbial taxa using a new semi - parametric rank - correlation method within a fully - connected network framework. Through the Bayesian non - parametric method, this model can effectively cluster taxa into different communities and estimate the number of communities. In addition, this model also solves the label - switching problem and naturally obtains a summary of the community membership of taxa through the posterior probability matrix. ### Key points: 1. **Microbial co - occurrence network**: The species in microbial communities interact to form complex networks, which are of great significance for explaining disease progression and prevention. 2. **Community detection**: Traditional microbiomes are regarded as single, homogeneous networks, while this paper proposes a new model to detect community structures in microbial co - occurrence networks. 3. **Weighted Stochastic Infinite Block Model (WSIBM)**: This model uses a semi - parametric rank - correlation method to process the transformed relative abundance data and estimates the number of communities and parameters through the Bayesian non - parametric method. 4. **Label - switching problem**: The label - switching problem is naturally solved through the posterior probability matrix. 5. **Practical applications**: Through simulation studies and practical applications (such as the microbiome data of postmenopausal women with recurrent urinary tract infections), the superiority of this method in terms of community detection accuracy has been proven. ### Formula examples: - **Fisher transformation**: It is used to map the correlation coefficient to the real number line. The formula is as follows: \[ W_{jj'} = F(R_{jj'})=\frac{1}{2}\ln\left(\frac{1 + R_{jj'}}{1 - R_{jj'}}\right) \] - **Posterior probability matrix**: It is used to solve the label - switching problem. The formula is as follows: \[ M_{j,j'}=\frac{1}{B}\sum_{b = 1}^{B}I(z_j^{(b)}=z_{j'}^{(b)}) \] ### Conclusion: This study provides a more detailed method to understand the organizational structure of the microbiome and has an important impact on disease research. By introducing the WSIBM model, researchers can more accurately detect community structures in microbial co - occurrence networks, thus laying the foundation for future microbiome and human health research.