Assembling bacterial puzzles: piecing together functions into microbial pathways

Henri Chung,Iddo Friedberg,Yana Bromberg
DOI: https://doi.org/10.1101/2024.03.27.587058
2024-04-18
Abstract:Functional metagenomics enables the study of unexplored bacterial diversity, gene families, and pathways essential to microbial communities. However, discovering biological insights with these data is impeded by the scarcity of quality annotations. Here, we use a co-occurrence-based analysis of predicted microbial protein functions to uncover pathways in genomic and metagenomic biological systems. Our approach, based on phylogenetic profiles, improves the identification of functional relationships, or participation in the same biochemical pathway, between enzymes over a comparable homology-based approach. We optimized the design of our profiles to identify potential pathways using minimal data, clustered functionally related enzyme pairs into multi-enzymatic pathways, and evaluated our predictions against reference pathways in KEGG. We then demonstrated a novel extension of this approach to predict inter-bacterial protein interactions amongst members of a marine microbiome. Most significantly, we show our method predicts emergent biochemical pathways between known and unknown functions. Thus, our work establishes a basis for identifying the potential functional capacities of the entire metagenome, capturing previously unknown and abstract functions into discrete putative pathways.
Bioinformatics
What problem does this paper attempt to address?
This paper aims to solve several key problems in microbiome function prediction: 1. **Lack of high - quality annotations**: In functional metagenomics research, since most microorganisms have not been cultured or described yet, high - quality functional annotations are scarce, which hinders the ability to discover biological insights from these data. 2. **Identifying unknown functions**: The paper proposes a co - occurrence analysis - based method to identify potential metabolic pathways by predicting the co - occurrence relationships of microbial protein functions, especially those emerging pathways that contain known and unknown functions. 3. **Cross - species protein interactions**: The paper also extends this method to predict protein interactions between different bacteria in marine microbial communities, thereby revealing the unique functional capabilities of these microbial communities in specific environments. Specifically, the authors used a phylogenetic profiles - based method to identify and predict metabolic pathways by comparing the co - occurrence relationships of protein functions in different microbial genomes. Compared with traditional homology analysis methods, this method performs better in identifying functional correlations. In addition, the authors optimized their method so that it can identify potential metabolic pathways with less data and applied it to marine microbial metagenomic data, successfully predicting cross - species protein interactions and emerging metabolic pathways. In summary, the main goal of this paper is to improve the understanding of unknown functions and their interactions in the microbiome through an improved functional co - occurrence analysis method, thereby providing new tools and methods for comprehensively analyzing the functions of the microbiome.