Metagenome-assembled genomes of Estonian Microbiome cohort reveal novel species and their links with prevalent diseases

Kateryna Pantiukh,Oliver Aasmets,Kertu Liis Krigul,Elin Org
DOI: https://doi.org/10.1101/2024.07.06.602324
2024-07-09
Abstract:Deep metagenomic data from population studies enables genome recovery and construction of population-specific references, including new species and uncovering microbial diversity that global references might miss. We constructed an Estonian population-specific reference of metagenome-assembled genomes (MAGs) from 1,878 stool samples of the EstMB-deep cohort. We assembled 84,762 MAGs, representing 2,257 species, including 353 potentially novel species (15.6%). Additionally, 607 species (26.9%) were not present in the global Unified Human Gastrointestinal Genome (UHGG) reference database and may therefore be population-specific. We further demonstrated the value of de novo assembly of bacterial genomes by analysing associations with 33 prevalent diseases and detected 44 significant associations for 15 diseases, including with 9 potentially new species and 5 species absent from UHGG. The correlations, especially with new species, demonstrate that de novo bacterial genome assembly from population cohorts can provide significant novel insights linking the microbiome with prevalent diseases and uncovering population-specific differences.
Bioinformatics
What problem does this paper attempt to address?