: a capture probe design toolkit for genetic diversity reconstructions from ancient environmental DNA
Zihao Huang,Zhengquan Gu,Yuanyang Cai,Ruairidh Macleod,Zhe Xue,Haoran Dong,Søren Overballe-Petersen,Shanlin Liu,Yu Gao,Hao Li,Sha Tang,Xianmin Diao,Morten Egevang Joergensen,Christoph Dockter,Lasse Vinner,Eske Willerslev,Fahu Chen,Hongru Wang,Yucheng Wang
DOI: https://doi.org/10.1101/2024.09.02.610737
2024-09-03
Abstract:Ancient environmental DNA (aeDNA) is now commonly used in paleoecology and evolutionary ecology, yet due to difficulties in gaining sufficient genome coverage on individual species from metagenome data, its genetic perspectives remain largely uninvestigated. Hybridization capture has proven as an effective approach for enriching the DNA of target species, thus increasing the genome coverage of sequencing data and enabling population and evolutionary genetics analysis. However, to date there is no tool available for designing capture probe sets tailored for aeDNA based population genetics. Here we present , an efficient, flexible and easy-to-use program toolkit that provides a complete workflow for capture probe design, assessment and validation. By benchmarking a probe set for foxtail millet, an annual grass, made by the workflow, we demonstrate a remarkable increase of capturing efficiency, with the target taxa recovery rate improved by 577-fold, and the genome coverage achieved by soil capture-sequencing data even higher than data directly shotgun sequenced from the plant tissues. Probes that underwent our filtering panels show notably higher efficiency. The capture sequencing data enabled accurate population and evolutionary genetic analysis, by effectively inferring the fine-scale genetic structures and patterns, as well as the genotypes on functional genes.
Bioinformatics