An efficient pipeline for ancient DNA mapping and recovery of endogenous ancient DNA from whole‐genome sequencing data

Wenhao Xu,Yu Lin,Keliang Zhao,Haimeng Li,Yinping Tian,Jacob Njaramba Ngatia,Yue Ma,Sunil Kumar Sahu,Huabing Guo,Xiaosen Guo,Yan Chun Xu,Huan Liu,Karsten Kristiansen,Tianming Lan,Xinying Zhou
DOI: https://doi.org/10.1002/ece3.7056
IF: 3.167
2020-12-21
Ecology and Evolution
Abstract:<p>Ancient DNA research has developed rapidly over the past few decades due to improvements in PCR and next‐generation sequencing (NGS) technologies, but challenges still exist. One major challenge in relation to ancient DNA research is to recover genuine endogenous ancient DNA sequences from raw sequencing data. This is often difficult due to degradation of ancient DNA and high levels of contamination, especially homologous contamination that has extremely similar genetic background with that of the real ancient DNA. In this study, we collected whole‐genome sequencing (WGS) data from 6 ancient samples to compare different mapping algorithms. To further explore more effective methods to separate endogenous DNA from homologous contaminations, we attempted to recover reads based on ancient DNA specific characteristics of deamination, depurination, and DNA fragmentation with different parameters. We propose a quick and improved pipeline for separating endogenous ancient DNA while simultaneously decreasing homologous contaminations to very low proportions. Our goal in this research was to develop useful recommendations for ancient DNA mapping and for separation of endogenous DNA to facilitate future studies of ancient DNA.</p>
ecology,evolutionary biology
What problem does this paper attempt to address?
This paper aims to address a major challenge in ancient DNA research: recovering genuine endogenous ancient DNA sequences from raw sequencing data. Due to the highly degraded nature of ancient DNA and high levels of contamination (especially homologous contamination with a genetic background extremely similar to genuine ancient DNA), this process is often very difficult. To tackle this challenge, researchers have proposed a fast and improved pipeline for isolating endogenous ancient DNA while simultaneously reducing the proportion of homologous contamination to very low levels. Specifically, they developed an effective filtering method by comparing different mapping algorithms and utilizing the unique characteristics of ancient DNA, such as deamination, depurination, and fragmentation, to optimize parameter settings. Additionally, the paper proposes a set of recommended guidelines aimed at enhancing the effectiveness of future ancient DNA research.