A high coverage Mesolithic aurochs genome and effective leveraging of ancient cattle genomes using whole genome imputation

Jolijn A M Erven,Amelie Scheu,Marta Pereira Verdugo,Lara Cassidy,Ningbo Chen,Birgit Gehlen,Martin Street,Ole Madsen,Victoria E Mullin
DOI: https://doi.org/10.1093/molbev/msae076
IF: 10.7
2024-04-26
Molecular Biology and Evolution
Abstract:Ancient genomic analyses are often restricted to utilising pseudo-haploid data due to low genome coverage. Leveraging low coverage data by imputation to calculate phased diploid genotypes that enable haplotype-based interrogation and SNP calling at unsequenced positions is highly desirable. This has not been investigated for ancient cattle genomes despite these being compelling subjects for archaeological, evolutionary and economic reasons. Here we test this approach by sequencing a Mesolithic European aurochs (18.49x; 9852-9376 calBCE), an Early Medieval European cow (18.69x; 427-580 calCE), and combine these with published individuals; two ancient and three modern. We downsample these genomes (0.25x, 0.5x, 1.0x, 2.0x) and impute diploid genotypes, utilising a reference panel of 171 published modern cattle genomes that we curated for 21.7 million (Mn) phased single-nucleotide polymorphisms (SNPs). We recover high densities of correct calls with an accuracy of >99.1% at variant sites for the lowest downsample depth of 0.25x, increasing to >99.5% for 2.0x (transversions only, minor allele frequency (MAF) ≥ 2.5%). The recovery of SNPs correlates with coverage, on average 58% of sites are recovered for 0.25x increasing to 87% for 2.0x, utilising an average of 3.5 million (Mn) transversions (MAF ≥2.5%), even in the aurochs, despite the highest temporal distance from the modern reference panel. Our imputed genomes behave similarly to directly called data in allele-frequency-based analyses; for example consistently identifying runs of homozygosity >2mb, including a long homozygous region in the Mesolithic European aurochs.
genetics & heredity,biochemistry & molecular biology,evolutionary biology
What problem does this paper attempt to address?