Insights into Treponema pallidum genomics from modern and ancient genomes using a novel mapping strategy

Marta Pla-Diaz,Guelfirde Akguel,Martyna Molak,Louis du Plessis,Hanna Panagiotopoulou,Karolina Doan,Wieslaw Bogdanowicz,Pawel Dąbrowski,Maciej Oziemblowski,Barbara Kwiatkowska,Jacek Szczurowski,Joanna Grzelak,Natasha Arora,Kerttu Majander,Fernando Gonzalez-Candelas,Verena Schuenemann
DOI: https://doi.org/10.1101/2023.02.08.526988
2024-09-02
Abstract:Background: Treponemal diseases pose significant global health risks, presenting severe challenges to public health due to their serious health impacts if left untreated. Despite numerous genomic studies on Treponema pallidum and the known possible biases introduced by the choice of the reference genome used for mapping, few investigations have addressed how these biases affect phylogenetic and evolutionary analysis of these bacteria. In this study, we assessed the impact of selecting an appropriate genomic reference on phylogenetic and evolutionary analyses of T. pallidum. Results: We designed a multiple-reference-based (MRB) mapping strategy using four different reference genomes and compared it to traditional single-reference mapping. To conduct this comparison, we created a genomic dataset comprising 77 modern and ancient genomes from the three subspecies of T. pallidum, including a newly sequenced 17th-century genome (35X coverage) of a syphilis-causing strain (designated as W86). Our findings show that recombination detection was consistent across different references, but the choice of reference significantly affected ancient genome reconstruction and phylogenetic inferences. The high-coverage W86 genome obtained here also provided a new calibration point for Bayesian molecular clock dating, improving the reconstruction of the evolutionary history of treponemal diseases. Additionally, we identified novel recombination events, positive selection targets, and refined dating estimates for key events in the species' history. Conclusions: This study highlights the importance of considering methodological implications and reference genome bias in High-Throughput Sequencing-based whole-genome analysis of T. pallidum, especially of ancient or low-coverage samples, contributing to a deeper understanding of this pathogen and its subspecies.
Microbiology
What problem does this paper attempt to address?