Assessing genome conservation on pangenome graphs with PanSel

Matthias Zytnicki
DOI: https://doi.org/10.1101/2024.04.26.591236
2024-10-08
Abstract:Motivation: With more and more telomere-to-telomere genomes assembled, pangenomes make it possible to capture the genomic diversity of a species. Because they introduce less biases, pangenomes, represented as graphs, tend to supplant the usual linear representation of a reference genome, augmented with variations. However, this major change requires new tools adapted to this data structure. Among the numerous questions that can be addressed to a pangenome graph is the search for conserved or divergent genes. Results: In this article, we present a new tool, named PanSel, which computes a conservation score for each segment of the genome, and finds genomic regions that are significantly conserved, or divergent. Availability: PanSel, written in C++11 with no dependency, is available at https://github.com/mzytnicki/pansel. Contact:matthias.zytnicki@inrae.fr
Bioinformatics
What problem does this paper attempt to address?