Distances between Extension Spaces of Phylogenetic Trees

Maria Alejandra Valdez Cabrera,Amy D Willis
2024-06-29
Abstract:Phylogenetic trees summarize evolutionary relationships between organisms, and tools to analyze collections of phylogenetic trees enable contrasts between different genes' ancestry. The BHV metric space has enabled the analysis of collections of trees that share a common set of leaves, but many genes are not shared, even between closely related species. BHV extension spaces represent trees with non-identical leaf sets in a common BHV space, but limited analytical tools exist for extension spaces. We define the distance between two phylogenetic trees with non-identical leaf sets as the shortest BHV distance between their extension spaces, and develop a reduced gradient algorithm to compute this distance. We study the scalability of our algorithm and apply it to analyze gene trees spanning multiple domains of life. Our distance and algorithm offer a fully general, interpretable approach to analyzing both ancient and recent evolutionary divergence.
Quantitative Methods
What problem does this paper attempt to address?