Modeling Mito-nuclear Compatibility and its Role in Species Identification

Debora Princepe,Marcus A. M. de Aguiar,Marcus A.M. de Aguiar
DOI: https://doi.org/10.48550/arXiv.2010.05656
2020-10-12
Populations and Evolution
Abstract:Mitochondrial genetic material is widely used for phylogenetic reconstruction and as a barcode for species identification. Here we study how mito-nuclear interactions affect the accuracy of species identification by mtDNA, as well as the speciation process itself. We simulate the evolution of a population of individuals who carry a recombining nuclear genome and a mitochondrial genome inherited maternally. We compare a null model fitness landscape that lacks any mito-nuclear interaction against a scenario in which interactions influence fitness. Fitness is assigned to individuals according to their mito-nuclear compatibility, which drives the coevolution of the nuclear and mitochondrial genomes. When the population breaks into distinct species we analyze the accuracy of mtDNA barcode for species identification. Remarkably, we find that species identification by mtDNA is equally accurate in the presence or absence of mito-nuclear coupling and that the success of the DNA barcode derives mainly from population geographical isolation during speciation. Nevertheless, selection imposed by mito-nuclear compatibility influences the diversification process and leaves signatures in the genetic content and spatial distribution of the populations, in three ways: phylogenetic trees are more balanced; clades correlate strongly with the spatial distribution and; there is a substantial increase in the intraspecies mtDNA similarity. We compare the evolutionary patterns observed in our model to empirical data from copepods (\textit{T. californicus}). We find good qualitative agreement in the geographic patterns and the topology of the phylogenetic tree, provided the model includes selection based on mito-nuclear interactions. These results highlight the role of mito-nuclear compatibility in the speciation process and its reconstruction from genetic data.
What problem does this paper attempt to address?