The reconstructed tree in the lineage-based model of protracted speciation

Amaury Lambert,Hélène Morlon,Rampal S. Etienne
DOI: https://doi.org/10.48550/arXiv.1301.5512
2013-01-23
Abstract:A popular line of research in evolutionary biology is the use of time-calibrated phylogenies for the inference of diversification processes. This requires computing the likelihood of a given ultrametric tree as the reconstructed tree produced by a given model of diversification. Etienne & Rosindell (2012) proposed a lineage-based model of diversification, called protracted speciation, where species remain incipient during a random duration before turning good species, and showed that this can explain the slowdown in lineage accumulation observed in real phylogenies. However, they were unable to provide a general likelihood formula. Here, we present a likelihood formula for protracted speciation models, where rates at which species turn good or become extinct can depend both on their age and on time. Our only restrictive assumption is that speciation rate does not depend on species status. Our likelihood formula utilizes a new technique, based on the contour of the phylogenetic tree and first developed in Lambert (2010). We consider the reconstructed trees spanned by all extant species, by all good extant species, or by all representative species, which are either good extant species or incipient species representative of some good extinct species. Specifically, we prove that each of these trees is a coalescent point process, that is, a planar, ultrametric tree where the coalescence times between two consecutive tips are independent, identically distributed random variables. We characterize the common distribution of these coalescence times in some, biologically meaningful, special cases for which the likelihood reduces to an elegant analytical formula or becomes numerically tractable.
Populations and Evolution,Probability
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to use time - calibrated phylogenetic trees to infer the species diversification process in evolutionary biology. Specifically, the paper focuses on calculating the likelihood of a given ultrametric tree (i.e., a tree in which all leaf nodes are equidistant from the root node) as a reconstructed tree generated by a specific diversification model. Etienne & Rosindell (2012) proposed a lineage - based diversification model, called the protracted speciation model, in which new species remain in the "initial species" state for a random period of time before transitioning to the "mature species" state. This model can explain the phenomenon of the slowdown in the rate of lineage accumulation observed in actual phylogenetic trees. However, they failed to provide a general likelihood formula. The authors of this paper propose a likelihood formula for the protracted speciation model, in which the rate at which a species transitions to a mature species or becomes extinct depends not only on the age of the species but may also change over time. The main assumption of the authors is that the diversification rate does not depend on the state of the species. They utilize a new technique, based on the profile of the phylogenetic tree, and prove that the reconstructed trees composed of all extant species, all mature extant species, or all representative species are coalescent point processes. This means that these trees are planar, ultrametric trees, where the coalescence time between two consecutive leaf nodes is an independently and identically distributed random variable. The authors also characterize the common distribution of these coalescence times in some biologically significant special cases, so that the likelihood can be simplified to an elegant analytical formula or made numerically tractable. In summary, the main objective of this paper is to provide a general likelihood formula for the protracted speciation model in order to better understand and infer the dynamic process of species diversification.