| Literature DB >> 31970528 |
Giovanni Laudanno1, Bart Haegeman2, Rampal S Etienne3.
Abstract
Molecular phylogenies have been increasingly recognized as an important source of information on species diversification. For many models of macroevolution, analytical likelihood formulas have been derived to infer macroevolutionary parameters from phylogenies. A few years ago, a general framework to numerically compute such likelihood formulas was proposed, which accommodates models that allow speciation and/or extinction rates to depend on diversity. This framework calculates the likelihood as the probability of the diversification process being consistent with the phylogeny from the root to the tips. However, while some readers found the framework presented in Etienne et al. (Proc R Soc Lond B Biol Sci 279(1732):1300-1309, 2012) convincing, others still questioned it (personal communication), despite numerical evidence that for special cases the framework yields the same (i.e., within double precision) numerical value for the likelihood as analytical formulas do that were independently derived for these special cases. Here we prove analytically that the likelihoods calculated in the new framework are correct for all special cases with known analytical likelihood formula. Our results thus add substantial mathematical support for the overall coherence of the general framework.Entities:
Keywords: Birth–death process; Diversification; Likelihood model; Macroevolution; Phylogenetic trees
Mesh:
Year: 2020 PMID: 31970528 PMCID: PMC6976549 DOI: 10.1007/s11538-020-00698-y
Source DB: PubMed Journal: Bull Math Biol ISSN: 0092-8240 Impact factor: 1.758
Fig. 1a Full tree where missing species are plotted as red dashed lines: the ones ending in a cross become extinct before the present, whereas the ones ending with a red dot are unsampled species at the present; b Corresponding reconstructed tree in which only extant species are present. This is the type of tree we usually work with because actual phylogenetic trees are usually obtained from molecular data taken from extant species; c Lineages-through-time plot: The green line represents the number of lineages leading to extant species (k), the red line represents lineages leading to extinct or unsampled species (m), and the blue line represents the total number of lineages ()
Fig. 2An example of how to build a likelihood for a tree with tips. We start with a vector at the crown age. We use and to evolve the vector across the entire tree (on branches and nodes, respectively) up to the present time according to . At the present time, the likelihood accounting for missing species will be proportional to the th component of the vector