Literature DB >> 25843219

Do tree split probabilities determine the branch lengths?

Benny Chor1, Mike Steel2.   

Abstract

The evolution of aligned DNA sequence sites is generally modeled by a Markov process operating along the edges of a phylogenetic tree. It is well known that the probability distribution on the site patterns at the tips of the tree determines the tree topology, and its branch lengths. However, the number of patterns is typically much larger than the number of edges, suggesting considerable redundancy in the branch length estimation. In this paper we ask whether the probabilities of just the 'edge-specific' patterns (the ones that correspond to a change of state on a single edge) suffice to recover the branch lengths of the tree, under a symmetric 2-state Markov process. We first show that this holds provided the branch lengths are sufficiently short, by applying the inverse function theorem. We then consider whether this restriction to short branch lengths is necessary. We show that for trees with up to four leaves it can be lifted. This leaves open the interesting question of whether this holds in general. Our results also extend to certain Markov processes on more than 2-states, such as the Jukes-Cantor model.
Copyright © 2015 Elsevier Ltd. All rights reserved.

Keywords:  Evolutionary model; Hadamard transform; Inverse function theorem; Markov process; Phylogenetic tree reconstruction; Systems of polynomial equations

Mesh:

Year:  2015        PMID: 25843219     DOI: 10.1016/j.jtbi.2015.03.024

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  1 in total

1.  GENESIS - The GENEric SImulation System for Modelling State Transitions.

Authors:  Matthew S Gillman
Journal:  J Open Res Softw       Date:  2017-09-20
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.