Literature DB >> 28434023

Developing a statistically powerful measure for quartet tree inference using phylogenetic identities and Markov invariants.

Jeremy G Sumner1, Amelia Taylor2, Barbara R Holland3, Peter D Jarvis3.   

Abstract

Recently there has been renewed interest in phylogenetic inference methods based on phylogenetic invariants, alongside the related Markov invariants. Broadly speaking, both these approaches give rise to polynomial functions of sequence site patterns that, in expectation value, either vanish for particular evolutionary trees (in the case of phylogenetic invariants) or have well understood transformation properties (in the case of Markov invariants). While both approaches have been valued for their intrinsic mathematical interest, it is not clear how they relate to each other, and to what extent they can be used as practical tools for inference of phylogenetic trees. In this paper, by focusing on the special case of binary sequence data and quartets of taxa, we are able to view these two different polynomial-based approaches within a common framework. To motivate the discussion, we present three desirable statistical properties that we argue any invariant-based phylogenetic method should satisfy: (1) sensible behaviour under reordering of input sequences; (2) stability as the taxa evolve independently according to a Markov process; and (3) explicit dependence on the assumption of a continuous-time process. Motivated by these statistical properties, we develop and explore several new phylogenetic inference methods. In particular, we develop a statistically bias-corrected version of the Markov invariants approach which satisfies all three properties. We also extend previous work by showing that the phylogenetic invariants can be implemented in such a way as to satisfy property (3). A simulation study shows that, in comparison to other methods, our new proposed approach based on bias-corrected Markov invariants is extremely powerful for phylogenetic inference. The binary case is of particular theoretical interest as-in this case only-the Markov invariants can be expressed as linear combinations of the phylogenetic invariants. A wider implication of this is that, for models with more than two states-for example DNA sequence alignments with four-state models-we find that methods which rely on phylogenetic invariants are incapable of satisfying all three of the stated statistical properties. This is because in these cases the relevant Markov invariants belong to a class of polynomials independent from the phylogenetic invariants.

Keywords:  Markov chains; Phylogenetic invariants; Quartets; Representation theory

Mesh:

Substances:

Year:  2017        PMID: 28434023     DOI: 10.1007/s00285-017-1129-2

Source DB:  PubMed          Journal:  J Math Biol        ISSN: 0303-6812            Impact factor:   2.259


  15 in total

1.  Multiple maxima of likelihood in phylogenetic trees: an analytic approach.

Authors:  B Chor; M D Hendy; B R Holland; D Penny
Journal:  Mol Biol Evol       Date:  2000-10       Impact factor: 16.240

2.  Phylogenetic invariants for the general Markov model of sequence mutation.

Authors:  Elizabeth S Allman; John A Rhodes
Journal:  Math Biosci       Date:  2003-12       Impact factor: 2.144

3.  Lie Markov models.

Authors:  J G Sumner; J Fernández-Sánchez; P D Jarvis
Journal:  J Theor Biol       Date:  2011-12-29       Impact factor: 2.691

4.  Markov invariants and the isotropy subgroup of a quartet tree.

Authors:  J G Sumner; P D Jarvis
Journal:  J Theor Biol       Date:  2009-02-01       Impact factor: 2.691

5.  Markov invariants, plethysms, and phylogenetics.

Authors:  J G Sumner; M A Charleston; L S Jermiin; P D Jarvis
Journal:  J Theor Biol       Date:  2008-04-07       Impact factor: 2.691

Review 6.  Genome-wide genetic marker discovery and genotyping using next-generation sequencing.

Authors:  John W Davey; Paul A Hohenlohe; Paul D Etter; Jason Q Boone; Julian M Catchen; Mark L Blaxter
Journal:  Nat Rev Genet       Date:  2011-06-17       Impact factor: 53.242

7.  Dimensional Reduction for the General Markov Model on Phylogenetic Trees.

Authors:  Jeremy G Sumner
Journal:  Bull Math Biol       Date:  2017-02-10       Impact factor: 1.758

8.  Low-parameter phylogenetic inference under the general markov model.

Authors:  Barbara R Holland; Peter D Jarvis; Jeremy G Sumner
Journal:  Syst Biol       Date:  2012-08-22       Impact factor: 15.683

9.  A rate-independent technique for analysis of nucleic acid sequences: evolutionary parsimony.

Authors:  J A Lake
Journal:  Mol Biol Evol       Date:  1987-03       Impact factor: 16.240

10.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.