Literature DB >> 22875145

Identifiability and inference of non-parametric rates-across-sites models on large-scale phylogenies.

Elchanan Mossel1, Sebastien Roch.   

Abstract

Mutation rate variation across loci is well known to cause difficulties, notably identifiability issues, in the reconstruction of evolutionary trees from molecular sequences. Here we introduce a new approach for estimating general rates-across-sites models. Our results imply, in particular, that large phylogenies are typically identifiable under rate variation. We also derive sequence-length requirements for high-probability reconstruction. Our main contribution is a novel algorithm that clusters sites according to their mutation rate. Following this site clustering step, standard reconstruction techniques can be used to recover the phylogeny. Our results rely on a basic insight: that, for large trees, certain site statistics experience concentration-of-measure phenomena.

Mesh:

Year:  2012        PMID: 22875145     DOI: 10.1007/s00285-012-0571-4

Source DB:  PubMed          Journal:  J Math Biol        ISSN: 0303-6812            Impact factor:   2.259


  14 in total

1.  Unidentifiable divergence times in rates-across-sites models.

Authors:  Steven N Evans; Tandy Warnow
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2004 Jul-Sep       Impact factor: 3.710

2.  Pitfalls of heterogeneous processes for phylogenetic reconstruction.

Authors:  Daniel Stefankovic; Eric Vigoda
Journal:  Syst Biol       Date:  2007-02       Impact factor: 15.683

3.  Phylogeny of mixture models: robustness of maximum likelihood and non-identifiable distributions.

Authors:  Daniel Stefankovic; Eric Vigoda
Journal:  J Comput Biol       Date:  2007-03       Impact factor: 1.479

4.  Phylogenetic mixtures on a single tree can mimic a tree of another topology.

Authors:  Frederick A Matsen; Mike Steel
Journal:  Syst Biol       Date:  2007-10       Impact factor: 15.683

5.  Mixed-up trees: the structure of phylogenetic mixtures.

Authors:  Frederick A Matsen; Elchanan Mossel; Mike Steel
Journal:  Bull Math Biol       Date:  2008-01-03       Impact factor: 1.758

6.  A basic limitation on inferring phylogenies by pairwise sequence comparisons.

Authors:  Mike Steel
Journal:  J Theor Biol       Date:  2008-10-22       Impact factor: 2.691

7.  Rate-variation need not defeat phylogenetic inference through pairwise sequence comparisons.

Authors:  Jihua Wu; Edward Susko
Journal:  J Theor Biol       Date:  2010-01-04       Impact factor: 2.691

8.  Phylogenetic methods come of age: testing hypotheses in an evolutionary context.

Authors:  J P Huelsenbeck; B Rannala
Journal:  Science       Date:  1997-04-11       Impact factor: 47.728

9.  Reconstructing trees when sequence sites evolve at variable rates.

Authors:  M A Steel; L A Székely; M D Hendy
Journal:  J Comput Biol       Date:  1994       Impact factor: 1.479

10.  Full reconstruction of Markov models on evolutionary trees: identifiability and consistency.

Authors:  J T Chang
Journal:  Math Biosci       Date:  1996-10-01       Impact factor: 2.144

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.