Literature DB >> 29186593

Complex Models of Sequence Evolution Require Accurate Estimators as Exemplified with the Invariable Site Plus Gamma Model.

Lam-Tung Nguyen1, Arndt von Haeseler1,2, Bui Quang Minh1.   

Abstract

The invariable site plus $\Gamma$ model (I$+\Gamma)$ is widely used to model rate heterogeneity among alignment sites in maximum likelihood and Bayesian phylogenetic analyses. The proof that the I$+$ continuous $\Gamma$ model is identifiable (model parameters can be inferred correctly given enough data) has increased the creditability of its application to phylogeny reconstruction. However, most phylogenetic software implement the I$+$ discrete $\Gamma$ model, whose identifiability is likely but unproven. How well the parameters of the I$+$ discrete $\Gamma$ model are estimated is still disputed. Especially the correlation between the fraction of invariable sites and the fractions of sites with a slow evolutionary rate is discussed as being problematic. We show that optimization heuristics as implemented in frequently used phylogenetic software (PhyML, RAxML, IQ-TREE, and MrBayes) cannot always reliably estimate the shape parameter, the proportion of invariable sites, and the tree length. Here, we propose an improved optimization heuristic that accurately estimates the three parameters. While research efforts mainly focus on tree search methods, our results signify the equal importance of verifying and developing effective estimation methods for complex models of sequence evolution.

Entities:  

Mesh:

Year:  2018        PMID: 29186593      PMCID: PMC6204645          DOI: 10.1093/sysbio/syx092

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  15 in total

1.  Maximum likelihood estimation of phylogenetic trees is consistent when substitution rates vary according to the invariable sites plus gamma distribution.

Authors:  J S Rogers
Journal:  Syst Biol       Date:  2001 Sep-Oct       Impact factor: 15.683

2.  A gamma mixture model better accounts for among site rate heterogeneity.

Authors:  Itay Mayrose; Nir Friedman; Tal Pupko
Journal:  Bioinformatics       Date:  2005-09-01       Impact factor: 6.937

3.  MODELTEST: testing the model of DNA substitution.

Authors:  D Posada; K A Crandall
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

4.  Seq-Gen: an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees.

Authors:  A Rambaut; N C Grassly
Journal:  Comput Appl Biosci       Date:  1997-06

5.  Maximum likelihood estimation of the heterogeneity of substitution rate among nucleotide sites.

Authors:  X Gu; Y X Fu; W H Li
Journal:  Mol Biol Evol       Date:  1995-07       Impact factor: 16.240

6.  A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.

Authors:  M Kimura
Journal:  J Mol Evol       Date:  1980-12       Impact factor: 2.395

7.  BEAGLE: an application programming interface and high-performance computing library for statistical phylogenetics.

Authors:  Daniel L Ayres; Aaron Darling; Derrick J Zwickl; Peter Beerli; Mark T Holder; Paul O Lewis; John P Huelsenbeck; Fredrik Ronquist; David L Swofford; Michael P Cummings; Andrew Rambaut; Marc A Suchard
Journal:  Syst Biol       Date:  2011-10-01       Impact factor: 15.683

8.  MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space.

Authors:  Fredrik Ronquist; Maxim Teslenko; Paul van der Mark; Daniel L Ayres; Aaron Darling; Sebastian Höhna; Bret Larget; Liang Liu; Marc A Suchard; John P Huelsenbeck
Journal:  Syst Biol       Date:  2012-02-22       Impact factor: 15.683

9.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

10.  The impact of modelling rate heterogeneity among sites on phylogenetic estimates of intraspecific evolutionary rates and timescales.

Authors:  Fangzhi Jia; Nathan Lo; Simon Y W Ho
Journal:  PLoS One       Date:  2014-05-05       Impact factor: 3.240

View more
  2 in total

1.  Genetic variation in aneuploidy prevalence and tolerance across Saccharomyces cerevisiae lineages.

Authors:  Eduardo F C Scopel; James Hose; Douda Bensasson; Audrey P Gasch
Journal:  Genetics       Date:  2021-04-15       Impact factor: 4.562

2.  A phylogeny for the Drosophila montium species group: A model clade for comparative analyses.

Authors:  William R Conner; Emily K Delaney; Michael J Bronski; Paul S Ginsberg; Timothy B Wheeler; Kelly M Richardson; Brooke Peckenpaugh; Kevin J Kim; Masayoshi Watada; Ary A Hoffmann; Michael B Eisen; Artyom Kopp; Brandon S Cooper; Michael Turelli
Journal:  Mol Phylogenet Evol       Date:  2020-12-31       Impact factor: 4.286

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.