Literature DB >> 18852108

Evaluating the robustness of phylogenetic methods to among-site variability in substitution processes.

Mark T Holder1, Derrick J Zwickl, Christophe Dessimoz.   

Abstract

Computer simulations provide a flexible method for assessing the power and robustness of phylogenetic inference methods. Unfortunately, simulated data are often obviously atypical of data encountered in studies of molecular evolution. Unrealistic simulations can lead to conclusions that are irrelevant to real-data analyses or can provide a biased view of which methods perform well. Here, we present a software tool designed to generate data under a complex codon model that allows each residue in the protein sequence to have a different set of equilibrium amino acid frequencies. The software can obtain maximum-likelihood estimates of the parameters of the Halpern and Bruno model from empirical data and a fixed tree; given an arbitrary tree and a fixed set of parameters, the software can then simulate artificial datasets.We present the results of a simulation experiment using randomly generated tree shapes and substitution parameters estimated from 1610 mammalian cytochrome b sequences.We tested tree inference at the amino acid, nucleotide and codon levels and under parsimony, maximum-likelihood, Bayesian and distance criteria (for a total of more than 650 analyses on each dataset). Based on these simulations, nucleotide-level analyses seem to be more accurate than amino acid and codon analyses. The performance of distance-based phylogenetic methods appears to be quite sensitive to the choice of model and the form of rate heterogeneity used. Further studies are needed to assess the generality of these conclusions. For example, fitting parameters of the Halpern Bruno model to sequences from other genes will reveal the extent to which our conclusions were influenced by the choice of cytochrome b. Incorporating codon bias and more sources heterogeneity into the simulator will be crucial to determining whether the current results are caused by a bias in the current simulation study in favour of nucleotide analyses.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18852108      PMCID: PMC2607409          DOI: 10.1098/rstb.2008.0162

Source DB:  PubMed          Journal:  Philos Trans R Soc Lond B Biol Sci        ISSN: 0962-8436            Impact factor:   6.237


  32 in total

1.  Efficient biased estimation of evolutionary distances when substitution rates vary across sites.

Authors:  Stéphane Guindon; Olivier Gascuel
Journal:  Mol Biol Evol       Date:  2002-04       Impact factor: 16.240

2.  Theoretical foundation of the balanced minimum evolution method of phylogenetic inference and its relationship to weighted least-squares tree fitting.

Authors:  Richard Desper; Olivier Gascuel
Journal:  Mol Biol Evol       Date:  2003-12-23       Impact factor: 16.240

3.  Can weighting improve bushy trees? Models of cytochrome b evolution and the molecular systematics of pipits and wagtails (Aves: Motacillidae).

Authors:  G Voelker; S V Edwards
Journal:  Syst Biol       Date:  1998-12       Impact factor: 15.683

4.  A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process.

Authors:  Nicolas Lartillot; Hervé Philippe
Journal:  Mol Biol Evol       Date:  2004-03-10       Impact factor: 16.240

5.  Exhaustive matching of the entire protein sequence database.

Authors:  G H Gonnet; M A Cohen; S A Benner
Journal:  Science       Date:  1992-06-05       Impact factor: 47.728

6.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

Review 7.  Construction of phylogenetic trees.

Authors:  W M Fitch; E Margoliash
Journal:  Science       Date:  1967-01-20       Impact factor: 47.728

8.  A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.

Authors:  M Kimura
Journal:  J Mol Evol       Date:  1980-12       Impact factor: 2.395

9.  A new method for calculating evolutionary substitution rates.

Authors:  C Lanave; G Preparata; C Saccone; G Serio
Journal:  J Mol Evol       Date:  1984       Impact factor: 2.395

10.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA.

Authors:  M Hasegawa; H Kishino; T Yano
Journal:  J Mol Evol       Date:  1985       Impact factor: 2.395

View more
  22 in total

1.  How to calculate the non-synonymous to synonymous rate ratio of protein-coding genes under the Fisher-Wright mutation-selection framework.

Authors:  Mario Dos Reis
Journal:  Biol Lett       Date:  2015-04       Impact factor: 3.703

2.  Mutation-selection models of coding sequence evolution with site-heterogeneous amino acid fitness profiles.

Authors:  Nicolas Rodrigue; Hervé Philippe; Nicolas Lartillot
Journal:  Proc Natl Acad Sci U S A       Date:  2010-02-22       Impact factor: 11.205

3.  The relationship between dN/dS and scaled selection coefficients.

Authors:  Stephanie J Spielman; Claus O Wilke
Journal:  Mol Biol Evol       Date:  2015-01-08       Impact factor: 16.240

4.  Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin.

Authors:  Dechun Zhang; Xianzhao Kan; Sarah Elizabeth Huss; Lan Jiang; Li-Qing Chen; Yibing Hu
Journal:  J Vis Exp       Date:  2018-08-14       Impact factor: 1.355

5.  On the statistical interpretation of site-specific variables in phylogeny-based substitution models.

Authors:  Nicolas Rodrigue
Journal:  Genetics       Date:  2012-12-05       Impact factor: 4.562

6.  A penalized-likelihood method to estimate the distribution of selection coefficients from phylogenetic data.

Authors:  Asif U Tamuri; Nick Goldman; Mario dos Reis
Journal:  Genetics       Date:  2014-02-14       Impact factor: 4.562

7.  A Comparison of One-Rate and Two-Rate Inference Frameworks for Site-Specific dN/dS Estimation.

Authors:  Stephanie J Spielman; Suyang Wan; Claus O Wilke
Journal:  Genetics       Date:  2016-08-17       Impact factor: 4.562

8.  Triallelic Population Genomics for Inferring Correlated Fitness Effects of Same Site Nonsynonymous Mutations.

Authors:  Aaron P Ragsdale; Alec J Coffman; PingHsun Hsieh; Travis J Struck; Ryan N Gutenkunst
Journal:  Genetics       Date:  2016-03-30       Impact factor: 4.562

9.  Arthropod relationships revealed by phylogenomic analysis of nuclear protein-coding sequences.

Authors:  Jerome C Regier; Jeffrey W Shultz; Andreas Zwick; April Hussey; Bernard Ball; Regina Wetzer; Joel W Martin; Clifford W Cunningham
Journal:  Nature       Date:  2010-02-10       Impact factor: 49.962

10.  Data mining approach identifies research priorities and data requirements for resolving the red algal tree of life.

Authors:  Heroen Verbruggen; Christine A Maggs; Gary W Saunders; Line Le Gall; Hwan Su Yoon; Olivier De Clerck
Journal:  BMC Evol Biol       Date:  2010-01-20       Impact factor: 3.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.