Literature DB >> 18852102

The genetic code can cause systematic bias in simple phylogenetic models.

Simon Whelan1.   

Abstract

Phylogenetic analysis depends on inferential methodology estimating accurately the degree of divergence between sequences. Inaccurate estimates can lead to misleading evolutionary inferences, including incorrect tree topology estimates and poor dating of historical species divergence. Protein coding sequences are ubiquitous in phylogenetic inference, but many of the standard methods commonly used to describe their evolution do not explicitly account for the dependencies between sites in a codon induced by the genetic code. This study evaluates the performance of several standard methods on datasets simulated under a simple substitution model, describing codon evolution under a range of different types of selective pressures. This approach also offers insights into the relative performance of different phylogenetic methods when there are dependencies acting between the sites in the data. Methods based on statistical models performed well when there was no or limited purifying selection in the simulated sequences (low degree of dependency between sites in a codon), although more biologically realistic models tended to outperform simpler models. Phylogenetic methods exhibited greater variability in performance for sequences simulated under strong purifying selection (high degree of the dependencies between sites in a codon). Simple models substantially underestimate the degree of divergence between sequences, and underestimation was more pronounced on the internal branches of the tree. This underestimation resulted in some statistical methods performing poorly and exhibiting evidence for systematic bias in tree inference. Amino acid-based and nucleotide models that contained generic descriptions of spatial and temporal heterogeneity, such as mixture and temporal hidden Markov models, coped notably better, producing more accurate estimates of evolutionary divergence and the tree topology.

Mesh:

Substances:

Year:  2008        PMID: 18852102      PMCID: PMC2607415          DOI: 10.1098/rstb.2008.0171

Source DB:  PubMed          Journal:  Philos Trans R Soc Lond B Biol Sci        ISSN: 0962-8436            Impact factor:   6.237


  24 in total

1.  Codon-substitution models for heterogeneous selection pressure at amino acid sites.

Authors:  Z Yang; R Nielsen; N Goldman; A M Pedersen
Journal:  Genetics       Date:  2000-05       Impact factor: 4.562

2.  Parsimony, likelihood, and the role of models in molecular phylogenetics.

Authors:  M Steel; D Penny
Journal:  Mol Biol Evol       Date:  2000-06       Impact factor: 16.240

3.  A kingdom-level phylogeny of eukaryotes based on combined protein data.

Authors:  S L Baldauf; A J Roger; I Wenk-Siefert; W F Doolittle
Journal:  Science       Date:  2000-11-03       Impact factor: 47.728

4.  Is the Felsenstein zone a fly trap?

Authors:  J P Huelsenbeck
Journal:  Syst Biol       Date:  1997-03       Impact factor: 15.683

5.  On the consistency of maximum likelihood estimation of phylogenetic trees from nucleotide sequences.

Authors:  J S Rogers
Journal:  Syst Biol       Date:  1997-06       Impact factor: 15.683

Review 6.  Molecular phylogenetics: state-of-the-art methods for looking into the past.

Authors:  S Whelan; P Liò; N Goldman
Journal:  Trends Genet       Date:  2001-05       Impact factor: 11.639

7.  Exploring among-site rate variation models in a maximum likelihood framework using empirical data: effects of model assumptions on estimates of topology, branch lengths, and bootstrap support.

Authors:  T R Buckley; C Simon; G K Chambers
Journal:  Syst Biol       Date:  2001-02       Impact factor: 15.683

8.  Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous.

Authors:  Bryan Kolaczkowski; Joseph W Thornton
Journal:  Nature       Date:  2004-10-21       Impact factor: 49.962

Review 9.  AIDS as a zoonosis: scientific and public health implications.

Authors:  B H Hahn; G M Shaw; K M De Cock; P M Sharp
Journal:  Science       Date:  2000-01-28       Impact factor: 47.728

10.  Inferring nonneutral evolution from human-chimp-mouse orthologous gene trios.

Authors:  Andrew G Clark; Stephen Glanowski; Rasmus Nielsen; Paul D Thomas; Anish Kejariwal; Melissa A Todd; David M Tanenbaum; Daniel Civello; Fu Lu; Brian Murphy; Steve Ferriera; Gary Wang; Xianqgun Zheng; Thomas J White; John J Sninsky; Mark D Adams; Michele Cargill
Journal:  Science       Date:  2003-12-12       Impact factor: 47.728

View more
  6 in total

1.  Pervasive cryptic epistasis in molecular evolution.

Authors:  Mark Lunzer; G Brian Golding; Antony M Dean
Journal:  PLoS Genet       Date:  2010-10-21       Impact factor: 5.917

2.  A duplicate gene rooting of seed plants and the phylogenetic position of flowering plants.

Authors:  Sarah Mathews; Mark D Clements; Mark A Beilstein
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2010-02-12       Impact factor: 6.237

Review 3.  Shifts in amino acid preferences as proteins evolve: A synthesis of experimental and theoretical work.

Authors:  Noor Youssef; Edward Susko; Andrew J Roger; Joseph P Bielawski
Journal:  Protein Sci       Date:  2021-08-12       Impact factor: 6.993

4.  Introduction. Statistical and computational challenges in molecular phylogenetics and evolution.

Authors:  Nick Goldman; Ziheng Yang
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

5.  Assessing the state of substitution models describing noncoding RNA evolution.

Authors:  James E Allen; Simon Whelan
Journal:  Genome Biol Evol       Date:  2014-01       Impact factor: 3.416

6.  A generalized mechanistic codon model.

Authors:  Maryam Zaheri; Linda Dib; Nicolas Salamin
Journal:  Mol Biol Evol       Date:  2014-06-23       Impact factor: 16.240

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.