Literature DB >> 25576365

The relationship between dN/dS and scaled selection coefficients.

Stephanie J Spielman1, Claus O Wilke2.   

Abstract

Numerous computational methods exist to assess the mode and strength of natural selection in protein-coding sequences, yet how distinct methods relate to one another remains largely unknown. Here, we elucidate the relationship between two widely used phylogenetic modeling frameworks: dN/dS models and mutation-selection (MutSel) models. We derive a mathematical relationship between dN/dS and scaled selection coefficients, the focal parameters of MutSel models, and use this relationship to gain deeper insight into the behaviors, limitations, and applicabilities of these two modeling frameworks. We prove that, if all synonymous changes are neutral, standard MutSel models correspond to dN/dS ≤ 1. However, if synonymous codons differ in fitness, dN/dS can take on arbitrarily high values even if all selection is purifying. Thus, the MutSel modeling framework cannot necessarily accommodate positive, diversifying selection, while dN/dS cannot distinguish between purifying selection on synonymous codons and positive selection on amino acids. We further propose a new benchmarking strategy of dN/dS inferences against MutSel simulations and demonstrate that the widely used Goldman-Yang-style dN/dS models yield substantially biased dN/dS estimates on realistic sequence data. In contrast, the less frequently used Muse-Gaut-style models display much less bias. Strikingly, the least-biased and most precise dN/dS estimates are never found in the models with the best fit to the data, measured through both AIC and BIC scores. Thus, selecting models based on goodness-of-fit criteria can yield poor parameter estimates if the models considered do not precisely correspond to the underlying mechanism that generated the data. In conclusion, establishing mathematical links among modeling frameworks represents a novel, powerful strategy to pinpoint previously unrecognized model limitations and strengths.
© The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Markov models of sequence evolution; dN/dS; mutation-selection models; protein evolution; scaled selection coefficients

Mesh:

Substances:

Year:  2015        PMID: 25576365      PMCID: PMC4379412          DOI: 10.1093/molbev/msv003

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  73 in total

1.  Predicting the evolution of human influenza A.

Authors:  R M Bush; C A Bender; K Subbarao; N J Cox; W M Fitch
Journal:  Science       Date:  1999-12-03       Impact factor: 47.728

2.  On the probability of fixation of mutant genes in a population.

Authors:  M KIMURA
Journal:  Genetics       Date:  1962-06       Impact factor: 4.562

3.  Not so different after all: a comparison of methods for detecting amino acid sites under selection.

Authors:  Sergei L Kosakovsky Pond; Simon D W Frost
Journal:  Mol Biol Evol       Date:  2005-02-09       Impact factor: 16.240

4.  A genetic algorithm approach to detecting lineage-specific variation in selection pressure.

Authors:  Sergei L Kosakovsky Pond; Simon D W Frost
Journal:  Mol Biol Evol       Date:  2004-10-27       Impact factor: 16.240

5.  HyPhy: hypothesis testing using phylogenies.

Authors:  Sergei L Kosakovsky Pond; Simon D W Frost; Spencer V Muse
Journal:  Bioinformatics       Date:  2004-10-27       Impact factor: 6.937

6.  The application of statistical physics to evolutionary biology.

Authors:  Guy Sella; Aaron E Hirsh
Journal:  Proc Natl Acad Sci U S A       Date:  2005-06-24       Impact factor: 11.205

7.  Mutational and fitness landscapes of an RNA virus revealed through population sequencing.

Authors:  Ashley Acevedo; Leonid Brodsky; Raul Andino
Journal:  Nature       Date:  2013-11-27       Impact factor: 49.962

8.  Evidence for selection on synonymous mutations affecting stability of mRNA secondary structure in mammals.

Authors:  J V Chamary; Laurence D Hurst
Journal:  Genome Biol       Date:  2005-08-16       Impact factor: 13.583

9.  Site-heterogeneous mutation-selection models within the PhyloBayes-MPI package.

Authors:  Nicolas Rodrigue; Nicolas Lartillot
Journal:  Bioinformatics       Date:  2013-12-18       Impact factor: 6.937

10.  Why time matters: codon evolution and the temporal dynamics of dN/dS.

Authors:  Carina F Mugal; Jochen B W Wolf; Ingemar Kaj
Journal:  Mol Biol Evol       Date:  2013-10-14       Impact factor: 16.240

View more
  40 in total

1.  How to calculate the non-synonymous to synonymous rate ratio of protein-coding genes under the Fisher-Wright mutation-selection framework.

Authors:  Mario Dos Reis
Journal:  Biol Lett       Date:  2015-04       Impact factor: 3.703

2.  Selective Advantages of a Parasexual Cycle for the Yeast Candida albicans.

Authors:  Ningxin Zhang; Beatrice B Magee; Paul T Magee; Barbara R Holland; Ely Rodrigues; Ann R Holmes; Richard D Cannon; Jan Schmid
Journal:  Genetics       Date:  2015-06-10       Impact factor: 4.562

3.  A Comparison of One-Rate and Two-Rate Inference Frameworks for Site-Specific dN/dS Estimation.

Authors:  Stephanie J Spielman; Suyang Wan; Claus O Wilke
Journal:  Genetics       Date:  2016-08-17       Impact factor: 4.562

4.  Extensively Parameterized Mutation-Selection Models Reliably Capture Site-Specific Selective Constraint.

Authors:  Stephanie J Spielman; Claus O Wilke
Journal:  Mol Biol Evol       Date:  2016-08-10       Impact factor: 16.240

5.  Site-Specific Amino Acid Distributions Follow a Universal Shape.

Authors:  Mackenzie M Johnson; Claus O Wilke
Journal:  J Mol Evol       Date:  2020-11-24       Impact factor: 2.395

6.  Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates.

Authors:  Dariya K Sydykova; Claus O Wilke
Journal:  PeerJ       Date:  2017-05-30       Impact factor: 2.984

7.  Intragenomic variation in non-adaptive nucleotide biases causes underestimation of selection on synonymous codon usage.

Authors:  Alexander L Cope; Premal Shah
Journal:  PLoS Genet       Date:  2022-06-17       Impact factor: 6.020

8.  Intermediate divergence levels maximize the strength of structure-sequence correlations in enzymes and viral proteins.

Authors:  Eleisha L Jackson; Amir Shahmoradi; Stephanie J Spielman; Benjamin R Jack; Claus O Wilke
Journal:  Protein Sci       Date:  2016-03-24       Impact factor: 6.725

Review 9.  Causes of evolutionary rate variation among protein sites.

Authors:  Julian Echave; Stephanie J Spielman; Claus O Wilke
Journal:  Nat Rev Genet       Date:  2016-01-19       Impact factor: 53.242

Review 10.  Shifts in amino acid preferences as proteins evolve: A synthesis of experimental and theoretical work.

Authors:  Noor Youssef; Edward Susko; Andrew J Roger; Joseph P Bielawski
Journal:  Protein Sci       Date:  2021-08-12       Impact factor: 6.993

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.