Literature DB >> 27535929

A Comparison of One-Rate and Two-Rate Inference Frameworks for Site-Specific dN/dS Estimation.

Stephanie J Spielman1, Suyang Wan2, Claus O Wilke3.   

Abstract

Two broad paradigms exist for inferring [Formula: see text] the ratio of nonsynonymous to synonymous substitution rates, from coding sequences: (i) a one-rate approach, where [Formula: see text] is represented with a single parameter, or (ii) a two-rate approach, where [Formula: see text] and [Formula: see text] are estimated separately. The performances of these two approaches have been well studied in the specific context of proper model specification, i.e., when the inference model matches the simulation model. By contrast, the relative performances of one-rate vs. two-rate parameterizations when applied to data generated according to a different mechanism remain unclear. Here, we compare the relative merits of one-rate and two-rate approaches in the specific context of model misspecification by simulating alignments with mutation-selection models rather than with [Formula: see text]-based models. We find that one-rate frameworks generally infer more accurate [Formula: see text] point estimates, even when [Formula: see text] varies among sites. In other words, modeling [Formula: see text] variation may substantially reduce accuracy of [Formula: see text] point estimates. These results appear to depend on the selective constraint operating at a given site. For sites under strong purifying selection ([Formula: see text]), one-rate and two-rate models show comparable performances. However, one-rate models significantly outperform two-rate models for sites under moderate-to-weak purifying selection. We attribute this distinction to the fact that, for these more quickly evolving sites, a given substitution is more likely to be nonsynonymous than synonymous. The data will therefore be relatively enriched for nonsynonymous changes, and modeling [Formula: see text] contributes excessive noise to [Formula: see text] estimates. We additionally find that high levels of divergence among sequences, rather than the number of sequences in the alignment, are more critical for obtaining precise point estimates.
Copyright © 2016 by the Genetics Society of America.

Keywords:  dN/dS; evolutionary rate; molecular evolution; mutation–selection models; sequence simulation

Mesh:

Year:  2016        PMID: 27535929      PMCID: PMC5068842          DOI: 10.1534/genetics.115.185264

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  40 in total

1.  Codon-substitution models for heterogeneous selection pressure at amino acid sites.

Authors:  Z Yang; R Nielsen; N Goldman; A M Pedersen
Journal:  Genetics       Date:  2000-05       Impact factor: 4.562

2.  Codon-substitution models to detect adaptive evolution that account for heterogeneous selective pressures among site classes.

Authors:  Ziheng Yang; Willie J Swanson
Journal:  Mol Biol Evol       Date:  2002-01       Impact factor: 16.240

3.  The relationship between dN/dS and scaled selection coefficients.

Authors:  Stephanie J Spielman; Claus O Wilke
Journal:  Mol Biol Evol       Date:  2015-01-08       Impact factor: 16.240

4.  FUBAR: a fast, unconstrained bayesian approximation for inferring selection.

Authors:  Ben Murrell; Sasha Moola; Amandla Mabona; Thomas Weighill; Daniel Sheward; Sergei L Kosakovsky Pond; Konrad Scheffler
Journal:  Mol Biol Evol       Date:  2013-02-18       Impact factor: 16.240

5.  A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome.

Authors:  S V Muse; B S Gaut
Journal:  Mol Biol Evol       Date:  1994-09       Impact factor: 16.240

6.  A codon-based model of nucleotide substitution for protein-coding DNA sequences.

Authors:  N Goldman; Z Yang
Journal:  Mol Biol Evol       Date:  1994-09       Impact factor: 16.240

7.  A counting renaissance: combining stochastic mapping and empirical Bayes to quickly detect amino acid sites under positive selection.

Authors:  Philippe Lemey; Vladimir N Minin; Filip Bielejec; Sergei L Kosakovsky Pond; Marc A Suchard
Journal:  Bioinformatics       Date:  2012-10-12       Impact factor: 6.937

8.  Geometric Constraints Dominate the Antigenic Evolution of Influenza H3N2 Hemagglutinin.

Authors:  Austin G Meyer; Claus O Wilke
Journal:  PLoS Pathog       Date:  2015-05-28       Impact factor: 6.823

9.  On the need for mechanistic models in computational genomics and metagenomics.

Authors:  David A Liberles; Ashley I Teufel; Liang Liu; Tanja Stadler
Journal:  Genome Biol Evol       Date:  2013       Impact factor: 3.416

10.  Dual host-virus arms races shape an essential housekeeping protein.

Authors:  Ann Demogines; Jonathan Abraham; Hyeryun Choe; Michael Farzan; Sara L Sawyer
Journal:  PLoS Biol       Date:  2013-05-28       Impact factor: 8.029

View more
  12 in total

1.  Extensively Parameterized Mutation-Selection Models Reliably Capture Site-Specific Selective Constraint.

Authors:  Stephanie J Spielman; Claus O Wilke
Journal:  Mol Biol Evol       Date:  2016-08-10       Impact factor: 16.240

2.  Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates.

Authors:  Dariya K Sydykova; Claus O Wilke
Journal:  PeerJ       Date:  2017-05-30       Impact factor: 2.984

3.  Measuring evolutionary rates of proteins in a structural context.

Authors:  Dariya K Sydykova; Benjamin R Jack; Stephanie J Spielman; Claus O Wilke
Journal:  F1000Res       Date:  2017-10-16

4.  Sequence amplification via cell passaging creates spurious signals of positive adaptation in influenza virus H3N2 hemagglutinin.

Authors:  Claire D McWhite; Austin G Meyer; Claus O Wilke
Journal:  Virus Evol       Date:  2016-10-03

Review 5.  Snake Genome Sequencing: Results and Future Prospects.

Authors:  Harald M I Kerkkamp; R Manjunatha Kini; Alexey S Pospelov; Freek J Vonk; Christiaan V Henkel; Michael K Richardson
Journal:  Toxins (Basel)       Date:  2016-12-01       Impact factor: 4.546

6.  Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models.

Authors:  Jesse D Bloom
Journal:  Biol Direct       Date:  2017-01-17       Impact factor: 4.540

7.  Relative evolutionary rate inference in HyPhy with LEISR.

Authors:  Stephanie J Spielman; Sergei L Kosakovsky Pond
Journal:  PeerJ       Date:  2018-02-05       Impact factor: 2.984

8.  Large-Scale Comparative Analysis of Codon Models Accounting for Protein and Nucleotide Selection.

Authors:  Iakov I Davydov; Nicolas Salamin; Marc Robinson-Rechavi
Journal:  Mol Biol Evol       Date:  2019-06-01       Impact factor: 16.240

9.  Relative Model Fit Does Not Predict Topological Accuracy in Single-Gene Protein Phylogenetics.

Authors:  Stephanie J Spielman
Journal:  Mol Biol Evol       Date:  2020-07-01       Impact factor: 16.240

10.  Improved inference of site-specific positive selection under a generalized parametric codon model when there are multinucleotide mutations and multiple nonsynonymous rates.

Authors:  Katherine A Dunn; Toby Kenney; Hong Gu; Joseph P Bielawski
Journal:  BMC Evol Biol       Date:  2019-01-14       Impact factor: 3.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.