Literature DB >> 23064000

A counting renaissance: combining stochastic mapping and empirical Bayes to quickly detect amino acid sites under positive selection.

Philippe Lemey1, Vladimir N Minin, Filip Bielejec, Sergei L Kosakovsky Pond, Marc A Suchard.   

Abstract

MOTIVATION: Statistical methods for comparing relative rates of synonymous and non-synonymous substitutions maintain a central role in detecting positive selection. To identify selection, researchers often estimate the ratio of these relative rates (dN/dS) at individual alignment sites. Fitting a codon substitution model that captures heterogeneity in dN/dS across sites provides a reliable way to perform such estimation, but it remains computationally prohibitive for massive datasets. By using crude estimates of the numbers of synonymous and non-synonymous substitutions at each site, counting approaches scale well to large datasets, but they fail to account for ancestral state reconstruction uncertainty and to provide site-specific dN/dS estimates.
RESULTS: We propose a hybrid solution that borrows the computational strength of counting methods, but augments these methods with empirical Bayes modeling to produce a relatively fast and reliable method capable of estimating site-specific dN/dS values in large datasets. Importantly, our hybrid approach, set in a Bayesian framework, integrates over the posterior distribution of phylogenies and ancestral reconstructions to quantify uncertainty about site-specific dN/dS estimates. Simulations demonstrate that this method competes well with more-principled statistical procedures and, in some cases, even outperforms them. We illustrate the utility of our method using human immunodeficiency virus, feline panleukopenia and canine parvovirus evolution examples.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 23064000      PMCID: PMC3579240          DOI: 10.1093/bioinformatics/bts580

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  45 in total

1.  Bayesian estimation of positively selected sites.

Authors:  John P Huelsenbeck; Kelly A Dyer
Journal:  J Mol Evol       Date:  2004-06       Impact factor: 2.395

2.  Bayes empirical bayes inference of amino acid sites under positive selection.

Authors:  Ziheng Yang; Wendy S W Wong; Rasmus Nielsen
Journal:  Mol Biol Evol       Date:  2005-02-02       Impact factor: 16.240

3.  Many-core algorithms for statistical phylogenetics.

Authors:  Marc A Suchard; Andrew Rambaut
Journal:  Bioinformatics       Date:  2009-04-15       Impact factor: 6.937

4.  Learning to count: robust estimates for labeled distances between molecular sequences.

Authors:  John D O'Brien; Vladimir N Minin; Marc A Suchard
Journal:  Mol Biol Evol       Date:  2009-01-08       Impact factor: 16.240

5.  Fast, accurate and simulation-free stochastic mapping.

Authors:  Vladimir N Minin; Marc A Suchard
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

6.  Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene.

Authors:  R Nielsen; Z Yang
Journal:  Genetics       Date:  1998-03       Impact factor: 4.562

7.  A codon-based model of nucleotide substitution for protein-coding DNA sequences.

Authors:  N Goldman; Z Yang
Journal:  Mol Biol Evol       Date:  1994-09       Impact factor: 16.240

8.  Variants other than aspartic acid at codon 69 of the human immunodeficiency virus type 1 reverse transcriptase gene affect susceptibility to nuleoside analogs.

Authors:  M A Winters; T C Merigan
Journal:  Antimicrob Agents Chemother       Date:  2001-08       Impact factor: 5.191

9.  Identification of accessory mutations associated with high-level resistance in HIV-1 reverse transcriptase.

Authors:  Patricia A Cane; Hannah Green; Esther Fearnhill; David Dunn
Journal:  AIDS       Date:  2007-02-19       Impact factor: 4.177

10.  BEAGLE: an application programming interface and high-performance computing library for statistical phylogenetics.

Authors:  Daniel L Ayres; Aaron Darling; Derrick J Zwickl; Peter Beerli; Mark T Holder; Paul O Lewis; John P Huelsenbeck; Fredrik Ronquist; David L Swofford; Michael P Cummings; Andrew Rambaut; Marc A Suchard
Journal:  Syst Biol       Date:  2011-10-01       Impact factor: 15.683

View more
  42 in total

1.  Positive Selection in CD8+ T-Cell Epitopes of Influenza Virus Nucleoprotein Revealed by a Comparative Analysis of Human and Swine Viral Lineages.

Authors:  Heather M Machkovech; Trevor Bedford; Marc A Suchard; Jesse D Bloom
Journal:  J Virol       Date:  2015-08-26       Impact factor: 5.103

2.  The comparative genomics of human respiratory syncytial virus subgroups A and B: genetic variability and molecular evolutionary dynamics.

Authors:  Lydia Tan; Frank E J Coenjaerts; Lieselot Houspie; Marco C Viveen; Grada M van Bleek; Emmanuel J H J Wiertz; Darren P Martin; Philippe Lemey
Journal:  J Virol       Date:  2013-05-22       Impact factor: 5.103

3.  Calculating Higher-Order Moments of Phylogenetic Stochastic Mapping Summaries in Linear Time.

Authors:  Amrit Dhar; Vladimir N Minin
Journal:  J Comput Biol       Date:  2017-02-08       Impact factor: 1.479

4.  A Comparison of One-Rate and Two-Rate Inference Frameworks for Site-Specific dN/dS Estimation.

Authors:  Stephanie J Spielman; Suyang Wan; Claus O Wilke
Journal:  Genetics       Date:  2016-08-17       Impact factor: 4.562

5.  Calculating site-specific evolutionary rates at the amino-acid or codon level yields similar rate estimates.

Authors:  Dariya K Sydykova; Claus O Wilke
Journal:  PeerJ       Date:  2017-05-30       Impact factor: 2.984

6.  Intrinsic disorder mediates hepatitis C virus core-host cell protein interactions.

Authors:  Patrick T Dolan; Andrew P Roth; Bin Xue; Ren Sun; A Keith Dunker; Vladimir N Uversky; Douglas J LaCount
Journal:  Protein Sci       Date:  2014-12-31       Impact factor: 6.725

7.  Testing for the Occurrence of Selective Episodes During the Divergence of Otophysan Fishes: Insights from Mitogenomics.

Authors:  Alejandro D'Anatro; Facundo Giorello; Matías Feijoo; Enrique P Lessa
Journal:  J Mol Evol       Date:  2017-04-04       Impact factor: 2.395

8.  A Darwinian Uncertainty Principle.

Authors:  Olivier Gascuel; Mike Steel
Journal:  Syst Biol       Date:  2020-05-01       Impact factor: 15.683

9.  The emergence and evolution of influenza A (H1α) viruses in swine in Canada and the United States.

Authors:  Martha I Nelson; Marie R Culhane; Nídia S Trovão; Devi P Patnayak; Rebecca A Halpin; Xudong Lin; Meghan H Shilts; Suman R Das; Susan E Detmer
Journal:  J Gen Virol       Date:  2017-10-23       Impact factor: 3.891

Review 10.  Causes of evolutionary rate variation among protein sites.

Authors:  Julian Echave; Stephanie J Spielman; Claus O Wilke
Journal:  Nat Rev Genet       Date:  2016-01-19       Impact factor: 53.242

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.