Literature DB >> 22977116

Integrating sequence variation and protein structure to identify sites under selection.

Austin G Meyer1, Claus O Wilke.   

Abstract

We present a novel method to identify sites under selection in protein-coding genes. Our method combines the traditional Goldman-Yang model of coding-sequence evolution with the information obtained from the 3D structure of the evolving protein, specifically the relative solvent accessibility (RSA) of individual residues. We develop a random-effects likelihood sites model in which rate classes are RSA dependent. The RSA dependence is modeled with linear functions. We demonstrate that our RSA-dependent model provides a significantly better fit to molecular sequence data than does a traditional, RSA-independent model. We further show that our model provides a natural, RSA-dependent neutral baseline for the evolutionary rate ratio ω = dN/dS Sites that deviate from this neutral baseline likely experience selection pressure for function. We apply our method to the influenza proteins hemagglutinin and neuraminidase. For hemagglutinin, our method recovers positively selected sites near the sialic acid-binding site and negatively selected sites that may be important for trimerization. For neuraminidase, our method recovers the oseltamivir resistance site and otherwise suggests that few sites deviate from the neutral baseline. Our method is broadly applicable to any protein sequences for which structural data are available or can be obtained via homology modeling or threading.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22977116      PMCID: PMC3525147          DOI: 10.1093/molbev/mss217

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  39 in total

1.  Site-to-site variation of synonymous substitution rates.

Authors:  Sergei Kosakovsky Pond; Spencer V Muse
Journal:  Mol Biol Evol       Date:  2005-08-17       Impact factor: 16.240

2.  Assessing site-interdependent phylogenetic models of sequence evolution.

Authors:  Nicolas Rodrigue; Hervé Philippe; Nicolas Lartillot
Journal:  Mol Biol Evol       Date:  2006-06-20       Impact factor: 16.240

3.  Structural determinants of the rate of protein evolution in yeast.

Authors:  Jesse D Bloom; D Allan Drummond; Frances H Arnold; Claus O Wilke
Journal:  Mol Biol Evol       Date:  2006-06-16       Impact factor: 16.240

4.  A Dirichlet process model for detecting positive selection in protein-coding DNA sequences.

Authors:  John P Huelsenbeck; Sonia Jain; Simon W D Frost; Sergei L Kosakovsky Pond
Journal:  Proc Natl Acad Sci U S A       Date:  2006-04-10       Impact factor: 11.205

5.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2006-08-23       Impact factor: 6.937

6.  Quantifying the impact of protein tertiary structure on molecular evolution.

Authors:  Sang Chul Choi; Asger Hobolth; Douglas M Robinson; Hirohisa Kishino; Jeffrey L Thorne
Journal:  Mol Biol Evol       Date:  2007-05-23       Impact factor: 16.240

7.  Solvent exposure imparts similar selective pressures across a range of yeast proteins.

Authors:  Gavin C Conant; Peter F Stadler
Journal:  Mol Biol Evol       Date:  2009-02-20       Impact factor: 16.240

Review 8.  Models of coding sequence evolution.

Authors:  Wayne Delport; Konrad Scheffler; Cathal Seoighe
Journal:  Brief Bioinform       Date:  2008-10-29       Impact factor: 11.622

9.  Directionality in the evolution of influenza A haemagglutinin.

Authors:  Sergey Kryazhimskiy; Georgii A Bazykin; Joshua B Plotkin; Joshua Plotkin; Jonathan Dushoff
Journal:  Proc Biol Sci       Date:  2008-11-07       Impact factor: 5.349

10.  Methods for selecting fixed-effect models for heterogeneous codon evolution, with comments on their application to gene and genome data.

Authors:  Le Bao; Hong Gu; Katherine A Dunn; Joseph P Bielawski
Journal:  BMC Evol Biol       Date:  2007-02-08       Impact factor: 3.260

View more
  31 in total

1.  The relationship between dN/dS and scaled selection coefficients.

Authors:  Stephanie J Spielman; Claus O Wilke
Journal:  Mol Biol Evol       Date:  2015-01-08       Impact factor: 16.240

2.  Cross-species comparison of site-specific evolutionary-rate variation in influenza haemagglutinin.

Authors:  Austin G Meyer; Eric T Dawson; Claus O Wilke
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2013-02-04       Impact factor: 6.237

3.  The site-wise log-likelihood score is a good predictor of genes under positive selection.

Authors:  Huai-Chun Wang; Edward Susko; Andrew J Roger
Journal:  J Mol Evol       Date:  2013-04-18       Impact factor: 2.395

4.  A new parameter-rich structure-aware mechanistic model for amino acid substitution during evolution.

Authors:  Peter B Chi; Dohyup Kim; Jason K Lai; Nadia Bykova; Claudia C Weber; Jan Kubelka; David A Liberles
Journal:  Proteins       Date:  2017-12-12

5.  Biophysics of protein evolution and evolutionary protein biophysics.

Authors:  Tobias Sikosek; Hue Sun Chan
Journal:  J R Soc Interface       Date:  2014-11-06       Impact factor: 4.118

6.  Limited utility of residue masking for positive-selection inference.

Authors:  Stephanie J Spielman; Eric T Dawson; Claus O Wilke
Journal:  Mol Biol Evol       Date:  2014-06-03       Impact factor: 16.240

7.  Phylogenetic Modeling of Regulatory Element Turnover Based on Epigenomic Data.

Authors:  Noah Dukler; Yi-Fei Huang; Adam Siepel
Journal:  Mol Biol Evol       Date:  2020-07-01       Impact factor: 16.240

8.  Site-Specific Amino Acid Distributions Follow a Universal Shape.

Authors:  Mackenzie M Johnson; Claus O Wilke
Journal:  J Mol Evol       Date:  2020-11-24       Impact factor: 2.395

9.  The utility of protein structure as a predictor of site-wise dN/dS varies widely among HIV-1 proteins.

Authors:  Austin G Meyer; Claus O Wilke
Journal:  J R Soc Interface       Date:  2015-10-06       Impact factor: 4.118

10.  Membrane environment imposes unique selection pressures on transmembrane domains of G protein-coupled receptors.

Authors:  Stephanie J Spielman; Claus O Wilke
Journal:  J Mol Evol       Date:  2013-01-26       Impact factor: 2.395

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.