Literature DB >> 29382650

Beyond Thermodynamic Constraints: Evolutionary Sampling Generates Realistic Protein Sequence Variation.

Qian Jiang1,2, Ashley I Teufel1, Eleisha L Jackson1, Claus O Wilke3.   

Abstract

Biological evolution generates a surprising amount of site-specific variability in protein sequences. Yet, attempts at modeling this process have been only moderately successful, and current models based on protein structural metrics explain, at best, 60% of the observed variation. Surprisingly, simple measures of protein structure, such as solvent accessibility, are often better predictors of site-specific variability than more complex models employing all-atom energy functions and detailed structural modeling. We suggest here that these more complex models perform poorly because they lack consideration of the evolutionary process, which is, in part, captured by the simpler metrics. We compare protein sequences that are computationally designed to sequences that are computationally evolved using the same protein-design energy function and to homologous natural sequences. We find that, by a wide variety of metrics, evolved sequences are much more similar to natural sequences than are designed sequences. In particular, designed sequences are too conserved on the protein surface relative to natural sequences, whereas evolved sequences are not. Our results suggest that evolutionary simulation produces a realistic sampling of sequence space. By contrast, protein design-at least as currently implemented-does not. Existing energy functions seem to be sufficiently accurate to correctly describe the key thermodynamic constraints acting on protein sequences, but they need to be paired with realistic sampling schemes to generate realistic sequence alignments.
Copyright © 2018 by the Genetics Society of America.

Keywords:  protein evolution; protein structure; site specific variability

Mesh:

Substances:

Year:  2018        PMID: 29382650      PMCID: PMC5887137          DOI: 10.1534/genetics.118.300699

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  47 in total

1.  Quantifying the effect of burial of amino acid residues on protein stability.

Authors:  Hongyi Zhou; Yaoqi Zhou
Journal:  Proteins       Date:  2004-02-01

2.  PyRosetta: a script-based interface for implementing molecular modeling algorithms using Rosetta.

Authors:  Sidhartha Chaudhury; Sergey Lyskov; Jeffrey J Gray
Journal:  Bioinformatics       Date:  2010-01-07       Impact factor: 6.937

3.  Structural determinants of protein evolution are context-sensitive at the residue level.

Authors:  Eric A Franzosa; Yu Xia
Journal:  Mol Biol Evol       Date:  2009-07-13       Impact factor: 16.240

4.  Maximum-Likelihood Phylogenetic Inference with Selection on Protein Folding Stability.

Authors:  Miguel Arenas; Agustin Sánchez-Cobos; Ugo Bastolla
Journal:  Mol Biol Evol       Date:  2015-04-02       Impact factor: 16.240

5.  Accelerated simulation of evolutionary trajectories in origin-fixation models.

Authors:  Ashley I Teufel; Claus O Wilke
Journal:  J R Soc Interface       Date:  2017-02       Impact factor: 4.118

Review 6.  Principles of protein-protein interactions.

Authors:  S Jones; J M Thornton
Journal:  Proc Natl Acad Sci U S A       Date:  1996-01-09       Impact factor: 11.205

7.  Amino-acid site variability among natural and designed proteins.

Authors:  Eleisha L Jackson; Noah Ollikainen; Arthur W Covert; Tanja Kortemme; Claus O Wilke
Journal:  PeerJ       Date:  2013-11-12       Impact factor: 2.984

8.  Local packing density is the main structural determinant of the rate of protein sequence evolution at site level.

Authors:  So-Wei Yeh; Tsun-Tsao Huang; Jen-Wei Liu; Sung-Huan Yu; Chien-Hua Shih; Jenn-Kang Hwang; Julian Echave
Journal:  Biomed Res Int       Date:  2014-07-09       Impact factor: 3.411

9.  Functional Sites Induce Long-Range Evolutionary Constraints in Enzymes.

Authors:  Benjamin R Jack; Austin G Meyer; Julian Echave; Claus O Wilke
Journal:  PLoS Biol       Date:  2016-05-03       Impact factor: 8.029

10.  The tangled bank of amino acids.

Authors:  Richard A Goldstein; David D Pollock
Journal:  Protein Sci       Date:  2016-05-12       Impact factor: 6.725

View more
  2 in total

1.  The Many Nuanced Evolutionary Consequences of Duplicated Genes.

Authors:  Ashley I Teufel; Mackenzie M Johnson; Jon M Laurent; Aashiq H Kachroo; Edward M Marcotte; Claus O Wilke
Journal:  Mol Biol Evol       Date:  2019-02-01       Impact factor: 16.240

2.  Site-Specific Amino Acid Distributions Follow a Universal Shape.

Authors:  Mackenzie M Johnson; Claus O Wilke
Journal:  J Mol Evol       Date:  2020-11-24       Impact factor: 2.395

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.