Literature DB >> 26223372

Constructing sequence-dependent protein models using coevolutionary information.

Ryan R Cheng1, Mohit Raghunathan1,2, Jeffrey K Noel1,2, José N Onuchic1,2.   

Abstract

Recent developments in global statistical methodologies have advanced the analysis of large collections of protein sequences for coevolutionary information. Coevolution between amino acids in a protein arises from compensatory mutations that are needed to maintain the stability or function of a protein over the course of evolution. This gives rise to quantifiable correlations between amino acid sites within the multiple sequence alignment of a protein family. Here, we use the maximum entropy-based approach called mean field Direct Coupling Analysis (mfDCA) to infer a Potts model Hamiltonian governing the correlated mutations in a protein family. We use the inferred pairwise statistical couplings to generate the sequence-dependent heterogeneous interaction energies of a structure-based model (SBM) where only native contacts are considered. Considering the ribosomal S6 protein and its circular permutants as well as the SH3 protein, we demonstrate that these models quantitatively agree with experimental data on folding mechanisms. This work serves as a new framework for generating coevolutionary data-enriched models that can potentially be used to engineer key functional motions and novel interactions in protein systems.
© 2015 The Protein Society.

Keywords:  coarse-grained protein models; coevolutionary information; computational biophysics; statistical inference

Mesh:

Substances:

Year:  2015        PMID: 26223372      PMCID: PMC4815312          DOI: 10.1002/pro.2758

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  64 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Complete change of the protein folding transition state upon circular permutation.

Authors:  Magnus Lindberg; Jeanette Tångrot; Mikael Oliveberg
Journal:  Nat Struct Biol       Date:  2002-11

3.  Transition states for folding of circular-permuted proteins.

Authors:  Jie Chen; Jun Wang; Wei Wang
Journal:  Proteins       Date:  2004-10-01

4.  Weak pairwise correlations imply strongly correlated network states in a neural population.

Authors:  Elad Schneidman; Michael J Berry; Ronen Segev; William Bialek
Journal:  Nature       Date:  2006-04-09       Impact factor: 49.962

5.  Robustness and generalization of structure-based models for protein folding and function.

Authors:  Heiko Lammert; Alexander Schug; José N Onuchic
Journal:  Proteins       Date:  2009-12

Review 6.  Theoretical perspectives on nonnative interactions and intrinsic disorder in protein folding and binding.

Authors:  Tao Chen; Jianhui Song; Hue Sun Chan
Journal:  Curr Opin Struct Biol       Date:  2014-12-24       Impact factor: 6.809

7.  Coevolutionary information, protein folding landscapes, and the thermodynamics of natural selection.

Authors:  Faruck Morcos; Nicholas P Schafer; Ryan R Cheng; José N Onuchic; Peter G Wolynes
Journal:  Proc Natl Acad Sci U S A       Date:  2014-08-11       Impact factor: 11.205

8.  Translating HIV sequences into quantitative fitness landscapes predicts viral vulnerabilities for rational immunogen design.

Authors:  Andrew L Ferguson; Jaclyn K Mann; Saleha Omarjee; Thumbi Ndung'u; Bruce D Walker; Arup K Chakraborty
Journal:  Immunity       Date:  2013-03-21       Impact factor: 31.745

9.  Learning To Fold Proteins Using Energy Landscape Theory.

Authors:  N P Schafer; B L Kim; W Zheng; P G Wolynes
Journal:  Isr J Chem       Date:  2014-08       Impact factor: 3.333

10.  Biomolecular dynamics: order-disorder transitions and energy landscapes.

Authors:  Paul C Whitford; Karissa Y Sanbonmatsu; José N Onuchic
Journal:  Rep Prog Phys       Date:  2012-06-28
View more
  8 in total

1.  Structural propensities of kinase family proteins from a Potts model of residue co-variation.

Authors:  Allan Haldane; William F Flynn; Peng He; R S K Vijayan; Ronald M Levy
Journal:  Protein Sci       Date:  2016-06-26       Impact factor: 6.725

Review 2.  Potts Hamiltonian models of protein co-variation, free energy landscapes, and evolutionary fitness.

Authors:  Ronald M Levy; Allan Haldane; William F Flynn
Journal:  Curr Opin Struct Biol       Date:  2016-11-18       Impact factor: 6.809

3.  Coevolutionary Landscape of Kinase Family Proteins: Sequence Probabilities and Functional Motifs.

Authors:  Allan Haldane; William F Flynn; Peng He; Ronald M Levy
Journal:  Biophys J       Date:  2018-01-09       Impact factor: 4.033

Review 4.  Sequence co-evolutionary information is a natural partner to minimally-frustrated models of biomolecular dynamics.

Authors:  Jeffrey K Noel; Faruck Morcos; Jose N Onuchic
Journal:  F1000Res       Date:  2016-01-26

5.  SMOG 2: A Versatile Software Package for Generating Structure-Based Models.

Authors:  Jeffrey K Noel; Mariana Levi; Mohit Raghunathan; Heiko Lammert; Ryan L Hayes; José N Onuchic; Paul C Whitford
Journal:  PLoS Comput Biol       Date:  2016-03-10       Impact factor: 4.475

6.  Inferring repeat-protein energetics from evolutionary information.

Authors:  Rocío Espada; R Gonzalo Parra; Thierry Mora; Aleksandra M Walczak; Diego U Ferreiro
Journal:  PLoS Comput Biol       Date:  2017-06-15       Impact factor: 4.475

7.  Connecting the Sequence-Space of Bacterial Signaling Proteins to Phenotypes Using Coevolutionary Landscapes.

Authors:  R R Cheng; O Nordesjö; R L Hayes; H Levine; S C Flores; J N Onuchic; F Morcos
Journal:  Mol Biol Evol       Date:  2016-09-07       Impact factor: 16.240

8.  Disease-relevant mutations alter amino acid co-evolution networks in the second nucleotide binding domain of CFTR.

Authors:  Gabrianne Ivey; Robert T Youker
Journal:  PLoS One       Date:  2020-01-24       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.