Literature DB >> 27810958

Tertiary alphabet for the observable protein structural universe.

Craig O Mackenzie1, Jianfu Zhou2, Gevorg Grigoryan3,2,4.   

Abstract

Here, we systematically decompose the known protein structural universe into its basic elements, which we dub tertiary structural motifs (TERMs). A TERM is a compact backbone fragment that captures the secondary, tertiary, and quaternary environments around a given residue, comprising one or more disjoint segments (three on average). We seek the set of universal TERMs that capture all structure in the Protein Data Bank (PDB), finding remarkable degeneracy. Only ∼600 TERMs are sufficient to describe 50% of the PDB at sub-Angstrom resolution. However, more rare geometries also exist, and the overall structural coverage grows logarithmically with the number of TERMs. We go on to show that universal TERMs provide an effective mapping between sequence and structure. We demonstrate that TERM-based statistics alone are sufficient to recapitulate close-to-native sequences given either NMR or X-ray backbones. Furthermore, sequence variability predicted from TERM data agrees closely with evolutionary variation. Finally, locations of TERMs in protein chains can be predicted from sequence alone based on sequence signatures emergent from TERM instances in the PDB. For multisegment motifs, this method identifies spatially adjacent fragments that are not contiguous in sequence-a major bottleneck in structure prediction. Although all TERMs recur in diverse proteins, some appear specialized for certain functions, such as interface formation, metal coordination, or even water binding. Structural biology has benefited greatly from previously observed degeneracies in structure. The decomposition of the known structural universe into a finite set of compact TERMs offers exciting opportunities toward better understanding, design, and prediction of protein structure.

Entities:  

Keywords:  protein structural universe; sequence–structure relationships; structural degeneracy; structural modularity; tertiary motif

Mesh:

Substances:

Year:  2016        PMID: 27810958      PMCID: PMC5127300          DOI: 10.1073/pnas.1607178113

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  71 in total

1.  The penultimate rotamer library.

Authors:  S C Lovell; J M Word; J S Richardson; D C Richardson
Journal:  Proteins       Date:  2000-08-15

2.  Global mapping of the protein structure space and application in structure-based inference of protein function.

Authors:  Jingtong Hou; Se-Ran Jun; Chao Zhang; Sung-Hou Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2005-02-10       Impact factor: 11.205

3.  Prediction of EF-hand calcium-binding proteins and analysis of bacterial EF-hand proteins.

Authors:  Yubin Zhou; Wei Yang; Michael Kirberger; Hsiau-Wei Lee; Gayatri Ayalasomayajula; Jenny J Yang
Journal:  Proteins       Date:  2006-11-15

4.  Why are some proteins structures so common?

Authors:  S Govindarajan; R A Goldstein
Journal:  Proc Natl Acad Sci U S A       Date:  1996-04-16       Impact factor: 11.205

5.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

6.  A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions.

Authors:  Maxim V Shapovalov; Roland L Dunbrack
Journal:  Structure       Date:  2011-06-08       Impact factor: 5.006

Review 7.  Evolution of protein kinase signaling from yeast to man.

Authors:  Gerard Manning; Gregory D Plowman; Tony Hunter; Sucha Sudarsanam
Journal:  Trends Biochem Sci       Date:  2002-10       Impact factor: 13.807

8.  BriX: a database of protein building blocks for structural analysis, modeling and design.

Authors:  Peter Vanhee; Erik Verschueren; Lies Baeten; Francois Stricher; Luis Serrano; Frederic Rousseau; Joost Schymkowitz
Journal:  Nucleic Acids Res       Date:  2010-10-23       Impact factor: 16.971

9.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

Authors:  Fabian Sievers; Andreas Wilm; David Dineen; Toby J Gibson; Kevin Karplus; Weizhong Li; Rodrigo Lopez; Hamish McWilliam; Michael Remmert; Johannes Söding; Julie D Thompson; Desmond G Higgins
Journal:  Mol Syst Biol       Date:  2011-10-11       Impact factor: 11.429

10.  De Novo Evolutionary Emergence of a Symmetrical Protein Is Shaped by Folding Constraints.

Authors:  Robert G Smock; Itamar Yadid; Orly Dym; Jane Clarke; Dan S Tawfik
Journal:  Cell       Date:  2016-01-21       Impact factor: 41.582

View more
  17 in total

1.  Multi-scale structural analysis of proteins by deep semantic segmentation.

Authors:  Raphael R Eguchi; Po-Ssu Huang
Journal:  Bioinformatics       Date:  2020-03-01       Impact factor: 6.937

2.  Diversity and motif conservation in protein 3D structural landscape: exploration by a new multivariate simulation method.

Authors:  Rajani R Joshi
Journal:  J Mol Model       Date:  2018-03-02       Impact factor: 1.810

Review 3.  Protein structural motifs in prediction and design.

Authors:  Craig O Mackenzie; Gevorg Grigoryan
Journal:  Curr Opin Struct Biol       Date:  2017-04-28       Impact factor: 6.809

Review 4.  Computational Tools and Strategies to Develop Peptide-Based Inhibitors of Protein-Protein Interactions.

Authors:  Maxence Delaunay; Tâp Ha-Duong
Journal:  Methods Mol Biol       Date:  2022

Review 5.  De novo protein design, a retrospective.

Authors:  Ivan V Korendovych; William F DeGrado
Journal:  Q Rev Biophys       Date:  2020-02-11       Impact factor: 5.318

6.  Structure-conditioned amino-acid couplings: How contact geometry affects pairwise sequence preferences.

Authors:  Jack Holland; Gevorg Grigoryan
Journal:  Protein Sci       Date:  2022-02-15       Impact factor: 6.725

Review 7.  Evolution, folding, and design of TIM barrels and related proteins.

Authors:  Sergio Romero-Romero; Sina Kordes; Florian Michel; Birte Höcker
Journal:  Curr Opin Struct Biol       Date:  2021-01-13       Impact factor: 6.809

Review 8.  Data-driven computational protein design.

Authors:  Vincent Frappier; Amy E Keating
Journal:  Curr Opin Struct Biol       Date:  2021-04-25       Impact factor: 7.786

9.  Design of protein-binding proteins from the target structure alone.

Authors:  Longxing Cao; Brian Coventry; Inna Goreshnik; Buwei Huang; William Sheffler; Joon Sung Park; Kevin M Jude; Iva Marković; Rameshwar U Kadam; Koen H G Verschueren; Kenneth Verstraete; Scott Thomas Russell Walsh; Nathaniel Bennett; Ashish Phal; Aerin Yang; Lisa Kozodoy; Michelle DeWitt; Lora Picton; Lauren Miller; Eva-Maria Strauch; Nicholas D DeBouver; Allison Pires; Asim K Bera; Samer Halabiya; Bradley Hammerson; Wei Yang; Steffen Bernard; Lance Stewart; Ian A Wilson; Hannele Ruohola-Baker; Joseph Schlessinger; Sangwon Lee; Savvas N Savvides; K Christopher Garcia; David Baker
Journal:  Nature       Date:  2022-03-24       Impact factor: 69.504

10.  Sequence statistics of tertiary structural motifs reflect protein stability.

Authors:  Fan Zheng; Gevorg Grigoryan
Journal:  PLoS One       Date:  2017-05-26       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.