Literature DB >> 19025980

Local descriptors of protein structure: a systematic analysis of the sequence-structure relationship in proteins using short- and long-range interactions.

Torgeir R Hvidsten1, Andriy Kryshtafovych, Krzysztof Fidelis.   

Abstract

Local protein structure representations that incorporate long-range contacts between residues are often considered in protein structure comparison but have found relatively little use in structure prediction where assembly from single backbone fragments dominates. Here, we introduce the concept of local descriptors of protein structure to characterize local neighborhoods of amino acids including short- and long-range interactions. We build a library of recurring local descriptors and show that this library is general enough to allow assembly of unseen protein structures. The library could on average re-assemble 83% of 119 unseen structures, and showed little or no performance decrease between homologous targets and targets with folds not represented among domains used to build it. We then systematically evaluate the descriptor library to establish the level of the sequence signal in sets of protein fragments of similar geometrical conformation. In particular, we test whether that signal is strong enough to facilitate correct assignment and alignment of these local geometries to new sequences. We use the signal to assign descriptors to a test set of 479 sequences with less than 40% sequence identity to any domain used to build the library, and show that on average more than 50% of the backbone fragments constituting descriptors can be correctly aligned. We also use the assigned descriptors to infer SCOP folds, and show that correct predictions can be made in many of the 151 cases where PSI-BLAST was unable to detect significant sequence similarity to proteins in the library. Although the combinatorial problem of simultaneously aligning several fragments to sequence is a major bottleneck compared with single fragment methods, the advantage of the current approach is that correct alignments imply correct long range distance constraints. The lack of these constraints is most likely the major reason why structure prediction methods fail to consistently produce adequate models when good templates are unavailable or undetectable. Thus, we believe that the current study offers new and valuable insight into the prediction of sequence-structure relationships in proteins. Copyright 2008 Wiley-Liss, Inc.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19025980      PMCID: PMC5479691          DOI: 10.1002/prot.22296

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  75 in total

1.  Ab initio protein structure prediction of CASP III targets using ROSETTA.

Authors:  K T Simons; R Bonneau; I Ruczinski; D Baker
Journal:  Proteins       Date:  1999

2.  Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks.

Authors:  A G de Brevern; C Etchebest; S Hazout
Journal:  Proteins       Date:  2000-11-15

3.  Improved protein loop prediction from sequence alone.

Authors:  D F Burke; C M Deane
Journal:  Protein Eng       Date:  2001-07

4.  Assessment of fold recognition predictions in CASP6.

Authors:  Guoli Wang; Yumi Jin; Roland L Dunbrack
Journal:  Proteins       Date:  2005

5.  Using multi-data hidden Markov models trained on local neighborhoods of protein structure to predict residue-residue contacts.

Authors:  Patrik Björkholm; Pawel Daniluk; Andriy Kryshtafovych; Krzysztof Fidelis; Robin Andersson; Torgeir R Hvidsten
Journal:  Bioinformatics       Date:  2009-03-16       Impact factor: 6.937

6.  Automatic definition of recurrent local structure motifs in proteins.

Authors:  M J Rooman; J Rodriguez; S J Wodak
Journal:  J Mol Biol       Date:  1990-05-20       Impact factor: 5.469

7.  Structure-based functional motif identifies a potential disulfide oxidoreductase active site in the serine/threonine protein phosphatase-1 subfamily.

Authors:  J S Fetrow; N Siew; J Skolnick
Journal:  FASEB J       Date:  1999-10       Impact factor: 5.191

8.  A 3D building blocks approach to analyzing and predicting structure of proteins.

Authors:  R Unger; D Harel; S Wherland; J L Sussman
Journal:  Proteins       Date:  1989

9.  A graph-theoretic approach to the identification of three-dimensional patterns of amino acid side-chains in protein structures.

Authors:  P J Artymiuk; A R Poirrette; H M Grindley; D W Rice; P Willett
Journal:  J Mol Biol       Date:  1994-10-21       Impact factor: 5.469

10.  Three-dimensional, sequence order-independent structural comparison of a serine protease against the crystallographic database reveals active site similarities: potential implications to evolution and to protein folding.

Authors:  D Fischer; H Wolfson; S L Lin; R Nussinov
Journal:  Protein Sci       Date:  1994-05       Impact factor: 6.725

View more
  8 in total

1.  Using multi-data hidden Markov models trained on local neighborhoods of protein structure to predict residue-residue contacts.

Authors:  Patrik Björkholm; Pawel Daniluk; Andriy Kryshtafovych; Krzysztof Fidelis; Robin Andersson; Torgeir R Hvidsten
Journal:  Bioinformatics       Date:  2009-03-16       Impact factor: 6.937

2.  Detecting local residue environment similarity for recognizing near-native structure models.

Authors:  Hyungrae Kim; Daisuke Kihara
Journal:  Proteins       Date:  2014-10-30

3.  Recognizing protein substructure similarity using segmental threading.

Authors:  Sitao Wu; Yang Zhang
Journal:  Structure       Date:  2010-07-14       Impact factor: 5.006

4.  A novel method to compare protein structures using local descriptors.

Authors:  Paweł Daniluk; Bogdan Lesyng
Journal:  BMC Bioinformatics       Date:  2011-08-17       Impact factor: 3.169

5.  Structural alignment of protein descriptors - a combinatorial model.

Authors:  Maciej Antczak; Marta Kasprzak; Piotr Lukasiak; Jacek Blazewicz
Journal:  BMC Bioinformatics       Date:  2016-09-17       Impact factor: 3.169

6.  DAMA-a method for computing multiple alignments of protein structures using local structure descriptors.

Authors:  Paweł Daniluk; Tymoteusz Oleniecki; Bogdan Lesyng
Journal:  Bioinformatics       Date:  2021-08-16       Impact factor: 6.937

7.  A comprehensive analysis of the structure-function relationship in proteins based on local structure similarity.

Authors:  Torgeir R Hvidsten; Astrid Laegreid; Andriy Kryshtafovych; Gunnar Andersson; Krzysztof Fidelis; Jan Komorowski
Journal:  PLoS One       Date:  2009-07-15       Impact factor: 3.240

8.  A chemogenomics view on protein-ligand spaces.

Authors:  Helena Strömbergsson; Gerard J Kleywegt
Journal:  BMC Bioinformatics       Date:  2009-06-16       Impact factor: 3.169

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.