Literature DB >> 1409573

Selection of a representative set of structures from Brookhaven Protein Data Bank.

J Boberg1, T Salakoski, M Vihinen.   

Abstract

Reliable structural and statistical analyses of three dimensional protein structures should be based on unbiased data. The Protein Data Bank is highly redundant, containing several entries for identical or very similar sequences. A technique was developed for clustering the known structures based on their sequences and contents of alpha- and beta-structures. First, sequences were aligned pairwise. A representative sample of sequences was then obtained by grouping similar sequences together, and selecting a typical representative from each group. The similarity significance threshold needed in the clustering method was found by analyzing similarities of random sequences. Because three dimensional structures for proteins of same structural class are generally more conserved than their sequences, the proteins were clustered also according to their contents of secondary structural elements. The results of these clusterings indicate conservation of alpha- and beta-structures even when sequence similarity is relatively low. An unbiased sample of 103 high resolution structures, representing a wide variety of proteins, was chosen based on the suggestions made by the clustering algorithm. The proteins were divided into structural classes according to their contents and ratios of secondary structural elements. Previous classifications have suffered from subjective view of secondary structures, whereas here the classification was based on backbone geometry. The concise view lead to reclassification of some structures. The representative set of structures facilitates unbiased analyses of relationships between protein sequence, function, and structure as well as of structural characteristics.

Mesh:

Year:  1992        PMID: 1409573     DOI: 10.1002/prot.340140212

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  11 in total

1.  Global mapping of the protein structure space and application in structure-based inference of protein function.

Authors:  Jingtong Hou; Se-Ran Jun; Chao Zhang; Sung-Hou Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2005-02-10       Impact factor: 11.205

2.  Modelling of peptide and protein structures.

Authors:  S Fraga; J M Parker
Journal:  Amino Acids       Date:  1994-06       Impact factor: 3.520

3.  Predicting protein folding rates from geometric contact and amino acid sequence.

Authors:  Zheng Ouyang; Jie Liang
Journal:  Protein Sci       Date:  2008-04-23       Impact factor: 6.725

4.  Identification of csk tyrosine phosphorylation sites and a tyrosine residue important for kinase domain structure.

Authors:  V Joukov; M Vihinen; S Vainikka; J M Sowadski; K Alitalo; M Bergman
Journal:  Biochem J       Date:  1997-03-15       Impact factor: 3.857

5.  Optimization of the electrostatic interactions in proteins of different functional and folding type.

Authors:  V Z Spassov; A D Karshikoff; R Ladenstein
Journal:  Protein Sci       Date:  1994-09       Impact factor: 6.725

6.  Common prevalence of alanine and glycine in mobile reactive centre loops of serpins and viral fusion peptides: do prions possess a fusion peptide?

Authors:  I Callebaut; A Tasso; R Brasseur; A Burny; D Portetelle; J P Mornon
Journal:  J Comput Aided Mol Des       Date:  1994-04       Impact factor: 3.686

7.  Structural basis for chromosome X-linked agammaglobulinemia: a tyrosine kinase disease.

Authors:  M Vihinen; D Vetrie; H S Maniar; H D Ochs; Q Zhu; I Vorechovský; A D Webster; L D Notarangelo; L Nilsson; J M Sowadski
Journal:  Proc Natl Acad Sci U S A       Date:  1994-12-20       Impact factor: 11.205

8.  Verification of protein structures: patterns of nonbonded atomic interactions.

Authors:  C Colovos; T O Yeates
Journal:  Protein Sci       Date:  1993-09       Impact factor: 6.725

9.  Hierarchical classification of protein folds using a novel ensemble classifier.

Authors:  Chen Lin; Ying Zou; Ji Qin; Xiangrong Liu; Yi Jiang; Caihuan Ke; Quan Zou
Journal:  PLoS One       Date:  2013-02-20       Impact factor: 3.240

10.  Comparing models of evolution for ordered and disordered proteins.

Authors:  Celeste J Brown; Audra K Johnson; Gary W Daughdrill
Journal:  Mol Biol Evol       Date:  2009-11-18       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.