Literature DB >> 14636603

The PDB is a covering set of small protein structures.

Daisuke Kihara1, Jeffrey Skolnick.   

Abstract

Structure comparisons of all representative proteins have been done. Employing the relative root mean square deviation (RMSD) from native enables the assessment of the statistical significance of structure alignments of different lengths in terms of a Z-score. Two conclusions emerge: first, proteins with their native fold can be distinguished by their Z-score. Second and somewhat surprising, all small proteins up to 100 residues in length have significant structure alignments to other proteins in a different secondary structure and fold class; i.e. 24.0% of them have 60% coverage by a template protein with a RMSD below 3.5A and 6.0% have 70% coverage. If the restriction that we align proteins only having different secondary structure types is removed, then in a representative benchmark set of proteins of 200 residues or smaller, 93% can be aligned to a single template structure (with average sequence identity of 9.8%), with a RMSD less than 4A, and 79% average coverage. In this sense, the current Protein Data Bank (PDB) is almost a covering set of small protein structures. The length of the aligned region (relative to the whole protein length) does not differ among the top hit proteins, indicating that protein structure space is highly dense. For larger proteins, non-related proteins can cover a significant portion of the structure. Moreover, these top hit proteins are aligned to different parts of the target protein, so that almost the entire molecule can be covered when combined. The number of proteins required to cover a target protein is very small, e.g. the top ten hit proteins can give 90% coverage below a RMSD of 3.5A for proteins up to 320 residues long. These results give a new view of the nature of protein structure space, and its implications for protein structure prediction are discussed.

Mesh:

Year:  2003        PMID: 14636603     DOI: 10.1016/j.jmb.2003.10.027

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  49 in total

1.  Automated structure prediction of weakly homologous proteins on a genomic scale.

Authors:  Yang Zhang; Jeffrey Skolnick
Journal:  Proc Natl Acad Sci U S A       Date:  2004-05-04       Impact factor: 11.205

2.  CONTSOR--a new knowledge-based fold recognition potential, based on side chain orientation and contacts between residue terminal groups.

Authors:  Boris Vishnepolsky; Malak Pirtskhalava
Journal:  Protein Sci       Date:  2011-11-23       Impact factor: 6.725

3.  Template-based structure prediction and classification of transcription factors in Arabidopsis thaliana.

Authors:  Tao Lu; Yuedong Yang; Bo Yao; Song Liu; Yaoqi Zhou; Chi Zhang
Journal:  Protein Sci       Date:  2012-05-01       Impact factor: 6.725

4.  iAlign: a method for the structural comparison of protein-protein interfaces.

Authors:  Mu Gao; Jeffrey Skolnick
Journal:  Bioinformatics       Date:  2010-07-11       Impact factor: 6.937

5.  Structural space of protein-protein interfaces is degenerate, close to complete, and highly connected.

Authors:  Mu Gao; Jeffrey Skolnick
Journal:  Proc Natl Acad Sci U S A       Date:  2010-12-13       Impact factor: 11.205

6.  Are protein-protein interfaces special regions on a protein's surface?

Authors:  Sam Tonddast-Navaei; Jeffrey Skolnick
Journal:  J Chem Phys       Date:  2015-12-28       Impact factor: 3.488

7.  Further evidence for the likely completeness of the library of solved single domain protein structures.

Authors:  Jeffrey Skolnick; Hongyi Zhou; Michal Brylinski
Journal:  J Phys Chem B       Date:  2012-02-13       Impact factor: 2.991

8.  The protein structure prediction problem could be solved using the current PDB library.

Authors:  Yang Zhang; Jeffrey Skolnick
Journal:  Proc Natl Acad Sci U S A       Date:  2005-01-14       Impact factor: 11.205

9.  Comprehensive evaluation of protein structure alignment methods: scoring by geometric measures.

Authors:  Rachel Kolodny; Patrice Koehl; Michael Levitt
Journal:  J Mol Biol       Date:  2005-01-16       Impact factor: 5.469

10.  Putting the pathway back into protein folding.

Authors:  Jeffrey Skolnick
Journal:  Proc Natl Acad Sci U S A       Date:  2005-02-09       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.