Literature DB >> 17360626

Growth of novel protein structural data.

Michael Levitt1.   

Abstract

Contrary to popular assumption, the rate of growth of structural data has slowed, and the Protein Data Bank (PDB) has not been growing exponentially since 1995. Reaching such a dramatic conclusion requires careful measurement of growth of novel structures, which can be achieved by clustering entry sequences, or by using a novel index to down-weight entries with a higher number of sequence neighbors. These measures agree, and growth rates are very similar for entire PDB files, clusters, and weighted chains. The overall sizes of Structural Classification of Proteins (SCOP) categories (number of families, superfamilies, and folds) appear to be directly proportional to the number of deposited PDB files. Using our weighted chain count, which is most correlated to the change in the size of each SCOP category in any time period, shows that the rate of increase of SCOP categories is actually slowing down. This enables the final size of each of these SCOP categories to be predicted without examining or comparing protein structures. In the last 3 years, structures solved by structural genomics (SG) initiatives, especially the United States National Institutes of Health Protein Structure Initiative, have begun to redress the slowing growth of the PDB. Structures solved by SG are 3.8 times less sequence-redundant than typical PDB structures. Since mid-2004, SG programs have contributed half the novel structures measured by weighted chain counts. Our analysis does not rely on visual inspection of coordinate sets: it is done automatically, providing an accurate, up-to-date measure of the growth of novel protein structural data.

Mesh:

Substances:

Year:  2007        PMID: 17360626      PMCID: PMC1802002          DOI: 10.1073/pnas.0611678104

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  38 in total

1.  Support for structural genomics and synchrotrons.

Authors:  M Cassman; J C Norvell
Journal:  Science       Date:  1999-10-08       Impact factor: 47.728

2.  Estimating the number of protein folds and families from complete genome data.

Authors:  Y I Wolf; N V Grishin; E V Koonin
Journal:  J Mol Biol       Date:  2000-06-16       Impact factor: 5.469

3.  A unifold, mesofold, and superfold model of protein fold use.

Authors:  Andrew F W Coulson; John Moult
Journal:  Proteins       Date:  2002-01-01

4.  Assessment of homology-based predictions in CASP5.

Authors:  Anna Tramontano; Veronica Morea
Journal:  Proteins       Date:  2003

5.  The number of protein folds and their distribution over families in nature.

Authors:  Xinsheng Liu; Ke Fan; Wei Wang
Journal:  Proteins       Date:  2004-02-15

6.  Rapid protein domain assignment from amino acid sequence using predicted secondary structure.

Authors:  Russell L Marsden; Liam J McGuffin; David T Jones
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

7.  A re-estimation for the total numbers of protein folds and superfamilies.

Authors:  Z X Wang
Journal:  Protein Eng       Date:  1998-08

8.  Structural genomics projects in Japan.

Authors:  S Yokoyama; H Hirota; T Kigawa; T Yabuki; M Shirouzu; T Terada; Y Ito; Y Matsuo; Y Kuroda; Y Nishimura; Y Kyogoku; K Miki; R Masui; S Kuramitsu
Journal:  Nat Struct Biol       Date:  2000-11

9.  SCOP database in 2004: refinements integrate structure and sequence family data.

Authors:  Antonina Andreeva; Dave Howorth; Steven E Brenner; Tim J P Hubbard; Cyrus Chothia; Alexey G Murzin
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

10.  Progress towards mapping the universe of protein folds.

Authors:  Alastair Grant; David Lee; Christine Orengo
Journal:  Genome Biol       Date:  2004-04-29       Impact factor: 13.583

View more
  60 in total

1.  Sub-AQUA: real-value quality assessment of protein structure models.

Authors:  Yifeng David Yang; Preston Spratt; Hao Chen; Changsoon Park; Daisuke Kihara
Journal:  Protein Eng Des Sel       Date:  2010-06-04       Impact factor: 1.650

Review 2.  Protein folds and protein folding.

Authors:  R Dustin Schaeffer; Valerie Daggett
Journal:  Protein Eng Des Sel       Date:  2010-11-03       Impact factor: 1.650

Review 3.  Protein crystallography for non-crystallographers, or how to get the best (but not more) from published macromolecular structures.

Authors:  Alexander Wlodawer; Wladek Minor; Zbigniew Dauter; Mariusz Jaskolski
Journal:  FEBS J       Date:  2007-11-23       Impact factor: 5.542

Review 4.  Structural genomics: keeping up with expanding knowledge of the protein universe.

Authors:  Marek Grabowski; Andrzej Joachimiak; Zbyszek Otwinowski; Wladek Minor
Journal:  Curr Opin Struct Biol       Date:  2007-06-22       Impact factor: 6.809

Review 5.  Chemical genomics: a challenge for de novo drug design.

Authors:  P M Dean
Journal:  Mol Biotechnol       Date:  2007-06-30       Impact factor: 2.695

6.  Contributions to the NIH-NIGMS Protein Structure Initiative from the PSI Production Centers.

Authors:  Stephen K Burley; Andrzej Joachimiak; Gaetano T Montelione; Ian A Wilson
Journal:  Structure       Date:  2008-01       Impact factor: 5.006

7.  Solvent dramatically affects protein structure refinement.

Authors:  Gaurav Chopra; Christopher M Summa; Michael Levitt
Journal:  Proc Natl Acad Sci U S A       Date:  2008-12-10       Impact factor: 11.205

8.  Structural genomics is the largest contributor of novel structural leverage.

Authors:  Rajesh Nair; Jinfeng Liu; Ta-Tsen Soong; Thomas B Acton; John K Everett; Andrei Kouranov; Andras Fiser; Adam Godzik; Lukasz Jaroszewski; Christine Orengo; Gaetano T Montelione; Burkhard Rost
Journal:  J Struct Funct Genomics       Date:  2009-02-05

9.  Target domain definition and classification in CASP8.

Authors:  Michael L Tress; Iakes Ezkurdia; Jane S Richardson
Journal:  Proteins       Date:  2009

10.  Improving structure-based function prediction using molecular dynamics.

Authors:  Dariya S Glazer; Randall J Radmer; Russ B Altman
Journal:  Structure       Date:  2009-07-15       Impact factor: 5.006

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.