Literature DB >> 10705434

An analysis of the Protein Data Bank in search of temporal and global trends.

H Weissig1, P E Bourne.   

Abstract

MOTIVATION: Biological databases, with their rapidly expanding contents, are indispensable tools in the quest to understand more about biological function. However, a serious user of a database that comprises a large collection of data, collected over a long period, will likely be struck by the inconsistency in reporting individual items of data. This paper takes a critical look at the Protein Data Bank (PDB) to explore the seriousness of the problem in one particular data set and to explore the implications to those actively engaged in comparative analysis of these data.
RESULTS: Averaged over the complete corpus, the stereochemical quality of atomic models has, in the past few years, moved towards ideal values. At the same time, there are inconsistencies in how data are reported. Water content is not reported consistently and the percent of data collected when reporting the high-resolution shell varies, detracting from the value of resolution as a yardstick for assessing the quality of a structure. A more detailed analysis of these inconsistencies is hampered by the lack of machine-readable experimental data. To the user of macromolecular structure data, this suggests that structural details beyond the standard quality measures of resolution and R value should be considered when using coordinate sets for further derivation or in inferring biological function. To the curators of the PDB, this suggests the need to capture more of the experimental data associated with the experiment in a way that permits straightforward parsing.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10705434     DOI: 10.1093/bioinformatics/15.10.807

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  2 in total

1.  The macro-ethics of genomics to health: the physiome project.

Authors:  James B Bassingthwaighte
Journal:  C R Biol       Date:  2003 Oct-Nov       Impact factor: 1.583

2.  Current awareness.

Authors:  R Drysdale; L Bayraktaroglu
Journal:  Yeast       Date:  2000-06-30       Impact factor: 3.239

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.