| Literature DB >> 22477769 |
Robbie P Joosten, Jean Salzemann, Vincent Bloch, Heinz Stockinger, Ann-Charlott Berglund, Christophe Blanchet, Erik Bongcam-Rudloff, Christophe Combet, Ana L Da Costa, Gilbert Deleage, Matteo Diarena, Roberto Fabbretti, Géraldine Fettahi, Volker Flegel, Andreas Gisel, Vinod Kasam, Timo Kervinen, Eija Korpelainen, Kimmo Mattila, Marco Pagni, Matthieu Reichstadt, Vincent Breton, Ian J Tickle, Gert Vriend.
Abstract
Structural biology, homology modelling and rational drug design require accurate three-dimensional macromolecular coordinates. However, the coordinates in the Protein Data Bank (PDB) have not all been obtained using the latest experimental and computational methods. In this study a method is presented for automated re-refinement of existing structure models in the PDB. A large-scale benchmark with 16 807 PDB entries showed that they can be improved in terms of fit to the deposited experimental X-ray data as well as in terms of geometric quality. The re-refinement protocol uses TLS models to describe concerted atom movement. The resulting structure models are made available through the PDB_REDO databank (http://www.cmbi.ru.nl/pdb_redo/). Grid computing techniques were used to overcome the computational requirements of this endeavour.Entities:
Year: 2009 PMID: 22477769 PMCID: PMC3246819 DOI: 10.1107/S0021889809008784
Source DB: PubMed Journal: J Appl Crystallogr ISSN: 0021-8898 Impact factor: 3.304
Data set selection and re-refinement
| PDB entries (January 2007) | 41277 |
| X-ray structure models | 35003 |
| X-ray + 2.70 resolution | 29541 |
| X-ray + experimental data (SF) | 20889 |
| X-ray + SF + | 16877 |
| X-ray + SF + usable | 16807 |
| Re-refined structure models | 15034 |
| Improved structure models | 10046 |
Status flag schemes for R-free set selection encountered in deposited reflection files at the wwPDB
| Scheme | Working set |
| Example PDB entry |
|---|---|---|---|
| 1 | o | f | 1aa6 (1dzi) |
| 2 | 0 | 1 | 101m (1a4i) |
| 3 | 1 | 1 | 1a8d |
| 4 | Positive integer | 0 | 1b7d |
| 5 | Positive real number | 0.00 | 1c3c |
| 6 | 1.0 | 0.0 | 1a27 |
PDB identifiers in parentheses are examples of reversed usage of the scheme.
Figure 1R-free values extracted from the PDB header (diamonds) and values recalculated with Refmac (Winn et al., 2001 ▶) using the deposited experimental data (squares) plotted against the experimental data resolution. The values are averages for all structure models in 0.1 Å bins. The recalculated R values (not shown) follow the same pattern.
Figure 2R-free values extracted from the PDB header (diamonds) and values obtained after re-refinement in Refmac (Winn et al., 2001 ▶) with TLS models (squares) plotted against the experimental data resolution. The values are averages for all structure models in 0.1 Å bins. The effect of the TLS parameterization is clearly shown by the results of the re-refinement without TLS models (dotted line). For all but the highest resolution bins, refinement with TLS gives lower average R-free values.
Figure 3WHAT_CHECK Ramachandran plot appearance Z-scores (Hooft et al., 1997 ▶) for original (diamonds) and TLS-refined structure models (squares) as a function of resolution. The values are averages for all structure models in 0.1 Å bins.
Figure 4Atomic overlaps (bumps) per structure model as detected by WHAT_CHECK for original (diamonds) and TLS-refined structure models (squares) as a function of resolution. The values are averages for all structure models in 0.1 Å bins.
Figure 5Bond-length r.m.s. Z-score per structure model as calculated by WHAT_CHECK for original (diamonds) and TLS-refined structure models (squares) as a function of resolution. The values are averages for all structure models in 0.1 Å bins.
Figure 6Bond-angle r.m.s. Z-score per structure model as calculated by WHAT_CHECK for original (diamonds) and TLS-refined structure models (squares) as a function of resolution. The values are averages for all structure models in 0.1 Å bins.
Figure 7Percentage of structure models that improve in terms of R-free after TLS refinement plotted as a function of the year of deposition. The percentage of all evaluated structures (diamonds) decreases from 90% for 1995 to 62% for 2006. The percentage of structures previously refined with TLS (squares) varies between 21 and 32%.