| Literature DB >> 20602353 |
Wolfgang Rieping1, Wim F Vranken.
Abstract
The public archives containing protein information in the form of NMR chemical shift data at the BioMagResBank (BMRB) and of 3D structure coordinates at the Protein Data Bank are continuously expanding. The quality of the data contained in these archives, however, varies. The main issue for chemical shift values is that they are determined relative to a reference frequency. When this reference frequency is set incorrectly, all related chemical shift values are systematically offset. Such wrongly referenced chemical shift values, as well as other problems such as chemical shift values that are assigned to the wrong atom, are not easily distinguished from correct values and effectively reduce the usefulness of the archive. We describe a new method to correct and validate protein chemical shift values in relation to their 3D structure coordinates. This method classifies atoms using two parameters: the per-atom solvent accessible surface area (as calculated from the coordinates) and the secondary structure of the parent amino acid. Through the use of Gaussian statistics based on a large database of 3220 BMRB entries, we obtain per-entry chemical shift corrections as well as Z scores for the individual chemical shift values. In addition, information on the error of the correction value itself is available, and the method can retain only dependable correction values. We provide an online resource with chemical shift, atom exposure, and secondary structure information for all relevant BMRB entries (http://www.ebi.ac.uk/pdbe/nmr/vasco) and hope this data will aid the development of new chemical shift-based methods in NMR. 2010 Wiley-Liss, Inc.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20602353 PMCID: PMC2970900 DOI: 10.1002/prot.22756
Source DB: PubMed Journal: Proteins ISSN: 0887-3585
Root-Mean-Square of the Difference Between the Chemical Shift Corrections from VASCO and Previously Published Methods
| Atom name | Talos | LACS | RefDb | CheckShift | Intermethod (all) | Intermethod (shared) |
|---|---|---|---|---|---|---|
| N | 0.57 | n/a | 0.67 | 0.63 | 0.70–0.79 | 0.66–0.79 |
| H | 0.07 | n/a | 0.13 | n/a | 0.09 | 0.09 |
| Hα | 0.04 | 0.06 | 0.05 | n/a | 0.05–0.08 | 0.05–0.08 |
| Cα | 0.15 | 0.19 | 0.21 | 0.55 | 0.19–0.52 | 0.18–0.45 |
| Cβ | 0.15 | 0.19 | 0.21 | 0.37 | 0.19–0.34 | 0.18–0.34 |
| C | 0.36 | 0.35 | 0.25 | 0.54 | 0.32–0.60 | 0.34–0.45 |
All values in ppm.
The range of the rms between the previously published methods is shown in the intermethod column for all possible combinations (all) and the subset of entries shared between the different methods for that atom (shared).
Figure 1Chemical shift corrections from the TALOS database compared with the VASCO—calculated correction for Cα atoms (top left), C atoms (top right), N atoms (bottom left), and Hα atoms (bottom right).
Figure 2Histogram showing the distribution of the chemical shift corrections for aliphatic carbon atoms.
Figure 3Chemical shift corrections for the aliphatic carbon atoms for selected NMR laboratories. The (median, average) corrections are listed behind the laboratory identifier.
Figure 4Variation of proton chemical shift correction for BMRB entry 7014 with increasing numbers of chemical shifts removed.
Number of Samples (out of 10,000 for Each Step) Where a Valid Chemical Shift Correction Was Erroneously Found After Removing an Increasing Number of Chemical Shifts for BMRB Entry 7014
| Percentage of chemical shifts removed | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Atom class | 10% | 20% | 30% | 40% | 50% | 60% | 70% | 80% | 90% |
| N | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 7 |
| H | 0 | 0 | 1 | 2 | 2 | 5 | 5 | 4 | 17 |
| Cali | 0 | 0 | 0 | 0 | 1 | 5 | 2 | 8 | 9 |
| Caro | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| CnoH | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |