| Literature DB >> 22986687 |
Jurgen F Doreleijers1, Alan W Sousa da Silva, Elmar Krieger, Sander B Nabuurs, Christian A E M Spronk, Tim J Stevens, Wim F Vranken, Gert Vriend, Geerten W Vuister.
Abstract
We present a suite of programs, named CING for Common Interface for NMR Structure Generation that provides for a residue-based, integrated validation of the structural NMR ensemble in conjunction with the experimental restraints and other input data. External validation programs and new internal validation routines compare the NMR-derived models with empirical data, measured chemical shifts, distance- and dihedral restraints and the results are visualized in a dynamic Web 2.0 report. A red-orange-green score is used for residues and restraints to direct the user to those critiques that warrant further investigation. Overall green scores below ~20 % accompanied by red scores over ~50 % are strongly indicative of poorly modelled structures. The publically accessible, secure iCing webserver ( https://nmr.le.ac.uk ) allows individual users to upload the NMR data and run a CING validation analysis.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22986687 PMCID: PMC3483101 DOI: 10.1007/s10858-012-9669-7
Source DB: PubMed Journal: J Biomol NMR ISSN: 0925-2738 Impact factor: 2.835
Fig. 1Schematic outline showing the data flow and software components involved in CING (yellow boxes). External programs interfaced to CING, CCPN services/APIs and wwPDB repositories are represented in brown-shades, green and dark-orange, respectively. External programs (olive green) can access the iCing web user interface (purple) through the dedicated iCing robot
External programs interfaced by CING
| Program | O | Version | References | Url (http) |
|---|---|---|---|---|
| CING | − | r1136 | N.a. |
|
| Cython | − | 0.15 | N.a. |
|
| Ghostscript | − | 9.04 | N.a. |
|
| ImageMagick | − | 6.7.3-1 | N.a. |
|
| MatplotLib | − | 1.0.1 | Hunter ( |
|
| Python | − | 2.7 | Lutz ( |
|
| Analysis | + | 2.1.5 | Vranken et al. ( |
|
| AQUA | + | 3.2 (r15) | Laskowski et al. ( |
|
| CCPN | + | r6249 | Vranken et al. ( |
|
| DSSP | + | 2010-04-01 | Hooft et al. ( |
|
| MolMol | + | 2K.2 | Koradi et al. ( | No longer supported |
| PROCHECK-NMR | + | 3.5.4 | Laskowski et al. ( |
|
| PyMol | + | 1.2r1 | DeLano and Bromberg ( |
|
| Queeny | + | r1076 | Nabuurs et al. ( |
|
| SHIFTX | + | 1.1.0 | Zhang et al. ( |
|
| Talos+ | + | 1.01 | Shen et al. ( |
|
| VASCO | + | r6249 | Rieping and Vranken ( |
|
| Wattos | + | r154 | Doreleijers et al. ( |
|
| WHAT_CHECK | + | 2010-08-16 | Hooft et al. ( |
|
| Xplor-NIH | + | 2.26 | Schwieters et al. ( |
|
| YASARA | + | 11.6.1 | Joosten et al. ( |
|
The maximum version of any instalment is listed. For example, the Python version for CING is 2.7 although version 2.5 is also supported. The optional status (second column labelled O) is based on the most basic CING functionality excluding plugins but including plotting features
ROG scoring criteria
| Entity | Program | Property | Orange | Red |
|---|---|---|---|---|
| Molecule | CING | ROGa | %G ≤ 20 + %R | %G ≤ %R − 20 |
| Chain | CING | Propagated from residue entitiesb | ||
| Residue | CING | Omega deviation (°)c | 9.4 | 14.1 |
| Residue | CING | D1D2 plot (Z)d | −1.0 | −0.8 |
| Residue | WHAT_CHECK | Ramachandran (Z)d | −1.0 | −1.3 |
| Residue | WHAT_CHECK | Janin plot (Z)d | −0.9 | −1.2 |
| Residue | PROCHECK_NMR | G-factore | −1.0 | −1.3 |
| Atom | CING | Propagated from CS entitiesb | ||
| Peak-list | CING | Propagated from peak entitiesb | ||
| Peak | CING | Linkingf | Unassigned atom | |
| Peak | CING | CS assignment (Z)g | 1 | 2 |
| CS-list | CING | Propagated from CS entitiesb | ||
| CS | CING | CS assignment (Z)h | 3 | |
| CS | CING | No coordinates | No coordinates | |
| CS | CING | Leucine side chaini | Inconsistency | |
| CS | CING | Proline omegaj | Inconsistency | |
| CS | CING | Assignment issuesf | Various | |
| DR-list | CING | Propagated from DR entitiesb | ||
| DR | CING | Max. violation (Å)k,l | 0.3 | 0.5 |
| DR | CING | RMS violation (Å)l | 0.15 | 0.3 |
| DR | CING | Presence atoms | No coordinates | |
| DH-list | CING | Propagated from DH entitiesb | ||
| DH | TALOS+/CING | Max. violation (°)l | 3 | 10 |
| DH | CING | RMS violation (°)l | 3 | 5 |
| DH | CING | Presence atoms | No coordinates | |
| RDC-list | CING | –m | ||
| RDC | CING | –m |
ROG scoring criteria at each entity level. The Program column denotes the program used for the assessment. See footnotes for the rationales of the cut off criteria values used. CS denotes chemical shift, DR distance restraint, DH dihedral angle restraint and RDC residual dipolar coupling restraint
aThe residue critiques (line items 3 through 7) propagate to the molecule level by evaluating the listed inequalities for orange and red scoring, using percentages of residues with a red (%R) and green (%G) ROG score. Only the well-defined residues, as determined by the CV-criterion (see methods) were included in the Molecule criterion. Results from data entities are not included in the Molecule ROG score
bEntity obtains the worst propagated ROG score. The residue critiques are the only items that cascade up to the Molecule level, in other words, e.g. the experimental data critiques do currently not affect the overall ROG score
cThe omega deviation is calculated as an average over the ensemble with the references values for cis and trans peptide bonds values taken from (Wilson et al. 1998). The cut offs are 3 and 4 SD removed from those averages
dThe unit for these criteria is the number of standard deviations denoted Z. The cut off were determined by manually examining a large number of examples. See text for a short introduction to the D1 and D2 dihedrals
eManually determined cut off
fVarious assignment issues are scored orange for e.g. the presence of multiple assignments, missing assignments and invalid stereospecific assignments
gThe standard deviation for the chemical shift assignment of peaks was assumed to be a uniform 0.01 ppm for 1H and 0.15 ppm for all other nucleii 15N, 13C, and 31P. The uncertainties on the observed CS have not been considered
hThe CS are flagged with respect to the BMRB derived database values
iThe Leucine CS are compared for consistency with the side chain conformation (Doreleijers et al. 2011; Mulder 2009)
jThe Proline CS are compared for consistency with the peptide bond conformation as described in the text as based on (Shen and Bax 2009)
kThe maximum restraint violation in any member of the ensemble
lCommonly used cut off, e.g. in Xplor-NIH analysis scripts
mRDCs are currently not validated and do not receive a ROG score
Fig. 2Overview of the CING analysis for PDB entry 2kq3 (Wang et al. 2010). a Project page of entry 2kq3. b Ribbon representation of the first conformer of the 2kq3 ensemble, colour-coded according to residue ROG score. Residues not included by the range selection are coloured grey. c Per-residue circular variance and positional RMSD values as function of residue number. DSSP-derived secondary structure analysis is shown on top. Coloured bars indicate relative solvent accessibility of each residue, as calculated by the WHAT_CHECK’s INOCHK routine, where red indicates “more exposed than usual” and blue means “more buried than usual”. Residues not included by the range selection are coloured grey
Fig. 3Residue analysis of 2kq3 residue Leu14. a The CING Residue page for Leu14. Structure analysis is displayed on the left and shown in detail in panels (b)–(d); experimental data involving Leu14 shown on the right. b Ramachandran plot of Leu14. Residue specific background colouring derived from the WHAT_CHECK protein reference database for helical (blue); β-sheet (yellow) and coil (green) regions as determined by DSSP. Grey areas define low-density transition regions. Experimental ϕ/ψ restraints are indicated by the transparent orange box. Experimental ϕ/ψ values of the individual members of the ensemble are indicated by green plus signs; its average value by an open circle. Open square box denotes an automatically Talos+ derived ϕ/ψ region on the basis of the experimental chemical shifts. c Janin plot of Leu14; colouring as in (b). D) D1D2 plot (see text) of Leu14; colouring as in (b)
Fig. 4CING analysis of experimental data. Expandable, searchable, tabular displays are used, that can be sorted on any column. Table entries are directly hyperlinked to the corresponding Residue and Atom pages. a ‘Distance restraints’ page. Check boxes allow for additional information to be displayed. Only critiqued restraints are shown. b Atoms page. Only critiqued atoms are shown
Fig. 5Comparison of the fraction of well-defined residues with a CING ROG score green (good) versus the fraction of residues that fall in the most favoured region of the Ramachandran plot according to PROCHECK-NMR. The horizontal line at 20 % indicates the CING cut off for the minimum percentage of green residues, beyond which the molecule as a whole is flagged red regardless of the allocation of orange and red residues (see Table 2). The plot shows a significant number of entries on the bottom right for which a high percentage of ‘PC core’ residues are not ‘green’ in CING. Data is shown for the 6,383 NMR PDB entries selected from NRG-CING to have at least ten models and 30 or more amino acid residues. Indicated with an open circle (at 77.9 %) is PDB entry 2kq3 that is discussed as an example in the text