| Literature DB >> 23180575 |
Pieter M S Hendrickx1, Aleksandras Gutmanas, Gerard J Kleywegt.
Abstract
We describe Vivaldi (VIsualization and VALidation DIsplay; http://pdbe.org/vivaldi), a web-based service for the analysis, visualization, and validation of NMR structures in the Protein Data Bank (PDB). Vivaldi provides access to model coordinates and several types of experimentalEntities:
Mesh:
Substances:
Year: 2013 PMID: 23180575 PMCID: PMC3618379 DOI: 10.1002/prot.24213
Source DB: PubMed Journal: Proteins ISSN: 0887-3585
Data Coverage
| Source | PDB archive | NMR entries | |
|---|---|---|---|
| Total number of entries | PDB | 84,508 | 9616 |
| Unique UniProt | SIFTS | 28,104 | 4822 |
| Pfam | SIFTS | 6143 | 2455 |
| CATH | SIFTS | 2549 | 660 |
| SCOP | SIFTS | 4191 | 1150 |
| Cluster analysis | OLDERADO | − | 7289 |
| Chemical shift analysis | VASCO | − | 3361 |
| Distance constraints | BMRB-NRG | − | 5978 |
| Dihedral constraints | BMRB-NRG | − | 3850 |
| RDCs | BMRB-NRG | − | 602 |
| Validation scores | NRG-CING | − | 9491 |
Coverage of the protein universe in the PDB, and data available in Vivaldi for NMR entries (as of September 12, 2012). Up to date data coverage statistics can be found at http://pdbe.org/nmrstats/.
Figure 1Layout of a Vivaldi page showing the default view for protein PA1076 from Pseudomonas aeruginosa (PDB entry 2k4v).15 The page contains a header including PDBprints,9 an interactive 3D viewer (OpenAstexViewer),48 a graph and a textual information section. The most representative model according to OLDERADO14 cluster analysis is displayed in the 3D viewer and the graph shows the NRG-CING47 red-orange-green (ROG) scores for this protein.
Figure 2Vivaldi visualization of experimental data for protein PA1076 from Pseudomonas aeruginosa (PDB entry 2k4v).15 (a,b) Presentation of VASCO28 analysis of chemical shifts. (a) Nuclei with unusual chemical shifts are shown as spheres in the 3D viewer; aromatic residues are shown as sticks, to remind the user that the effect of aromatic ring currents is not explicitly accounted for in the VASCO analysis. (b) The same information is displayed in an interactive graph vs. residue number. (c,d) Analysis of distance constraints. (c) The most representative model of the ensemble is shown, colored by rigid-body domains as determined by OLDERADO.14 All long-range distance constraints (five or more residues apart in sequence) are shown as green sticks in the 3D viewer. (d) Graph of the number of long-range distance constraints per residue. The shaded area of the graph corresponds to helix 91–105 (UniProt numbering). (e) The chemical groups, for example N–HN, for which RDCs are available are shown as balls and sticks. Principal axes of the alignment tensor are also displayed. (f) Correlation plot of calculated vs. experimental RDCs. For each datapoint, the middle bar shows the calculated RDC averaged over the ensemble, and the box represents the standard deviation, while the whiskers are the minimum and maximum values calculated from the ensemble.
Figure 3Comparison of two NMR solution structures for protein HP_0495 from Helicobacter pylori (PDB codes 2joq, left, and 2h9z, right). (a,c) The 3D viewers show the most representative model of each ensemble, colored by the NRG-CING red-orange-green scores, which are calculated for the entire ensemble. (b) Superposition of the most representative models from the two structures, highlighting the differences in the conformation of helix I and adjacent loops. Image created in VMD54 (d,e). Graphs of per-residue WHATIF scores reporting unusual short distances in the entire ensemble. (f,g) Graphs of the number of long-range distance constraints for each residue. Long-range constraints are defined as connecting two atoms that are five or more residues apart in the amino-acid sequence.