| Literature DB >> 27710932 |
Wouter G Touw1, Bart van Beusekom2, Jochem M G Evers1, Gert Vriend1, Robbie P Joosten2.
Abstract
Many crystal structures in the Protein Data Bank contain zinc ions in a geometrically distorted tetrahedral complex with four Cys and/or His ligands. A method is presented to automatically validate and correct these zinc complexes. Analysis of the corrected zinc complexes shows that the average Zn-Cys distances and Cys-Zn-Cys angles are a function of the number of cysteines and histidines involved. The observed trends can be used to develop more context-sensitive targets for model validation and refinement.Entities:
Keywords: geometric restraints; protein zinc-binding site; refinement; validation; zinc metal-site geometry
Mesh:
Substances:
Year: 2016 PMID: 27710932 PMCID: PMC5053137 DOI: 10.1107/S2059798316013036
Source DB: PubMed Journal: Acta Crystallogr D Struct Biol ISSN: 2059-7983 Impact factor: 7.652
Figure 1R.m.s.Z for the five possible ZnCysHis site types. The scales on the two axes are different; black lines indicate the situation where the r.m.s.Z is the same for complexes in the PDB and after Zen remediation and re-refinement in PDB_REDO. Ligand atoms and site counts are indicated in the legend.
Figure 2Box-and-whisker plots of the Z-scores characterizing ZnCys3His1 complexes in PDB_REDO with Zen remediation (blue), PDB_REDO without Zen remediation (green) and original PDB (red) structure models. The whiskers extend to the nearest value that is within 1.5 times the inter-quartile range; outliers are marked as dots. The Z score for ‘Zn position’ indicates the deviation from the expected Zn position in the tetrahedron. 1411 outliers with a Z-score outside (−15, +15) are not shown for clarity. 891 of these outliers are from PDB structure models, while 476 and 44 outliers are from PDB_REDO entries without and with Zen remediation, respectively.
Figure 3ZnCysHis complexes before (left) and after PDB_REDO without (middle) and with (right) Zen remediation. Side chains are coloured by atom type; grey spheres are Zn ions. Figures were prepared with CCP4mg (McNicholas et al., 2011 ▸). Electron-density maps were omitted for clarity and are available from the PDB_REDO databank. (a) Zn300, chain A, from the 8-oxoguanine DNA glycosylase MutM (PDB entry 1l1z; 1.7 Å; Fromme & Verdine, 2002 ▸). Cys252 points away from the Zn ion. The LINK between Cys252 and Zn was not annotated in the PDB model. In the PDB_REDO models Cys252 Sγ has moved 2.7 Å. Arg251 was refitted to a more plausible conformation only after Zen detected the ZnCys4 site. (b) Zn203, chain I, from the RNA polymerase II–transcription factor IIB complex (PDB entry 1r5u; 4.5 Å; Bushnell et al., 2004 ▸). Zn203 is modelled far away from the centre of the four Sγ ligands. The presence of a LINK record between Zn and Cδ2 of Tyr34 and the absence of three Sγ—Zn LINK records in the PDB file precludes complex formation in a standard (re-)refinement. Correction of the Zn site required the Zn to move more than 5 Å. (c) Zn313, chain B, from aspartate transcarbamoylase (PDB entry 3d7s; 2.8 Å; Stieglitz et al., 2009 ▸). Several types of cysteine-bridge problems exist in the PDB (Evers et al., 2015 ▸), and the four cysteines next to Zn313 form an extreme example. Only three of the four necessary LINK records are specified in the original PDB file and at the same time superfluous SSBOND records are present for three of the six bridges shown. The cysteine clashes are almost resolved even without Zen processing thanks to the adaptations that were made to REFMAC as a result of our work. The additional restraints generated by Zen were necessary to refine the Zn position correctly. (d) Zn4001, chain D, from the DDB1–Cul4A–Rbx1–SV5V complex (PDB entry 2hye; 3.1 Å; Angers et al., 2006 ▸). The three cysteines and the histidine are not arranged tetrahedrally around Zn4001 and the three cysteines appear to form one big cysteine bridge. Without Zen remediation the r.m.s.Z is 9.69. The correct Cys42 rotamer was found during re-refinement after processing with Zen, allowing better refinement of the Zn and ligand positions (final r.m.s.Z of 1.09). The Zn4003 site is located close to the Zn4001 site and has a tetrahedral conformation. In the PDB entry the distance from the Cβ atom of Cys53 to Zn4001 is 4.38 Å, whereas the distance to Zn4003 is 4.20 Å. Zen detected correctly that Cys53 only coordinates Zn4003. (e) Zn61, chain B, from the box H/ACA ribonucleoprotein protein particle–RNA complex (PDB entry 3lwq; 2.7 Å; Zhou et al., 2010 ▸). Four cysteines are tightly connected near the Zn. In the PDB entry SSBOND records are present for these cysteines, while LINK records for the Zn are found to the backbone N atoms of Gly12 and Lys10. Normal ZnCys4 geometry is obtained in the Zen-processed PDB_REDO model. The ion has moved 3.5 Å. (f) Zn6, chain C, of the Simian virus 40 large T-antigen–human p53 complex (PDB entry 2h1l; 3.2 Å; Lilyestrom et al., 2006 ▸). For 12 of the 24 chains in the PDB model SSBOND records are specified between Cys302 and Cys305, while these two residues actually coordinate the Zn together with two histidines. The complex was refined correctly with and without processing by Zen. (g) Zn4, chain B, from the catalytic domain of human AMSH (PDB entry 3rzu; 2.5 Å; Davies et al., 2011 ▸). The coordination distances are too large. The distances in the PDB_REDO models were closer to the expected values.
Figure 4Zn—Sγ distance (top) and Sγ—Zn—Sγ angle (bottom) distributions as a function of the number of cysteines and histidines in ZnCysHis complexes determined at 1.6 Å resolution or better. The contours of the violin plots are kernel density estimates and the box plots are shown as in Fig. 2 ▸. The light grey background areas show one standard deviation around the refinement targets for the Zn—Sγ distance (2.340 ± 0.020 Å) and the Sγ—Zn—Sγ angle (109.5 ± 3.0°). The difference between the types of ZnCysHis complexes is significant (see Table 1 ▸). When Zn is coordinated by Nδ in ZnCys3His1 complexes, the Sγ—Zn—Sγ angle distribution is somewhat bimodal and partly depends on the rotameric state and backbone conformation of the cysteines.
Figure 5Zn1702, chain B, from jumonji H3K27 demethylase (PDB entry 4eyu; Kruidenier et al., 2012 ▸). mF o − DF c difference electron-density maps after a PDB_REDO run with (a) an isotropic B factor for Zn2+ (grey sphere) or (b) an anisotropic B factor for Zn2+ (grey thermal ellipsoid). The maps (positive, green mesh; negative, red mesh) are contoured at 3σ, are rendered with a grid size of 0.77 Å and for clarity are shown only in the vicinity of the Zn. The largest atomic displacement between any atom in this ZnCys4 complex between (a) and (b) is 0.16 Å.
Suggested refinement targets for the five possible ZnCysHis complex types
The targets have been derived from crystallographic structures determined at a resolution of 1.6 Å or better and are listed as mean ± standard deviation. Numbers in parentheses indicate the number of observations. For all targets a significant difference between means was observed across the types of ZnCysHis complexes [one-way ANOVA with a Welch correction for nonhomogeneity of variances (Welch, 1951 ▸): Zn—Sγ distance, F (3, 49.5) = 50.7, p = 4.1 × 10−15; Sγ—Zn—Sγ angle, F (2, 100.3) = 124.7, p << 10−15; Zn—N distance, F (2, 86.9) = 45.5, p = 3.1 × 10−14; N—Zn—N angle, F (1, 71.6) = 16.6, p = 1.2 × 10−4]. The same parameters derived from crystallographic structures determined at a resolution of 2.5 Å or better are given in Supplementary Table S2.
| Zn—Sγ (Å) | Sγ—Zn—Sγ (°) | Zn—N (Å) | N—Zn—N (°) | ZnCys |
|---|---|---|---|---|
| 2.330 ± 0.029 (1033) | 109.45 ± 5.46 (1553) | n/a | n/a | Cys4 |
| 2.318 ± 0.027 (912) | 112.15 ± 3.96 (912) | 2.074 ± 0.056 (303) | n/a | Cys3His1 |
| 2.306 ± 0.029 (76) | 116.23 ± 4.58 (38) | 2.040 ± 0.050 (65) | 102.38 ± 5.44 (38) | Cys2His2 |
| 2.298 ± 0.017 (12) | n/a | 2.002 ± 0.045 (36) | 107.23 ± 4.78 (36) | Cys1His3 |
| n/a | n/a | Insufficient data | Insufficient data | His4 |