| Literature DB >> 28009542 |
Charles S Bury1, Ian Carmichael2, Elspeth F Garman1.
Abstract
During macromolecular X-ray crystallography experiments, protein crystals held at 100 K have been widely reported to exhibit reproducible bond scission events at doses on the order of several MGy. With the objective to mitigate the impact of radiation damage events on valid structure determination, it is essential to correctly understand the radiation chemistry mechanisms at play. OH-cleavage from tyrosine residues is regularly cited as amongst the most available damage pathways in protein crystals at 100 K, despite a lack of widespread reports of this phenomenon in protein crystal radiation damage studies. Furthermore, no clear mechanism for phenolic C-O bond cleavage in tyrosine has been reported, with the tyrosyl radical known to be relatively robust and long-lived in both aqueous solutions and the solid state. Here, the initial findings of Tyr -OH group damage in a myrosinase protein crystal have been reviewed. Consistent with that study, at increasing doses, clear electron density loss was detectable local to Tyr -OH groups. A systematic investigation performed on a range of protein crystal damage series deposited in the Protein Data Bank has established that Tyr -OH electron density loss is not generally a dominant damage pathway in protein crystals at 100 K. Full Tyr aromatic ring displacement is here proposed to account for instances of observable Tyr -OH electron density loss, with the original myrosinase data shown to be consistent with such a damage model. Systematic analysis of the effects of other environmental factors, including solvent accessibility and proximity to disulfide bonds or hydrogen bond interactions, is also presented. Residues in known active sites showed enhanced sensitivity to radiation-induced disordering, as has previously been reported.Entities:
Keywords: Fourier difference maps; electron density loss; radiation chemistry; specific damage; tyrosine
Mesh:
Substances:
Year: 2017 PMID: 28009542 PMCID: PMC5182017 DOI: 10.1107/S1600577516016775
Source DB: PubMed Journal: J Synchrotron Radiat ISSN: 0909-0495 Impact factor: 2.616
Selection of MX damage series (all conducted at 100 K) previously deposited in the PDB, complemented with one novel GH7 family protein damage series (deposited in conjunction with this study) and one unpublished insulin data set. See §2 of the supporting information for data collection and refinement of the GH7 series
For series in which intermediate datasets fall below the 1 MGy selection criterion, PDB accession codes are only provided for datasets included in the current analysis.
| Protein | PDB accession codes | Resolution for first dataset (Å) | Quoted total molecular weight (kDa) | Unique Tyr residues in asymmetrical unit | Publication |
|---|---|---|---|---|---|
| Acetylcholinesterase ( |
| 2.05 | 60.7 | 17 | Weik |
| Thermolysin ( |
| 2.2 | 34.6 | 28 | Juers & Weik (2011 |
| Myrosinase ( |
| 2.0 | 62.4 | 37 | Burmeister (2000 |
| Malate dehydrogenase ( |
| 2.0 | 131.6 | 32 | Fioravanti |
| Chicken egg white lysozyme ( |
| 2.0 | 14.6 | 3 | De la Mora |
| Chicken egg white lysozyme ( |
| 1.2 | 14.5 | 3 | Sutton |
| Pancreatic elastase ( |
| 1.2 | 26.1 | 11 | Petrova |
| Pancreatic elastase ( |
| 1.33 | 26.2 | 11 | Nanao |
| Insulin ( |
| 1.4 | 5.8 | 4 | Nanao |
| Lysozyme ( |
| 1.4 | 14.7 | 3 | Nanao |
| Ribonuclease A ( |
| 1.4 | 13.7 | 6 | Nanao |
| Thaumatin ( |
| 1.4 | 22.3 | 8 | Nanao |
| Trypsin ( |
| 1.2 | 24.0 | 10 | Nanao |
| C-protein-DNA complex ( |
| 2.8 | 59.7 | 8 | Bury |
|
|
| 1.98 | 204.1 | 22 | Bury |
| Phosphoserine aminotransferase |
| 1.68 | 81.8 | 28 | Dubnovitsky |
| Insulin ( | Undeposited damage series | 1.38 | 5.8 | 4 | Current publication |
| GH7 family cellobiohydrolase |
| 2.0 | 96.6 | 34 | Unpublished |
Figure 1Specific radiation damage to Tyr-330 and Glu-409 in the myrosinase damage series. F obs(5) − F obs(1) difference map, contoured at ±5σ in green/red, overlaid on the initial coordinate model in white (PDB accession code: 1dwa). Coordinate model and structure factor amplitudes were retrieved from PDB_REDO, and F obs(5) − F obs(1) maps were generated by FFT within the RIDL pipeline. Electron density loss is observable around a glycerol oxygen, presumably also due to the dislocation of the glycerol upon decarboxylation of Glu-409.
Figure 2Bdamage change for each Tyr –O atom in the highest dose myrosinase structure (PDB accession code: 1dwi) relative to the 1dwa structure. For each atom a, the relative change is calculated as [Bdamage(a 1dwi) − Bdamage(a 1dwa)]/Bdamage(a 1dwa). To account for non-unity occupancies in the originally deposited data, all atomic occupancies in the 1dwa and 1dwi coordinate models were set to 1 and a further round of isotropic B-factor refinement was performed in phenix.refine prior to calculating Bdamage.
Figure 3Density loss ranking for each damage series for distinct side-chain oxygen atom types occurring within each protein structure. Blank squares represent atom types not present within a given protein. The ranking scheme starts at 0 for the most damaged atom species within each protein, thus a large ranking number indicates low D loss. For clarity, each square is coloured by rank, with the corresponding colour key provided to the right of the grid. Superscripted proteins correspond to the following studies: †Nanao et al. (2005 ▸), ††Petrova et al. (2010 ▸), ‡Sutton et al. (2013 ▸). *Data collected from GH7 and insulin crystals (distinct from PDB_REDO collated series) are additionally included as part of the test set (see Table 1 ▸ for details).
Figure 4Cα-normalized D loss damage signature histogram plots over all atoms of specific residue types for (a, b) myrosinase, (c, d) malate dehydrogenase, (e, f) TRAP and (g, h) GH7. The normalization scheme is: Cα-normalized D loss(atom) = [ − ]/, where is the average calculated over the set of Cα atoms. Two residue types are directly compared in each plot: (a, c, e, g) Tyr and Phe, and (b, d, f, h) Asp and Asn. Gaussian kernel density estimates, which are non-parametric approximations of the probability density function underlying each D loss distribution, are overlaid on the histogram plots. Highest dose datasets for each series are displayed; other doses within each series exhibit qualitatively similar behaviour.
Figure 5Kolmogorov–Smirnov (KS) test statistics comparing similarity of different residue types for large protein structures (>60 kDa). For each damage series, box plots illustrate the distribution in KS statistic values across all the datasets in that series. A higher KS statistic indicates less similarity between the distribution of D loss values for a specified residue pairing.
Figure 6(a) Specific radiation damage to the myrosinase crystal as reported by Burmeister (2000 ▸). F obs(4) − F obs(1) difference map, contoured at ±3σ, overlaid on the initial coordinate model (PDB accession code: 1dwa). A secondary Tyr-330 side-chain conformation (occupancy: 50%) has been built into the positive F obs(4) − F obs(1) difference density manually using COOT. (b) Occupancy of primary and secondary Tyr-330 conformation after occupancy refinement (in phenix.refine, with atomic B-factors held constant) of the model pertaining to each successive dataset in the damage series. Doses were originally quoted in units of photons mm−2; diffraction weighted doses (DWD) (Zeldin, Brockhauser et al., 2013 ▸) have been calculated in RADDOSE-3D (Zeldin, Gerstel et al., 2013 ▸) for this damage series using crystal composition (heavy atom content, crystal size) and beam characteristics (energy, flux, exposure time and collimation) as supplied by Burmeister (2000 ▸).
Figure 7D loss(atom) metric as a function of dose for Tyr –OH atoms in (a) TRAP and (b) lysozyme (De la Mora et al., 2011 ▸). Tyr –OH residues exhibiting hydrogen bond interactions to Glu or Asp carboxyl groups are coloured blue. Plots for other tested protein series are provided in Fig. S1.4.
Figure 8Linear R-squared coefficients calculated for each MX damage series for correlation between Tyr –OH D loss and (a) solvent accessibility and (b) distance to closest disulfide bond. Box plots represent distribution in R-squared values for series containing multiple datasets. Only structures containing eight or more Tyr residues per asymmetric unit are included, and in (b) only structures containing disulfide bonds are included.