| Literature DB >> 28368276 |
Raghurama P Hegde1, Alexander A Fedorov2, J Michael Sauder3, Stephen K Burley4, Steven C Almo2, Udupi A Ramagopal1.
Abstract
Single-wavelength anomalous dispersion (SAD) utilizing anomalous signal from native S atoms, or other atoms with Z ≤ 20, generally requires highly redundant data collected using relatively long-wavelength X-rays. Here, the results from two proteins are presented where the anomalous signal from serendipitously acquired surface-bound Ca atoms with an anomalous data multiplicity of around 10 was utilized to drive de novo structure determination. In both cases, the Ca atoms were acquired from the crystallization solution, and the data-collection strategy was not optimized to exploit the anomalous signal from these scatterers. The X-ray data were collected at 0.98 Å wavelength in one case and at 1.74 Å in the other (the wavelength was optimized for sulfur, but the anomalous signal from calcium was exploited for structure solution). Similarly, using a test case, it is shown that data collected at ∼1.0 Å wavelength, where the f'' value for sulfur is 0.28 e, are sufficient for structure determination using intrinsic S atoms from a strongly diffracting crystal. Interestingly, it was also observed that SHELXD was capable of generating a substructure solution from high-exposure data with a completeness of 70% for low-resolution reflections extending to 3.5 Å resolution with relatively low anomalous multiplicity. Considering the fact that many crystallization conditions contain anomalous scatterers such as Cl, Ca, Mn etc., checking for the presence of fortuitous anomalous signal in data from well diffracting crystals could prove useful in either determining the structure de novo or in accurately assigning surface-bound atoms.Entities:
Keywords: SAD phasing; weak anomalous signal
Mesh:
Substances:
Year: 2017 PMID: 28368276 PMCID: PMC5379167 DOI: 10.1107/S2053230X17002680
Source DB: PubMed Journal: Acta Crystallogr F Struct Biol Commun ISSN: 2053-230X Impact factor: 1.056
Diffraction data
Unless otherwise mentioned, values in parentheses are for the highest resolution shell.
| Protein | PSPTO | PTO | HEWL |
|---|---|---|---|
| Beamline | X4A | X4A | X29A |
| No. of amino acids in asymmetric unit | 145 | 726 | 129 |
| No. of non-H atoms in asymmetric unit | 1190 | 5814 | 1001 |
| Anomalous scatterers | Ca (2), S (4) | Ca (7.3), S (30) | S (10) |
| Space group |
|
|
|
| Unit-cell parameters | |||
|
| 47.43 | 90.76 | 78.81 |
|
| 47.43 | 143.49 | 78.81 |
|
| 122.49 | 129.80 | 37.04 |
| Wavelength (Å)/energy (keV) | 0.979/12.664 | 1.743/7.113 | 1.075/11.533 |
| Resolution (Å) | 40.0–1.60 (1.63–1.60) | 50–2.08 (2.12–2.08) | 50–1.46 (1.49–1.46) |
| Total rotation range (°) | 270 | 360 | 357 |
| Anomalous multiplicity | 10.4 (6.2) | 6.3 (2.3) | 12.2 (4.7) |
| Multiplicity | 18.9 (11.6) | 12.0 (4.2) | 22.7 (8.8) |
| Completeness (%) | 99.2 (92.5) | 96.7 (67.0) | 99.7 (96.4) |
|
| 3.6 (21.6) | 5.0 (22.1) | 4.0 (14.8) |
|
| 2.15 | 4.03 | 2.0 |
|
| 1.2 (10.3) | 2.0 (17.2) | 1.2 (7.3) |
| CC1/2 in highest resolution shell | 0.959 | 0.900 | 0.977 |
| 〈 | 53.64 (8.14) | 34.07 (4.06) | 57.64 (13.04) |
|
| 13.8 | 21.5 | 12.9 |
| Solvent content | 0.38 | 0.48 | 0.35 |
|
| 0.56 (Ca), 0.24 (S) | 1.6 (Ca), 0.70 (S) | 0.28 (S) |
| 〈Δ | 0.57 | 1.6 | 0.61 |
For data set HEWLAll (see Table 3 ▸).
With six molecules in the asymmetric unit.
The value in parentheses is the number of anomalous scattering atoms in the asymmetric unit. In lysozyme there are eight cysteines and two methionines.
Multiplicity with Friedel pairs kept separate.
f′′ values were obtained from http://skuld.bmsc.washington.edu/scatter/AS_periodic.html.
Calculated using the formula 〈ΔF anom〉/〈F〉 = , where N A is the number of and f′′A is the imaginary scattering contribution of an anomalous scatterer of type i, N T is the total number of non-H atoms in the molecule and Z eff is the effective number of electrons of the ‘average’ protein atom (6.7; Wang et al., 2006 ▸).
Absorption edges of the anomalous scatterers observed in this study and the difference between the energy at the absorption edge and the energy at the wavelength used for data collection
The difference between f′′ values (Δf′′) at the absorption edge and the wavelength used for data collection is also included.
| Absorption edges | Difference in energy and | ||||||
|---|---|---|---|---|---|---|---|
| Atom | Wavelength (Å) | Energy (keV) |
| Protein | Energy used (keV) | Difference from absorption edge (keV) | Δ |
| Ca | 3.070 | 4.038 | 4.05 | PSPTO | 12.664 | 8.626 (Ca) | 3.49 (Ca) |
| S | 5.016 | 2.472 | 4.1 | PTO | 7.113 | 4.043 (Ca) | 2.45 (Ca) |
| HEWL | 11.533 | 9.061 (S) | 3.82 (S) | ||||
f′′ values were obtained from http://skuld.bmsc.washington.edu/scatter/AS_periodic.html.
Comparison of HEWL data sets
Values in parentheses are for the highest resolution shell.
| HEWLAll | Data set 1 | Data set 2 | Data set 3 | Data set 4 | Data set 5 | |
|---|---|---|---|---|---|---|
| Total rotation range (°) | 357 | 200 | 300 | 200 | 200 | 157 |
| Resolution (Å) | 50–1.46 (1.49–1.46) | 50–1.46 (1.49–1.46) | 50–1.46 (1.49–1.46) | 50–1.46 (1.49–1.46) | 50–1.46 (1.49–1.46) | 50–1.46(1.49–1.46) |
| Anomalous multiplicity | 12.2 (4.7) | 6.9 (2.6) | 10.3 (4.2) | 7.2 (3.5) | 7.1 (2.7) | 5.2 (2.2) |
| Completeness (%) | 99.7 (96.4) | 99.6 (95.9) | 99.0 (89.3) | 96.3 (74.3) | 99.7 (96.3) | 96.6 (89.1) |
|
| 4.0 (14.8) | 4.1 (13.3) | 4.0 (13.7) | 3.7 (12.5) | 3.6 (16.3) | 3.5 (7.3) |
|
| 2.04 | 2.21 | 2.33 | 2.50 | 2.22 | 2.91 |
| CC1/2 in highest resolution shell | 0.977 | 0.968 | 0.975 | 0.978 | 0.957 | 0.989 |
| 〈 | 57.6 (13.0) | 40.6 (8.8) | 50.0 (14.8) | 40.6 (14.2) | 47.3 (5.3) | 39.2 (13.8) |
| Wilson | 12.9 | 13.6 | 13.0 | 13.0 | 12.5 | 12.0 |
| Successful phasing | Yes | Yes | Yes | Yes | Yes | Yes |
| Average phase error before d.m. | 64.3 | 63.8 | 66.4 | 67.3 | 69.8 | 74.8 |
| Average phase error after 20 cycles of d.m. | 49.2 | 51.7 | 54.3 | 60.5 | 57.7 | 67.7 |
| Average phase error after d.m. | N/A | N/A | N/A | 48.7 | 45.3 | 42.9 |
| Map correlation before d.m. | 0.408 | 0.408 | 0.392 | 0.294 | 0.296 | 0.163 |
| Map correlation after 20 cycles of d.m. | 0.655 | 0.611 | 0.590 | 0.471 | 0.541 | 0.378 |
| Map correlation after d.m. | N/A | N/A | N/A | 0.647 | 0.714 | 0.725 |
The first 100 frames from the high-intensity data merged with the last 100 frames from the low-intensity data.
The first 150 frames from both data sets merged together.
The first 100 frames from both data sets merged together.
All 200 frames of the low-intensity data.
All 157 frames of the high-intensity data.
Multiplicity with Friedel pairs kept separate.
An electron-density map from which a near-complete model could be built was generated.
d.m. indicates density modification in SHELXE.
Tracing was not necessary for phasing and hence was not performed.
Figure 1CCall/CCweak from a SHELXD run of 1000 cycles representing 999 incorrect substructure solutions and one correct solution that resulted in successful phasing of data set 5 for HEWL, which has a data completeness of only ∼70% to 3.5 Å resolution owing to overloads, with an average multiplicity as low as 5.2.
Figure 2χ2 versus resolution plots for (a) PSPTO, (b) PTO and (c) HEWL. The blue and red lines represent χ2 and R factor, respectively, for the data with Friedel mates separated; orange and green lines represent χ2 and R factor, respectively, for the data with Friedel mates merged.
Figure 3A plot of 〈d′′/sig〉 versus resolution for the three proteins discussed in this paper along with representative selenomethionine data collected at the Se edge of a putative glucarate dehydratase from Acidaminococcus sp. D21 (PDB entry 4hyr). The values for HEWL correspond to the data set HEWLAll. The purple line is drawn at 〈d′′/sig〉 = 0.8, at which the signal is considered to be negligible. All three proteins show weak anomalous signal throughout the resolution range.
Figure 4A plot of CC1/2(anom) versus resolution for the three proteins discussed in this paper with representative selenomethionine data collected at the Se edge of a putative glucarate dehydratase from Acidaminococcus sp. D21 (PDB entry 4hyr). For HEWL, CC1/2(anom) versus resolution is plotted for both the data set with high-intensity and low-intensity data merged (HEWLAll) and for the weak-intensity data alone (HEWLweak, corresponding to data set 4 in Table 3 ▸).
Map correlation coefficients and phase errors
| Map correlation | Average phase error (°) | ||||
|---|---|---|---|---|---|
| Protein | Before d.m. | After d.m. | Before d.m. | After d.m. | Solvent content |
| PSPTO | 0.382 | 0.539 | 71.4 | 61.1 | 0.38 |
| PTO | 0.391 | 0.596 (0.730) | 70.6 | 61.5 (49.8) | 0.48 |
| HEWL | 0.396 | 0.6 | 66.8 | 52.7 | 0.35 |
Comparison between initial phases before density modification in SHELXE and the final phases (from the refined model).
Comparison between phases after density modification in SHELXE and the final phases.
Autotracing in SHELXE was required before these could be phased successfully. The value in parentheses is the phase error/map correlation after three cycles of autotracing with 20 cycles of density modification before each cycle of autotracing in SHELXE.
Corresponds to the HEWLAll data set.
Figure 5Anomalous maps around the sulfurs and chlorines in HEWL shown at (a) 3σ, (b) 5σ and (c) 7σ. The maps were calculated using the HEWLAll data set. The S atoms in the side chains are in yellow and the Cl atoms are in cyan. An anomalous difference Fourier map at 7σ is not seen around most Cl atoms. This figure was produced using PyMOL (Schrödinger).