| Literature DB >> 30575586 |
Max T B Clabbers1, Tim Gruene2, Eric van Genderen2, Jan Pieter Abrahams1.
Abstract
Compared with X-rays, electron diffraction faces a crucial challenge: dynamical electron scattering compromises structure solution and its effects can only be modelled in specific cases. Dynamical scattering can be reduced experimentally by decreasing crystal size but not without a penalty, as it also reduces the overall diffracted intensity. In this article it is shown that nanometre-sized crystals from organic pharmaceuticals allow positional refinement of the hydrogen atoms, even whilst ignoring the effects of dynamical scattering during refinement. To boost the very weak diffraction data, a highly sensitive hybrid pixel detector was employed. A general likelihood-based computational approach was also introduced for further reducing the adverse effects of dynamic scattering, which significantly improved model accuracy, even for protein crystal data at substantially lower resolution. open access.Entities:
Keywords: dynamical scattering; electron diffraction; hybrid pixel detector; hydrogen atoms; nanocrystals
Mesh:
Substances:
Year: 2019 PMID: 30575586 PMCID: PMC6302931 DOI: 10.1107/S2053273318013918
Source DB: PubMed Journal: Acta Crystallogr A Found Adv ISSN: 2053-2733 Impact factor: 2.290
Data-merging statistics
Reference values of unit-cell dimensions for IRELOH were obtained from Dai et al. (2010 ▸) and from Deffieu et al. (1977 ▸) for EPICZA. Values in parentheses correspond to the highest resolution shell, and data were truncated at approximately I/σ(I) > 1.0 and CC 1/2 > 50% (Karplus & Diederichs, 2012 ▸; Diederichs & Karplus, 2013 ▸), see also Fig. S4 in the supporting information.
| IRELOH | EPICZA | |
|---|---|---|
| Literature | ||
| Chemical formula | C16H18O5 | C18H16N2O6S2·H2O |
| Space group |
|
|
| Unit-cell dimensions | ||
|
| 8.06, 10.00, 17.73 | 11.11, 12.61, 13.49 |
| α, β, γ (°) | 90.00, 90.00, 90.00 | 90.00, 90.00, 90.00 |
| Data integration | ||
| Number of crystals | 3 | 4 |
| Space group |
|
|
| Unit-cell dimensions | ||
|
| 8.090 (2), 9.940 (2), 17.700 (4) | 11.080 (2), 12.580 (2), 13.440 (3) |
| α, β, γ (°) | 90.00, 90.00, 90.00 | 90.00, 90.00, 90.00 |
| Resolution (Å) | 17.60–0.82 (0.85–0.82) | 12.63–0.83 (0.85–0.83) |
|
| 6.47 (2.50) | 5.93 (1.45) |
|
| 98.2 (46.2) | 98.6 (67.7) |
|
| 16.6 (50.0) | 20.6 (28.6) |
|
| 18.5 (61.2) | 22.2 (39.8) |
| Completeness (%) | 84.6 (78.9) | 89.6 (36.4) |
| Reflections | 6096 (352) | 12039 (70) |
| Unique observations | 1332 (135) | 1761 (59) |
See Tables S2 and S3 in the supporting information for the data-processing statistics of the individual crystal data sets.
Figure 1Structure determination of IRELOH. (a) The model after phasing using the direct methods program SHELXT. Small icosahedra represent maxima in the difference map. Peak heights are colour coded in rainbow colours (purple = high, red = low). Icosahedra that indicate the positions of individual hydrogen atoms are encircled. (b) Automated placement of the hydrogen atoms by SHELXLE confirms the quality of the model’s geometry, placing all hydrogen atoms correctly except for H5. (c) Final model after manual building and anisotropic refinement (RIGU, XNPD 0.001) in SHELXL. (d) Unconstrained positional refinement of the hydrogen atoms. (e) ORTEP plot with the numbering for the non-hydrogen atoms of the final model. Parts (a)–(d) were drawn using SHELXLE with default contour levels of 2.7σ for the − difference map and 1.2σ for the map. The atomic displacement ellipsoids are colour coded black for carbon and red for oxygen, while the hydrogen atoms are represented in white.
Model building and refinement statistics without refining hydrogen atoms
| IRELOH | EPICZA | |
|---|---|---|
| Space group |
|
|
| Unit-cell dimensions | ||
|
| 8.015 (2), 10.015 (2), 17.703 (4) | 10.996 (2), 12.452 (2), 13.218 (3) |
| α, β, γ (°) | 90.00, 90.00, 90.00 | 90.00, 90.00, 90.00 |
| Parameters | 196 | 259 |
| Restraints | 156 | 267 |
| Reflections | 6096 | 12039 |
| Unique | 1332 | 1761 |
| Refinement | ||
|
| 16.8 (15.0) | 17.2 (15.4) |
|
| 19.7 | 21.0 |
|
| 36.9 | 39.0 |
| GooF | 1.100 | 1.109 |
| R.m.s.d. bond lengths (Å) | 0.022 (18) | 0.027 (19) |
| Dynamical corrections | ||
|
| 13.2 (12.2) | 12.7 (11.9) |
|
| 15.1 | 14.3 |
|
| 29.5 | 29.1 |
| GooF | 0.944 | 0.879 |
| R.m.s.d. bond lengths (Å) | 0.022 (17) | 0.025 (13) |
New unit-cell dimensions after lattice refinement, see also Tables S9 and S10 in the supporting information.
Enhanced rigid-bond restraints (RIGU) were applied for refinement in SHELXL (Thorn et al., 2012 ▸).
= − where the sum is over all reflections, values in parentheses show R1 for reflections .
R complete was calculated over all reflections with a 0.2% test set size as a robust and unbiased validation tool (Luebben & Gruene, 2015 ▸); since all data are included, R work is equivalent to R1.
R.m.s.d. for all non-hydrogen atoms, calculated against reference values from high-resolution X-ray models (Dai et al., 2010 ▸; Deffieu et al., 1977 ▸); see also Tables S5 and S7 in the supporting information.
Model building and refinement statistics after unconstrained refinement of hydrogen atoms
| IRELOH | EPICZA | |
|---|---|---|
| Space group |
|
|
| Unit-cell dimensions | ||
|
| 8.015 (2), 10.015 (2), 17.703 (4) | 10.996 (2), 12.452 (2), 13.218 (3) |
| α, β, γ (°) | 90.00, 90.00, 90.00 | 90.00, 90.00, 90.00 |
| Parameters | 245 | 305 |
| Restraints | 156 | 267 |
| Reflections | 6096 | 12039 |
| Unique | 1332 | 1761 |
| Refinement | ||
|
| 15.7 (13.9) | 16.6 (14.7) |
|
| 19.9 | 21.5 |
|
| 34.6 | 37.1 |
| GooF | 1.031 | 1.051 |
| R.m.s.d. bond lengths (Å) | 0.024 (18) | 0.030 (20) |
| R.m.s.d. hydrogen-bond lengths (Å) | 0.180 (72) | 0.259 (80) |
| Dynamical corrections | ||
|
| 12.5 (11.5) | 12.2 (11.4) |
|
| 15.2 | 14.5 |
|
| 28.1 | 28.0 |
| GooF | 0.907 | 0.851 |
| R.m.s.d. bond lengths (Å) | 0.022 (13) | 0.026 (13) |
| R.m.s.d. hydrogen-bond lengths (Å) | 0.073 (52) | 0.110 (56) |
New unit-cell dimensions after lattice refinement, see also Tables S9 and S10 in the supporting information.
Enhanced rigid-bond restraints (RIGU) were applied for refinement in SHELXL (Thorn et al., 2012 ▸).
= − where the sum is over all reflections, values in parentheses show R1 for reflections .
R complete was calculated over all reflections with a 0.2% test set size as a robust and unbiased validation tool (Luebben & Gruene, 2015 ▸); since all data are included, R work is equivalent to R1.
R.m.s.d. for all non-hydrogen atoms, calculated against reference values from high-resolution X-ray models (Dai et al., 2010 ▸; Deffieu et al., 1977 ▸).
R.m.s.d. for the idealized hydrogen-bond lengths after unconstrained refinement of the hydrogen positions (Gruene et al., 2014 ▸; Sheldrick, 2015a ▸); see also Tables S6 and S8 in the supporting information.
Figure 2Structure determination of EPICZA. (a) The model after phasing using the direct methods program SHELXT. Small icosahedra represent maxima in the difference map. Peak heights are colour coded in rainbow colours (purple = high, red = low). Icosahedra that indicate the positions of individual hydrogen atoms are encircled. (b) Automated placement of the hydrogen atoms by SHELXLE showing the quality of the model’s geometry, placing all hydrogen atoms apart from H3, H5 and H5B. (c) Final model after building and anisotropic refinement (RIGU, XNPD 0.001) in SHELXL. (d) Unconstrained positional refinement of the hydrogen atoms where the positions of H7 and H10 are unstable and move out of bounds. (e) ORTEP plot with the numbering for the non-hydrogen atoms of the final model. Parts (a)–(d) were drawn using SHELXLE with default contour levels of 2.7σ for the − difference map and 1.2σ for the map. The atomic displacement ellipsoids are colour coded black for carbon, red for oxygen, blue for nitrogen and yellow for sulfur, while the hydrogen atoms are represented in white.
Figure 3Likelihood-based corrections for dynamical scattering. (a) versus plot for IRELOH where is a hyperbolic function with an expected dynamical scattering error term of = 1.51 as determined by least-squares fitting, and where shows a perfect linear correlation. (b) Zoomed-in version on the lower intensity reflections, as marked with a grey box in (a). (c) as a function of the resolution, where we fitted an exponential curve with parameters a = 3.64, b = 0.058, c = −24.67 and d = −2.38. (d) versus plot after applying the likelihood-based corrections shows an improved correlation between and with an error of = 0.33.
Figure 4Likelihood-based corrections for dynamical scattering. (a) versus plot for EPICZA where is a hyperbolic function with an expected dynamical scattering error term of = 2.20 as determined by least-squares fitting, and where shows a perfect linear correlation. (b) Zoomed-in version of the same plot on the lower intensity reflections, as marked with a grey box in (a). (c) as a function of the resolution, where we fitted an linear model with parameters a = 8.61 and b = −6.98. (d) versus plot after applying the likelihood-based corrections shows an improved correlation between and with an error of = 0.43.
Figure 5Likelihood-based corrections for dynamical scattering for low-resolution protein data. (a) versus plot for hen egg-white lysozyme where is a hyperbolic function with an expected dynamical scattering error term of = 768.13 as determined by least-squares fitting, and where shows a perfect linear correlation. (b) as function of resolution. (c) Mean structure-factor amplitude as a function of the resolution; the same resolution bins were used as in (b) and error bars indicate the standard deviation. (d) versus plot after applying discrete likelihood-based corrections, showing an improved correlation between and with an error of = 53.3.
Refinement statistics and dynamical scattering corrections of low-resolution protein data from seven lysozyme data sets recorded and solved previously (Clabbers et al., 2017 ▸)
Experimental data are available online at https://doi.org/10.5281/zenodo.1250447. Values in parentheses correspond to the highest resolution shell, and the data were truncated at approximately I/σ(I) > 1.0 and CC 1/2 > 50% (Diederichs & Karplus, 2013 ▸).
| Lysozyme | |
|---|---|
| Data integration | |
| Space group |
|
| Unit-cell dimensions | |
|
| 104.56, 68.05, 32.05 |
| α, β, γ (°) | 90.0, 90.0, 90.0 |
| Number of crystals | 7 |
| Resolution (Å) | 57.03–2.11 (2.17–2.11) |
|
| 42.1 (57.2) |
|
| 90.4 (60.3) |
|
| 2.7 (1.0) |
| Completeness (%) | 62.1 (49.8) |
| Reflections | 41191 (1462) |
| Unique reflections | 8560 (545) |
| Refinement | |
| Reflections | 8503 |
|
| 24.4 |
|
| 29.1 |
| 〈 | 33.02 |
| R.m.s.d. bond lengths (Å) | 0.074 |
| R.m.s.d. bond angles (°) | 1.0706 |
| Ramachandran | |
| Favoured, allowed, outliers (%) | 98.4, 1.6, 0.0 |
| Dynamical corrections | |
| Reflections | 8503 |
|
| 24.3 |
|
| 26.2 |
| 〈 | 41.09 |
| R.m.s.d. bond lengths (Å) | 0.066 |
| R.m.s.d. bond angles (°) | 1.0072 |
| Ramachandran | |
| Favoured, allowed, outliers (%) | 98.0, 2.0 0.0 |
We present R1 and R complete instead of R work and R free. With less than 10 000 unique reflections R complete is preferred over R free since it is calculated from all reflections (Brünger, 1997 ▸; Luebben & Gruene, 2015 ▸). Since all structure factors are used in turn, this leads to a more robust calculation than R free. With this validation method, the actual refinement uses all reflections, hence R work is equivalent to R1.