| Literature DB >> 25478847 |
Nicholas K Sauter1, Johan Hattne1, Aaron S Brewster1, Nathaniel Echols1, Petrus H Zwart1, Paul D Adams1.
Abstract
X-ray diffraction patterns from still crystals are inherently difficult to process because the crystal orientation is not uniquely determined by measuring the Bragg spot positions. Only one of the three rotational degrees of freedom is directly coupled to spot positions; the other two rotations move Bragg spots in and out of the reflecting condition but do not change the direction of the diffracted rays. This hinders the ability to recover accurate structure factors from experiments that are dependent on single-shot exposures, such as femtosecond diffract-and-destroy protocols at X-ray free-electron lasers (XFELs). Here, additional methods are introduced to optimally model the diffraction. The best orientation is obtained by requiring, for the brightest observed spots, that each reciprocal-lattice point be placed into the exact reflecting condition implied by Bragg's law with a minimal rotation. This approach reduces the experimental uncertainties in noisy XFEL data, improving the crystallographic R factors and sharpening anomalous differences that are near the level of the noise.Entities:
Keywords: X-ray free-electron lasers; single-shot exposures
Mesh:
Year: 2014 PMID: 25478847 PMCID: PMC4257623 DOI: 10.1107/S1399004714024134
Source DB: PubMed Journal: Acta Crystallogr D Biol Crystallogr ISSN: 0907-4449
Figure 1Relation between the observation and prediction of Bragg spots. (a) The aim of data processing is to exactly predict the Bragg spots that are actually recorded. (b) Definition of the laboratory coordinate system, with incident X-rays traveling in the −z direction and rotations R and R along the three principal axes. Only the rotation R has a direct effect on Bragg spot positions.
Figure 2The relationship between mosaicity and mosaic block size. Ewald sphere diagram identifying the reciprocal-lattice points (gold) actually observed in the diffraction pattern. A goal of modeling is to adjust the effective mosaicity (a) and effective mosaic domain size (b) together so as to bring all the observed points into contact with the sphere of reflection of radius 1/λ (where λ is the wavelength), but not the unobserved (blue) points. (a) Mutual rotation of mosaic blocks spreads the points into concentric arcs (spherical caps in three dimensions) subtending a constant angle η at the reciprocal-lattice origin O, with η interpreted as the full-width effective mosaicity. A lattice point diffracts if its centroid (midpoint) can be brought onto the sphere of reflection with a rotation Δψ ≤ η/2. (b) Expansion of the reciprocal-lattice points into constant-sized spheres, reflecting the finite size of mosaic blocks (Nave, 1998 ▶) or, equivalently, the domain-size broadening (Scherrer, 1918 ▶). The sphere diameter α is inversely proportional to the effective mosaic block size D eff. The sphere size illustrated in (b) falls short of that needed to completely model the observed reflections.
Figure 3The construction of Δψ. As stated in the text, the sign of the rotation Δψ bringing Q onto the Ewald sphere is considered to be negative if Q is outside the sphere (as shown) or positive if it is inside.
Figure 4Probability of the observation Δψ given the model Δψmodel.
Figure 5cctbx.xfel data-processing workflow. Steps leading to integrated data are listed in black, while choices that are under user control are listed in red. Program parameters controlling these choices are given in a tutorial at http://cci.lbl.gov/xfel.
Processing outcome on measured XFEL still shots from thermolysin
| Previous work | Equation (1) | Equations (1) | Best practice | |
|---|---|---|---|---|
| Protocol | NM | 4 | 5 | 6 |
| Refinement target | ||||
| Initial triclinic cell | Equation (1) | Equation (1) | Equation (2) | Equation (2) |
| Constrained hexagonal cell | Equation (2) | Equation (1) | Equation (1) | Equation (2) |
| Fitting of mosaicity and block size | Least squares | Maximum likelihood | Maximum likelihood | Maximum likelihood |
| Indexing results | ||||
| Total No. of hits with >15 Bragg spots | 14041 | 14041 | 14041 | 14041 |
| No. of integrated and merged lattices | 11151 | 12097 | 11958 | 12551 |
| Model accuracy | ||||
| Half-width mosaicity () | 0.471 | 0.292 | 0.286 | 0.168 |
| Mosaic block size () | 2920 | 4320 | 4320 | 4220 |
| Integrated data results | ||||
| Individual image CC (%) | 32.5 | 32.0 | 32.3 | 40.2 |
| No. of measurements, 512.2 | 5793963 | 6605566 | 6538120 | 5036076 |
| Positive measurements, 512.2 | 3893827 | 4297065 | 4265829 | 3626262 |
| Negative measurements (%) | 33 | 35 | 35 | 28 |
| Structure-factor merging | ||||
| Unique Miller indices, 512.2 | 17156 | 17198 | 17193 | 17297 |
| Multiplicity of observation | 222 | 245 | 243 | 207 |
| Completeness (%) | 97.9 | 98.2 | 98.2 | 98.8 |
|
| 41.1 | 36.1 | 36.7 | 56.7 |
| CCiso
| 90.1 | 86.8 | 86.6 | 94.7 |
|
| 22.5 | 23.6 | 23.4 | 18.0 |
| Structure-factor quality tests | ||||
| | | 0.340 | 0.302 | 0.304 | 0.376 |
|
| 0.169 | 0.137 | 0.138 | 0.202 |
|
| 0.159 | 0.201 | 0.196 | 0.121 |
|
| 0.238 | 0.271 | 0.265 | 0.198 |
| Quality of refined structure | ||||
|
| 21.9 | 24.5 | 24.2 | 20.6% |
|
| 27.9 | 29.6 | 29.8 | 26.0 |
| Zn2+ anomalous difference map peak height () | 3.5 | 2.9 | 3.0 | 5.9 |
This column replicates the method in our previous publication (Hattne et al., 2014 ▶) used to derive the thermolysin structure (PDB entry 4ow3).
For the thermolysin data analysis, candidate Bragg spots were chosen with a minimum spot area of two square pixels.
Processing outcome on simulated PSI data with different protocols
| Non-optimal spotfinding and indexing | Equation (1) | Equations (1) | Best practice | |||
|---|---|---|---|---|---|---|
| Protocol | 1 | 2 | 3 | 4 | 5 | 6 |
| Refinement target | ||||||
| Initial triclinic cell | Equation (1) | Equation (1) | Equation (1) | Equation (1) | Equation (2) | Equation (2) |
| Constrained hexagonal cell | Equation (1) | Equation (1) | Equation (1) | Equation (1) | Equation (1) | Equation (2) |
| Indexing practices | ||||||
| Spotfinder spot area (pixels) | 2 | 1 | 1 | 1 | 1 | 1 |
| Spotfinder method 2 cutoff (%) | 5 | 20 | 5 | 5 | 5 | 5 |
| Target unit cell | Provided | Provided | Not given | Provided | Provided | Provided |
| Indexing results | ||||||
| Total No. of images | 20000 | 20000 | 20000 | 20000 | 20000 | 20000 |
| No. of integrated and merged images | 19706 | 19490 | 18926 | 19608 | 19998 | 19984 |
| Model accuracy | ||||||
| R.m.s. | 0.039 | 0.041 | 0.040 | 0.017 | 0.017 | 0.017 |
| R.m.s. | 0.379 | 0.586 | 1.108 | 0.584 | 0.083 | 0.031 |
| R.m.s. total angular misorientation () | 0.381 | 0.588 | 1.109 | 0.584 | 0.085 | 0.035 |
| Median total angular misorientation () | 0.130 | 0.087 | 0.134 | 0.078 | 0.054 | 0.021 |
| No. of outliers >0.1 misoriented | 13085 | 9264 | 13136 | 7971 | 4566 | 172 |
| False Bragg predictions, 153.5 (%) | 65.0 | 54.3 | 64.7 | 51.6 | 40.0 | 9.1 |
| Unmodeled Bragg spots, 153.5 (%) | 15.4 | 10.9 | 13.3 | 10.4 | 10.0 | 8.0 |
| Half-width mosaicity | 0.101 | 0.080 | 0.098 | 0.050 | 0.025 | 0.000 |
| Mosaic block size | 5160 | 4660 | 4960 | 4660 | 4780 | 5100 |
| Integrated data results | ||||||
| Individual image CC (%) | 51.6 | 54.1 | 50.6 | 56.3 | 61.2 | 70.1 |
| No. of measurements, 153.5 | 38398465 | 35264088 | 38755029 | 36948917 | 32210606 | 22817281 |
| Positive measurements, 153.5 | 26322687 | 25323211 | 26836439 | 26779348 | 24458975 | 19303707 |
| Negative measurements (%) | 31 | 28 | 31 | 28 | 24 | 15 |
| Structure-factor merging | ||||||
| Unique Miller indices, 153.5 | 92204 | 92204 | 92204 | 92204 | 92204 | 92204 |
| Multiplicity of observation | 286 | 275 | 291 | 290 | 265 | 209 |
| Completeness (%) | 100 | 100 | 100 | 100 | 100 | 100 |
|
| 33.4 | 36.0 | 33.8 | 38.0 | 40.9 | 46.0 |
| CCiso
| 96.6 | 96.6 | 95.8 | 96.8 | 97.5 | 99.0 |
|
| 36.7 | 33.0 | 35.5 | 32.1 | 27.7 | 18.1 |
| Structure-factor quality tests | ||||||
| | | 0.270 | 0.299 | 0.282 | 0.301 | 0.320 | 0.358 |
|
| 0.109 | 0.131 | 0.118 | 0.132 | 0.148 | 0.182 |
|
| 0.263 | 0.223 | 0.245 | 0.222 | 0.197 | 0.154 |
|
| 0.364 | 0.332 | 0.341 | 0.330 | 0.306 | 0.243 |
cctbx.xfel now runs protocol 6 by default, while the other protocols may be accessed by changing the program parameters described at http://cci.lbl.gov/xfel.
Half-width mosaicity and mosaic block size were fitted by the maximum-likelihood approach outlined in Appendix B. The values reported here are D eff and 1/, respectively, where is the average over all merged images.
Figure 6Agreement of model and data. ‘True knowledge’ of the system used for the PSI simulation gives the fraction of Bragg spots falsely predicted (a) and the fraction of Bragg spots in the simulation that remain unmodeled (b) for each of the six protocols listed in Table 1 ▶. Integrated data from all simulated images are grouped into reciprocal-space shells of equal volume ranging from 15 to 3.5 Å.
Figure 7Two methods for fitting the mosaicity and mosaic block size. Δψ values for bright spots from a single thermolysin still image (blue circles) are plotted as a function of the diffraction angle 2θ, which is inversely related to the resolution d by Bragg’s law. A wider spread of Δψ values is observed at low 2θ. The best integrated intensities are obtained by finding the function Δψmodel (green curve) that minimally envelopes the spots. (16) breaks Δψmodel into a resolution-dependent term containing the effective mosaic block size D eff (inner red curve) and a peripheral zone of constant width determined by the effective mosaic spread η. Alternate algorithms determine these parameters either by (a) least-squares fit of the |Δψ|max values determined for resolution bins or (b) maximum-likelihood treatment of all of the data. Approach (b) consistently gives more realistic fits with smaller η and larger D eff values. Plots reflect refinement results from protocol 6 (Table 2 ▶).