| Literature DB >> 30821701 |
Abstract
A synthetic data set demonstrating a particularly challenging case of indexing ambiguity in the context of radiation damage was generated. This set shall serve as a standard benchmark and reference point for the ongoing development of new methods and new approaches to robust structure solution when single-crystal methods are insufficient. Of the 100 short wedges of data, only the first 36 are currently necessary to solve the structure by `cheating', or using the correct reference structure as a guide. The total wall-clock time and number of crystals required to solve the structure without cheating is proposed as a metric for the efficacy and efficiency of a given multi-crystal automation pipeline. open access.Entities:
Keywords: multi-microcrystallography; phasing; protein; radiation damage; simulation
Mesh:
Substances:
Year: 2019 PMID: 30821701 PMCID: PMC6400260 DOI: 10.1107/S2059798319001426
Source DB: PubMed Journal: Acta Crystallogr D Struct Biol ISSN: 2059-7983 Impact factor: 7.652
Simulated crystal volumes (µm3)
The true scale factor of the spots from each simulated data set is directly proportional to the simulated crystal volume, which was chosen randomly for each crystal. The actual values used in the simulation are listed here and may be used to check the accuracy of scaling programs as in Section 3.2 because no other variables such as the X-ray beam flux or even the structure factors were varied from crystal to crystal. The only remaining correction after this is the resolution-dependent scale factor of the simulated radiation damage described in Section 3.3.
| Crystal | Volume | Crystal | Volume | Crystal | Volume | Crystal | Volume | Crystal | Volume |
|---|---|---|---|---|---|---|---|---|---|
| 001 | 225 | 021 | 139 | 041 | 132 | 061 | 50.3 | 081 | 105 |
| 002 | 56.3 | 022 | 232 | 042 | 234 | 062 | 99.5 | 082 | 230 |
| 003 | 63.9 | 023 | 155 | 043 | 46.9 | 063 | 196 | 083 | 171 |
| 004 | 220 | 024 | 114 | 044 | 75.9 | 064 | 102 | 084 | 122 |
| 005 | 186 | 025 | 38.4 | 045 | 51.6 | 065 | 229 | 085 | 56.8 |
| 006 | 89.2 | 026 | 155 | 046 | 89.1 | 066 | 161 | 086 | 90.5 |
| 007 | 52.2 | 027 | 46.7 | 047 | 230 | 067 | 72.4 | 087 | 90.2 |
| 008 | 249 | 028 | 60.7 | 048 | 56.7 | 068 | 14.5 | 088 | 171 |
| 009 | 185 | 029 | 70.7 | 049 | 97.8 | 069 | 131 | 089 | 186 |
| 010 | 110 | 030 | 166 | 050 | 153 | 070 | 37.5 | 090 | 128 |
| 011 | 166 | 031 | 143 | 051 | 237 | 071 | 207 | 091 | 42.2 |
| 012 | 121 | 032 | 132 | 052 | 87.4 | 072 | 159 | 092 | 295 |
| 013 | 160 | 033 | 213 | 053 | 130 | 073 | 88.4 | 093 | 240 |
| 014 | 60.4 | 034 | 27.8 | 054 | 128 | 074 | 60.2 | 094 | 148 |
| 015 | 189 | 035 | 210 | 055 | 86.4 | 075 | 190 | 095 | 51.5 |
| 016 | 39.4 | 036 | 100 | 056 | 127 | 076 | 39.2 | 096 | 134 |
| 017 | 47.6 | 037 | 12.5 | 057 | 52.8 | 077 | 186 | 097 | 46.3 |
| 018 | 123 | 038 | 228 | 058 | 104 | 078 | 78.5 | 098 | 15.8 |
| 019 | 277 | 039 | 210 | 059 | 146 | 079 | 108 | 099 | 201 |
| 020 | 71.4 | 040 | 83.7 | 060 | 102 | 080 | 31.2 | 100 | 111 |
Figure 1Enlarged sections of diffraction patterns from simulated crystal 016. Six lunes are apparent on image 001, but indexing this wedge still proved problematic. The resolution-dependent exponential fading of spots with dose is exemplified by the rapid loss of high-angle data and the relative persistence of low-angle features. Despite perfect isomorphism, images 004 and higher degraded the overall anomalous signal and images 002 and higher degraded the overall resolution of the final data set.
Figure 2BLEND (Foadi et al., 2013 ▸) dendrogram of unit cells obtained from XDS (Kabsch, 2010 ▸) processing. Although the clustering suggests groups of related crystals, the true underlying unit cells and structure factors were identical for all 100 wedges. The unit-cell variation shown here is therefore entirely owing to the impact of random noise on indexing and cell refinement.
Figure 3Graph of the relative error (R right) between the correct structure factor (F right) and the structure factor obtained from scaling and merging the first N images from all 100 simulated crystals (F sim). Also shown are R work and R free from refinement to convergence of the correct starting model against F sim from N-image data. Despite perfect isomorphism, fewer images resulted in better agreement. The y axis also represents the maximum peak height found in the phased anomalous difference Fourier (dashed line). Phases were obtained by removing all Se atoms before refining to convergence against F sim. The phasing signal is maximized at N = 3.