| Literature DB >> 35414146 |
Henry J Kirkwood1, Raphael de Wijn2, Grant Mills2, Romain Letrun2, Marco Kloos2, Mohammad Vakili2, Mikhail Karnevskiy2, Karim Ahmed2, Richard J Bean2, Johan Bielecki2, Fabio Dall'Antonia2, Yoonhee Kim2, Chan Kim2, Jayanath Koliyadu2, Adam Round2,3, Tokushi Sato2, Marcin Sikorski2, Patrik Vagovič2, Jolanta Sztuk-Dambietz2, Adrian P Mancuso2,4.
Abstract
Serial femtosecond crystallography is a rapidly developing method for determining the structure of biomolecules for samples which have proven challenging with conventional X-ray crystallography, such as for membrane proteins and microcrystals, or for time-resolved studies. The European XFEL, the first high repetition rate hard X-ray free electron laser, provides the ability to record diffraction data at more than an order of magnitude faster than previously achievable, putting increased demand on sample delivery and data processing. This work describes a publicly available serial femtosecond crystallography dataset collected at the SPB/SFX instrument at the European XFEL. This dataset contains information suitable for algorithmic development for detector calibration, image classification and structure determination, as well as testing and training for future users of the European XFEL and other XFELs.Entities:
Year: 2022 PMID: 35414146 PMCID: PMC9005607 DOI: 10.1038/s41597-022-01266-w
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Description of sample delivery conditions and corresponding run number.
| Run number | Duration (mins) | Flow rate ( | He pressure (psi) | He flow rate (mg/min) | Jet velocity (m/s) | Frame count | Indexed count | Indexing rate (%) |
|---|---|---|---|---|---|---|---|---|
| 79 | 10.1 | 30 | 450 | 34.0 | 50.8 | 2,129,547 | 47,129 | 2.21 |
| 80 | 12.1 | 30 | 450 | 34.0 | 50.8 | 2,564,884 | 20,156 | 0.79 |
| 84 | 10.0 | 80 | 400 | 26.0 | 37.4 | 2,116,224 | 20,479 | 0.97 |
| 85 | 10.0 | 80 | 400 | 26.0 | 37.4 | 2,116,224 | 30,401 | 1.44 |
| 95 | 10.0 | 60 | 450 | 34.0 | 44.0 | 2,113,760 | 31,697 | 1.5 |
| 96 | 10.0 | 60 | 450 | 34.0 | 44.0 | 2,113,760 | 40,239 | 1.9 |
| 98 | 10.0 | 60 | 300 | 14.0 | 31.2 | 2,115,520 | 40,802 | 1.93 |
| 99 | 10.2 | 60 | 300 | 14.0 | 31.2 | 2,154,241 | 74,437 | 3.46 |
Fig. 1Example of single crystal diffraction data measured by AGIPD (left). Off-axis microscope for monitoring the overlap of the liquid jet and X-ray beam (right). The image was acquired with a single 800 nm wavelength, 65 fs duration laser pulse from the SASE1 pump-probe laser system, 110 ns after the first X-ray pulse in the train.
Calibration constants and corresponding file addresses and data dimensions used for calibrating the raw data from each of the 16 AGIPD modules at SPB/SFX.
| Parameter | HDF5 key | Data dimensions |
|---|---|---|
| BadPixelsDark | /BadPixelsDark/0/data | ( |
| BadPixelsFF | /BadPixelsFF/0/data | ( |
| BadPixelsPC | /BadPixelsPC/0/data | ( |
| Noise | /Noise/0/data | ( |
| Offset | /Offset/0/data | ( |
| SlopesFF | /SlopesFF/0/data | ( |
| SlopesPC | /SlopesPC/0/data | ( |
| ThresholdsDark | /ThresholdsDark/0/data | ( |
Individual data quality statistics and figures of merit for Run 79, Run 80 (50.8 m/s) and Run 95, Run 96 (44.0 m/s).
| Data Collection | Run 79 | Run 80 | Run 95 | Run 96 |
|---|---|---|---|---|
| space group | P 43 21 2 | P 43 21 2 | P 43 21 2 | P 43 21 2 |
| cell dimensions (Å) | 79.73, 79.77, 38.60 | 79.73, 79.77, 38.60 | 79.73, 79.77, 38.60 | 79.78, 79.77, 38.61 |
| cell dimensions (°) | 90, 90, 90 | 90, 90, 90 | 90, 90, 90 | 90.02, 90.02, 90.03 |
| Resolution | 27.73 - 2.00 (2.07 - 2.00) | 27.73 - 2.00 (2.07 - 2.00) | 27.73 - 2.00 (2.07 - 2.00) | 27.74 - 2.00 (2.05 - 2.00) |
| Rsplit | 12.49 (111.33) | 17.93 (141.74) | 15.89 (173.28) | 14.43 (175.84) |
| CC1/2 (%) | 98.38 (50.57) | 96.38 (31.59) | 98.00 (27.67) | 97.99 (23.48) |
| CC* (%) | 99.59 (81.96) | 99.07 (69.29) | 99.49 (65.84) | 99.49 (61.67) |
| SNR | 6.75 (1.31) | 4.61 (1.07) | 5.46 (0.97) | 6.18 (0.98) |
| Completeness | 100 (100) | 100 (100) | 100 (100) | 100 (100) |
| Multiplicity | 421.3 (271.2) | 174.1 (111.0) | 246.6 (156.9) | 342.8 (220.8) |
| No. reflections | 16102 | 16102 | 16109 | 16190 |
| Rwork/Rfree | 0.1982/0.2279 | 0.1838/0.2092 | 0.1935/0.2327 | 0.1877/0.2318 |
| Bond lengths (Å) | 0.002 | 0.004 | 0.004 | 0.007 |
| Bond angles (°) | 0.49 | 0.63 | 0.58 | 0.79 |
Data quality statistics and figures of merit for runs combined by jet velocities.
| Data Collection | Combined 31.2 m/s | Combined 37.4 m/s | Combined 44.0 m/s | Combined 50.8 m/s |
|---|---|---|---|---|
| space group | P 43 21 2 | P 43 21 2 | P 43 21 2 | P 43 21 2 |
| cell dimensions (Å) | 79.75, 79.75, 38.60 | 79.75, 79.75, 38.60 | 79.75, 79.75, 38.60 | 79.73, 79.77, 38.60 |
| cell dimensions (°) | 90, 90, 90 | 90, 90, 90 | 90, 90, 90 | 90, 90, 90 |
| Resolution | 27.73 - 2.00 (2.07 - 2.00) | 27.73 - 2.00 (2.07 - 2.00) | 27.73 - 2.00(2.07 - 2.00) | 27.73 - 2.00 (2.07 - 2.00) |
| Rsplit | 9.43 (56.12) | 12.45 (52.03) | 14.21 (83.80) | 12.79 (37.65) |
| CC1/2 | 99.1 (82.2) | 98.9 (86.3) | 98.5 (74.9) | 97.9 (82.3) |
| CC* | 99.7 (93.6) | 99.5 (92.7) | 99.5 (85.3) | 99.5 (95.6) |
| SNR | 7.44 (1.42) | 6.09 (1.76) | 4.52 (0.98) | 6.44 (2.25) |
| Completeness | 99.8 (100) | 99.9 (100) | 99.9 (100) | 99.9 (100) |
| Multiplicity | 549.81 (371.5) | 269.3 (180.6) | 160.4 (106.8) | 301.8 (202.1) |
| No. reflections | 15537 | 15554 | 15554 | 15548 |
| Rwork/Rfree | 0.1978 / 0.2346 | 0.1845 / 0.2296 | 0.2039 / 0.2232 | 0.1859 / 0.2300 |
| Bond lengths (Å) | 0.003 | 0.002 | 0.002 | 0.002 |
| Bond angles (°) | 0.53 | 0.44 | 0.41 | 0.44 |
Relevant data sources and corresponding addresses within the deposited raw HDF5 data files.
| HDF5 key (raw data) | Description & file |
|---|---|
| INSTRUMENT SPB_DET_AGIPD1M-1 DET 0CH0:xtdf image data | AGIPD raw intensity and gain bit for module 0. File: RAW-RXXXX-AGIPD00-SXXXXX.h5 |
| INSTRUMENT SPB_EXP_ZYLA CAM 1:daqOutput data image pixels | off-axis microscope, monitoring the interaction region at 10 Hz File:: RAW-RXXXX-DA03-SXXXXX.h5 |
| INSTRUMENT SPB_XTD9_XGM XGM DOOCS:output data intensitySa1TD | X-ray pulse energy measured upstream of the instrument ( |
| INSTRUMENT SA1_XTD2_XGM XGM DOOCS:output data intensitySa1TD | X-ray pulse energy measured downstream of the SASE1 undulator ( |
| CONTROL SPB_IRU_AGIPD1M MOTOR Z_STEPPER actualPosition value | AGIPD positioner stage readback value (mm). File: RAW-RXXXX-DA03-SXXXXX.h5 |
| CONTROL ACC_SYS_DOOCS CTRL BEAMCONDITIONS kParameter value | SASE1 undulator k-parameter (). File: RAW-RXXXX-DA01-SXXXXX.h5 |
Relevant data sources and corresponding addresses within the deposited calibrated HDF5 data files.
| HDF5 key (calibrated data) | Description |
|---|---|
| INSTRUMENT SPB_DET_AGIPD1M-1 DET 0CH0:xtdf image data | AGIPD raw intensity and gain bit for module 0 |
| INSTRUMENT SPB_DET_AGIPD1M-1 DET 0CH0:xtdf image gain | AGIPD gain state for module 0 |
| INSTRUMENT SPB_DET_AGIPD1M-1 DET 0CH0:xtdf image mask | AGIPD pixel mask for module 0 |
| Measurement(s) | lysozyme measurement |
| Technology Type(s) | X-ray crystallography |
Individual data quality statistics and figures of merit for Run 84, Run 85 (37.4 m/s) and Run 98, Run 99 (31.2 m/s).
| Data Collection | Run 84 | Run 85 | Run 98 | Run 99 |
|---|---|---|---|---|
| space group | P 43 21 2 | P 43 21 2 | P 43 21 2 | P 43 21 2 |
| cell dimensions (Å) | 79.75, 79.75, 38.60 | 79.75, 79.75, 38.60 | 79.75, 79.75, 38.60 | 79.75, 79.75, 38.60 |
| cell dimensions (°) | 90, 90, 90 | 90, 90, 90 | 90, 90, 90 | 90, 90, 90 |
| Resolution | 27.73 - 2.00 (2.07 - 2.00) | 26.20 - 2.00 (2.07 - 2.00) | 27.73 - 2.00 (2.07 - 2.00) | 27.73 - 2.00 (2.07 - 2.00) |
| Rsplit | 21.44 (305.52) | 16.60 (245.94) | 15.79 (505.80) | 11.00 (173.59) |
| CC1/2 (%) | 96.47 (5.42) | 97.95 (25.98) | 98.01 (12.41) | 99.01 (46.56) |
| CC* (%) | 99.09 (32.07) | 99.48 (64.22) | 99.49 (46.98) | 99.75 (79.71) |
| SNR | 3.84 (0.55) | 4.83 (0.71) | 5.81 (0.46) | 7.58 (0.85) |
| Completeness | 100 (100) | 100 (100) | 100 (100) | 100 (100) |
| Multiplicity | 149.71 (95.09) | 149.71 (95.09) | 416.59 (263.80) | 668.4 (431.5) |
| No. reflections | 16109 | 16112 | 16079 | 16081 |
| Rwork/Rfree | 0.2005/0.2383 | 0.1874/0.2378 | 0.1973/0.2410 | 0.1950/0.2367 |
| Bond lengths (Å) | 0.002 | 0.004 | 0.004 | 0.003 |
| Bond angles (°) | 0.44 | 0.58 | 0.59 | 0.55 |