Literature DB >> 27479754

Single-shot diffraction data from the Mimivirus particle using an X-ray free-electron laser.

Tomas Ekeberg1,2, Martin Svenda1, M Marvin Seibert1, Chantal Abergel3, Filipe R N C Maia1, Virginie Seltzer3, Daniel P DePonte2,4, Andrew Aquila4,5, Jakob Andreasson1, Bianca Iwan1,6, Olof Jönsson1,7, Daniel Westphal1, Duško Odić1, Inger Andersson1, Anton Barty2, Meng Liang2,4, Andrew V Martin2,8, Lars Gumprecht2, Holger Fleckenstein2, Saša Bajt9, Miriam Barthelmess9, Nicola Coppola2, Jean-Michel Claverie3, N Duane Loh10,11, Christoph Bostedt4, John D Bozek12, Jacek Krzywinski4, Marc Messerschmidt4, Michael J Bogan10, Christina Y Hampton10, Raymond G Sierra10, Matthias Frank13, Robert L Shoeman14, Lukas Lomb14, Lutz Foucar14,15, Sascha W Epp15,16, Daniel Rolles14,15,17, Artem Rudenko15,16,17, Robert Hartmann18, Andreas Hartmann18, Nils Kimmel19,20, Peter Holl18, Georg Weidenspointner19,20, Benedikt Rudek15,16, Benjamin Erk15,16, Stephan Kassemeyer14, Ilme Schlichting14,15, Lothar Strüder18,21, Joachim Ullrich15,16,22, Carlo Schmidt15,16, Faton Krasniqi14,15, Günter Hauser19,20, Christian Reich18, Heike Soltau18, Sebastian Schorb23, Helmut Hirsemann9, Cornelia Wunderer9, Heinz Graafsma9, Henry Chapman2,24, Janos Hajdu1,5.   

Abstract

Free-electron lasers (FEL) hold the potential to revolutionize structural biology by producing X-ray pules short enough to outrun radiation damage, thus allowing imaging of biological samples without the limitation from radiation damage. Thus, a major part of the scientific case for the first FELs was three-dimensional (3D) reconstruction of non-crystalline biological objects. In a recent publication we demonstrated the first 3D reconstruction of a biological object from an X-ray FEL using this technique. The sample was the giant Mimivirus, which is one of the largest known viruses with a diameter of 450 nm. Here we present the dataset used for this successful reconstruction. Data-analysis methods for single-particle imaging at FELs are undergoing heavy development but data collection relies on very limited time available through a highly competitive proposal process. This dataset provides experimental data to the entire community and could boost algorithm development and provide a benchmark dataset for new algorithms.

Entities:  

Mesh:

Year:  2016        PMID: 27479754      PMCID: PMC4968188          DOI: 10.1038/sdata.2016.60

Source DB:  PubMed          Journal:  Sci Data        ISSN: 2052-4463            Impact factor:   6.444


Background & Summary

Free-electron lasers (FEL) provide ultra short and extremely bright pulses of coherent X-rays[1]. It has been predicted that such pulses could enable structure determination without crystallization by outrunning radiation damage and thus capturing diffraction data before the particle has time to respond and eventually be destroyed by the deposited energy[2]. Experimental verification of this ‘diffraction-before-destruction’ principle has been demonstrated several times for resolution down to 10 nm (refs 3,4). Using many such diffraction patterns from multiple copies of a reproducible sample, the patterns could be assembled into a 3D diffraction space from which the 3D structure could be derived[5,6]. This promise was a main part of the scientific case for building free-electron lasers[7]. Several examples of 2D reconstructions from biological samples at X-ray FEL have been demonstrated[4,8,9] but 3D reconstructions have remained elusive. A single diffraction pattern represents a curved slice through the Fourier transform of the electron density of the object. For successful 3D reconstruction many diffraction patterns from identical particles need to be assembled into the complete 3D Fourier transform of the particle. This is difficult since the orientation of the injected particles is unknown and has to be recovered from the diffraction data alone. A recent paper[10] demonstrates this, using a modified version of the expand, maximize and compress algorithm[5] (EMC) on the Mimivirus particle. Here we describe the data collection, data preprocessing and the dataset that was used for this reconstruction. Today, beam time at free-electron lasers is scarce as there are few facilities and they serve a multitude of scientific disciplines. Furthermore, applications such as 3D imaging require a large amount of effort in algorithm development and testing. Several groups around the world are active in this development but the majority of them don’t have regular beam time access. This dataset can thus serve as a benchmark for algorithm testing and give many more groups access to experimental data. For any new method, validation tools are of crucial importance. Therefore, together with the 3D reconstruction of the Mimivirus we also presented two new validation methods[10]. Further development of these methods, and the development of new ones will therefore benefit from being applied to this dataset in particular. The sample in this dataset is the Mimivirus (Acanthamoeba polyphaga mimivirus)[11,12]. Mimivirus is part of a recently discovered class of giant DNA viruses. Viral capsid is pseudo-icosahedral with a corner-to-corner diameter of 500 nm and a face-to-face diameter of 400 nm (ref. 13). The virus is covered by fibres with a length of 125 nm giving it a total diameter of 750 nm (ref. 14).

Methods

These methods were described in ref. 10. The description here is more detailed with regards to data collection and on-line data analysis.

Sample injection

Purified Mimivirus particles[15] were transferred into a volatile buffer (250 mM ammonium acetate, pH 7.5) and the suspension was aerosolized with helium in a gas dynamic nebulizer[16]. The aerosol of hydrated and adiabatically cooled particles entered a differentially pumped aerodynamic lens[17].

Data collection

Experiments were performed at the Atomic Molecular Optics (AMO) beam line[18] of the Linac Coherent Light Source (LCLS) hard X-ray laser[1], using the CAMP[19] instrument[20,21]. The experiment was part of an experiment running from June 17 to June 21 of 2010 with proposal number L150. Diffraction data were recorded on a pair of pnCCD detectors[19] at a repetition rate of 60 Hz matching the repetition rate of LCLS. The two detectors were placed at a distance of 740 mm from the interaction region with a gap between them of 2.1 mm to let the direct beam through. The pixel size is 75 μm and each detector in the pair has 512×1,024 pixels giving the full setup a pixel count of 1,024×1,024 pixels. The photon energy was 1.2 keV corresponding to a wavelength of 1.03 nm. At this wavelength the full-period resolution at the edge of the detector is 19.9 nm. The electron bunch used to create the X-ray pulse was 70 fs long (full duration at half maximum) and the X-ray pulse is believed to be shorter than this[22]. The focus was ~10 μm (full width at half maximum) at the interaction point, giving a power density of ~3.4 1015 W cm−2 or 1012 photons per pulse. The experiment was performed at a pressure of 10−6 mbar to reduce background scattering. Some of the most important experiment parameters are summarized in Table 1.
Table 1

Summary of experimental parameters.

Parameter Value
Photon energy1,200 keV
Detector distance0.74 m
Pixel size75 μm
Number of pixels1,024×1,024
Focal size10 μm2

On-line data analysis

On-line hit-identification provided real-time statistics that guided injector alignment and tuning. Hits with a high scattering strength were identified by counting the number of pixels that measured a value above a threshold. Diffraction patterns with more than 500 pixels with a value above 170 ADU were defined as a hit. See ref. 21 for a detailed description.

Data Records

Two datasets are provided: the full record of all collected data and a smaller preprocessed dataset. Both data sets are available in the same CXIDB entry (Data Citation 1).

Full data record

We provide all data collected from the Mimivirus at this beamtime before any preprocessing or sorting. This dataset is in the extended tagged container (XTC) file format. This can be converted to HDF5 format using programs such as CASS[23] or Cheetah[24]. This conversion normally also involves preprocessing and the data is therefore provided in the untouched XTC format. The record contains 19 LCLS runs. 14 of these had the sample injector and X-ray laser turned on while the remaining 5 runs only collected detector background noise. These so called ‘dark runs’ can be used for to better subtract the background from the actual data. Table 2 shows a list of all 19 runs.
Table 2

List of experimental runs.

Run number Sample Number of frames Number of frames selected for analysis in ref. 10
Runs labeled as dark had the X-ray beam turned off and are included to allow for detector calibration. The lack of good hits before run 90 was possibly fixed by a changed injection nozzle at this point.   
73Dark651N/A
80Mimivirus14,2730
81Mimivirus14,4500
82Mimivirus11,3710
83Dark15,636N/A
84Mimivirus65,5940
87Mimivirus93,8400
89Dark3,821N/A
90Mimivirus77,64430
91Mimivirus4,9436
92Mimivirus33,72139
93Mimivirus43,67924
94Mimivirus58,93140
95Mimivirus42,08310
97Mimivirus36,89933
98Dark7,794N/A
152Dark2,498N/A
156Mimivirus64,9773
157Mimivirus90,40313
Initial hit finding showed that 0.3% of the frames contained diffraction that was stronger than the background. The rest were misses, i.e., frames that were read out when the pulse did not hit any particle. In addition to hits from single Mimivirus particles the hits also include droplets of buffer, clusters of viruses and a few particles that were injected earlier and had stayed in the injection system.

Preprocessed and filtered data

This dataset contains the 198 preprocessed (see ref. 19) diffraction patterns that were used in ref. 10 to recover the 3D structure of the Mimivirus particle. This data is in the CXIDB format described in ref. 25. The pixel values in the data are in arbitrary detector units (ADU). The conversion factor from ADU to number of photons is 7 ADU per photon. Some areas of the detector were unreadable and some scattering angles were not covered by the detector, such as the gap between the two detector halves that lets the direct beam through. These areas are identified by a mask entry in the CXIDB file format. The data was filtered in three steps. (1) Hits were distinguished from blanks using methods described in refs 21 and 24. This yielded 1,600 hits. (2) 307 Diffraction patterns that correspond to a Mimivirus particle were selected by hand. An icosahedral or pseudo-icosahedral particle will in most orientations yield a distinctive type of pattern showing six outwards-going streaks. This feature and particle size, determined from the fringe spacing, was used for this selection. (3) When the detector is exposed to high intensities, the intensity can spill over from a pixel to neighboring pixels. We start seeing these effects at intensities above 750 photons per pixel. In the final dataset, diffraction patterns suffering from this effect were filtered out, resulting in 198 unsaturated diffraction patterns. A subset of this dataset is shown in Fig. 1.
Figure 1

The first 24 of the 198 diffraction patterns in this dataset.

The color scale is logarithmic and ranges from 1 to 820 photons per pixel. This is a modification of a figure previously presented in ref. 10.

Technical Validation

3D reconstruction

The 198 diffraction patterns were successfully assembled in a 3D Fourier volume and subsequent phase retrieval provided the full 3D electron density of the virus with a full-period resolution of 125 nm[10]. This indicates that the Mimivirus is reproducible to at least this resolution.

Validation of the 3D reconstruction

In cryo-electron microscopy (cryo-EM), data is routinely split prior to analysis and the analysis is performed in parallel on the two sets[26]. Our first method for validating the 3D reconstruction is an adaptation of this technique. The diffraction patterns are randomly split in two sets of equal size. The recovery of the 3D alignment is performed independently on the two sets using the same parameters but independent random starting points. Phase retrieval is also performed independently using the same parameters. The standard practice in the field is to repeat the reconstruction at least 100 times and the results are then averaged to average out effects of the random starting point[4,27]. The EMC algorithm recovers the relative orientation of the particles from the diffracted data alone but the arbitrary rotation of the entire system can be different comparing the two resulting 3D electron density maps. In order to compare the two recovered electron densities we therefore have to rotate one of the two data sets to best match the other. This is done using brute force by interpolating one of the maps at a regular array corresponding to the tested rotation. To compare the two maps, the Pearson correlation coefficient is calculated. The rotation with the highest Pearson correlation is assumed to correspond to the proper relative orientation. The two aligned electron density maps are then compared using the Fourier shell correlation (FSC)[26], which provides a measure of the similarity as a function of resolution. The threshold for what is regarded as an acceptable fit ranges between 0.14 and 0.5 in cryo-EM literature[28,29]. In X-ray crystallography some Bragg spots are usually excluded from the analysis and instead that information is used to verify the recovered expected strength of the respective Bragg spot[30]. Using the same idea, 10% of the diffraction patterns were selected to be used for validation only. In the EMC scheme these patterns are excluded from the analysis but are still compared to the recovered Fourier transform of the particle. The measure for determining whether the recovered model agrees with the excluded diffraction patterns is the likelihood function used internally in EMC. The analogy with the Rfree value in crystallography should not be over emphasized. Bragg peaks are linearly independent parameters and ther is no suitable analogy in the continuous diffraction case. Furthermore, in the case described here, the process that is validated is that of pattern alignment and not phase retrieval. It is therefore natural to choose individual diffraction patterns as the information unit to exclude rather than i.e., individual pixels or regions of pixels. These validation methods were previously described in ref. 10.

Usage Notes

Data was stored in the CXIDB[25] data format which uses the HDF5 format. HDF5 files are readable in many computing environments including python using the h5py module and MATLAB using e.g., the h5read function. Convenient functions for accessing the CXIDB data file exist in the libspimage package for C and python[31]. For visualizing data the CXIDB file browser Owl (https://github.com/FilipeMaia/owl) is recommended.

Additional Information

How to cite this article: Ekeberg, T et al. Single-shot diffraction data from the Mimivirus particle using an X-ray free-electron laser. Sci. Data 3:160060 doi: 10.1038/sdata.2016.60 (2016).
  20 in total

1.  Potential for biomolecular imaging with femtosecond X-ray pulses.

Authors:  R Neutze; R Wouts; D van der Spoel; E Weckert; J Hajdu
Journal:  Nature       Date:  2000-08-17       Impact factor: 49.962

2.  Optimal determination of particle orientation, absolute hand, and contrast loss in single-particle electron cryomicroscopy.

Authors:  Peter B Rosenthal; Richard Henderson
Journal:  J Mol Biol       Date:  2003-10-31       Impact factor: 5.469

3.  The 1.2-megabase genome sequence of Mimivirus.

Authors:  Didier Raoult; Stéphane Audic; Catherine Robert; Chantal Abergel; Patricia Renesto; Hiroyuki Ogata; Bernard La Scola; Marie Suzan; Jean-Michel Claverie
Journal:  Science       Date:  2004-10-14       Impact factor: 47.728

4.  High-resolution ab initio three-dimensional x-ray diffraction microscopy.

Authors:  Henry N Chapman; Anton Barty; Stefano Marchesini; Aleksandr Noy; Stefan P Hau-Riege; Congwu Cui; Malcolm R Howells; Rachel Rosen; Haifeng He; John C H Spence; Uwe Weierstall; Tobias Beetz; Chris Jacobsen; David Shapiro
Journal:  J Opt Soc Am A Opt Image Sci Vis       Date:  2006-05       Impact factor: 2.129

5.  The polyadenylation site of Mimivirus transcripts obeys a stringent 'hairpin rule'.

Authors:  Deborah Byrne; Renata Grzela; Audrey Lartigue; Stéphane Audic; Sabine Chenivesse; Stéphanie Encinas; Jean-Michel Claverie; Chantal Abergel
Journal:  Genome Res       Date:  2009-04-29       Impact factor: 9.043

6.  Reconstruction algorithm for single-particle diffraction imaging experiments.

Authors:  Ne-Te Duane Loh; Veit Elser
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2009-08-24

7.  Single mimivirus particles intercepted and imaged with an X-ray laser.

Authors:  M Marvin Seibert; Tomas Ekeberg; Filipe R N C Maia; Martin Svenda; Jakob Andreasson; Olof Jönsson; Duško Odić; Bianca Iwan; Andrea Rocker; Daniel Westphal; Max Hantke; Daniel P DePonte; Anton Barty; Joachim Schulz; Lars Gumprecht; Nicola Coppola; Andrew Aquila; Mengning Liang; Thomas A White; Andrew Martin; Carl Caleman; Stephan Stern; Chantal Abergel; Virginie Seltzer; Jean-Michel Claverie; Christoph Bostedt; John D Bozek; Sébastien Boutet; A Alan Miahnahri; Marc Messerschmidt; Jacek Krzywinski; Garth Williams; Keith O Hodgson; Michael J Bogan; Christina Y Hampton; Raymond G Sierra; Dmitri Starodub; Inger Andersson; Saša Bajt; Miriam Barthelmess; John C H Spence; Petra Fromme; Uwe Weierstall; Richard Kirian; Mark Hunter; R Bruce Doak; Stefano Marchesini; Stefan P Hau-Riege; Matthias Frank; Robert L Shoeman; Lukas Lomb; Sascha W Epp; Robert Hartmann; Daniel Rolles; Artem Rudenko; Carlo Schmidt; Lutz Foucar; Nils Kimmel; Peter Holl; Benedikt Rudek; Benjamin Erk; André Hömke; Christian Reich; Daniel Pietschner; Georg Weidenspointner; Lothar Strüder; Günter Hauser; Hubert Gorke; Joachim Ullrich; Ilme Schlichting; Sven Herrmann; Gerhard Schaller; Florian Schopper; Heike Soltau; Kai-Uwe Kühnel; Robert Andritschke; Claus-Dieter Schröter; Faton Krasniqi; Mario Bott; Sebastian Schorb; Daniela Rupp; Marcus Adolph; Tais Gorkhover; Helmut Hirsemann; Guillaume Potdevin; Heinz Graafsma; Björn Nilsson; Henry N Chapman; Janos Hajdu
Journal:  Nature       Date:  2011-02-03       Impact factor: 49.962

8.  The Coherent X-ray Imaging Data Bank.

Authors:  Filipe R N C Maia
Journal:  Nat Methods       Date:  2012-09       Impact factor: 28.547

9.  Determination of the fold of the core protein of hepatitis B virus by electron cryomicroscopy.

Authors:  B Böttcher; S A Wynne; R A Crowther
Journal:  Nature       Date:  1997-03-06       Impact factor: 49.962

10.  Structural studies of the giant mimivirus.

Authors:  Chuan Xiao; Yurii G Kuznetsov; Siyang Sun; Susan L Hafenstein; Victor A Kostyuchenko; Paul R Chipman; Marie Suzan-Monti; Didier Raoult; Alexander McPherson; Michael G Rossmann
Journal:  PLoS Biol       Date:  2009-04-28       Impact factor: 8.029

View more
  6 in total

1.  A science education model for large collaborative centers.

Authors:  William J Bauer; Sarah B Woodruff
Journal:  Struct Dyn       Date:  2021-03-05       Impact factor: 2.920

2.  Biological single-particle imaging using XFELs - towards the next resolution revolution.

Authors:  Dominik Oberthür
Journal:  IUCrJ       Date:  2018-10-31       Impact factor: 4.769

3.  Kinetics of Mimivirus Infection Stages Quantified Using Image Flow Cytometry.

Authors:  Liran Ben Yaakov; Yael Mutsafi; Ziv Porat; Tali Dadosh; Abraham Minsky
Journal:  Cytometry A       Date:  2019-04-24       Impact factor: 4.355

4.  Structural Heterogeneity in Single Particle Imaging Using X-ray Lasers.

Authors:  Thomas Mandl; Christofer Östlin; Ibrahim E Dawod; Maxim N Brodmerkel; Erik G Marklund; Andrew V Martin; Nicusor Timneanu; Carl Caleman
Journal:  J Phys Chem Lett       Date:  2020-07-16       Impact factor: 6.475

5.  Femtosecond X-ray coherent diffraction of aligned amyloid fibrils on low background graphene.

Authors:  Carolin Seuring; Kartik Ayyer; Eleftheria Filippaki; Miriam Barthelmess; Jean-Nicolas Longchamp; Philippe Ringler; Tommaso Pardini; David H Wojtas; Matthew A Coleman; Katerina Dörner; Silje Fuglerud; Greger Hammarin; Birgit Habenstein; Annette E Langkilde; Antoine Loquet; Alke Meents; Roland Riek; Henning Stahlberg; Sébastien Boutet; Mark S Hunter; Jason Koglin; Mengning Liang; Helen M Ginn; Rick P Millane; Matthias Frank; Anton Barty; Henry N Chapman
Journal:  Nat Commun       Date:  2018-05-09       Impact factor: 14.919

6.  Diffraction data from aerosolized Coliphage PR772 virus particles imaged with the Linac Coherent Light Source.

Authors:  Haoyuan Li; Reza Nazari; Brian Abbey; Roberto Alvarez; Andrew Aquila; Kartik Ayyer; Anton Barty; Peter Berntsen; Johan Bielecki; Alberto Pietrini; Maximilian Bucher; Gabriella Carini; Henry N Chapman; Alice Contreras; Benedikt J Daurer; Hasan DeMirci; Leonie Flűckiger; Matthias Frank; Janos Hajdu; Max F Hantke; Brenda G Hogue; Ahmad Hosseinizadeh; Mark S Hunter; H Olof Jönsson; Richard A Kirian; Ruslan P Kurta; Duane Loh; Filipe R N C Maia; Adrian P Mancuso; Andrew J Morgan; Matthew McFadden; Kerstin Muehlig; Anna Munke; Hemanth Kumar Narayana Reddy; Carl Nettelblad; Abbas Ourmazd; Max Rose; Peter Schwander; M Marvin Seibert; Jonas A Sellberg; Raymond G Sierra; Zhibin Sun; Martin Svenda; Ivan A Vartanyants; Peter Walter; Daniel Westphal; Garth Williams; P Lourdu Xavier; Chun Hong Yoon; Sahba Zaare
Journal:  Sci Data       Date:  2020-11-19       Impact factor: 6.444

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.