| Literature DB >> 29951483 |
Laura Y Kim1, William J Rice1, Edward T Eng1, Mykhailo Kopylov1, Anchi Cheng1, Ashleigh M Raczkowski1, Kelsey D Jordan1, Daija Bobe1, Clinton S Potter1, Bridget Carragher1.
Abstract
Cryo electron microscopy facilities running multiple instruments and serving users with varying skill levels need a robust and reliable method for benchmarking both the hardware and software components of their single particle analysis workflow. The workflow is complex, with many bottlenecks existing at the specimen preparation, data collection and image analysis steps; the samples and grid preparation can be of unpredictable quality, there are many different protocols for microscope and camera settings, and there is a myriad of software programs for analysis that can depend on dozens of settings chosen by the user. For this reason, we believe it is important to benchmark the entire workflow, using a standard sample and standard operating procedures, on a regular basis. This provides confidence that all aspects of the pipeline are capable of producing maps to high resolution. Here we describe benchmarking procedures using a test sample, rabbit muscle aldolase.Entities:
Keywords: alignment; benchmarking; cryo-electron microscopy; resolution; single particle workflow; structural biology
Year: 2018 PMID: 29951483 PMCID: PMC6009202 DOI: 10.3389/fmolb.2018.00050
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
Data collection statistics.
| Counting vs. super-resolution | Counting | Super-resolution | Super-resolution |
| Exposure time (ms) | 6,600 | 6,000 | 6,000 |
| Total dose (e−/Å2) | 68 | 63 | 63 |
| Duration of data collection | 18 h | 52 h | 40 h |
| Micrographs collected (per hour) | ~38 | ~31 | ~40 |
| # images total | 699 | 1,635 | 1,614 |
All datasets were collected on the same Titan Krios equipped with an energy filter and spherical aberration corrector at 300 keV accelerating voltage. Micrographs were collected on the K2 Summit DED, at either counting or super-resolution mode, at a nominal magnification of 130,000x, equivalent to a 0.85 Å pixel size at a dose rate of 8.0 e.
Figure 1Comparing images from thick vs. thin ice. Exemplary images from (A) 17sep21j (#1), (E) 17nov02c (#2), and (I) 17dec27a (#3) datasets. Quantitative metrics such as the estimation of resolution from CTFFindV4 (B,F,J) and qualitative metrics such as the presence/absence of the water diffraction ring around the 3 Å mark (C,G,K), and ice thickness measurements of the micrographs (D,H,L), should be monitored during data collection. A CTFFindV4 resolution estimation worse than 4 Å and the presence of a strong water diffraction ring are both indicative of thick ice, and areas like this should be avoided. All images were acquired with ~1.5 mm defocus. Ice thickness measurements provide a useful metric for data quality (D,H,L). Datasets 17sep21j and 17dec27a both contain a majority of images where ice thickness is in the range 0–20 nm. The majority of 17nov02c images have thickness in the range 0–10 nm (ice that is too thin or completely absent) or very thick ice in the range 100–250 nm. The dimensions of aldolase are ~100 Å so this thick ice is more than 20 times more than the longest dimension of the particle.
Processing and reconstruction statistics.
| Sorting method | All img | <25 nm ice thickness | 1st 500 img | All img | <25 nm ice thickness | 1st 700 img | All img | <25 nm ice thickness | 1st 382 img |
| Dataset number | #1 | #1a | #1b | #2 | #2a | #2b | #3 | #3a | #3b |
| Duration of data collection | 18.0 h | 18.0 h | 13.5 h | 52.0 h | 52.0 h | 24.0 h | 40.0 h | 40.0 h | 10.0 h |
| # images total | 699 | 699 | 699 | 1635 | 1635 | 1635 | 1614 | 1614 | 1614 |
| # images used | 699 | 535 | 500 | 1635 | 63 | 700 | 1614 | 1108 | 382 |
| # picks | 642 K | 491 K | 256 K | 1,380 K | 60 K | 685 K | 1,214 K | 975 K | 234 K |
| Duration of 2D classification | 0.8 h | 0.4 h | 0.3 h | 1.3 h | 0.3 h | 0.4 h | 1.1 h | 1.1 h | 0.4 h |
| # particles after 2D | 373 K | 374 K | 133 K | 464 K | 26 K | 198 K | 498,000 | 369 K | 87 K |
| % particles after 2D | 58% | 76% | 52% | 34% | 43% | 29% | 41% | 38% | 37% |
| Duration of 3D classification | 1.2 h | 1.7 h | 0.9 h | 3.9 h | 0.5 h | 0.5 h | 6.3 h | 5.1 h | 0.8 h |
| # particles into refinement | 219 K | 124 K | 62 K | 204 K | 22 K | 75 K | 205 K | 187 K | 87 K |
| % particles into refinement | 59% | 33% | 47% | 44% | 85% | 38% | 41% | 52% | 100% |
| Duration of refinement | 0.9 h | 0.9 h | 0.9 h | 1.9 h | 0.5 h | 0.5 h | 1.5 h | 1.4 h | 0.7 h |
| Ice thickness range | 10–20 nm | 10–20 nm | 10–20 nm | 100–250 nm | 100–250 nm | 100–250 nm | 10–20 nm | 10–20 nm | 10–20 nm |
| Total processing time | 2.9 h | 3.0 h | 2.0 h | 7.1 h | 1.2 h | 1.4 h | 8.9 h | 7.6 h | 1.9 h |
| Resolution (global) | 2.5 Å | 2.5 Å | 2.8 Å | 3.0 Å | 3.5 Å | 4.6 Å | 2.4 Å | 2.4 Å | 2.8 Å |
Datasets were sorted based on three methods, using all micrographs, using micrographs with <25 nm ice thickness and using only the first few hundred micrographs. This is to test whether it is more important to collect and process data based on the quantity of data (all micrographs), quality of data (<25 nm ice thickness), or time spent on data collection and processing (first few hundred micrographs). All 2D and 3D processing was performed using Cryosparc.
Figure 2Comparing 3D reconstructions from thick vs. thin ice 2D and 3D processing results from 17sep21j (dataset #1) and 17nov02c (dataset #2) which yielded maps at 2.5 and 3.0 Å resolution, respectively. Dataset #1 has thinner ice in the raw micrographs, ranging from 10 to 20 nm thick, whereas dataset #2 has thicker ice, ranging from 100 to 250 nm thick. Data to assess include raw micrographs (A,G), 2D classifications (B,J), FSC plots (C,H), sphericity plots (D,I), 3D maps (E,K), and local resolution maps (F,L). Both datasets have about 200,000 particles contributing to the final refinement but dataset #1 is both qualitatively and quantitatively better than dataset #2.
Ranking of datasets based on data collection and processing time and resolution.
| 1 | #3b | 11.9 | 2.8 | 87 | 10–20 |
| 2 | #1b | 15.5 | 2.8 | 62 | 10–20 |
| 3 | #1 | 20.9 | 2.5 | 219 | 10–20 |
| 4 | #1a | 21.0 | 2.5 | 124 | 10–20 |
| 5 | #3a | 47.6 | 2.4 | 186 | 10–20 |
| 6 | #3 | 48.9 | 2.4 | 205 | 10–20 |
| 7 | #2 | 59.1 | 3.0 | 204 | 100–250 |
| 8 | #2a | 53.2 | 3.5 | 22 | 100–250 |
| 9 | #2b | 25.4 | 4.6 | 75 | 100–250 |
Datasets were ranked primarily on total data collection + processing time and secondarily on nominal resolution. Six out of nine datasets went to <3 Å. We find that all <3 Å reconstructions come from datasets with 10–20 nm ice thickness and that more than half of those <3 Å datasets were acquired in under 24 h. Datasets that do not go <3 Å had ice thickness measurements ranging from 100–250 nm.
| 1 | 17sep21j—all images | 7,616 |
| 1a | 17sep21j—<25 nm ice thickness | 7,617 |
| 1b | 17sep21j—1st 500 images | 7,614 |
| 2 | 17nov02c—all images | 7,551 |
| 2a | 17nov02c—<25 nm ice thickness | 7,562 |
| 2b | 17nov02c—1st 700 images | 7,615 |
| 3 | 17dec27a—all images | 7,541 |
| 3a | 17dec27a—<25 nm ice thickness | 7,550 |
| 3b | 17dec27a—1st 382 images | 7,528 |