Literature DB >> 25707029

De novo protein structure determination from near-atomic-resolution cryo-EM maps.

Ray Yu-Ruei Wang¹, Mikhail Kudryashev², Xueming Li³, Edward H Egelman⁴, Marek Basler⁵, Yifan Cheng³, David Baker⁶, Frank DiMaio⁷.

Abstract

We present a de novo model-building approach that combines predicted backbone conformations with side-chain fit to density to accurately assign sequence into density maps. This method yielded accurate models for six of nine experimental maps at 3.3- to 4.8-Å resolution and produced a nearly complete model for an unsolved map containing a 660-residue heterodimeric protein. This method should enable rapid and reliable protein structure determination from near-atomic-resolution cryo-electron microscopy (cryo-EM) maps.

Entities: Chemical Disease Gene Species

Mesh：

Substances：
Proteins

Year: 2015 PMID： 25707029 PMCID： PMC4435692 DOI： 10.1038/nmeth.3287

Source DB: PubMed Journal: Nat Methods ISSN： 1548-7091 Impact factor: 28.547

Model building is a key step in macromolecular structure determination. While most atomic-resolution structures are solved using X-ray crystallography, single-particle cryo-electron microscopy (cryo-EM) has emerged as a powerful tool in determining electron density maps of large and high-symmetry particles to near-atomic resolution (3–5 Å)[1-5]. Recent advances even allow it to reach these resolutions from smaller particles with low or no symmetry[6-12]. Despite these developments, little progress has been made in de novo model building into near-atomic resolution cryo-EM density maps. Structural interpretation of cryo-EM maps typically starts with fitting an atomic X-ray or NMR structure into the map[13-15]. Previous work has shown that atomic resolution models are achievable from near-atomic resolution cryo-EM density, starting from a homologous structure of the correct topology[16]. However, when there are no previously solved structures of homologous proteins, de novo model building must be carried out. Currently, such structure determination requires manually building a backbone model into density and assigning sequence[6-9]. While tracing the backbone into density at this resolution is generally straightforward, manually assigning sequence remains time consuming and error-prone. Automated protein model-building tools developed for X-ray crystallography[18-20] are widely used in structure determination from maps with resolution better than 3 Å. These methods separate backbone tracing and side-chain assignment, with density features largely guiding side-chain identification. Consequently, at resolutions worse than 3 Å, where side-chain density is mostly indiscernible, these approaches fail. Several de novo model building methods targeted to cryo-EM have been developed for maps with resolution ranging from near-atomic (3–5 Å) to medium resolution (5–10 Å)[21-23]. Although these methods are powerful in identifying the protein topology given a map, they have poor recovery, often <50%, of correct sequence registration[21,22]. Here, we describe a novel de novo model-building approach for cryo-EM maps at 3–5 Å resolution. Our approach combines sequence-derived backbone conformations with side-chain fit-to-density to assign sequence into the maps. On a benchmark set of nine experimental cryo-EM maps at near-atomic resolution with previously determined structure and a previously unsolved map for the 660-residue contractile sheath protein of the type VI secretion system from Vibrio cholera, we show that high-accuracy models can be obtained without knowledge of detectable structural homologues. Our method should streamline the protein structure determination process from cryo-EM maps at near-atomic resolution. Our approach for de novo interpretation of near-atomic resolution density maps consists of three steps: (1) matching sequence-based local backbone conformations into the density map; (2) identification of a maximally consistent subset of these fragment matches and assembly into a partial model, and (3) completion of the partial model using density-guided sampling and all-atom refinement (Fig. 1a). In the first step, for overlapping 9-residue windows of amino-acid sequence, we identify segments (or fragments) of solved protein structures with similar local sequences and predicted secondary structures[24], analogous to the fragments used in Rosetta de novo structure prediction[25]. For each fragment, a translation/rotation search identifies placements with good map agreement after optimizing side-chain conformations; only a small subset of these placements are located near the native position (RMSd < 2.5 Å). To identify these correct placements, we search a mutually compatible subset of fragment placements, using a score function that favors fragment pairs with: (a) the same residue in the same place, (b) residues nearby in sequence nearby in space, and (c) no two residues occupying the same space. Simulated annealing Monte Carlo (SA-MC) guided by this score function finds the maximally consistent subset of fragment placements from this larger set. Fragment matching and SA-MC assembly are applied iteratively until >70% of the sequence has been assigned into density. Each iteration places fragments from unassigned sequence positions of the sequence into unoccupied regions in density (Fig. 1b). Finally, the partial model from the final iteration is completed through rebuilding and all-atom refinement using RosettaCM[26] guided by the experimental density data.

Figure 1

Protocol Overview

(a) First, for a 9-residue window centered on each position in the sequence, representative backbone conformations (fragments) are collected and docked into the density map. Second, the resulting fragment placements are then evaluated using a score function consisting of four terms: a density correlation term assessing the agreement of fragment and map; an overlap term favoring fragment pairs assigning the same residue to the same location; a closability term favoring fragment pairs close in sequence that are close in space; and a clash term preventing two residues from occupying the same place. Third, from the candidate placements (square green blocks), simulated annealing Monte Carlo finds a set of fragments (square orange blocks) optimizing the score function; a null placement (empty blocks) may be assigned in positions where no good placements have been identified. Fourth, a partial model is assembled by combining fragment placements from multiple Monte Carlo trajectories. Steps 1–4 are carried out iteratively until ~70% of sequence is covered. Finally, unassigned regions in the final partial model are completed using density-guided loop sampling followed by all-atom refinement. (b) Model building for the 20S α-subunti in a 4.8 Å resolution cryo-EM map required three iterations, illustrated in the three rows in the figure. In leftmost column, the density map used for the corresponding iteration, after masking out density from the previous round’s partial model. In column 2, the assembled partial models after Monte Carlo sampling (colored blue at the N-terminus to red at the C-terminus). In column 3, fragment placement results after translation and rotation search. The x-axis covers the sequence of the protein, and each black point represents a single fragment placement; the y-axis indicates the distance of the fragment placement to the native conformation. Pink points indicate fragments chosen to assemble the partial model, and the grey shading shows residues covered in the partial model. Secondary structural elements in the native are indicated above the plot, where H indicates helix and S indicates strand. In rightmost column, convergence of Monte Carlo trajectories. Each point represents the fragment assignment of an independent search trajectory, colored by number of total fragments placed. The X-axis indicates the percentage of fragments placed within 2.5 Å RMSd to the native configuration, while the Y-axis shows the score with the fragment compatibility function. The horizontal dashed line shows the score cut used for partial model generation.

We tested our method on a benchmark set of 9 proteins. These proteins range in size from 155 to 397 residues, include different fold types, and have experimental cryo-EM maps varying in resolution range from 3.3 to 4.8 Å (Supplementary Table 1). For each map, a single subunit was first segmented from the entire density map. Fragments from proteins with similar topology or sequence were excluded while constructing the fragment libraries. In 7 out of the 9 cases, partial models from the final iteration of the de novo building step are within 1.1–2.3 Å Cα RMSD from the experimental structures (Supplementary Table 1 and Fig. 2), 6 of which are more than 70% complete. These partial models were then completed and refined using RosettaCM, yielding models with 1.3–2.2 Å Cα RMSd (2.0–3.1 Å all-atom RMSd) from the experimentally determined structures. In contrast, Buccaneer[18], a widely used model building method from X-ray crystallography – while able to trace portions of the backbone for all targets – only correctly identifies more than 5% of the sequence in 3 cases, and never identifies more than 50% (Supplementary Table 2).

Figure 2

High-accuracy model building in near-atomic resolution cryo-EM maps

(Leftmost column) The density maps used for de novo model building on 20S-α at 4.8 Å, TRPV1-TM at 3.4 Å, FrhB at 3.4 Å, and FrhA at 3.4 Å (Row 1 to 4, respectively). (Column2) The partial model at the final iteration. (Column 3 and 4) Full-length RosettaCM models (red) are superimposed with the native structure (blue). Each sub-figure shows the lowest-RMSd structure from 10 lowest-electron-density-score models (left) with a close-up of the core showing that native core packing is recovered (right).

Among the proteins in the benchmark set, TRPV1[8] and FrhB[7,27] were proteins with new folds solved recently by manually building models into cryo-EM density. Our method automatically obtained completed models with 1.4 Å Cα RMSd model for TRPV1 and 1.7 Å Cα RMSd for FrhB. To test the resolution limit at which de novo structure determination is possible, a previously unpublished 4.8 Å resolution map from the 20S proteasome α-subunit (20S-α) was used. At this resolution, the α-helix pitch is somewhat visible, however, β-strand separation is only barely resolved (Fig. 1b). Using our approach, the final partial model had 196 out of 221 residues placed, with just 1.3 Å RMSd to the crystal structure (Figs. 1 and 2, and Supplementary Table 1). Using RosettaCM to build a completed model, we obtained a 1.2 Å Cα RMSd model (2.0 Å all-atom RMSd). Despite the lack of side-chain density details, side-chains in the core of the protein show very good agreement to the deposited crystal structure (Fig. 2). Even though one or more of the predicted fragments adopt near-native conformations for 78% of the sequence (Supplementary Table 3), near-native placements of some fragments do not score well enough initially to be carried over to SA-MC assembly. Therefore, we iterate fragment matching and assembly, in each round assembling a consensus assignment, only containing fragments placed in similar locations in all low-scoring SA-MC trajectories. These regions are locked, the corresponding density is masked out, and another round of fragment search and SA-MC is carried out. In all cases except one (TMV), more than one iteration was required to obtain a partial model with at least 70% of the sequence placed (Supplementary Table 1). For example (Fig. 1b and Supplementary Table 1), 20S-α took three rounds to reach this level of coverage; the partial model after one round only had 34% of the sequence placed. Sequence positions at S3, S6 and S7 were correctly traced only in the second round, and S1, S2, S5, S9 and S10 only in the third (Fig. 1b). There are three cases (BBP1, STIV and VP6) where our approach was unable to automatically determine accurate full-length models. This is clearly identifiable by the poor coverage of the models after a single round of modeling (Supplementary Table 1). There are two main reasons for such failures. If a large portion of the protein does not have sufficiently accurate fragments, it is not possible to accurately assign positions for these residues into the map. BPP1 is one such case, where almost half the sequence has no accurate fragments (Supplementary Table 2). Secondly, β-sheet assembly from fragments is difficult due to the conformational variability of sheets compared to helices. STIV and VP6 are such cases where we fail to accurately build sheets (Supplementary Fig. 2). These failures suggest possibilities for future method improvement. We applied our method on a recently reconstructed cryo-EM map of the contractile sheath proteins of type VI secretion system (EMD-2699) at 3.5 Å resolution, with no detectable homologues of known structure. The asymmetric unit contained a hetero-dimer with 660 residues total. After manually segmenting the map, eight iterations of our protocol generated a partial model with 466 residues placed. In parallel, the map was manually traced with the aid of Buccaneer in the lab of co-author Basler. There is good overall agreement between two models: over 394 residues, the Cα RMSd is 1.1 Å. However, there are 35 residues where sequence registration is shifted by six positions between the two models (Fig. 3a–b). The segment is flanked by disordered residues; this combined with the poor local resolution makes sequence assignment particularly difficult. The sequence assignment made by our method shows better agreement with the density map than the hand-traced model in this region (Supplementary Fig. 3). We used RosettaCM to assemble full-length structures starting from both configurations. Among the low-energy models RosettaCM generated, the segment assigned by the automated method was exclusively chosen, suggesting our assignment is more energetically favorable and hence correct. Additionally, our approach was able to assign sequence in regions where the manual model did not (Fig. 3c). Combining our model with the manual model in RosettaCM, we were able to build a full-length model for the hetero-dimer complex (Fig. 3d and Kudryashev et al, in review). The blind case shows that our approach is tolerant to errors in segmentation; although our manual segmentation was imperfect, structure determination was still successful.

Figure 3

Blind structure determination

An error in the manually traced model (pink, a) is corrected by our method (green, b). The arrows in black show the positions of two residues in both models (F95 and F101), highlighting the six-residue registration shift between the models. Orange and blue arrows in (a) and (b) indicate the beginning and end of the region with the sequence registration discrepancy. (c) A partial trace generated by our method in a region where manual tracing was impossible. (d) The full-length RosettaCM model at the same region shows good agreement with the map.

We have developed a method for automatic de novo protein structure determination from near-atomic resolution cryo-EM data, and demonstrated its applicability to a wide range of datasets. Our method uses predicted backbone conformation to aid in sequence assignment, allowing determination of structure to atomic-level accuracy without requiring prior knowledge of protein topology from homologous structures or manually traced models[15,16]. The key concept introduced in this paper, that certain local sequences have preferences for certain backbone conformations, has previously been used to predict structures of small proteins (< 100 residues)[28] de novo, and larger proteins using sparse backbone-only NMR data[29-31]. However, no previous approach in protein structure modeling has used this concept in conjunction with experimentally determined local Cartesian-space restraints to restrict conformational space. The method described here should provide a general framework for the use of these types of sparse experimental constraints in protein structure determination. Several improvements will increase both the applicability and accuracy of our approach. Our tests assumed a map where the asymmetric unit was segmented. While manual segmentation is often straightforward (as in the blind case), it may prove difficult in highly intertwined structures. Further improvements of the method on all-β proteins are also necessary: strand-pairing bonuses in the scoring function combined with more aggressive fragment optimization into density should improve accuracy with all-β proteins. Our approach is amenable to incorporate additional structural information: known structures of components are easily incorporated, experimentally derived pairwise distance restraints may guide conformational sampling, and Cα traces provided by users. Our method should contribute to the determination of high accuracy models from near-atomic resolution maps, reducing human effort and errors due to human biases.

Methods

Map preparation

For all benchmark targets, the cryo-EM maps were segmented into single-subunit guided by native structures using UCSF Chimera’s “zone” tool at a distance of 4 Å. The cryo-EM maps and the corresponding deposited native structures used are listed in Table 1.

Matching fragments into density

For each 9-residue window of amino-acid sequence, we used the standard Rosetta fragment picker[24] to collect libraries of representative backbone conformations from proteins of known structure based on similar sequence and predicted secondary structure. Fragments from proteins of known structure homology (PSI- BLAST e-value < 0.05) to the benchmark proteins were excluded while constructing the fragment libraries. A sequence-derived fragment library given a protein sequence was curated with 25 backbone conformations per sequence position. We used backbone information given a fragment to first identify the likely locations and orientations in the density map using 6-D translation/rotation search. The density map was subdivided into a regular three-dimensional grid and the search fragment was translated to each grid point in turn. At each grid point, the spherical harmonic decomposition of model and map density was used to rapidly search all rotations of a backbone fragment against regions of experimental density[32]. To further speed up matching, this rotation search was only carried out at regions of high density (mean density Z score > 1 in a sphere around each grid point). For each fragment, the top 2000 placements were collected using the approximated correlation score between backbone configurations and density[33], giving 50000 candidate placements per sequence position. Side-chain information was then used to further refine the placements and identify the most likely placements where both backbone and physically realistic side-chain conformations have good agreement to the local density. At each sequence position, the 50000 backbone placements were then further refined with rotamer optimization and rigid-body minimization using Rosetta. After this optimization, 2500 placements for each sequence position are selected for each sequence position using the Rosetta full-atom density correlation score[33]. These fragments were clustered (with 2 Å RMSd cluster radius), and the lowest density score member was taken from each cluster. Finally, if there were more than 50 clusters, only 50 models were carried over to model assembly.

Evaluating compatible set of fragments

From these fragment placements, we next want to select a mutually compatible set. We assessed this compatibility using a scoring function with four terms: The term score measures the fit of a fragment to density, and is based on the density correlation between the fragment after side-chain rotamer optimization and the experimental map[33]. The other three terms, score, score, and score, assess the compatibility of a pair of fragments: The term score gives a bonus to pairs of fragments that place the same residue nearby, with a larger bonus for more overlapping residues; score penalizes pairs of fragments that put residues close in the sequence further apart than maxdist, the maximum observed distance of residues at a particular sequence separation; finally, score penalizes fragment pairs with two residues occupying the same place.

Simulated annealing Monte Carlo sampling

Simulated annealing Monte Carlo sampling (SA-MC) was used to search for a set of fragments that are mutually compatible. Each sequence position is initially assigned one random (out of 50 possible) fragment placements or a “null placement” which handles the possibility that there may be no good fragment placements at a particular sequence position. Each step in the trajectory replaces the fragment at a particular position subject to the Metropolis criterion using the score. For pairwise score terms, precomputing all pairwise scores allows for fast score evaluation of a fragment assignment. To control precision versus coverage, we assign a density score, dens, to the null placement; lower values lead to reduced coverage but more precision in fragment placement. All experiments in the paper used dens = −150. Finally, simulated annealing was carried out by slowly reducing the temperature from 500 to 1 in 200 increments with 5000 moves each. Total runtime was approximately 10 minutes per trajectory.

Iterative assembly of models

In many cases, there are a few similar fragment assignments with roughly equivalent scores. To identify all of these alternate models, we run 2000 SA-MC trajectories. We use this ensemble to find a high-confidence partial model to carry into the next round. From the lowest scoring 5% of trajectories, we assemble a backbone model by identifying all residues that are placed in the same position (with 3 Å tolerance) and taking the average backbone coordinate at each residue position. If fewer than 70% of backbone residues have been assigned, we iterate fragment matching and SA-MC assembly. The subsequent iteration of fragment matching was carried out by first masking out density which has been assigned in the backbone model from the previous iteration, then placing fragments only from sequence not yet assigned into density.

Completing models with RosettaCM

The final step in our approach is to rebuild the final set of unassigned residue positions in the partial models using RosettaCM[26], a comparative modeling method. Unassigned sequence positions in each partial model are rebuilt in the same manner as unaligned regions in comparative modeling. RosettaCM is guided by the cryo-EM density maps in completing partial models, by adding a score term assessing agreement of a model to experimental density during model-building and refinement with Rosetta’s physically realistic all-atom energy function[34]. For each partial model, 1000 full-length models are generated with RosettaCM. The best 20% by Rosetta energy are selected, and of those, the 10 models with best fit to the density are selected. In four of the cases from our testset, this leads to models that have similar or slightly higher RMSds than the partial model from the final iteration, which is expected since the unbuilt parts are mostly loops or regions with less resolved density. However, in two cases – FrhA and 20S-α – we see an improvement in overall RMS. For FrhA, this improvement is particularly striking: the Cα RMSd decreases from 2.3 Å to 1.3 Å. Supplementary Fig. 1 illustrates some improvements in the structure: RosettaCM corrected several loop residues incorrectly placed into density from the previous SA-MC assembly step. As indicated in Supplemtary Table 1, this rebuilding is consistent and robust, with minimal structural deviation over the 10 lowest scoring models.

Model-building with Buccaneer

Model building with Buccaneer[18] used the same segmented maps and was provided the same sequences as was our approach. Reflection data was computed from the cryo-EM maps using phenix.map_to_structure_factors[35]. SIGF was set to F/10 for all reflections using SFTOOLS from the CCP4 Program Suite v6.4.0[36]. A map padding of 5 Å was added to the border to ensure no effects from periodicity on model-building. We ran Buccaneer from the CCP4 Program Suite v6.4.0 with mostly default setting: five cycles of building/refinement were carried out using the correlation target function during model building, with “use R-free” disabled.

20S map reconstruction

T. acidophilum 20S proteasome was expressed and purified from E. coli according to the established protocols[37]. A drop of 2μL of purified 20S proteasome at a concentration of ~0.9 μM sample was applied to glow-discharged Quantifoil holey carbon grids (Quantifoil, Micro Tools GmbH, Germany), and plunge-frozen by using a Vitrobot Mark III (FEI company, USA). Grids of frozen hydrated samples were imaged using a FEI TF30 Polara electron microscope (FEI Company, Hillsborough, OR) equipped with a field mission electron source and operated at an accelerating voltage of 300kV. Images were recorded at a nominal magnification 20kX using a Gatan K2 Summit™ camera (Gatan Inc., Pleasanton, CA) operated at super resolution counting mode with a calibrated physical pixel size of 1.96 Å at 20kX. A 10 sec exposure time at a dose rate of ~10 counts/pixel/sec leads to a total dose ~30 e−/Å2. The defocus was in the range of 0.8~1.9 μm. The CTFFIND3 (Mindell and Grigorieff, 2003) was used to determine the defocus values. Half number of images with substantial drift and bad Thon rings are discarded. Side view particles of 20S proteasome were picked automatically by using FindEM[38]. All picked particles were first subject to standard procedure of multiple rounds of multi-reference alignment and classification[39]. Particles within bad 2D classes were removed. All remaining particles were subject to further manual inspection and more bad particles were removed. The final dataset contains 79,801 particles from 157 images of 20kX magnification. GeFREALIGN[40] was used to refine and determine the 3D reconstructions with a D7 symmetry following a frequency-limited refinement procedure[6,41]. Note that no motion correction was carried out for this dataset. Atomic structure of archaeal 20S proteasome (pdb code: 2C92) filtered to 15Å was used as initial model. The final 3D reconstruction has a resolution of ~4.8 Å using Fourier Shell Correlation 0.143 criteria[42]. This resolution is beyond 80% of camera’s physical Nyquist limit. Structure features in the amplitude-sharpened map confirm this claimed resolution.

Method availability

The de novo protein structure determination method described here is freely available for academic use, and will be distributed in an upcoming weekly release of Rosetta modeling suite. A preliminary version of the software described in this manuscript, including all scripts, executable files, and a documented example can be downloaded from: https://www.dropbox.com/s/5jngzlivdtz4zn0/Wang_etal_TRPV1.tar.gz?dl=0

42 in total

1. Situs: A package for docking crystal structures into low-resolution maps from electron microscopy.

Authors: W Wriggers; R A Milligan; J A McCammon
Journal: J Struct Biol Date: 1999 Apr-May Impact factor: 2.867

2. Optimal determination of particle orientation, absolute hand, and contrast loss in single-particle electron cryomicroscopy.

Authors: Peter B Rosenthal; Richard Henderson
Journal: J Mol Biol Date: 2003-10-31 Impact factor: 5.469

3. Combining electron microscopic with x-ray crystallographic structures.

Authors: M G Rossmann; R Bernal; S V Pletnev
Journal: J Struct Biol Date: 2001-12 Impact factor: 2.867

4. Toward high-resolution de novo structure prediction for small proteins.

Authors: Philip Bradley; Kira M S Misura; David Baker
Journal: Science Date: 2005-09-16 Impact factor: 47.728

5. Near-atomic resolution using electron cryomicroscopy and single-particle reconstruction.

Authors: Xing Zhang; Ethan Settembre; Chen Xu; Philip R Dormitzer; Richard Bellamy; Stephen C Harrison; Nikolaus Grigorieff
Journal: Proc Natl Acad Sci U S A Date: 2008-01-31 Impact factor: 11.205

6. Structure of the yeast mitochondrial large ribosomal subunit.

Authors: Alexey Amunts; Alan Brown; Xiao-Chen Bai; Jose L Llácer; Tanweer Hussain; Paul Emsley; Fei Long; Garib Murshudov; Sjors H W Scheres; V Ramakrishnan
Journal: Science Date: 2014-03-28 Impact factor: 47.728

7. EM-fold: De novo folding of alpha-helical proteins guided by intermediate-resolution electron microscopy density maps.

Authors: Steffen Lindert; René Staritzbichler; Nils Wötzel; Mert Karakaş; Phoebe L Stewart; Jens Meiler
Journal: Structure Date: 2009-07-15 Impact factor: 5.006

8. Atomic structure of the 75 MDa extremophile Sulfolobus turreted icosahedral virus determined by CryoEM and X-ray crystallography.

Authors: David Veesler; Thiam-Seng Ng; Anoop K Sendamarai; Brian J Eilers; C Martin Lawrence; Shee-Mei Lok; Mark J Young; John E Johnson; Chi-yu Fu
Journal: Proc Natl Acad Sci U S A Date: 2013-03-21 Impact factor: 11.205

De novo protein structure determination from near-atomic-resolution cryo-EM maps.

Methods

Map preparation

Matching fragments into density

Evaluating compatible set of fragments

Simulated annealing Monte Carlo sampling

Iterative assembly of models

Completing models with RosettaCM

Model-building with Buccaneer

20S map reconstruction

Method availability

1. Situs: A package for docking crystal structures into low-resolution maps from electron microscopy.

2. Optimal determination of particle orientation, absolute hand, and contrast loss in single-particle electron cryomicroscopy.

3. Combining electron microscopic with x-ray crystallographic structures.

4. Toward high-resolution de novo structure prediction for small proteins.

5. Near-atomic resolution using electron cryomicroscopy and single-particle reconstruction.

6. Structure of the yeast mitochondrial large ribosomal subunit.

7. EM-fold: De novo folding of alpha-helical proteins guided by intermediate-resolution electron microscopy density maps.

8. Atomic structure of the 75 MDa extremophile Sulfolobus turreted icosahedral virus determined by CryoEM and X-ray crystallography.

9. Atomic-accuracy models from 4.5-Å cryo-electron microscopy data with density-guided iterative local refinement.

10. Iterative model building, structure refinement and density modification with the PHENIX AutoBuild wizard.

1. Variability of Protein Structure Models from Electron Microscopy.

Review 2. X-ray crystallography over the past decade for novel drug discovery - where are we heading next?

3. What Could Go Wrong? A Practical Guide to Single-Particle Cryo-EM: From Biochemistry to Atomic Models.

Review 4. Hybrid methods for combined experimental and computational determination of protein structure.

5. Refined Cryo-EM Structure of the T4 Tail Tube: Exploring the Lowest Dose Limit.

6. Flexible Fitting of Atomic Models into Cryo-EM Density Maps Guided by Helix Correspondences.

7. cryoem-cloud-tools: A software platform to deploy and manage cryo-EM jobs in the cloud.

Review 8. Cryo-EM in drug discovery: achievements, limitations and prospects.

9. Map segmentation, automated model-building and their application to the Cryo-EM Model Challenge.

10. Structure of the DASH/Dam1 complex shows its role at the yeast kinetochore-microtubule interface.