| Literature DB >> 25707029 |
Ray Yu-Ruei Wang1, Mikhail Kudryashev2, Xueming Li3, Edward H Egelman4, Marek Basler5, Yifan Cheng3, David Baker6, Frank DiMaio7.
Abstract
We present a de novo model-building approach that combines predicted backbone conformations with side-chain fit to density to accurately assign sequence into density maps. This method yielded accurate models for six of nine experimental maps at 3.3- to 4.8-Å resolution and produced a nearly complete model for an unsolved map containing a 660-residue heterodimeric protein. This method should enable rapid and reliable protein structure determination from near-atomic-resolution cryo-electron microscopy (cryo-EM) maps.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25707029 PMCID: PMC4435692 DOI: 10.1038/nmeth.3287
Source DB: PubMed Journal: Nat Methods ISSN: 1548-7091 Impact factor: 28.547
Figure 1Protocol Overview
(a) First, for a 9-residue window centered on each position in the sequence, representative backbone conformations (fragments) are collected and docked into the density map. Second, the resulting fragment placements are then evaluated using a score function consisting of four terms: a density correlation term assessing the agreement of fragment and map; an overlap term favoring fragment pairs assigning the same residue to the same location; a closability term favoring fragment pairs close in sequence that are close in space; and a clash term preventing two residues from occupying the same place. Third, from the candidate placements (square green blocks), simulated annealing Monte Carlo finds a set of fragments (square orange blocks) optimizing the score function; a null placement (empty blocks) may be assigned in positions where no good placements have been identified. Fourth, a partial model is assembled by combining fragment placements from multiple Monte Carlo trajectories. Steps 1–4 are carried out iteratively until ~70% of sequence is covered. Finally, unassigned regions in the final partial model are completed using density-guided loop sampling followed by all-atom refinement. (b) Model building for the 20S α-subunti in a 4.8 Å resolution cryo-EM map required three iterations, illustrated in the three rows in the figure. In leftmost column, the density map used for the corresponding iteration, after masking out density from the previous round’s partial model. In column 2, the assembled partial models after Monte Carlo sampling (colored blue at the N-terminus to red at the C-terminus). In column 3, fragment placement results after translation and rotation search. The x-axis covers the sequence of the protein, and each black point represents a single fragment placement; the y-axis indicates the distance of the fragment placement to the native conformation. Pink points indicate fragments chosen to assemble the partial model, and the grey shading shows residues covered in the partial model. Secondary structural elements in the native are indicated above the plot, where H indicates helix and S indicates strand. In rightmost column, convergence of Monte Carlo trajectories. Each point represents the fragment assignment of an independent search trajectory, colored by number of total fragments placed. The X-axis indicates the percentage of fragments placed within 2.5 Å RMSd to the native configuration, while the Y-axis shows the score with the fragment compatibility function. The horizontal dashed line shows the score cut used for partial model generation.
Figure 2High-accuracy model building in near-atomic resolution cryo-EM maps
(Leftmost column) The density maps used for de novo model building on 20S-α at 4.8 Å, TRPV1-TM at 3.4 Å, FrhB at 3.4 Å, and FrhA at 3.4 Å (Row 1 to 4, respectively). (Column2) The partial model at the final iteration. (Column 3 and 4) Full-length RosettaCM models (red) are superimposed with the native structure (blue). Each sub-figure shows the lowest-RMSd structure from 10 lowest-electron-density-score models (left) with a close-up of the core showing that native core packing is recovered (right).
Figure 3Blind structure determination
An error in the manually traced model (pink, a) is corrected by our method (green, b). The arrows in black show the positions of two residues in both models (F95 and F101), highlighting the six-residue registration shift between the models. Orange and blue arrows in (a) and (b) indicate the beginning and end of the region with the sequence registration discrepancy. (c) A partial trace generated by our method in a region where manual tracing was impossible. (d) The full-length RosettaCM model at the same region shows good agreement with the map.