Literature DB >> 32042190

Measurement of atom resolvability in cryo-EM maps with Q-scores.

Grigore Pintilie1, Kaiming Zhang2, Zhaoming Su2, Shanshan Li2, Michael F Schmid3, Wah Chiu4,5.   

Abstract

Cryogenic electron microscopy (cryo-EM) maps are now at the point where resolvability of individual atoms can be achieved. However, resolvability is not necessarily uniform throughout the map. We introduce a quantitative parameter to characterize the resolvability of individual atoms in cryo-EM maps, the map Q-score. Q-scores can be calculated for atoms in proteins, nucleic acids, water, ligands and other solvent atoms, using models fitted to or derived from cryo-EM maps. Q-scores can also be averaged to represent larger features such as entire residues and nucleotides. Averaged over entire models, Q-scores correlate very well with the estimated resolution of cryo-EM maps for both protein and RNA. Assuming the models they are calculated from are well fitted to the map, Q-scores can be used as a measure of resolvability in cryo-EM maps at various scales, from entire macromolecules down to individual atoms. Q-score analysis of multiple cryo-EM maps of the same proteins derived from different laboratories confirms the reproducibility of structural features from side chains down to water and ion atoms.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32042190      PMCID: PMC7446556          DOI: 10.1038/s41592-020-0731-1

Source DB:  PubMed          Journal:  Nat Methods        ISSN: 1548-7091            Impact factor:   28.547


Introduction

CryoEM single-particle methods strive to create accurate, high-resolution 3D maps of macromolecules. Depending on many factors including imaging apparatus, detector, reconstruction method, structure flexibility, sample heterogeneity, and differential radiation damage, resulting maps have varying degrees of resolvability. Accurate quantification of resolvability in cryoEM maps has been a challenge in the field[1]. This task is very important as it can affect the interpretation of such maps. For every cryoEM map, a resolution is estimated from a Fourier shell correlation (FSC) plot between two independent reconstructions, each reconstruction stemming from a separate half of the data set[2]. It is well recognized that cryoEM maps usually do not have isotropic resolution throughout, and thus local resolution is typically estimated, e.g. with ResMap[3], Bsoft[4], or MonoRes[5]. However such loclal resolutions do not easily translate to particular features of interest such as side chains or individual atoms. Atomic models can be either fitted or built directly into cryoEM maps[6,7]. Map-model scores are then calculated to assess how well the model fits the map[8]. Real-space refinement[9] or flexible fitting[10,11] can be applied, making sure to not overfit to noise[12,13]. The latter is accomplished through stereochemical restraints, e.g. bond lengths, angles, dihedrals, preferred rotamers and van-der Waals distances, and additional secondary-structure constraints, e.g. in the form of hydrogen bonds[9,11,14,15]. Once an atomic model has been fitted to or derived from a cryoEM map, it can then be used to assess the map itself. This can be done in several ways, including a map-model FSC curve, which requires that the model first be converted to a cryoEM-like map at the same resolution as the original map. Such an FSC plot reflects the entire map volume. Proper masking may be used to assess smaller features such as individual protein chains[12], however it is impractical to assess even smaller features such as side chains or individual atoms using this approach. Other methods that assess smaller features in a cryoEM map using a fitted model include EMRinger[16] and Z-scores[17]. EMRinger considers map values near carbon-β atoms, while Z-scores can be applied to secondary structure elements (such as α-helices and β-sheets) or side chains. These scores were shown to correlate with map resolution when averaged over entire maps and models. Moreover, they can also identify features in the model (e.g. secondary structure elements or side chains) which are not well-resolved or not fitted properly to the map. CryoEM maps have reached resolutions nearer to atomic-dimensions, for example apoferritin at 1.54Å (EMD:9865), 1.62Å (EMD:0144)[18], 1.65Å (EMD:9599), and 1.75Å (EMD: 20026). At such resolutions, we may start to assess the resolvability of individual atoms. In crystallography, B-factors or atomic displacement parameters (ADPs) reflect the uncertainty in the position of any atom, and are refined from diffraction data[19-21]. ADPs can also be calculated in cryoEM maps[22]. However, since ADPs are typically refined with restraints, they are not dependent only on the map values around the atom. Other ways to measure positional uncertainties include multi-model refinement[23] and molecular dynamics[12,24]; these also assume various restraints on atoms and hence do not reflect map values alone. In this paper, we introduce Q-scores, which are calculated directly from map values around an atom’s position. A similar score is EDIA, which was applied to high-resolution X-ray maps. The EDIA method considers map values within each atom’s radius, which is parameterized for different elements and resolutions. In contrast, Q-scores are calculated independently of element type or map resolution. We apply Q-scores to measure resolvability of individual atoms, including solvent atoms, and also of groups of atoms such as side chains in proteins and bases in nucleic acids.

Results

Atomic Map Profiles

The basis of the Q-score is the atomic map profile. Atomic map profiles are calculated by averaging map values at increasing radial distances from an atom’s position. The radial distances range from 0Å to 2.0Å, and only points that are closer to the atom in question than to any other atoms in the model are considered. Figure 1A shows example atomic profiles in our two new maps of Apoferritin with resolutions of 1.75Å and 2.32Å, now deposited as EMD:20026, and EMD:20027.
Figure 1.

Atomic map profiles in cryoEM two maps of Apoferritin. (A) The residue Leu26 in the fitted model (PDB:3ajo) is shown, along with contour surface of the cryoEM map around this residue. Spherical shells of points centered on the CD2 atom are shown at increasing radial distances. Only points that are closer to the CD2 atom than to any other atom in the model are used to calculate an average map value at each radial distance. (B) Plots of average map value vs. radial distance; these are the atomic map profiles. The dotted lines represent Gaussian functions which are fitted to each profile.

When calculating the profile for an atom, map values at N points are used to calculate the average at a particular distance, r. The N points are distributed evenly across the part of the sphere (centered at the atom, with radius r) that is closer to the atom and not any other atom in the model. At r=0 or the atom center, the map value is duplicated N times, so that N is the same at each radial distance. In all calculations used here, we use N=8. Larger values of N typically create smoother profiles, however have only minor effects on Q-scores described below. The model in Figure 1 is the X-ray model of Apoferritin, (PDB:3ajo), which was first rigidly fitted to the cryoEM map, and then further refined into each cryoEM map using Phenix real-space refinement[9]. In the examples, atomic profiles have Gaussian-like contours. We consider a Gaussian equation of the form: Gaussian functions of the form in Eqn.1, where x is the radial distance and y the average map value, fit well to the atomic profiles shown in Figure 1 up to a distance of 2Å, with a mean error of 2.4%. For higher resolution data, e.g. from X-ray crystallography, multiple Gaussians are used to closely represent atomic form factors[25], however we do not consider that here. Past 2Å from the atom, map profiles observed in these and other similar resolution cryoEM maps become noisy and start to increase. This is likely due to effects from other nearby atoms and/or solvent. When the model is well-fitted to the map, the width of the Gaussian function (Eqn.1) fitted to the profile, , may be considered to be proportional to factors such as the resolution of the map and the overall mobility of the atom. Regardless of the cause, in this paper we assume that the profile seen in the map indicates to what degree the atom is resolved: narrower profiles indicate the atom is better resolved, while wider profiles indicate the atom is less well resolved.

Q-score

The Q-score measures how similar the map profile of an atom is to a Gaussian-like function we would see if the atom is well-resolved. Thus, to calculate it, the atomic map profile is compared to a ‘reference Gaussian’ as given by Eqn. 1, with the following parameters: In the above, the mean, μ, is set to 0, as the reference Gaussian is centered at the atom’s position. The parameters and are obtained using the mean/average across all values in the entire map, avg, and the standard deviation of all values around this mean, σ. The width of the reference Gaussian is set as σ=0.6. These parameters were chosen to make the reference Gaussian roughly match the atomic profile of a well-resolved atom in the 1.54Å cryoEM map as shown in Figure 2B.
Figure 2.

Calculation of Q-scores for an atom in 6 maps at different resolutions, including an X-ray map (PDB:3ajo). The atom is CD2 from Leu 26 in the X-ray model PDB:3ajo fitted to each map. The atomic profile in each map is marked with the letter , while the reference Gaussian is marked with .

The Q-score is then calculated as a correlation between values in the atomic profile obtained from the map, , by trilinear interpolation to nearest 8 grid points, and values obtained from the reference Gaussian, . The following normalized about-the-mean cross-correlation formula is used: Several atomic profiles and reference Gaussians are illustrated in Figure 2. At resolutions close to 1.5Å, the atomic profiles are more similar to the reference Gaussian, and hence Q-scores are higher. At lower resolutions, the atomic profiles of the same atom are wider than the reference Gaussian, hence Q-scores are lower. Q-scores would also be low for atomic profiles that are mostly noise (e.g. random values or a sharp peak). In some cases when the atom is not well-placed in the map, the Q-score can be negative if the atomic profile has a shape that increases away from the atom’s position. Q-scores are low when the entire model is placed incorrectly in the map, e.g. during a global search. They can increase if the model-map fit is improved by local refinement (Supplementary Figure 1). Q-scores begin to decrease as resolutions of the map increase beyond 1.30Å, as atomic profiles begin to be much narrower than the reference Gaussian (Supplementary Figure 2). This effect may be useful in cryoEM maps to give very sharp peaks, which are more likely to be noise, lower Q-scores. Calculating Q-scores is similar to calculating a cross-correlation between the model and a cryoEM map, using a simulated map of the model blurred using a Gaussian function with the parameters in Eqns. 2–5. The main difference is that with Q-scores, the cross-correlation is performed atom-by-atom, separating out parts of the density that are closest to each atom. The cross-correlation about the mean is used so that the Q-scores decrease as resolution also decreases. When not subtracting the mean, this effect would not be ensured, as shown previously[17] and also in Supplementary Figure 3. We tested the effect of several factors on Q-scores. First, using the cross-correlation about the mean makes the Q-scores insensitive to the height and vertical offset of the reference Gaussian (Supplementary Figure 3). This means that as long as map values are decreasing around an atom, regardless of their relative magnitude in the map, the Q-score for the atom could still be high. Second, small changes in grid step and placement do not affect the Q-score; however if the grid step is too large relative to the resolution of the map, resolvability and also Q-scores can start to decrease (Supplementary Figure 4). Finally, sharpening can increase the visible detail in the map along with Q-scores, but Q-scores start to decrease if excessive sharpening is applied (Supplementary Figure 5).

Q-scores of Atoms in Proteins

Figure 3 shows Q-scores for atoms taken from maps of Apoferritin at various resolutions. One of the maps is an X-ray map at 1.52Å resolution (2fo-fc, PDB:3ajo) as a reference; another is a recent high-resolution map at 1.54Å (EMD:9599). The other three are new maps we reconstructed to 1.75Å (EMD:20026), 2.3Å (EMD:20027), and 3.1Å (EMD:20028) with different numbers of particle images from the same data set. For the cryoEM maps, the X-ray model PDB:3ajo was fitted to the density and refined using Phenix real-space refinement[9].
Figure 3.

Atom Q-scores for three residues taken from Apoferritin maps at various resolutions. Atom Q-scores are shown close to each atom, and the average Q-score is shown under each residue.

In Figure 3, Q-scores for each atom correlate well with visual resolvability at the contour level used in each case, i.e. the more resolvable an atom, the higher the Q-score. However, in some cases, the Q-score for an atom can be relatively high even if there is no map contour around it; this is due to the effect mentioned previously that even if the map values around an atom are low, the Q-score can still be high if they are decreasing away from the atom. Resolvability and Q-scores can decrease for some residues faster than others as a function of resolution. For example, in Figure 3, the Q-score for ASP126 drops more than for ASN25 from 1.52Å to 3.9Å. This effect may be due to several reasons. First, some residue types may be more susceptible to radiation damage (as previously shown using EMRinger[16]). Also, certain residue types may be more conformationally dynamic, or occur in environments that are more dynamic (e.g. solvent accessible), and hence may not resolve as well with fewer number of particles. Finally, the interaction of the electron beam with negatively charged side chains may have a weakening effect on map values around them[22].

Q-scores for Atoms in Nucleic Acids

Q-scores can also be calculated for atoms in nucleic acids. In Figure 4, we used several maps and models containing RNA from the EMDB at resolutions ranging from 2.5Å to 4.0Å. Q-scores were averaged over atoms in bases (labeled with Qbase), phosphate-sugar backbones (labeled with Qbb), and entire nucleotides. As with proteins, Q-scores decrease with resolvability and estimated map resolution. Figure 4 also illustrates a general trend that at ~4Å and lower resolutions, stacked bases from adjacent nucleotides are typically not separable in cryoEM maps, whereas at higher than 4Å resolutions, they usually do become separate at some contour levels.
Figure 4.

Q-scores averaged over nucleotides (Qnt) in cryoEM maps and models of ribosomes from the EMDB at four different resolutions. Q-scores are also averaged for base (Qbase) and phosphate-sugar backbone (Qbb) atoms.

It is also interesting to note that for the examples in Figure 4, at high resolutions (~2.5Å), the difference in Q-score or resolvability of individual bases is higher than that of the backbone (0.84 for base vs. 0.73 for backbone). Going towards lower resolutions in this example, bases become less resolvable (0.45 for bases vs 0.56 for backbone). This may be counter-intuitive as bases can have higher values in the map (i.e. appear first at a high contour level). However, these contours may have overall less detail as adjacent stacked bases are not fully separable at any contour level.

Q-score vs. Resolution

Q-scores can also be averaged across an entire model to represent an average resolvability measure for the entire map. Such average Q-scores were plotted as a function of reported resolution for a number of maps and models obtained from the EMDB. Figure 5 shows these plots for two sets of maps and models, one set using only atoms in proteins, and the other set only atoms in nucleic acids. The full sets are listed in Tables 1 and 2. In both cases, the average Q-score correlates very strongly to reported resolution. This strong correlation indicates that Q-scores closely capture the resolvability of atomic features in cryoEM maps, much as the estimated resolution of a map does. However, Q-scores are useful in quantifying resolvability of small features within each map down to individual atoms.
Figure 5.

Average Q-scores vs. reported resolution for maps and models obtained from EMDB. (A) Q-scores averaged over only protein atoms in maps and models listed in Table 1. (B) Q-scores averaged over only nucleic acid atoms in maps and models listed in Table 2. Linear functions fitted to the points are drawn with a dotted line in both plots; equations and r2 value are inset.

Table 1.

Maps from EMDB for which Q-scores of protein components are calculated for the plot in Figure 5A. The entries marked with * were also in the original EMRinger analysis[16]. All others are maps of Apoferritin and β-galactosidase at resolutions up to 1.54Å.

EMD IDPDBResolution (Å)Q-score# Protein Atoms
198653ajo1.540.851,473
295993wnw1.620.871,433
31443ajo1.650.851,473
4200263ajo1.750.811,473
5101016s611.840.902,799
601535a1a1.890.7232,828
798903ajo1.90.821,473
877705a1a1.90.7132,828
999143wnw2.010.841,433
1049056rjh2.10.831,364
1141165a1a2.20.691,364
1244155a1a2.20.6932,828
1389085a1a2.20.6932,828
1429845a1a2.20.6232,828
15200273ajo2.320.751,473
1644145a1a2.40.6832,828
1768405a1a2.60.6432,828
1847013wnw2.70.671,433
19202273ajo2.850.481,473
20200283ajo3.080.601,473
215256*3izx3.10.5732,209
2238543ajo3.150.661,473
235160*3iyl3.20.5680,835
245623*3j9i3.20.6046,228
255995*3j7h3.20.5832,824
2659955a1a3.20.5432,828
275778*3j5p3.270.3718,424
282513*4ci03.360.606,867
292762*3j7y3.40.5260,863
302787*4v19,4v1a3.40.5166,810
312278*3j2v3.50.474,629
325764*3j4u3.50.5524,653
336035*3j7w3.50.5017,829
345925*3j6j3.60.436,344
352764*3j803.750.4239,871
362773*4uy83.80.3426,960
375830*3j633.80.4210,590
386000*3j7l3.80.523,613
3901403ajo3.90.481,473
402763*3j8140.3943,848
415600*3j3i4.10.377,515
4228245a1a4.20.3832,828
432364*4btg4.40.3411,840
442273*3zif4.50.3094,377
452677*4upc4.50.283,127
465678*3j404.50.3924,066
475645*3j3x4.60.2161,264
482788*4v1w4.70.3632,736
495646*3j3x4.70.1761,264
505895*3j6e4.70.2960,318
515391*3j1b4.90.2462,992
525886*3jbd50.377,560
535896*3j6f50.2760,318
546187*3j8x50.219,235
556188*3j8y50.209,343
Table 2.

Maps from EMDB containing RNA for which Q-scores vs. resolution are plotted in Figure 5B.

EMD IDPDB FileResolution (Å)Q-score# Nucleic Acid Atoms
1101294udv1.90.8167
2101304udv20.8067
3100776s0z2.30.6497,227
4100766s0x2.430.5764,722
570256az3-pdb-bundle12.50.7034,068
670256az3-pdb-bundle22.50.7039,212
783615t5h-pdb-bundle12.540.6860,092
802436hma2.650.6663,217
970246az12.70.6642,699
1065833jcs-pdb-bundle12.80.5772,130
11201736ore-pdb-bundle12.90.6297,294
1246386qul30.6562,760
1306006ole-pdb-bundle330.6280,776
1402336hiz-pdb-bundle13.080.6631,798
1545606qik-pdb-bundle13.10.613,030
16100686rzz-pdb-bundle13.20.5867,292
1701016gzq-pdb-bundle13.280.5667,292
1841255lze-pdb-bundle13.50.5065,324
1941255lze-pdb-bundle23.50.5464,391
2029384ug0-pdb-bundle13.60.5437,311
2129384ug0-pdb-bundle23.60.5038,504
2265593jcj-pdb-bundle13.70.4734,577
2365593jcj-pdb-bundle23.70.4263,932
2486205uyq-pdb-bundle13.80.4233,012
2586205uyq-pdb-bundle23.80.4370,155
2600766gwt-pdb-bundle13.80.4234,656
2700766gwt-pdb-bundle23.80.4136,969
2801926hcf-pdb-bundle13.90.5264,900
2901926hcf-pdb-bundle23.90.5183,585
3001926hcf-pdb-bundle33.90.412,109
3182795kps-pdb-bundle13.90.4333,016
3282795kps-pdb-bundle23.90.4468,569
3386185uyn-pdb-bundle140.3833,012
3486185uyn-pdb-bundle240.3970,133
3540805lmu40.4334,527
3627633j8140.4039,828
3743506g514.10.4319,905
3882805kpv-pdb-bundle14.10.4433,016
3982805kpv-pdb-bundle24.10.4370,236
4006436o7k4.20.4034,777
41201886ost-pdb-bundle14.20.4097,110
4243826gc74.30.3440,850
4300836gxp-pdb-bundle14.40.3364,749
4443496g4w4.50.3118,753
4531335ady4.50.3612,104
4643516g534.50.3419,905
4701046gzx-pdb-bundle14.570.3665,324
4840835lmv4.90.2334,527
4935535mrf-pdb-bundle14.970.3557,598
5084735tzs5.10.1813,410
5136615no25.160.3332,930
5236625no35.160.3132,930
5341225lzb-pdb-bundle15.30.2837,309
5444276i7o-pdb-bundle15.30.2972,803
5540755lmp5.350.2832,964
The linear plots in Figure 5 show that average Q-scores drop toward 0 at ~6–7 Å, however an analysis using simulated maps indicates that they taper off and decrease slowly toward 0 at lower resolutions (Supplementary Figure 6). Negative Q-scores would only be expected if atoms are not placed on peaks, such that map values increase away from their position. Nevertheless, due to the change in rate of decrease, we expect that Q-scores are most useful at resolutions better than 5–6Å.

Q-scores vs. B-factors and ADPs

B-factors and atomic displacement parameters (ADPs) are used in X-ray crystallography to convey the positional uncertainty of atoms[19-21]. They are also dependent to some degree on resolution[27] (Supplementary Figure 7). When refining B-factors and ADPs, various restraints, parameters and initial values can be used, hence the results in each map may vary. Comparisons of B-factors/ADPs to Q-scores show that they correlate only weakly (Supplementary Figures 8,9). Hence they likely convey somewhat different information.

Q-scores of Solvent Atoms

The X-ray Apoferritin model (PDB:3ajo) contains one protein chain, 229 oxygen (O) atoms (from water) and 12 Mg atoms. A closeup on the 2Fo-Fc map and model with two Mg and three O atoms is shown in Figure 6A. Figure 6 B,C,D shows cryoEM maps at near-atomic resolutions (1.54Å, 1.65Å, and 1.75Å). The model used all cases comes from the X-ray map. It is reassuring to see that some of the solvent atoms in the X-ray structure can also be observed in the cryoEM maps (e.g. Mg183, O280, O236). However, some of the solvent atoms (e.g. Mg184), are not seen equally well in all three maps; for example, in the 1.54Å and 1.65Å maps, Mg184 has low Q-score (0.12 and 0.03 respectively). Such differences may be due to different affinities at some sites and/or different biochemical conditions across the different data sets.
Figure 6.

A close up in Apoferritin models showing solvent atoms (Mg and O from water), along with calculated Q-scores in purple under each atom and nearby residue. The initial model comes from the X-ray map (PDB:3ajo) shown in A. It was further refined into each of the three cryoEM maps, B–F.

Supplementary Figure 10A shows distributions of Q-scores for solvent atoms in the X-ray map (PDB:3ajo). Most solvent atoms have very high Q-scores of 0.9 and higher. Visual inspection confirmed that all these solvent atoms can be seen in the X-ray map (2fo-fc), e.g. as shown in Figure 6A. Supplementary Figure 10B,C shows Q-score distribution plots for the same model rigidly fitted to, and also refined in, the cryoEM maps at 1.54Å and 1.75Å resolution. The model was refined in the cryoEM maps including solvent atoms, using Phenix real-space refine[9]. For the rigidly fitted model, Q-scores of the solvent atoms are considerably lower than in the X-ray map (Supplementary Figure 10B). For example, in the 1.75Å cryoEM map, only 44 of the 229 O atoms from water have Q-scores of 0.8 and higher. In the 1.54Å map, 68 have Q-scores of 0.8 and higher. Thus some of the solvent atoms in the X-ray structure may not be resolvable in the cryoEM maps or potentially be in different positions. To explore whether solvent atoms may have different positions in the cryoEM maps, Q-scores of the solvent atoms were also calculated in the X-ray structure after real-space refinement with Phenix[9]. The distributions in the Q-scores for solvent atoms after this procedure are plotted in Supplementary Figure 10B, C for the two cryoEM maps. Q-scores are now higher; 142 water atoms in the 1.54Å map and 145 atoms in the 1.75Å map have Q-scores of 0.8 and higher, compared to 225 water atoms in the X-ray map with Q-scores of 0.8 and higher. We further consider water atoms with Q-scores of 0.8 and higher after refinement, which can be considered to be resolved in the cryoEM maps. In the 1.54Å map, the 142 water atoms with Q-scores 0.8 and higher moved between 0.1Å and 2.2Å, on average 0.54Å. In the 1.75Å map, the 145 water atoms with Q-scores of 0.8 and higher moved between 0.1Å and 1.6Å, on average 0.67Å. Radial distance plots in Supplementary Figure 11 show sharp peaks at ~2.8Å for water-water and water-protein distances in X-ray maps, but more diffuse peaks around the same distance in cryoEM maps. Although it is difficult to assess the exact cause of these relatively small distance variations between X-ray and cryoEM structures, it is reasonable to conclude that many of the waters in the X-ray structure are also resolved and near the same positions in cryoEM maps. Water networks have been shown to be important in ligand binding affinities and to vary due to structural differences even in X-ray structures[28]. Further studies with more cryoEM maps at similar resolutions may further elucidate and characterize such variations. In the above analysis, solvent atom positions were based on those originally observed in the X-ray structure. If one studies a de novo map, the identification of solvent atoms would require a protocol used in modeling software[30]. In addition to such a protocol, Q-scores may be useful as an additional parameter to assist in the finding of such solvent atoms.

Q-scores of Solvent Atoms at Different Resolutions

Finally, we looked at the resolvability and Q-scores of solvent atoms in cryoEM maps of Apoferritin at lower resolutions, as shown in Figure 6 E,F. The locations of the solvent atoms are again taken from the X-ray model (PDB:3ajo). Mg183 appears resolved at both 1.75Å and 2.3Å, with separable contours in both maps and high Q-scores (0.93 and 0.80). In the 3.1Å map, the contour is no longer separable from that of the nearby His65 residue, and the Q-score is also considerably lower (0.60). The water atoms are similarly resolved in the 1.75Å and 2.3Å maps and contours around them can be seen, however at 3.1Å they can no longer be seen and Q-scores become very low. At 3.1Å resolution, both Mg atoms still have relatively high Q-scores, and they are inside the map contour at lower threshold. Thus even at such lower resolutions, it appears ions can significantly influence cryoEM map values. Thus even at these resolutions, solvent atoms perhaps may be considered in the model, particularly if known structures of the same complex at higher resolutions also contain such atoms. Consequently, this may improve the accuracy of side chain positions and rotameric configurations during refinement.

Discussion

Q-scores measure the resolvability of individual atoms in a cryoEM map, using an atomic model fitted to or built into the map. It should be noted that nothing is assumed about the model itself, e.g. whether it has good stereochemistry; this could be deduced with other scores such as the Molprobity score[3131]. Q-scores averaged over entire models correlate very closely to the reported resolution of the maps in which they are calculated. The score can also be useful to analyze the map and its resolvability in different regions, and also test whether the model may need further refinement in some areas as indicated by low Q-scores. Here, Q-scores were also applied to various maps at different resolutions to show quantifiable trends across different side chains in proteins, bases in nucleic acids, and also to assess the resolvability of solvent atoms and ions. Q-scores should continue to be a useful metric in the analysis of cryoEM maps and models.

Online Methods

CryoEM

Human apoferritin samples were provided by F. Sun and X.J. Huang (Institute of Biophysics, CAS). Images of the sample were collected in Titan Krios electron microscope (Thermo Fisher) at 300 keV, equipped with BioQuantum energy filter and K2 director detector (Gatan). A total of 1,100 images were recorded in movie mode. Motion correction was performed with MotionCor2[1] (v1.1.0). Particles were picked using the EMAN2 neural network particle picker[2] (EMAN2 v2.22). 3D reconstruction was performed using Relion[3] (v3.0). Map resolution was estimated from two independently reconstructed maps. Three maps of apoferritin were reconstructed using different number of particles: 1.75Å using 70,648 particles, 2.3Å using 9,600 particles, and 3.1Å using 495 particles. All three maps were reconstruction with octahedral symmetry.

Models

The X-ray model PDB:3ajo of human apoferritin was rigidly fitted to each new apoferritin cryoEM map using the Segger[4] plugin in UCSF Chimera[5], (v2.3), and refined using Phenix real-space refinement[6] (v1.14 build 3260). Q-score calculations were performed with the MapQ plugin to UCSF Chimera (v1.2).

Statistical Analysis

The Pearson correlation (r) values for Q-scores vs. reported resolution (plotted in Figure 5) were calculated using python and the scipy.stats.linregress function. The reported r_value was squared to obtain r in each case. In these figures, the number of data points is the number of entries in the respective table (Table 1 for Figure 5A an Table 2 for Figure 5B). For all figures, since the methods used are deterministic, the measurements were only performed once to obtain the displayed values.
  57 in total

1.  Data-guided Multi-Map variables for ensemble refinement of molecular movies.

Authors:  John W Vant; Daipayan Sarkar; Ellen Streitwieser; Giacomo Fiorin; Robert Skeel; Josh V Vermaas; Abhishek Singharoy
Journal:  J Chem Phys       Date:  2020-12-07       Impact factor: 3.488

2.  Structural basis for strand-transfer inhibitor binding to HIV intasomes.

Authors:  Dario Oliveira Passos; Min Li; Ilona K Jóźwik; Xue Zhi Zhao; Diogo Santos-Martins; Renbin Yang; Steven J Smith; Youngmin Jeon; Stefano Forli; Stephen H Hughes; Terrence R Burke; Robert Craigie; Dmitry Lyumkis
Journal:  Science       Date:  2020-01-30       Impact factor: 47.728

3.  Near-Atomic-Resolution Cryo-Electron Microscopy Structures of Cucumber Leaf Spot Virus and Red Clover Necrotic Mosaic Virus: Evolutionary Divergence at the Icosahedral Three-Fold Axes.

Authors:  Michael B Sherman; Richard Guenther; Ron Reade; D'Ann Rochon; Tim Sit; Thomas J Smith
Journal:  J Virol       Date:  2020-01-06       Impact factor: 5.103

4.  Simplified quality assessment for small-molecule ligands in the Protein Data Bank.

Authors:  Chenghua Shao; John D Westbrook; Changpeng Lu; Charmi Bhikadiya; Ezra Peisach; Jasmine Y Young; Jose M Duarte; Robert Lowe; Sijian Wang; Yana Rose; Zukang Feng; Stephen K Burley
Journal:  Structure       Date:  2022-01-12       Impact factor: 5.006

5.  The landscape of translational stall sites in bacteria revealed by monosome and disome profiling.

Authors:  Tomoya Fujita; Takeshi Yokoyama; Mikako Shirouzu; Hideki Taguchi; Takuhiro Ito; Shintaro Iwasaki
Journal:  RNA       Date:  2021-12-14       Impact factor: 4.942

6.  Cryo-EM Structures of Human Drosha and DGCR8 in Complex with Primary MicroRNA.

Authors:  Alexander C Partin; Kaiming Zhang; Byung-Cheon Jeong; Emily Herrell; Shanshan Li; Wah Chiu; Yunsun Nam
Journal:  Mol Cell       Date:  2020-03-27       Impact factor: 17.970

7.  Structures of ABCB4 provide insight into phosphatidylcholine translocation.

Authors:  Kamil Nosol; Rose Bang-Sørensen; Rossitza N Irobalieva; Satchal K Erramilli; Bruno Stieger; Anthony A Kossiakoff; Kaspar P Locher
Journal:  Proc Natl Acad Sci U S A       Date:  2021-08-17       Impact factor: 11.205

8.  Cryo-EM structures of full-length Tetrahymena ribozyme at 3.1 Å resolution.

Authors:  Zhaoming Su; Kaiming Zhang; Kalli Kappel; Shanshan Li; Michael Z Palo; Grigore D Pintilie; Ramya Rangan; Bingnan Luo; Yuquan Wei; Rhiju Das; Wah Chiu
Journal:  Nature       Date:  2021-08-11       Impact factor: 49.962

9.  VESPER: global and local cryo-EM map alignment using local density vectors.

Authors:  Xusi Han; Genki Terashi; Charles Christoffer; Siyang Chen; Daisuke Kihara
Journal:  Nat Commun       Date:  2021-04-07       Impact factor: 14.919

10.  Cryo-EM Map-Based Model Validation Using the False Discovery Rate Approach.

Authors:  Mateusz Olek; Agnel Praveen Joseph
Journal:  Front Mol Biosci       Date:  2021-05-18
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.