| Literature DB >> 29652252 |
Robert D Oeffner1, Pavel V Afonine2, Claudia Millán3, Massimo Sammito3, Isabel Usón3, Randy J Read1, Airlie J McCoy1.
Abstract
Molecular-replacement phasing of macromolecular crystal structures is often fast, but if a molecular-replacement solution is not immediately obtained the crystallographer must judge whether to pursue molecular replacement or to attempt experimental phasing as the quickest path to structure solution. The introduction of the expected log-likelihood gain [eLLG; McCoy et al. (2017), Proc. Natl Acad. Sci. USA, 114, 3637-3641] has given the crystallographer a powerful new tool to aid in making this decision. The eLLG is the log-likelihood gain on intensity [LLGI; Read & McCoy (2016), Acta Cryst. D72, 375-387] expected from a correctly placed model. It is calculated as a sum over the reflections of a function dependent on the fraction of the scattering for which the model accounts, the estimated model coordinate error and the measurement errors in the data. It is shown how the eLLG may be uEntities:
Keywords: LLGI; Phaser; eLLG; log-likelihood gain; maximum likelihood; molecular replacement
Mesh:
Substances:
Year: 2018 PMID: 29652252 PMCID: PMC5892874 DOI: 10.1107/S2059798318004357
Source DB: PubMed Journal: Acta Crystallogr D Struct Biol ISSN: 2059-7983 Impact factor: 7.652
Guidance for the outcome of molecular replacement in Phaser for the placement of the first model in nonpolar space groups, showing the relationship between the translation-function Z-score TFZ and the LLGI (TFZ ≃ LLGI1/2)
| Solved? | TFZ | LLGI |
|---|---|---|
| No | <5 | <25 |
| Unlikely | 5–6 | 25–36 |
| Possibly | 6–7 | 36–49 |
| Probably | 7–8 | 49–64 |
| Definitely | >8 | >64 |
TFZ and LLGI are significant at lower values for the first model in polar space groups: TFZ = 7 and LLGI = 50 for the first model in polar space groups, and TFZ = 5.5 and LLGI = 30 for the first model in space group P1.
Figure 1Confidence in the molecular-replacement solution for the placement of two components in the asymmetric unit. The increase in the final refined LLGI score (ΔLLGI = LLGI2 − LLGI1, where LLGI1 is for the placement of the first component and LLGI2 is for the placement of both components) provides a clear diagnostic for success in molecular replacement (8762 trials).
ROP1 (Glykos & Kokkinidis, 2003 ▸) solved with a 25-residue polyalanine helix
The LLGI values achieved in the search follow the eLLG values predicted from an Δm of 0.3 Å, an appropriate value for a helix of this length, and a scattering fraction of 0.14. The TFZ exceeds 8 for the placement of the third helix, when the increase in the LLGI (ΔLLGI) is 84.
| Helix number | eLLG | LLGI | ΔLLGI | TFZ |
|---|---|---|---|---|
| 1 | 20 | 63 | — | 4.5 |
| (1 +) 2 | 79 | 113 | 50 | 7.0 |
| (1 + 2 +) 3 | 177 | 197 | 84 | 11.6 |
| (1 + 2 + 3 +) 4 | 315 | 281 | 84 | 9.9 |
Figure 2Increase in the eLLG with resolution (orange line) for a model with Δm = 1.0 Å and a data set with 10 000 reflections to 2.0 Å resolution. An eLLG of 64 (greater than the solved-LLG) is achieved at 5.8 Å resolution. A contrasting case (blue line) shows the increase in the eLLG for a model with Δm = 1.7 Å. The eLLG will at best be 40.4 (less than the solved-LLG); however, this value is reached asympotically and including data with resolution higher than 3.0 Å (1.8 × Δm) will not increase the eLLG significantly.
Figure 3Average Phaser runtime for structure solution for 331 successful molecular-replacement test cases versus the eLLG used to determine the resolution of the data used for molecular replacement. The optimal-target-eLLG for minimizing the total Phaser runtime was 225.
Figure 4Single-atom molecular replacement for Shisa3 (PDB entry 5m0w) (McCoy et al., 2017 ▸). The eLLG for a single S atom depends on how well ordered it is, as measured by the difference between its B factor and the Wilson B factor.
Figure 5ΔRSCC (7) for 8966 successful molecular-replacement test cases consisting of 1526 targets and between one and 33 models per target (with an average of six models per target). The mean and standard deviation of the distribution of ΔRSCC was calculated in narrow windows of ΔeLLG. The mean (orange line) and one standard deviation either side of the mean (yellow lines) are indicated.
Figure 6Likelihood-guided occupancy refinement for PDB entry 2hh6 solved by molecular replacement with CASP7 model T0283TS020_2. (a) The structure 2hh6 (reference copy, ribbon representation, colour ramp from blue to red from the N-terminus to the C-terminus), the N-terminal helix of a symmetry-related copy of 2hh6 (worm representation, colour ramp as for the reference copy) and the result of likelihood-guided occupancy refinement of the placed model showing occupancies per residue ranging from 1 (black) to 0 (purple). The regions where 2hh6 and the model diverge are the regions where the refined occupancies are close to 0 (the model is shown in purple), and conversely where they coincide the refined occupancies are close to 1 (the model is shown in black). The window size for occupancy refinement was five residues, determined by the optimal-target-ΔeLLG. This figure was produced with CCP4mg (McNicholas et al., 2011 ▸). (b) The difference between the LLGI for the placed model T0283TS020_2 before and after removing five residues centred on each residue along the chain (blue line). The RSCC per residue is shown between the placed model T0283TS020_2 and the ‘true’ map (see text; orange line) and between the placed model T0283TS020_2 and the ‘model’ map (see text; dotted green line). The RSCC of the model to the ‘true’ map is better predicted by the change in LLGI (orange line versus blue line) than by the RSCC to the ‘model’ map (orange line versus dotted green line).
Figure 7The molecular-replacement solution of PDB entry 1xhb (grey) solved with PDB entry 2d7i (red and blue) after likelihood-guided pruning of the placed 2d7i model, where blue indicates an occupancy of 1 and red indicates an occupancy of 0. The symmetry-related copy of 1xhb that clashes with the model after initial molecular replacement is shown in gold.
Figure 8The LLGI as a function of the twin fraction for calculated data, showing that the LLGI of the molecular-replacement solution decreases in proportion to the twin fraction. (a) The LLGI as a function of the hemihedral twin fraction for calculated data for the test case of the β-lactamase (BETA)–β-lactamase inhibitor (BLIP) complex (Strynadka et al., 1996 ▸). (b) The LLGI as a function of tetartohedral twinning for calculated data for the test case of human complement factor 1 (Roversi et al., 2012 ▸).