| Literature DB >> 29533236 |
Isabel Usón1, George M Sheldrick2.
Abstract
For the purpose of this article, experimental phasing is understood to mean the determination of macromolecular structures by exploiting small intensity differences of Friedel opposites and possibly of reflections measured at different wavelengths or for heavy-atom derivatives, without the use of specific structural models. The SHELX programs provide a robust and efficient route for routine structure solution by the SAD, MAD and related methods, but involve a number of simplifying assumptions that may limit their applicability in borderline cases. The substructure atoms (i.e. those with significant anomalous scattering) are first located by direct methods, and the experimental data are then used to estimate phase shifts that are added to the substructure phases to obtain starting phases for the native reflections. These are then improved by density modification and, if the resolution of the data and the type of structure permit, polyalanine tracing. A number of extensions to the tracing algorithm are discussed; these are designed to improve its performance at low resolution. Given native data to 2.5 Å resolution or better, a correlation coefficient greater than 25% between the structure factors calculated from such a trace and the native data is usually a good indication that the structure has been solved.Entities:
Keywords: MAD; SAD; SHELX; autotracing; density modification; direct methods; experimental phasing
Mesh:
Substances:
Year: 2018 PMID: 29533236 PMCID: PMC5947774 DOI: 10.1107/S2059798317015121
Source DB: PubMed Journal: Acta Crystallogr D Struct Biol ISSN: 2059-7983 Impact factor: 7.652
Figure 1Resolution dependence of (a) mean divided by its standard deviation and (b) CC1/2(ano) for viscotoxin A1 (Pal et al., 2008 ▸).
Figure 2Issues in map tracing and additional constraints. (a) The trace corresponding to an electron-density map of the coiled coil autophagy-related protein 38 at 2.4 Å resolution illustrates how the helical geometry is degraded in the regions where the map is poor. (b) shows how an originally placed seed (lime) continuing a correct seed (red) is refined by the unrestrained simplex into the density of a neighbouring helix (purple). The neighbouring helix happens to be in the reversed direction. (c) Seed coverage in the first autotracing cycle of fibronectin for the unrestrained versus (d) restrained simplex refinement of β-sheets. In (c) and (d) the sequence numbers increase from left to right and the r.m.s.d. of the seeds versus the correct structure is represented in blue (<0.36 Å), green (0.5 Å) and yellow (<1 Å). Restraining the simplex refinement of the seeds (d) renders a better and more accurate coverage than in (c).
Figure 3Three examples of tracing evolution: (a, b) apoferritin, (c, d) autophagy-related protein 38 and (e, f) titin A168-A169. (a), (c) and (e) show standard polyalanine tracing, (b) shows restrained helical tracing with 14-residue seeds, (d) with 12-residue helical seeds extended by sliding and (f) with two-stranded antiparallel β seeds. The full sequence is shown on the left, with blue for Cα atoms within 0.3 Å, green for those at 0.6–0.3 Å, yellow for those at 1.0–0.6 Å, orange for those at 2.0–1.0 Å and red for no match. Incorrectly traced residues are shown on the same scale on the right.
Summary of density-modification and tracing statistics for test structures
CC(HA) is the substructure CC for the heavy atoms against the anomalous data from SHELXE, n is the first tracing-cycle number with CC > 30%, CC is the best CC within three cycles of cycle n, %CA/1 Å is the percentage of Cα atoms within 1 Å of their correct positions and wMPE is the weighted mean phase error for this trace. The -z option was not used (since it is not very reliable). The SHELXE parameters given are -s, solvent fraction; -m, number of density-modification cycles; -h, number of heavy-atom sites to use. -a for autotracing was used in all cases.
| Structure and source of data |
| Space group |
| CC(HA) |
| α/β | CC (%) | %CA/1 Å | wMPE (°) |
|---|---|---|---|---|---|---|---|---|---|
| Apoferritin |
|
| 2.00 | 29.0 | 4 | 77/7 | 47.6 | 91.2 | 29.8 |
| hARH3 |
|
| 1.82 | 13.2 | 4 | 67/6 | 39.2 | 81.9 | 34.9 |
| Insulin |
|
| 1.80 | 30.8 | 1 | 47/20 | 56.5 | 96.1 | 23.3 |
| Lysozyme pH 4.5 |
|
| 1.84 | 40.6 | 1 | 44/13 | 36.7 | 69.8 | 33.9 |
| Lysozyme pH 8.0 |
|
| 1.84 | 26.6 | 2 | 43/13 | 41.7 | 82.2 | 35.2 |
| NBR1 PB1 |
|
| 2.15 | 15.3 | 2 | 30/33 | 50.6 | 88.4 | 38.2 |
| PPE-Ca |
|
| 1.84 | 29.7 | 1 | 15/44 | 45.2 | 92.1 | 36.4 |
| PPE-Na |
|
| 2.15 | 41.1 | 3 | 15/43 | 35.4 | 70.1 | 42.2 |
| Proteinase K |
|
| 1.95 | 34.3 | 5 | 32/26 | 34.3 | 68.5 | 42.8 |
| Thaumatin |
|
| 2.00 | 24.2 | 2 | 13/44 | 39.2 | 81.2 | 29.a |
| Thermolysin |
|
| 1.98 | 26.7 | 1 | 42/17 | 42.3 | 84.9 | 33.6 |
| Trypsin |
|
| 1.82 | 35.5 | 1 | 14/40 | 41.8 | 82.1 | 34.3 |
| Trypsin |
|
| 1.20 | 19.9 | 1 | 11/36 | 41.2 | 85.3 | 28.0 |
| Elastase |
|
| 1.37 | 15.9 | 1 | 15/43 | 41.5 | 96.2 | 31.5 |
| Viscotoxin A1 |
|
| 1.70 | 23.5 | 1 | 45/22 | 52.4 | 100.0 | 24.1 |
| Viscotoxin A3 |
|
| 2.20 | 28.3 | 4 | 45/19 | 37.6 | 65.2 | 44.4 |
| Titin A168-A169 |
|
| 2.20 | 12.4 | 15 | 5/64 | 34.9 | 70.6 | 36.2 |
| SYCP3 |
|
| 2.41 | 13.1 | — | 96/1 | 23.6 | 28.8 | 62.3 |
| Autophagy-related protein 38 |
|
| 2.44 | 7.3 | 19 | 87/0 | 32.5 | 47.2 | 43.7 |
| Fibronectin |
|
| 1.60 | 19.8 | 11 | 4/54 | 39.3 | 34.0 | 48.6 |
| MG491 |
|
| 3.00 | 42.9 | 21 | 79/4 | 34.6 | 21.1 | 63.4 |
Using data from Mueller-Dieckmann et al. (2007 ▸).
Using data from Debreczeni, Bunkóczi et al. (2003 ▸).
Using data from Pal et al. (2008 ▸).
Using data from Debreczeni, Girmann et al. (2003 ▸).
Using data from Syrjanen et al. (2014 ▸).
Using data from Ohashi et al. (2016 ▸).
Using data from Rudiño-Piñera et al. (2007 ▸).
Using data from Martinelli et al. (2015 ▸).
Summary of density modification and autotracing with SHELXE for six test structures
The SHELXE parameter line applies to the case for which results are summarized: -m, number of density-modification cycles; -h, number of heavy-atom sites to use; -s, solvent fraction; -a, autotracing; -q, length of helix template; -Q, use helical extension in tracing; -B1, use antiparallel β templates for tracing.
| Standard tracing | Restrained tracing | |||||
|---|---|---|---|---|---|---|
| ID | Space group |
| CASE |
| wMPE (°)/NTRACE/CC (%) | wMPE (°)/NTRACE/CC (%) |
| Apoferritin |
| 171 | SAD, 8 Cd | 2.0 |
|
|
| 33.4/123/33.0 (8.2) | 31.0/160/50.4 (1.2) | |||||
| Autophagy-related protein 38 |
| 518 | SAD, 10 Se | 2.5 |
|
|
| 53.3/275/24.6 (11.0) | 42.3/327/32.3 (7.3) | |||||
|
| ||||||
| 42.0/311/33.6 (5.8) | ||||||
| Human synaptonemal complex protein 3 |
| 1150 | SAD, 15 I | 2.4 |
|
|
| 62.0/613/25.4 (7.3) | 55.5/775/34.6 (2.4) | |||||
|
| ||||||
| 56.3/762/33.5 (2.5) | ||||||
| Terminal organelle protein MG491 |
| 556 | SAD, 4 Se | 3.0 |
|
|
| 65.6/272/33.4 (12.2) | 56.4/299/39.5 (4.8) | |||||
|
| ||||||
| 58.9/282/38.0 (5.2) | ||||||
| Fibronectin |
| 90 | SAD, 9 S | 1.6 |
|
|
| 38.5/68/34.8 (4.3) | 38.7/63/35.4 (1.1) | |||||
| Titin protein A168-A169 |
| 195 | SAD, 4 S | 2.2 |
|
|
| 39.4/124/28.9 (5.6) | 37.1/132/30.6 (2.0) | |||||
| Kgp prodomain |
| 254 | SAD, 5 I | 2.6 |
|
|
| 68.6/107/17.6 (5.3) | 64.2/115/20.4 (5.0) |
Figure 4Phasing results for 20 runs comparing standard (blue) versus restrained tracing from new seeds (long, tethered helices or two-stranded β seeds in green; sliding of helical seeds in red) in the case of apoferritin with 14-residue helical seeds, autophagy-related protein 38 (atg38) with helix sliding to extend the 12-residue helical seeds, synaptonemal complex SYCP3 with both types of helical tracing, fibronectin (fibro) with two-stranded β seeds, titin protein A168-A169 (a16) with two-stranded β-sheets and Kgp prodomain (KGP-pro).
Figure 5Percentage of the main chain traced correctly within 1 Å r.m.s.d. (green), within 0.5 Å r.m.s.d. (blue) or incorrectly traced (red) for (a) apoferritin with 14-residue helical seeds, (b) autophagy-related protein 38 with helix sliding to extend the 12-residue helical seeds, (c) synaptonemal complex SYCP3 with both types of helical tracing, (d) fibronectin with two-stranded β seeds, (e) titin protein A168-A169 with two-stranded β-sheets and (f) Kgp prodomain.