| Literature DB >> 19002604 |
Jeffrey J Headd1, Robert M Immormino, Daniel A Keedy, Paul Emsley, David C Richardson, Jane S Richardson.
Abstract
Misfit sidechains in protein crystal structures are a stumbling block in using those structures to direct further scientific inference. Problems due to surfaceEntities:
Mesh:
Substances:
Year: 2008 PMID: 19002604 PMCID: PMC2704614 DOI: 10.1007/s10969-008-9045-8
Source DB: PubMed Journal: J Struct Funct Genomics ISSN: 1345-711X
Fig. 1Example Autofix correction of a Leu decoy rotamer from the 945-file dataset: Leu D 427 from 1A0E (Thermotoga neapolitana xylose isomerase) at 2.7 Å resolution. a (original) Leu D 427 in its deposited conformation, which is a rotamer outlier with an eclipsed χ angle and a clash with Leu D 430. b (both) Overlay, in stereo, of proposed corrected Leu rotamer (green) over the deposited conformation (pink). c (fixed) Corrected Leu D 427, in a favored mt rotamer. The clash with Leu D 430 has been alleviated and the bond angle idealized, with a somewhat better fit to the density. Images in Figs. 1, 2 and 4 were generated using KING [3]
Results of Autofix correction of Leu, Val, Thr, and Arg in 945 PDB files
| Database composition | Autofix results | |||||
|---|---|---|---|---|---|---|
| Total | Outliers | Outlier rate | Flipped | Rejected | Correction rate | |
| Leucine | ||||||
| All | 53,104 | 4,660 | 0.09 | 2,037 | 2,623 | 0.44 |
| <2.0 | 15,046 | 497 | 0.03 | 345 | 152 | 0.69 |
| ≥2.0 & <2.5 | 19,494 | 1,657 | 0.09 | 1,025 | 632 | 0.62 |
| ≥ 2.5 & <3.0 | 14,498 | 1,622 | 0.11 | 534 | 1,088 | 0.33 |
| ≥3.0 | 6,570 | 884 | 0.13 | 133 | 751 | 0.15 |
| Valine | ||||||
| All | 43,380 | 1,377 | 0.03 | 577 | 800 | 0.42 |
| <2.0 | 12,178 | 66 | 0.01 | 33 | 33 | 0.50 |
| ≥2.0 & <2.5 | 15,271 | 419 | 0.03 | 249 | 170 | 0.59 |
| ≥2.5 & <3.0 | 11,032 | 561 | 0.05 | 243 | 318 | 0.43 |
| ≥3.0 | 4,899 | 331 | 0.07 | 52 | 279 | 0.16 |
| Threonine | ||||||
| All | 32,762 | 1,764 | 0.05 | 570 | 1,194 | 0.32 |
| <2.0 | 9,037 | 86 | 0.01 | 43 | 43 | 0.50 |
| ≥2.0 & <2.5 | 11,305 | 432 | 0.04 | 196 | 236 | 0.45 |
| ≥2.5 & <3.0 | 8,761 | 698 | 0.08 | 242 | 456 | 0.35 |
| ≥3.0 | 3,659 | 548 | 0.15 | 89 | 459 | 0.16 |
| Arginine | ||||||
| All | 29,843 | 3,059 | 0.10 | 465 | 2,594 | 0.15 |
| <2.0 | 8,029 | 375 | 0.05 | 52 | 323 | 0.14 |
| ≥2.0 & <2.5 | 10,473 | 967 | 0.09 | 195 | 772 | 0.20 |
| ≥2.5 & <3.0 | 7,575 | 1,026 | 0.14 | 173 | 853 | 0.17 |
| ≥3.0 | 3,766 | 691 | 0.18 | 45 | 646 | 0.07 |
Fig. 2Example Autofix correction from the 50S ribosome: a Thr rotamer outlier, from protein L18e in the 1YHQ archaeal large ribosomal subunit (2.4 Å) [15], before and after correction. a (original) Thr O 3 in its deposited orientation, with fairly good fit to the density, but a serious clash with RNA backbone (Thr methyl to G 0 656 H5′), no H-bond, and a rotamer outlier. b (both) Overlay, in stereo, of proposed corrected Thr rotamer (green) over the original position (pink). c (fixed) Corrected Thr O 3, with equivalent fit to the density, no steric clashes, an excellent p rotamer, and now a strong H-bond from Thr OG1 to the 2′OH of G 0 655. C atoms are gray or black balls; O atoms are larger red balls. Steric clashes are shown as clusters of hot pink spikes, H-bonds as lenses of pale green dots
Fig. 4Before and after χ1–χ2 plots of the 2,037 accepted Leu corrections, for those identified as rotamer outliers (<1%) in our 945-file dataset and successfully corrected by Autofix. Contours are taken from the Top500 Leu set [1], with decoys removed; black lines are the 1% contours and gray lines are the 10% contours of rotamer score. a Before: χ1–χ2 plot for the original conformation of each corrected Leu outlier (thus outside the 1% contours). b After: χ1–χ2 plot of the final χ values for each successfully corrected Leu outlier (now inside the contours). Data points are color-coded by which rotamer they ended up in after correction: mt green, tp blue, tt red, mp brown, pp purple, tm yellow, mm hot pink, and pt orange. Note that for most rotamers, the corrected examples came from a well-defined decoy cluster approximately 180° distant
Fig. 3Summary of Autofix results on 1YHQ 50S ribosomal subunit. Bar chart summary of correction results on Leu, Thr, Val, and Arg residues in 1YHQ. Gray bars represent the total number of each residue type in the file. Red represents the number of candidate outliers (<1% rotamer score). Blue represents the number of successfully corrected residues of each type: 7 Leu, 8 Val, 8 Thr, and 7 Arg, which are 63, 57, 67, and 25% of the outliers, respectively
Fig. 5Summary of real-space correlation coefficients (RSCC) for corrected outlier residues before (gray) and after (black) Autofix, showing improvement for all 4 amino acid types. Median RSCC values are indicated by a vertical line. The box around the median spans the 25th to the 75th percentile. Whiskers end at the 1st or 99th percentile