| Literature DB >> 20179340 |
Abstract
A method for the rapid tracing of polypeptide backbones has been developed. The method creates an approximate chain tracing that is useful for visual evaluation of whether a structure has been solved and for use in scoring the quality of electron-density maps. The essence of the method is to (i) sample candidate C(alpha) positions at spacings of approximately 0.6 A along ridgelines of high electron density, (ii) list all possible nonapeptides that satisfy simple geometric and density criteria using these candidate C(alpha) positions, (iii) score the nonapeptides and choose the highest scoring ones, and (iv) find the longest chains that can be made by connecting nonamers. An indexing and storage scheme that allows a single calculation of most distances and density values is used to speed up the process. The method was applied to 42 density-modified electron-density maps at resolutions from 1.5 to 3.8 A. A total of 21 428 residues in these maps were traced in 24 CPU min with an overall r.m.s.d. of 1.61 A for C(alpha) atoms compared with the known refined structures. The method appears to be suitable for rapid evaluation of electron-density map quality.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20179340 PMCID: PMC2827349 DOI: 10.1107/S0907444910000272
Source DB: PubMed Journal: Acta Crystallogr D Biol Crystallogr ISSN: 0907-4449
Figure 1Finding potential Cα positions based on the density-modified electron-density map for S-hydrolase (see text). (a) Initial high-density points. (b) Points moved to the highest nearby location on the ridgeline. (c) Points in moderate density (in red) along lines connecting points in high density. (d) Potential Cα positions. These figures were created with PyMOL (DeLano, 2002 ▶).
Figure 2Tracing chains using potential Cα positions from Fig. 1 ▶ (see text). (a) Scoring of potential Cα–Cα pairs. (b) Scoring of trimers. (c) Final connected chain (red) with refined Cα positions (green). These figures were created with PyMOL (DeLano, 2002 ▶).
Figure 3Schematic of the message-passing technique. The blue lines represent nonamers and the dotted red lines indicate connections, so that nonamer A is connected on the right to nonamers B1, B2 and B3. In the first stage of message passing, each nonamer receives, from each nonamer connected to its right, the identity of that nonamer (e.g. nonamer C receives the identity ‘d’ from nonamer D). In subsequent iterations, each nonamer receives, from each nonamer to the right, the identity (if any) that it has been passed from its connection to the right (e.g. nonamer B1 receives from C the identity ‘d’ in the second cycle and nonamer A receives from B1 the identity ‘d’ in the third cycle). The process is complete when no further messages are received. If a nonamer receives its own identity then the connection is ignored (e.g. nonamer A receives from nonaner B3 the identity ‘a’ in the second cycle so this circular reference is ignored).
Chain-tracing in experimental electron-density maps
| Structure | Map quality (CC to model map using data to 3 Å) | Residues | Residues built | Cα r.m.s.d. (Å) | Residues in secondary structure (%) | CPU time (s) | |
|---|---|---|---|---|---|---|---|
| RNase P ( | 1.5 | 0.53 | 416 | 284 | 2.53 | 8 | 14 |
| 1063B ( | 1.7 | 0.68 | 243 | 132 | 1.98 | 17 | 7 |
| Epsin ( | 1.8 | 0.89 | 149 | 132 | 1.31 | 43 | 6 |
| Isocitrate lyase ( | 1.8 | 0.65 | 836 | 754 | 1.59 | 42 | 81 |
| MBP ( | 1.8 | 0.89 | 227 | 194 | 1.41 | 69 | 9 |
| P9 ( | 1.8 | 0.81 | 136 | 128 | 1.61 | 76 | 9 |
| Penicillopepsin ( | 1.8 | 0.84 | 323 | 279 | 1.58 | 41 | 10 |
| Myoglobin (Ana Gonzales, personal communication) | 1.9 | 0.73 | 154 | 139 | 1.96 | 5 | 10 |
| ROP ( | 1.9 | 0.84 | 108 | 107 | 2.07 | 60 | 4 |
| 1167B ( | 2.0 | 0.72 | 370 | 254 | 1.77 | 42 | 10 |
| CobD ( | 2.0 | 0.80 | 355 | 331 | 1.73 | 32 | 18 |
| NSF-N ( | 2.0 | 0.80 | 195 | 162 | 1.57 | 40 | 8 |
| Synapsin ( | 2.0 | 0.78 | 585 | 421 | 1.71 | 60 | 24 |
| Tryparedoxin ( | 2.0 | 0.79 | 143 | 142 | 1.67 | 47 | 6 |
| PDZ ( | 2.1 | 0.67 | 174 | 130 | 1.65 | 50 | 7 |
| Fusion complex ( | 2.3 | 0.73 | 867 | 643 | 1.98 | 14 | 141 |
| GPATase ( | 2.3 | 0.82 | 992 | 901 | 1.49 | 71 | 50 |
| Granulocyte ( | 2.3 | 0.62 | 241 | 141 | 1.80 | 16 | 8 |
| VMP ( | 2.3 | 0.76 | 1141 | 833 | 1.42 | 41 | 37 |
| Armadillo ( | 2.4 | 0.86 | 457 | 369 | 1.21 | 59 | 23 |
| Cyanase ( | 2.4 | 0.82 | 1560 | 1506 | 1.71 | 55 | 62 |
| Mev kinase ( | 2.4 | 0.83 | 317 | 302 | 1.38 | 66 | 9 |
| NSF D2 ( | 2.4 | 0.84 | 247 | 243 | 1.59 | 49 | 11 |
| 1102B ( | 2.5 | 0.78 | 344 | 308 | 1.45 | 56 | 22 |
| AEP transaminase ( | 2.5 | 0.81 | 2169 | 2045 | 1.32 | 71 | 95 |
| FLR ( | 2.5 | 0.77 | 460 | 401 | 2.01 | 39 | 13 |
| P32 ( | 2.5 | 0.86 | 529 | 475 | 1.38 | 71 | 13 |
| PSD-95 ( | 2.5 | 0.76 | 264 | 231 | 1.46 | 53 | 13 |
| QAPRTase ( | 2.5 | 0.71 | 1704 | 1209 | 1.53 | 35 | 69 |
| RNase S ( | 2.5 | 0.65 | 192 | 133 | 2.06 | 42 | 7 |
| Gene V ( | 2.6 | 0.74 | 86 | 74 | 1.52 | 65 | 4 |
| Rab3A ( | 2.6 | 0.82 | 301 | 262 | 1.55 | 41 | 19 |
| GerE ( | 2.7 | 0.70 | 384 | 317 | 1.41 | 27 | 14 |
| CP synthase ( | 2.8 | 0.75 | 534 | 253 | 1.53 | 40 | 18 |
| Rh dehalogenase ( | 2.8 | 0.78 | 291 | 270 | 1.42 | 56 | 9 |
| S-hydrolase ( | 2.8 | 0.81 | 861 | 813 | 1.62 | 41 | 43 |
| UT synthase ( | 2.8 | 0.78 | 990 | 867 | 1.53 | 60 | 48 |
| 1029B ( | 3.0 | 0.73 | 1130 | 1016 | 1.57 | 61 | 37 |
| 1038B ( | 3.0 | 0.71 | 1432 | 1308 | 1.39 | 70 | 114 |
| 1071B ( | 3.0 | 0.65 | 801 | 760 | 1.63 | 67 | 62 |
| Synaptotagmin ( | 3.2 | 0.67 | 275 | 199 | 2.55 | 19 | 29 |
| GroEL ( | 3.8 | 0.55 | 3668 | 1960 | 1.98 | 14 | 247 |
Figure 4Backbone diagrams of chain tracings. (a) Mevalonate kinase (PDB entry 1kkh; Yang et al., 2002 ▶). (b) Structural genomics target 1038B (PDB entry 1lql; Choi et al., 2003 ▶). (c) Armadillo repeat region of murine β-catenin (PDB entry 3bct; Huber et al., 1997 ▶). Red tracings are from the present method; green tracings are from the deposited structures. These figures were created with Coot (Emsley & Cowtan, 2004 ▶).
Comparison of model-building procedures
| Method | Residues built (of 26651 possible residues in 42 experimental density-modified maps) | R.m.s.d. (Å) | Time (s) | Residues per second |
|---|---|---|---|---|
| trace_chain | 21428 | 1.61 | 1441 | 14.9 |
| Helices–strands | 12322 | 1.24 | 5331 | 2.3 |
| 19037 | 1.16 | 16933 | 1.1 | |
| 20601 | 0.95 | 155767 | 0.1 |
trace_chain is the method in this paper (without optional assembly steps) with phenix.find_helices_strands and trace_chain=True. The r.m.s.d. is for Cα atoms only.
Helices–strands is a combination of finding α-helices and β-strands with phenix.find_helices_strands and trace_chain=False (Terwilliger, 2010a, ▶ b ▶).
RESOLVE is the superquick option for model building in RESOLVE (Terwilliger, 2003a ▶).
phenix.autobuild is the standard model-building procedure in PHENIX and includes several cycles of model building with RESOLVE alternating with atomic refinement with phenix.refine (Afonine et al., 2005 ▶).
Figure 5Secondary structure identified in models compared with map quality (see text). Map quality is the correlation of the map with one based on the refined structure. Secondary structure is the percentage of residues in α-helices or β-sheets in these models identified as described in the text.
Figure 6Number of residues built as function of the target ratio of nonamers to atoms.