| Literature DB >> 25279564 |
Hyojung Ryu1, Tae-Rae Kim2, SeonJoo Ahn3, Sunyoung Ji1, Jinhyuk Lee1.
Abstract
The refinement of low-quality structures is an important challenge in protein structure prediction. Many studies have been conducted on protein structure refinement; the refinement of structures derived from NMR spectroscopy has been especially intensively studied. In this study, we generated flat-bottom distance potential instead of NOE data because NOE data have ambiguity and uncertainty. The potential was derived from distance information from given structures and prevented structural dislocation during the refinement process. A simulated annealing protocol was used to minimize the potential energy of the structure. The protocol was tested on 134 NMR structures in the Protein Data Bank (PDB) that also have X-ray structures. Among them, 50 structures were used as a training set to find the optimal "width" parameter in the flat-bottom distance potential functions. In the validation set (the other 84 structures), most of the 12 quality assessment scores of the refined structures were significantly improved (total score increased from 1.215 to 2.044). Moreover, the secondary structure similarity of the refined structure was improved over that of the original structure. Finally, we demonstrate that the combination of two energy potentials, statistical torsion angle potential (STAP) and the flat-bottom distance potential, can drive the refinement of NMR structures.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25279564 PMCID: PMC4184813 DOI: 10.1371/journal.pone.0108888
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Two flat-bottom distance potential functions: the same equilibrium distance (d) of 5 Å and two flat-bottom widths (w), 0 (blue) and 4 (red line).
The used parameters are defined in Method section.
Figure 2Total score change in the training set (A) considering all quality assessment scores (TM-score, NOE violations and “protein-like” scores), and (B) considering only “protein-like” scores.
The optimal width for (A) is 4.0 Å, while it is gradually improved in (B).
Comparison of refined structures using the optimal width with original NMR structures.
| Score | TM-score | NOE Violations (0.5/1.0/2.0) | nDOPE | dDFIRE | Clash | Rama (MOL) | Rama (PRO) | 1st Packing | 2nd Packing | Rama (WHAT) | Rotamer | Backbone | Total score | RMSD | GDT-TS | GDT-HA |
| Refined structures (Optimal width) |
| 0.335 (87/41/12) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Original NMR Structures | 0.782 |
| −0.8232 | −206.894 | 53.68 | 81.81 | 74.71 | −2.511 | −2.424 | −4.667 | −5.59 | −1.020 | 1.081 | 3.113 | 0.757 | 0.562 |
Four Structural similarities are measured by TM-score program (reference atom: Cα): TM-score, RMSD, GDT-TS, and GDT-HA. The used reference structure is X-ray structure (Table S2).
Bold font numbers indicate better scores.
Number of violated NOE distances over 0.5, 1.0, and 2.0 Å. The NOE violations are measured with the experimental NOE data obtained from BMRB (Biological Magnetic Resonance Bank).
Ramachandran appearance measured using MolProbity.
Ramachandran appearance measured using PROCHECK.
Ramachandran appearance measured using WHAT_CHECK.
Figure 3Frequency of the best structures in the (A) training and (B) test sets as a function of flat-bottom width from 0 to 10 Å.
Comparison of refined structures using best width with original NMR structuresa.
| Score | TM-score | NOE Violations (0.5/1.0/2.0) | nDOPE | dDFIRE | Clash | Rama (MOL) | Rama (PRO) | 1st Packing | 2nd Packing | Rama (WHAT) | Rotamer | Backbone | Total | RMSD | GDT-TS | GDT-HA |
| Refined structures (Best width) |
| 0.321 (35/21/8) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Original NMR structures | 0.795 |
| −0.8838 | −209.13 | 47.22 | 83.78 | 76.5 | −2.369 | −2.312 | −4.423 | −5.221 | −0.854 | 1.215 | 3.113 | 0.757 | 0.562 |
See the footnotes in Table 1.
Figure 4Comparison of quality assessment scores for each of the best structures.
The shaded yellow color indicates the region where the best refined structures (Y-axis) are better than the original structures (X-axis).
Figure 5Two examples of our refinement on 1KOT and 1FA4.
The structures are drawn as cartoons using Chimera [47]. The refined and original structures (blue color cartoons) are superimposed with respect to their reference structures: X-ray structures (red color cartoons; PDB ID: 3D32 for sub-figures A and B, and PDB ID: 2CJ3 for sub-figures C and D). Dashed circles in the structures represent the apparent secondary structure regions improved by our method. The backbone accuracies with regard to the reference structure are calculated with the TM-score, the GDT-HA score, and the RMSD, where those scores are measured using the TM-score program.
Figure 6Secondary structure schemes of three conformations (original, refined, and native (X-ray)) of PDB (A) 1KOT and (B) 1FA4.
The black dashed lines indicate the refined regions.
Comparison between our refinement and the re-refinement structuresa , b.
| Score | Ours | AMBER & RECOORD |
| NOE violation | 0.464 |
|
| nDOPE |
| −1.0911 |
| dDFIRE |
| −177.856 |
| Clash |
| 6.31 |
| Rama (MOL) |
| 82.94 |
| Rama (PRO) |
| 74.26 |
| 1st packing |
| −3.16 |
| 2nd packing |
| −2.326 |
| Rama (WHAT) |
| −4.180 |
| Rotamer |
| −5.037 |
| Backbone |
| −1.306 |
| Total |
| 0.6870 |
See the footnotes in Table 1.
A total of 24 NMR structures were used (lists are in Table S9). Because no corresponding X-ray structures exist, the TM-score cannot be measured.