| Literature DB >> 23442792 |
Xuefeng Cui1, Shuai Cheng Li, Dongbo Bu, Babak Alipanahi, Ming Li.
Abstract
: Previous studies show that the same type of bond lengths and angles fit Gaussian distributions well with small standard deviations on high resolution protein structure data. The mean values of these Gaussian distributions have been widely used as ideal bond lengths and angles in bioinformatics. However, we are not aware of any research done to evaluate how accurately we can model protein structures with dihedral angles and ideal bond lengths and angles.Here, we introduce the protein structure idealization problem. We focus on the protein backbone structure idealization. We describe a fast O(nm/ε) dynamic programming algorithm to find an idealized protein backbone structure that is approximately optimal according to our scoring function. The scoring function evaluates not only the free energy, but also the similarity with the target structure. Thus, the idealized protein structures found by our algorithm are guaranteed to be protein-like and close to the target protein structure.We have implemented our protein structure idealization algorithm and idealized the high resolution protein structures with low sequence identities of the CULLPDB_PC30_RES1.6_R0.25 data set. We demonstrate that idealized backbone structures always exist with small changes and significantly better free energy. We also applied our algorithm to refine protein pseudo-structures determined in NMR experiments.Entities:
Year: 2013 PMID: 23442792 PMCID: PMC3655034 DOI: 10.1186/1748-7188-8-5
Source DB: PubMed Journal: Algorithms Mol Biol ISSN: 1748-7188 Impact factor: 1.405
Figure 1-RMSD.
Figure 2-RMSD of all regions v.s. -helix regions.
Figure 3-RMSD of all regions v.s. -sheet regions.
Figure 4-RMSD of inner regions v.s. outer regions.
Figure 5All-atom RMSD.
Figure 6Protein backbone free energy (calculated by dDFIRE).
Figure 7Protein all-atom free energy (calculated by dDFIRE).
DSSP hydrogen bond differences before and after idealization
| Parallel Bridge | 9 | 0.04% |
| Antiparallel Bridge | -211 | -0.37% |
| 27 Helix | 7080 | 26.46% |
| 310 Helix | -1018 | -2.35% |
| -1644 | -1.48% | |
| -82 | -1.27% | |
| All | 5183 | 1.85% |
The percentages of the favored dihedral angles of NMR protein structures before and after idealization
| 1SSK | 44.6% | 71.9% | 27.3% | 2LBN | 59.7% | 77.6% | 17.9% |
| 2KQP | 62.9% | 80.0% | 17.1% | 1WPI | 64.4% | 81.4% | 17.0% |
| 1EXE | 60.5% | 76.7% | 16.2% | 2LNV | 58.6% | 72.4% | 13.8% |
| 1X6F | 64.1% | 73.1% | 9.0% | 2L6B | 72.2% | 81.1% | 8.9% |
| 2GFU | 72.3% | 80.4% | 8.1% | 1PC2 | 79.3% | 87.4% | 8.1% |
| 2LMR | 79.7% | 87.0% | 7.3% | 2KA0 | 72.6% | 78.3% | 5.7% |
| 2L3O | 71.3% | 76.9% | 5.6% | 1O1W | 67.2% | 72.1% | 4.9% |
| 2CQ9 | 78.3% | 82.6% | 4.3% | 2RQA | 72.0% | 75.4% | 3.4% |
| 2D86 | 89.0% | 92.1% | 3.1% | 1NTC | 80.5% | 83.1% | 2.6% |
| 2JZT | 76.6% | 79.0% | 2.4% | 2CZN | 76.5% | 76.5% | 0.0% |
| 1RCH | 75.4% | 74.6% | -0.8% | 2JU1 | 77.1% | 75.9% | -1.2% |
| 2KV7 | 85.5% | 84.2% | -1.3% | 2JT2 | 83.6% | 81.5% | -2.1% |
| 2KYW | 83.8% | 81.1% | -2.7% | 2OSR | 82.7% | 80.0% | -2.7% |
| 2L6M | 81.7% | 78.5% | -3.2% | 2CU1 | 81.1% | 77.8% | -3.3% |
| 1AJ3 | 93.3% | 88.8% | -4.5% | 1WI5 | 84.0% | 78.0% | -6.0% |
| 1NMW | 85.0% | 78.0% | -7.0% | 2LBV | 83.9% | 74.8% | -9.1% |
Figure 8Ramachandran plot of the native NMR protein structure 1WPI.
Figure 9Ramachandran plot of the idealized NMR protein structure 1WPI.