| Literature DB >> 15571634 |
Alessandro Dal Palù1, Agostino Dovier, Federico Fogolari.
Abstract
BACKGROUND: The protein structure prediction problem is one of the most challenging problems in biological sciences. Many approaches have been proposed using database information and/or simplified protein models. The protein structure prediction problem can be cast in the form of an optimization problem. Notwithstanding its importance, the problem has very seldom been tackled by Constraint Logic Programming, a declarative programming paradigm suitable for solving combinatorial optimization problems.Entities:
Mesh:
Year: 2004 PMID: 15571634 PMCID: PMC539352 DOI: 10.1186/1471-2105-5-186
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Main program predicate.
Figure 2Code for stating that consecutive aminoacids must be in adjacent lattice points.
Experimental results
| Name | N | Time | Energy | RMSD |
| 1LE0 | 12 | 1.3 s | -9040 | 2.8 / 2.6 (2–11) |
| 1KVG | 12 | 7.3 s | -14409 | 2.7 / 2.4 (3–11) |
| 1LE3 | 16 | 2.3 s | -13653 | 3.0 / 2.7 (2–1 5) |
| 1EDP | 17 | 20.4 s | -19389 | 4.3 / 1.1 (9–15) |
| 1PG1 | 18 | 14.6 s | -10126 | 6.0 / 5.2 (4–17) |
| 1ZDD | 34 | 300 s (limit) | -20412 | 5.6 / 5.6 (5–34) |
| 17 m25 s | -22350 | 4.0 / 4.0 (5–34) | ||
| 1VII | 36 | 300 s (limit) | -20860 | 10.4 / 6.7 (4–32) |
| 1000 s (limit) | -22377 | 9.1 /6.3 (4–32) | ||
| 7 h42 m | -26408 | 10.2 / 7.8 (4–32) | ||
| CF = 0.3 | 3 h58 m | -28710 | 8.0 / 7.4 (4–32) | |
| 1VII(*) | 36 | 300 s (limit) | -17948 | 9.2 / 7.3 (4–32) |
| 1000 s (limit) | -17948 | 9.2 /7.3 (4–32) | ||
| 3 h20 m | -21211 | 10.3 / 6.9 (4–32) | ||
| 1E0M | 37 | 300 s (limit) | -13830 | 6.5 / 5.8 (8–29) |
| 1200 s (limit) | -24613 | 8.4 / 3.6 (8–29) | ||
| 10 h (limit) | -26592 | 8.8 / 3.4(8–29) | ||
| 24 h (limit) | -30163 | 8.9 / 4.4 (8–29) | ||
| 2GP8 | 40 | 300 s (limit) | -10303 | 10.5 / 8.9 (6–38) |
| 1000 s (limit) | -24748 | 4.1 / 3.5 (6–38) | ||
| 10 h (limit) | -26196 | 4.9 / 3.5 (6–38) | ||
| 10 h39 m | -26196 | 4.9 / 3.5 (6–38) | ||
| 1ED0 | 46 | 300 s (limit) | -29970 | 7.3 / 4.1 (3–40) |
| 1000 s (limit) | -32369 | 8.6 / 7.1 (3–40) | ||
| 9 h38 m | -38218 | 8.0 / 7.2 (3–40) | ||
| 1ENH | 54 | 300 s (limit) | -12480 | 10.4 / 8.9 (8–52) |
| 1000 s (limit) | -12480 | 10.4 / 8.9 (8–52) | ||
| 10 h(limit) | -23373 | 9.9 / 8.6 (8–52) | ||
| 24 h (limit) | -23373 | 9.9 / 8.6 (8–52) | ||
| 6PTI | 58 | 300 s (limit) | no sol. | |
| 1000 s (limit) | -29709 | 10.0 / 9.7 (3–55) | ||
| 10 h (limit) | -37837 | 10.0 / 9.7 (3–55) | ||
| 24 h (limit) | -37837 | 10.0 / 9.7 (3–55) | ||
| CF = 0.25 | 48 h (limit) | -42096 | 9.7 / 9.4 (3–55) | |
| CF= 0.18 | 24 h (limit) | -47451 | 10.9 / 10.7 (3–55) | |
| 2IGD | 60 | 300 s (limit) | -24158 | 19.3 / 16.3 (6–59) |
| 1000 s (limit) | -29178 | 19.0 /16.2 (6–59) | ||
| 10 h (limit) | -37479 | 16.9 / 15.0(6–59) | ||
| 24 h (limit) | -37479 | 16.9 / 15.0 (6–59) | ||
| CF = 0.17 | 4 h 59 m | -40588 | 12.6 / 11.5 (6–59) | |
| 2ERA | 61 | 300 s (limit) | -28993 | 11.6 / 10.6 (3–55) |
| 9 m28 s, | -30746 | 12. 3/ 12.1 (3–55) | ||
| ss = 5 | 15 m13 s | -31692 | 12.7/11.6 (3–55) | |
| CF = 0.25, ss = 5 | 15 m12 s | -33693 | 10.9/9.3 (3–55) | |
| CF = 0.25, ss = 4 | 1000 s (limit) | -32985 | 12.3/12.4 (3–55) | |
| CF = 0.19, ss = 5 | 1000 s (limit) | -34275 | 10.6/8.9 (3–55) | |
| CF = 0.19, ss = 4 | 1000 s (limit) | -38138 | 11.6/10.6 (3–55) | |
| 1SN1 | 63 | 300 s (limit) | no sol. | |
| 1000 s (limit) | -53888 | 13.0 / 10.5 (2–51) | ||
| 10 h (limit) | -57615 | 11.9/ 9.6 (2–51) | ||
| CF = 0.25, ss = 5 | 10 h (limit) | -47121 | 8.6 / 8.1 (2–51) | |
| 1YPA | 63 | 300 s (limit) | -36626 | 16.1 / 9.4 (12–52) |
| 1000 s (limit) | -33886 | 17.1 / 10.9 (12–52) | ||
| 10 h (limit) | -33886 | 17.1 / 10.9 (12–52) | ||
| CF = 0.17 | 100 s (limit) | -26297 | 12.5 / 10.5 (12–52) | |
| CF = 0.17 | 10 h (limit) | -60244 | 12.9 / 9.8 (12–52) |
Summary of molecular dynamics results
| Name | N | RMSD (Å) | Time | RMSD ( |
| 1VII | 36 | 5.3 / 4.7 (4–32) | 17.8 h (2 ns) | 5.8 / 4.8 (4–32) |
| 1E0M | 37 | 5.5 / 4.0 (7–30) | 26.3 h (4 ns) | 8.7 / 3.6 (7–30) |
| 2GP8 | 40 | 5.9 / 3.8 (6–38) | 37.7 h (4 ns) | 3.9 / 2.3 (6–38) |
| 1ENH | 54 | 5.9 / 5.0 (8–52) / 3.7 (8–36) | 29.4 h (2 ns) | 11.2 / 10.7 (8–52) / 4.7 (8–36) |
| 2IGD | 61 | 5.7 / 4.1 (6–59) | 48.6 h (4 ns) | 12.9 / 11.5 (6–59) |
| 1YPA | 64 | 9.2 / 7.1 (12–52) | 116.9 h (8 ns) | 11.8 / 9.4 (12–52) |
Figure 3Native (yellow) and model after all-atom reconstruction and optimization (red) for WW domain (PDB id. 1E0M). The trace of core residues (7–30) is shown.
Comparison with Rosetta predictions
| Name | N | Rosetta Time | Rosetta RMSD | ||
| 1ZDD | 34 | 17 m.25 s. | 4.0 | 5 m.35 s. | 3.5 |
| 1VII | 36 | 3 h.58 m. | 7.4 (4–32) | 5 m.35 s. | 4.2 |
| 1E0M | 37 | 10 h. | 3.4 (8–29) | 6 m.35 s. | 7.7 |
| 2GP8 | 40 | 10 h. | 3.5 (6–38) | 6 m.35 s. | 6.4 |
| 1ED0 | 46 | 10 h. | 7.2 (3–40) | 7 m.23 s. | 8.9 |
Figure 4Program execution modes.
Figure 5Protein sequence and secondary structure representation