| Literature DB >> 26151847 |
Jyh-Jong Tsay1, Shih-Chieh Su2, Chin-Sheng Yu3.
Abstract
Protein structure prediction (PSP) is concerned with the prediction of protein tertiary structure from primary structure and is a challenging calculation problem. After decades of research effort, numerous solutions have been proposed for optimisation methods based on energy models. However, further investigation and improvement is still needed to increase the accuracy and similarity of structures. This study presents a novel backbone angle preference factor, which is one of the factors inducing protein folding. The proposed multiobjective optimisation approach simultaneously considers energy models and backbone angle preferences to solve the ab initio PSP. To prove the effectiveness of the multiobjective optimisation approach based on the energy models and backbone angle preferences, 75 amino acid sequences with lengths ranging from 22 to 88 amino acids were selected from the CB513 data set to be the benchmarks. The data sets were highly dissimilar, therefore indicating that they are meaningful. The experimental results showed that the root-mean-square deviation (RMSD) of the multiobjective optimization approach based on energy model and backbone angle preferences was superior to those of typical energy models, indicating that the proposed approach can facilitate the ab initio PSP.Entities:
Keywords: backbone angle preferences; face-centered cubic; multiobjective optimization; protein structure
Mesh:
Year: 2015 PMID: 26151847 PMCID: PMC4519891 DOI: 10.3390/ijms160715136
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1(a) Root-mean-square deviation (RMSD) comparison among four objective functions; (b) Comparison of running time complexity among four objective functions.
Three different types of energy model for visual comparison.
| No. | PDB-ID | Native | |||
|---|---|---|---|---|---|
| 01 | 1EDN Len.21 | ||||
| 02 | 1COI Len.29 | ||||
| 03 | 1MRT Len.31 | ||||
| 04 | 2ERL Len.40 | ||||
| 05 | 1CRN Len.46 | ||||
| 06 | 1RPO Len.61 |
In the figure green balls indicate hydrophobic amino acids while the grey balls indicate the hydrophilic amino acids. The results were input into the HPview tool (http://cpsp.informatik.uni-freiburg.de:8080/ViewJSP.jsp) to obtain the backbone structure; PDB-ID: A 4-character PDB identification (ID) code is assigned to each new structure at the time of deposition. They serve as the unique, immutable identifier of each entry in the Protein Data Bank.
Figure 2Comparison of the proposed approach with the QUARK.
Figure 3The FCC lattice model: Each lattice point has 12 neighbours schematic.
Figure 4(κ, α) pair backbone angle schematic.
(κ, α) pair backbone angle preferences matrix.
| α Angle | |||||||
|---|---|---|---|---|---|---|---|
| 0 | 0 | 0 | 0 | 0 | 0 | ||
| 0 | 0.05 | 0.01 | 0 | 0 | 0 | ||
| 0.21 | 0.73 | 0.18 | 0.06 | 0 | 0.05 | ||
| 1.24 | 1.5 | 0.34 | 0.49 | 0 | 0.38 | ||
| 17.87 | 1.76 | 0.14 | 0.55 | 0 | 0.95 | ||
| 11.08 | 0.59 | 0.21 | 0.51 | 0.02 | 1.65 | ||
| 1.28 | 0.58 | 0.31 | 0.64 | 0.25 | 2.44 | ||
| 0.87 | 0.68 | 0.39 | 0.98 | 0.56 | 2.98 | ||
| 0.72 | 0.70 | 0.47 | 1.29 | 0.96 | 2.14 | ||
| 0.28 | 0.56 | 0.57 | 1.22 | 1.78 | 1.04 | ||
| 0.08 | 0.24 | 0.47 | 1.39 | 2.08 | 0.57 | ||
| 0.04 | 0.10 | 0.40 | 1.99 | 2.03 | 0.43 | ||
| 0.02 | 0.02 | 0.21 | 7.43 | 17.46 | 0.60 | ||
60° probability value represent the sum of DSSP value ranging from 130° to 140°, 80° represent that from 150° to 190°, 180° represent that from 200° to 260°, −130° represent that from −100° to −50°, −110° represent that from −40° to −10° and 10° to 40°, −10° represent that from 50° to 120°.
Figure 5(a) Square-based rotation schematic; and (b) Triangle-hexagon-based rotation schematic.
Parameter setting for the four evolutionary algorithms used in experiments.
| Operations/Parameters | Setting |
|---|---|
| Representation | 1–12 Represents 12 vertex coordinates |
| Population size | Equal to substring_length |
| Selection | Tournament selection |
| Crossover rate | 0.85 |
| Mutation rate | 1/(substring_length) |
| 3 | |
| Termination | Substring_length *2 generations |