| Literature DB >> 20522256 |
Xiaolong Zhang1, Ting Wang, Huiping Luo, Jack Y Yang, Youping Deng, Jinshan Tang, Mary Qu Yang.
Abstract
BACKGROUND: Protein structure prediction (PSP) has important applications in different fields, such as drug design, disease prediction, and so on. In protein structure prediction, there are two important issues. The first one is the design of the structure model and the second one is the design of the optimization technology. Because of the complexity of the realistic protein structure, the structure model adopted in this paper is a simplified model, which is called off-lattice AB model. After the structure model is assumed, optimization technology is needed for searching the best conformation of a protein sequence based on the assumed structure model. However, PSP is an NP-hard problem even if the simplest model is assumed. Thus, many algorithms have been developed to solve the global optimization problem. In this paper, a hybrid algorithm, which combines genetic algorithm (GA) and tabu search (TS) algorithm, is developed to complete this task.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20522256 PMCID: PMC2880412 DOI: 10.1186/1752-0509-4-S1-S6
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1TSM process TSM is a search process. With this strategy, the potential energy functional in equation (1) is used as the evaluation function to compute offspring’s energy values, and then these offspring and their energies are combined with the tabu list to determine the output offspring.
Figure 2Genetic tabu search algorithm The hybrid algorithm combines genetic algorithm and tabu search algorithm and can deal with multi-extremum and multi-parameter problems.
Lowest energies for Fibonacci sequences obtained by the previous algorithms and the proposed GATS algorithm
| N | SEQUENCES | |||||
|---|---|---|---|---|---|---|
| 13 | ABBABBABABBAB | -4.9746 | -4.967 | -4.9746 | -6.5687 | -6.9539 |
| 21 | BABABBABABBABBABABBAB | -12.0617 | -12.316 | -12.3266 | -13.4151 | -14.7974 |
| 34 | ABBABBABABBABBABABBABABBABBABABBAB | -23.0441 | -25.476 | -25.5113 | -27.9903 | -27.9897 |
| 55 | BABABBABABBABBABABBABABBABBABABBABBABABBABABBABBABABBAB | -38.1977 | -42.428 | -42.3418 | -41.5098 | -42.4746 |
The (n-2) bond angles and (n-3) torsional angles at the global minimum energies of four Fibonacci sequences by GATS
| N | bond angles( | torsional angles ( |
|---|---|---|
| 13 | 0.11611, -1.44697, 0.32017, 0.01576,0.50275, -0.86862, 0.12051, -0.58378,-0.45150, -0.89987, -0.00446 | 0.00102, -0.67737, -2.14380, 2.65948, -0.12802, 2.11709, 0.24980, 1.82580, -2.99295, 1.46672 |
| 21 | -0.08637, -1.24407, -0.02388, 0.71560, 0.20713, 1.77167, 0.25962, -0.31786, 1.57686, -0.95041, 0.04461, 0.03119, 0.54498, -0.82860, 0.37745, -0.62814, -0.98493, -0.38993, -0.23650 | 2.97004, 0.58008, 2.15906, -2.64461, 0.40657, -2.00601, 2.47012, 2.63691, -1.09615, 3.11625, -1.56743, -0.03975, -2.96520, 0.06719, -1.86684, -0.32537, 1.10542, -0.70935 |
| 34 | 0.23573, -1.43459, 0.29165, -1.84341, 0.14631, -1.61093, -0.35038, 1.67799, 0.17042, 1.60574, -0.40332, -1.26662, -0.27629, 1.20980, 0.21014, 1.29636, 0.00999, -1.69196, -0.35294, 0.72788, -0.62924, 0.17575, -0.34728, -0.97787, -0.16688, -0.91012, -0.07292, 1.03188, 2.48730, -0.16685, -0.69786, 0.68108 | -0.14964, 0.63562, 0.75934, -2.74033, 0.28633, -2.61899, -2.91781, -0.44565, -0.12498, -0.84551, -2.50502, 2.53977, -2.54137, 1.87607, -2.82187, 2.47530, 2.87212, -2.10783, 0.15378, -2.88144, 1.93040, 0.43629, 2.41795, 0.72296, -0.58509, 1.13745, -2.92674, 0.68888, -2.37150, -0.79336, 2.98078 |
| 55 | 0.57994, -0.54747, -1.63058, 0.32512, 1.29499, 0.94198, 0.15639, 0.54547, 0.51204, 2.42050, 0.42994, 0.04798, 0.53466, 0.72372, -2.84018, -0.25987, -0.88420, 0.50741, 0.31571, -0.38491, -0.36698, -0.85173, 0.13171, -0.28528, 1.24401, -0.44344, 0.32826, -0.71533, -0.52747, -0.08801, -0.44238, -0.05707, -0.08495, -0.62277, 0.07570, -0.90285, -0.24254, 0.16364, -0.47504, -0.50923, -0.37872, 0.57320, 1.66339, 0.32637, -0.74187, 0.43684, -0.15112, 1.46664, 0.34051, -0.72797, -0.07620, -0.73615, -0.79086 | -2.62178, 2.15436, 2.79679, -2.21273, -2.62632, -0.37956, -2.18578, -1.26221, -0.51001, 2.21957, -2.55211, -1.00242, -2.74164, -2.46270, -2.53201, 2.51849, 0.55237, -1.22255, 0.70861, -1.09153, 0.34246, 2.12777, 0.25911, 0.39082, -2.89463, -0.93970, 3.02711, -1.82971, -1.76602, -2.81629, 1.66725, 1.77810, 0.81533, 2.01598, -0.19887, 1.65355, 1.11533, 0.46418, -1.38864, 0.55938, -1.12062, 0.29809, 1.89867, -2.71331, -2.06007, -1.76112, 0.24818, -1.91570, -2.01395, -0.43327, 0.97151, -0.82385 |
Figure 3The lowest energy conformations for the four Fibonacci sequences obtained by GATS algorithm Solid dots indicate hydrophobic monomers A, and open dots indicate hydrophilic monomers B.
Minimum energies for three real proteins obtained by TS and GATS algorithm using off-lattice AB model in three dimensions.
| PDB ID | SEQUENCES | ||
|---|---|---|---|
| 1BXL | GQVGRQLAIIGDDINR | -15.7164 | -15.8246 |
| 1EDP | CSCSSLMDKECVYFCHL | -12.8392 | -13.7769 |
| 1AGT | GVPINVSCTGSPQCIKPCKDQGMRFGKCMNRKCHCTPK | -44.2656 | -46.0842 |
The PDB ID is unique identifier of a protein in the database, representing its amino acid sequences
Figure 4The lowest energy conformations for the three real protein sequences obtained by GATS algorithm Solid dots indicate hydrophobic monomers I, V, L, P, C, M, A, G, and open dots indicate hydrophilic monomers D, E, F, H, K, N, Q, R, S, T, W, Y.