| Literature DB >> 16689690 |
Abstract
A branch and bound algorithm is proposed for the two-dimensional protein folding problem in the HP lattice model. In this algorithm, the benefit of each possible location of hydrophobic monomers is evaluated and only promising nodes are kept for further branching at each level. The proposed algorithm is compared with other well-known methods for 10 benchmark sequences with lengths ranging from 20 to 100 monomers. The results indicate that our method is a very efficient and promising tool for the protein folding problem.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16689690 PMCID: PMC5172541 DOI: 10.1016/s1672-0229(05)03031-7
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Fig. 1The lowest energy conformation with E = —4 of sequence HPPHPPHPHPPHP. Black point particle: hydrophobic (H); White point particle: polar (P).
Fig. 2The three possible positions for Monomer 5.
Fig. 3A representation of the search tree.
Fig. 4The pseudo-code of the subroutine in the branch and bound algorithm.
The 10 Benchmark Sequences for Algorithm Evaluation
| Length | Sequence |
|---|---|
| 20 | HPHPPHHPHPPHPHHPPHPH |
| 24 | HHPPHPPHPPHPPHPPHPPHPPHH |
| 25 | PPHPPHHPPPPHHPPPPHHPPPPHH |
| 36 | PPPHHPPHHPPPPPHHHHHHHPPHHPPPPHHPPHPP |
| 48 | PPHPPHHPPHHPPPPPHHHHHHHHHHPPPPPPHHPPHHPPHPPHHHHH |
| 50 | PPHPPHPHPHHHHPHPPPHPPPHPPPPHPPPHPPPHPHHHHPHPHPHPHH |
| 60 | PPHHHPHHHHHHHHPPPHHHHHHHHHHPHPPPHHHHHHHHHHHHPPPPHHHHHHPHHPHP |
| 64 | HHHHHHHHHHHHPHPHPPHHPPHHPPHPPHHPPHHPPHPPHHPPHHPPHPHPHHHHHHHHHHHH |
| 85 | HHHHPPPPHHHHHHHHHHHHPPPPPPHHHHHHHHHHHHPPPHHHHHHHHHHHHPPPHHHHHHHHHHHHPPPHPPHHPPHHPPHPH |
| 100 | PPPHHPPHHHHPPHHHPHHPHHPHHHHPPPPPPPPHHHHHHPPHHHHHHPPPPPPPPPHPHHPHHHHHHHHHHHPPHHHPHHPHPPHPHHHPPPPPPHHH |
Performance Comparison of the Four Algorithms *
| Length | Optimal | MC | GA | MS | BB |
|---|---|---|---|---|---|
| 20 | −9 | −9 | −9 | −9 | −9 |
| 24 | −9 | −9 | −9 | −9 | −9 |
| 25 | −8 | −7 | −8 | −8 | −8 |
| 36 | −14 | −12 | −14 | −14 | −14 |
| 48 | −23 | −18 | −22 | −22 | −22 |
| 50 | −21 | −19 | −21 | −21 | −21 |
| 60 | −36 | −31 | −34 | −34 | −36 |
| 64 | −42 | −31 | −37 | −38 | −38 |
| 85 | −53 | N/A | N/A | N/A | −52 |
| 100 | −50 | N/A | N/A | N/A | −48 |
Performance comparison on finding the lowest energy conformations of the four algorithms, including Monte Carlo (MC), genetic algorithm (GA), mixed search (MS), and branch and bound (BB).
Fig. 5The lowest energy states of the sequences with length n = 24, 36, 60, 85, and 100, respectively.