| Literature DB >> 31874607 |
Hongjie Wu1, Ru Yang1, Qiming Fu2,3, Jianping Chen1,4, Weizhong Lu1, Haiou Li1.
Abstract
BACKGROUND: Protein structure prediction has always been an important issue in bioinformatics. Prediction of the two-dimensional structure of proteins based on the hydrophobic polarity model is a typical non-deterministic polynomial hard problem. Currently reported hydrophobic polarity model optimization methods, greedy method, brute-force method, and genetic algorithm usually cannot converge robustly to the lowest energy conformations. Reinforcement learning with the advantages of continuous Markov optimal decision-making and maximizing global cumulative return is especially suitable for solving global optimization problems of biological sequences.Entities:
Keywords: HP model; Reinforcement learning; Structure prediction
Mesh:
Substances:
Year: 2019 PMID: 31874607 PMCID: PMC6929271 DOI: 10.1186/s12859-019-3259-6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1A framework for 2D-HP protein folding based on reinforcement learning with full states
HP model training algorithm based on reinforcement learning with Q-learning
HP sequence set for testing
| Sequence No. | HP Sequence | Length | Known lowest energy | Rigid criterion | Flexible criterion | Greedy algorithm | Partial state space |
|---|---|---|---|---|---|---|---|
| 1 | HPPHHPH [ | 7 | −2 | −2 | − 2 | -2 | -2 |
| 2 | HPHHHPHHPH [ | 10 | −4 | − 4 | − 4 | − 4 | − 4 |
| 3 | HPPHPPPPHPPHP [ | 13 | −4 | − 4 | −2 | − 4 | −3 |
| 4 | HHPHPPHPHPHHPH [ | 14 | −6 | − 6 | − 5 | −6 | − 6 |
| 5 | HPHHHHHHHHHPHH | 14 | −7 | − 7 | − 7 | − 7 | − 7 |
| 6 | HHHPPHHHHHPHHH | 14 | − 7 | − 7 | −7 | − 7 | − 6 |
| 7 | HHHHHPPHHHHPHH | 14 | −7 | −7 | −6 | − 7 | − 6 |
| 8 | HPHHPPPHHHHHHH | 14 | −6 | − 6 | −5 | − 6 | − 6 |
| 9 | HHHPHHPPPHHPHH | 14 | −6 | −6 | −5 | − 6 | −6 |
| 10 | HHPHHHHHPPPPPH | 14 | −4 | −4 | − 4 | − 4 | − 4 |
| 11 | HHPPHHHPHPPHPH | 14 | −6 | −6 | −5 | − 6 | − 4 |
| 12 | HHHPPPPHPHHPHH | 14 | −5 | − 5 | − 5 | −5 | − 5 |
| 13 | HPHPPHHPHPPHPHHPPHPH [ | 20 | − 9 | − 9 | − 4 | −8 | − 6 |
| 14 | HHHPPHPHPHPPHPHPHPPH [ | 20 | −10 | − 10 | −7 | − 9 | − 8 |
| 15 | HHHHHPHHPHHHHPPHHHHHH | 21 | − 12 | − 12 | −9 | − 11 | − 11 |
| 16 | PHPPHPHHHPHPPHPHHHPPH | 21 | −9 | − 9 | −4 | − 9 | − 7 |
Comparison of convergence required number of sequences under two criteria (unit: / ten thousand)
| Sequence No. | Criterions | 1 | 2 | 3 | 4 | 5 | AVG |
|---|---|---|---|---|---|---|---|
| 1 | Rigid | 3 | 3 | 3 | 3 | 2 | 3 |
| Flexible | 8 | 9 | 3 | 2 | 7 | 6 | |
| 2 | Rigid | 29 | 28 | 31 | 9 | 10 | 21 |
| Flexible | 37 | 142 | 43 | 22 | 41 | 57 | |
| 3 | Rigid | 439 | 345 | 186 | 200 | 418 | 318 |
| Flexible | – | – | – | – | – | – | |
| 4 | Rigid | 238 | 380 | 256 | 339 | 114 | 265 |
| Flexible | – | – | – | – | – | – |
The number of successfully folding to the lowest energy conformations in the last 100 episodes
| Sequence No. | Methods | 1 | 2 | 3 | 4 | 5 | AVG |
|---|---|---|---|---|---|---|---|
| 5 | Full states | 8 | 10 | 8 | 7 | 6 | |
| Greedy algorithm | 7 | 7 | 8 | 5 | 7 | 7 | |
| Partial states | 0 | 0 | 0 | 0 | 0 | 0 | |
| 6 | Full states | 3 | 8 | 3 | 1 | 6 | 4 |
| Greedy algorithm | 9 | 11 | 6 | 3 | 10 | 8 | |
| Partial states | 0 | 0 | 0 | 0 | 0 | 0 | |
| 7 | Full states | 7 | 4 | 3 | 3 | 3 | 4 |
| Greedy algorithm | 7 | 5 | 7 | 9 | 9 | 7 | |
| Partial states | 0 | 0 | 0 | 0 | 0 | 0 | |
| 8 | Full states | 7 | 8 | 8 | 11 | 5 | |
| Greedy algorithm | 2 | 3 | 2 | 6 | 2 | 3 | |
| Partial states | 0 | 0 | 0 | 1 | 0 | 0 | |
| 9 | Full states | 5 | 4 | 1 | 3 | 2 | |
| Greedy algorithm | 0 | 0 | 0 | 1 | 1 | 0 | |
| Partial states | 0 | 0 | 0 | 0 | 0 | 0 | |
| 10 | Full states | 14 | 12 | 11 | 15 | 12 | |
| Greedy algorithm | 12 | 9 | 9 | 17 | 8 | 11 | |
| Partial states | 1 | 1 | 1 | 2 | 0 | 1 | |
| 11 | Full states | 3 | 7 | 2 | 3 | 4 | |
| Greedy algorithm | 4 | 0 | 0 | 2 | 1 | 1 | |
| Partial states | 0 | 0 | 0 | 0 | 0 | 0 | |
| 12 | Full states | 9 | 7 | 8 | 6 | 8 | |
| Greedy algorithm | 2 | 5 | 2 | 5 | 2 | 3 | |
| Partial states | 0 | 0 | 1 | 0 | 0 | 0 | |
| 13 | Full states | 0 | 2 | 2 | 3 | 4 | |
| Greedy algorithm | 0 | 0 | 0 | 0 | 0 | 0 | |
| Partial states | 0 | 0 | 0 | 0 | 0 | 0 | |
| 14 | Full states | 2 | 0 | 1 | 1 | 2 | |
| Greedy algorithm | 0 | 0 | 0 | 0 | 0 | 0 | |
| Partial states | 0 | 0 | 0 | 0 | 0 | 0 | |
| 15 | Full states | 2 | 4 | 5 | 2 | 2 | |
| Greedy algorithm | 0 | 0 | 0 | 0 | 0 | 0 | |
| Partial states | 0 | 0 | 0 | 0 | 0 | 0 | |
| 16 | Full states | 1 | 2 | 4 | 5 | 1 | |
| Greedy algorithm | 1 | 0 | 0 | 0 | 0 | 0 | |
| Partial states | 0 | 0 | 0 | 0 | 0 | 0 |
The data in bold and italic indicates the average number of successfully folding to the lowest energy conformations by the reinforcement learning with full states is more than the other two methods
Fig. 2The optimal 2D conformations of sequence no.12 under the rigid criterion. a First optimal structure. b Second optimal structure. c Third optimal structure
Fig. 3The last 100 samplings of the training process of three methods. a Training process sampling of full state space. b Training process sampling of greedy algorithm. c Training process sampling of partial state space
Fig. 4Comparison of testing process between full state space and partial state space. a Testing process of full state space. b Testing process of partial state space