| Literature DB >> 28905320 |
Boxin Guan1, Changsheng Zhang2, Jiaxu Ning1.
Abstract
Protein-ligand docking plays an important role in computer-aided pharmaceutical development. Protein-ligand docking can be defined as a search algorithm with a scoring function, whose aim is to determine the conformation of the ligand and the receptor with the lowest energy. Hence, to improve an efficient algorithm has become a very significant challenge. In this paper, a novel search algorithm based on crossover elitist preservation mechanism (CEP) for solving protein-ligand docking problems is proposed. The proposed algorithm, namely genetic algorithm with crossover elitist preservation (CEPGA), employ the CEP to keep the elite individuals of the last generation and make the crossover more efficient and robust. The performance of CEPGA is tested on sixteen molecular docking complexes from RCSB protein data bank. In comparison with GA, LGA and SODOCK in the aspects of lowest energy and highest accuracy, the results of which indicate that the CEPGA is a reliable and successful method for protein-ligand docking problems.Entities:
Keywords: AutoDock; Crossover elitist preservation; Genetic algorithm; Pharmaceutical development; Protein–ligand docking
Year: 2017 PMID: 28905320 PMCID: PMC5597564 DOI: 10.1186/s13568-017-0476-0
Source DB: PubMed Journal: AMB Express ISSN: 2191-0855 Impact factor: 3.298
Fig. 1Diagram of CEP. (1) The individuals of the previous generation pair and cross to generate the individuals of the next generation. The optimum individual 2a serves as an elitist individual, and its parents are preserved. (2) The adjusted individuals of the next generation after CEP
Pseudo-code of CEPGA
Fig. 2Block diagram of CEPGA. The basic process of CEPGA is showed in the figure. CEP was applied after crossover so that the genes of the new population are excellent
Lowest energy and smallest RMSD results of five compared algorithms
| CEPGA | LGA | GA | SODOCK | ABC | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| PDB | Ligand (torsions) | Energy | RMSD | Energy | RMSD | Energy | RMSD | Energy | RMSD | Energy | RMSD |
| 3ptb | ben (0) | −11.72 | 1.90 | −11.46 | 1.92 | −10.31 | 1.66 | −11.57 | 2.00 | −10.90 | 1.97 |
| 1aha | ade (1) | −15.32 | 0.89 | −16.10 | 0.45 | −15.16 | 1.28 | −14.95 | 1.44 | −13.90 | 1.80 |
| 3hvt | nvp (2) | −17.90 | 0.30 | −17.22 | 0.33 | −15.73 | 0.43 | −16.78 | 0.58 | −15.60 | 0.55 |
| 1phg | hem (3) | −9.32 | 0.64 | −8.56 | 0.80 | −7.46 | 1.20 | −8.95 | 1.54 | −7.95 | 1.67 |
| 2mcp | pc (4) | −9.10 | 1.20 | −8.22 | 1.33 | −7.76 | 1.46 | −7.72 | 1.42 | −7.80 | 1.54 |
| 1stp | btn (5) | −13.57 | 0.90 | −13.37 | 1.65 | −11.03 | 1.84 | −13.52 | 1.00 | −13.17 | 1.68 |
| 6rnt | ca (6) | −9.32 | 0.58 | −9.13 | 0.70 | −8.58 | 0.69 | −9.12 | 1.95 | −8.90 | 1.55 |
| 4dfr | mtx (7) | −12.12 | 1.90 | −11.44 | 1.23 | −10.01 | 0.95 | −11.34 | 1.60 | −10.21 | 1.97 |
| 1ett | 4qq (8) | −14.21 | 1.29 | −13.89 | 1.38 | −11.42 | 1.62 | −12.06 | 1.56 | −12.70 | 1.70 |
| 1hri | s57 (9) | −10.89 | 1.38 | −10.21 | 1.87 | −9.67 | 1.80 | −10.31 | 1.68 | −10.13 | 1.67 |
| 1hvr | xk2 (10) | −31.06 | 0.64 | −30.85 | 0.62 | −21.95 | 1.68 | −29.29 | 0.68 | −28.64 | 0.85 |
| 4hmg | sia (11) | −10.32 | 1.89 | −10.09 | 1.70 | −8.44 | 1.69 | −10.08 | 1.36 | −9.80 | 1.54 |
| 1cdg | mol (12) | −8.70 | 1.45 | −8.22 | 1.94 | −7.32 | 1.69 | −8.45 | 1.80 | −7.13 | 1.12 |
| 1htf | g26 (13) | −21.48 | 1.27 | −20.69 | 1.33 | −18.86 | 1.46 | −21.79 | 1.42 | −19.17 | 1.96 |
| 1glq | gtb (14) | −9.46 | 1.38 | −9.27 | 1.87 | −7.97 | 1.87 | −8.83 | 1.90 | −9.13 | 1.60 |
| 1tmn | nas (15) | −10.29 | 0.85 | −10.11 | 1.20 | −9.68 | 1.11 | −10.62 | 1.95 | −9.37 | 0.60 |
Fig. 3Convergence diagrams of five algorithms in different PDB. The energy is used as the vertical axis and the number of energy evaluations Neval is used as the horizontal axis. The energy values of different Neval are recorded
Fig. 4Box plots of five algorithms in different PDB. The energy is used as the vertical axis and the five compared algorithms are used as the horizontal axis. The energy values of the upper edge, the upper quartile, the median, median, the lower four quantile, and the lower edge are recorded, respectively
Hypothesis test result
| CEPGA | LGA | GA | SODOCK | ABC | |
|---|---|---|---|---|---|
| 3ptb | |||||
| CEPGA | – | 0.012 | 0.004 | 0.028 | 0.006 |
| LGA | 0.988 | – | 0.008 | 0.563 | 0.225 |
| GA | 0.996 | 0.992 | – | 0.995 | 0.688 |
| SODOCK | 0.972 | 0.437 | 0.005 | – | 0.100 |
| ABC | 0.994 | 0.775 | 0.312 | 0.900 | – |
| 1aha | |||||
| CEPGA | – | 0.519 | 0.402 | 0.124 | 0.105 |
| LGA | 0.481 | – | 0.036 | 0.017 | 0.004 |
| GA | 0.598 | 0.964 | – | 0.342 | 0.260 |
| SODOCK | 0.976 | 0.983 | 0.658 | – | 0.470 |
| ABC | 0.895 | 0.996 | 0.740 | 0.530 | – |
| 3hvt | |||||
| CEPGA | – | 0.035 | 0.007 | 0.023 | 0.013 |
| LGA | 0.965 | – | 0.205 | 0.324 | 0.150 |
| GA | 0.993 | 0.795 | – | 0.778 | 0.437 |
| SODOCK | 0.977 | 0.676 | 0.222 | – | 0.215 |
| ABC | 0.987 | 0.850 | 0.463 | 0.585 | – |
| 1phg | |||||
| CEPGA | – | 0.036 | 0.008 | 0.041 | 0.017 |
| LGA | 0.964 | – | 0.016 | 0.624 | 0.450 |
| GA | 0.992 | 0.984 | – | 0.988 | 0.537 |
| SODOCK | 0.959 | 0.376 | 0.012 | – | 0.215 |
| ABC | 0.983 | 0.550 | 0.463 | 0.785 | – |
| 2mcp | |||||
| CEPGA | – | 0.013 | 0.004 | 0.002 | 0.003 |
| LGA | 0.987 | – | 0.203 | 0.182 | 0.224 |
| GA | 0.996 | 0.797 | – | 0.492 | 0.610 |
| SODOCK | 0.997 | 0.818 | 0.508 | – | 0.640 |
| ABC | 0.995 | 0.776 | 0.390 | 0.360 | – |
| 1stp | |||||
| CEPGA | – | 0.034 | 0.007 | 0.042 | 0.013 |
| LGA | 0.964 | – | 0.009 | 0.624 | 0.450 |
| GA | 0.993 | 0.991 | – | 0.992 | 0.487 |
| SODOCK | 0.958 | 0.376 | 0.008 | – | 0.215 |
| ABC | 0.987 | 0.550 | 0.513 | 0.785 | – |
| 6rnt | |||||
| CEPGA | – | 0.035 | 0.008 | 0.029 | 0.010 |
| LGA | 0.965 | – | 0.018 | 0.368 | 0.127 |
| GA | 0.992 | 0.982 | – | 0.695 | 0.588 |
| SODOCK | 0.971 | 0.632 | 0.305 | – | 0.404 |
| ABC | 0.990 | 0.873 | 0.412 | 0.496 | – |
| 4dfr | |||||
| CEPGA | – | 0.015 | 0.005 | 0.011 | 0.008 |
| LGA | 0.985 | – | 0.009 | 0.337 | 0.115 |
| GA | 0.995 | 0.991 | – | 0.986 | 0.685 |
| SODOCK | 0.989 | 0.663 | 0.014 | – | 0.142 |
| ABC | 0.992 | 0.885 | 0.315 | 0.858 | – |
| 1ets | |||||
| CEPGA | – | 0.025 | 0.009 | 0.015 | 0.018 |
| LGA | 0.975 | – | 0.018 | 0.063 | 0.127 |
| GA | 0.991 | 0.982 | – | 0.595 | 0.688 |
| SODOCK | 0.985 | 0.937 | 0.405 | – | 0.504 |
| ABC | 0.982 | 0.873 | 0.312 | 0.496 | – |
| 1hri | |||||
| CEPGA | – | 0.038 | 0.002 | 0.040 | 0.015 |
| LGA | 0.962 | – | 0.014 | 0.723 | 0.151 |
| GA | 0.998 | 0.986 | – | 0.982 | 0.637 |
| SODOCK | 0.960 | 0.277 | 0.012 | – | 0.020 |
| ABC | 0.985 | 0.849 | 0.363 | 0.980 | – |
| 1hvr | |||||
| CEPGA | – | 0.043 | 0.005 | 0.011 | 0.009 |
| LGA | 0.957 | – | 0.038 | 0.177 | 0.044 |
| GA | 0.995 | 0.962 | – | 0.942 | 0.565 |
| SODOCK | 0.989 | 0.823 | 0.058 | – | 0.168 |
| ABC | 0.991 | 0.956 | 0.435 | 0.832 | – |
| 4hmg | |||||
| CEPGA | – | 0.020 | 0.005 | 0.017 | 0.010 |
| LGA | 0.980 | – | 0.008 | 0.417 | 0.214 |
| GA | 0.995 | 0.992 | – | 0.988 | 0.900 |
| SODOCK | 0.983 | 0.583 | 0.012 | – | 0.240 |
| ABC | 0.990 | 0.786 | 0.100 | 0.760 | – |
| 1cdg | |||||
| CEPGA | – | 0.017 | 0.006 | 0.044 | 0.005 |
| LGA | 0.983 | – | 0.117 | 0.763 | 0.105 |
| GA | 0.994 | 0.883 | – | 0.985 | 0.408 |
| SODOCK | 0.956 | 0.237 | 0.015 | – | 0.012 |
| ABC | 0.995 | 0.895 | 0.592 | 0.988 | – |
| 1htf | |||||
| CEPGA | – | 0.148 | 0.023 | 0.640 | 0.015 |
| LGA | 0.852 | – | 0.027 | 0.883 | 0.151 |
| GA | 0.977 | 0.973 | – | 0.987 | 0.637 |
| SODOCK | 0.360 | 0.127 | 0.013 | – | 0.017 |
| ABC | 0.985 | 0.849 | 0.363 | 0.983 | – |
| 1glq | |||||
| CEPGA | – | 0.045 | 0.009 | 0.049 | 0.042 |
| LGA | 0.955 | – | 0.018 | 0.163 | 0.227 |
| GA | 0.991 | 0.982 | – | 0.695 | 0.788 |
| SODOCK | 0.951 | 0.837 | 0.305 | – | 0.704 |
| ABC | 0.958 | 0.773 | 0.212 | 0.296 | – |
| 1tmn | |||||
| CEPGA | – | 0.317 | 0.008 | 0.744 | 0.095 |
| LGA | 0.683 | – | 0.217 | 0.763 | 0.105 |
| GA | 0.992 | 0.783 | – | 0.905 | 0.408 |
| SODOCK | 0.256 | 0.237 | 0.005 | – | 0.012 |
| ABC | 0.995 | 0.895 | 0.592 | 0.988 | – |