| Literature DB >> 32397151 |
Abstract
Novel machine learning and molecular modelling filtering procedures for drug repurposing have been carried out for the recognition of the novel fungicide targets of Cyp51 and Erg2. Classification and regression approaches on molecular descriptors have been performed using stepwise multilinear regression (FS-MLR), uninformative-variable elimination partial-least square regression, and a non-linear method called Forward Stepwise Limited Correlation Random Forest (FS-LM-RF). Altogether, 112 prediction models from two different approaches have been built for the descriptor recognition of fungicide hit compounds. Aiming at the fungal targets of sterol biosynthesis in membranes, antifungal hit compounds have been selected for docking experiments from the Drugbank database using the Autodock4 molecular docking program. The results were verified by Gold Protein-Ligand Docking Software. The best-docked conformation, for each high-scored ligand considered, was submitted to quantum mechanics/molecular mechanics (QM/MM) gradient optimization with final single point calculations taking into account both the basis set superposition error and thermal corrections (with frequency calculations). Finally, seven Drugbank lead compounds were selected based on their high QM/MM scores for the Cyp51 target, and three were selected for the Erg2 target. These lead compounds could be recommended for further in vitro studies.Entities:
Keywords: Fe-N(R)C angle; QM/MM; classification; docking; drug repurposing; regression
Mesh:
Substances:
Year: 2020 PMID: 32397151 PMCID: PMC7249108 DOI: 10.3390/molecules25092198
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Scheme 1Approach I. In silico drug repurposing process for each compound from the Drugbank database, using the criteria set above. From the molecular descriptors and mode of action working (MOAW) training set, we established models for QM//MM calculations in the target protein. In addition, see approach II (differing only in the first three steps from approach I).
Scheme 2Approach II. In silico drug repurposing process, alternative machine learning steps (1–3). The rest of the steps, from step 4, are the same as in Scheme 1.
Figure 1Definition of the Fe-N(R)C angle. ′(R)C′ is the in-plane ring coordinates spanned by the vector sum of the two neighboring atoms to the N coordinated atom. In the specific case of nitrile bonded ligand (Fe-N≡C), the angle is the Fe-N-C angle (without ′(R)′).
Results of Uninformative Variable Elimination Partial-Least Squares UVE-PLS, UVE-PLS-RF, stepwise multilinear regression (FS-MLR), and UVE-PLS-FS-MLR for mode of action MOA classification and the number of selected compounds from Drugbank. The model accuracy by the percent of correct classification (PCC) of cross-validation ‘(CV)’ on the training set and the test set ‘te’.
| No Class. | Method | Nvar | PCC(CV) | PCCte | Passed Step II (III) | No. Hit (Fung) * |
|---|---|---|---|---|---|---|
| 11 | UVE-PLS-DA | 185 | 66.2 | 67.1 | 75 (35) | 34 (13) |
| 11 | UVE-PLS-RF | 185 | 65.0 | 76.8 | 62 (49) | 34 (16) |
| 4 | UVE-PLS-DA | 324 | 77.3 | 77.6 | 636 (2) | 1 (0) |
| 4 | UVE-PLS-RF | 324 | 76.1 | 82.4 | 412 (2) | 2 (2) |
| 3 | UVE-PLS-DA | 115 | 82.8 | 84.1 | 273 (24) | 14 (5) |
| 3 | UVE-PLS-RF | 115 | 81.6 | 86.6 | 334 (20) | 17 (7) |
| 11 | UVE-PLS-FS-MLR | 21 | 68.7 | 63.4 | 64 (34) | 28 (10) |
| 11 | FS-MLR | 30 | 71.8 | 65.9 | 24 (24) | 10 (2) |
| 4 | UVE-PLS-FS-MLR | 6 | 65.7 | 60.8 | 0 (0) | 0 (0) |
| 4 | FS-MLR | 14 | 76.9 | 66.4 | 106 (51) | 24 (5) |
| 3 | UVE-PLS-FS-MLR | 11 | 84.0 | 73.2 | 3 (3) | 0 (0) |
| 3 | FS-MLR | 20 | 91.4 | 79.3 | 100 (18) | 8 (1) |
* “No. hits” denotes the number of Drugbank hit compounds that passed step 4 (number in brackets denotes the already literature-established fungicides among these hits).
Forward Stepwise Limited Correlation Random Forest (FS-LM-RF) results.
| No Class. | Corr Gap | Nvar | PCC(CV) | PCCte | Passed Step II (III) | No. Hits (Fung) * |
|---|---|---|---|---|---|---|
| 11 | 0.5 | 16 | 68.1 | 65.9 | 25 (24) | 21 (7) |
| 11 | 0.6 | 14 | 64.4 | 72.0 | 57 (44) | 36 (12) |
| 11 | 0.7 | 22 | 71.8 | 68.3 | 62 (40) | 27 (8) |
| 11 | 0.8 | 17 | 65.6 | 70.7 | 97 (22) | 14 (4) |
| 11 | 0.9 | 14 | 69.9 | 76.8 | 67 (35) | 24 (11) |
| 4 | 0.5 | 17 | 72.9 | 75.2 | 402 (11) | 8 (4) |
| 4 | 0.6 | 13 | 80.1 | 80.0 | 437 (13) | 7 (6) |
| 4 | 0.7 | 18 | 79.7 | 80.0 | 468 (11) | 5 (3) |
| 4 | 0.8 | 17 | 78.5 | 73.6 | 296 (26) | 21 (13) |
| 4 | 0.9 | 18 | 78.5 | 72.8 | 328 (10) | 4 (3) |
| 3 | 0.5 | 15 | 83.4 | 80.5 | 79 (28) | 21 (6) |
| 3 | 0.6 | 19 | 79.8 | 85.4 | 314 (46) | 35 (13) |
| 3 | 0.7 | 12 | 81.6 | 86.6 | 94 (18) | 11 (8) |
| 3 | 0.8 | 18 | 85.3 | 86.6 | 218 (2) | 1 (0) |
| 3 | 0.9 | 16 | 83.4 | 80.5 | 176 (13) | 13 (5) |
* “No. hits” denotes the number of Drugbank hit compounds that passed step 4 (number in brackets denotes the already literature-established fungicides among these hits).
Results of the two-class regression. Columns 2–5 all denote the average PCC among a certain number of experiments. PCCTP(te) denotes the percent of correct classification of the true positives for the test set (details in Table S6b).
| Method | PCC(CV) | PCC(te) | PCCTP(te) | PCC (27 Fung) | Comp. Sel. (Hits) * | Total No. Rep. |
|---|---|---|---|---|---|---|
| UVE-PLS | 98.14 | 98.5 | 71.33 | 77.14 | 42 | 365 |
| UVE-RF | 96.93 | 98.54 | 70.0 | 91.85 | 26 | 148 |
| RF-LM-FS | 98.40 | 98.37 | 66.55 | 90.37 | 78 | 300 |
| All | 97.95 | 98.46 | 69.33 | 85.27 | 100 (37) | 813 |
* Number in brackets is the total number of hit compounds selected with at least 3 selections. Other numbers in the sixth column denote the numbers of the selected compounds from Drugbank with only one selection.
Analysis of the 100 docked conformations of the known fungicides (ΔG in kcal/mol) taking into account both the energy and geometry of the docked complex (see also Table S7).
| Δ | Δ | α (c) | Δ | Δ | α (c) | ||
|---|---|---|---|---|---|---|---|
| albaconazole | −9.78 | −9.08 | 92.9 | flutriafol | −7.89 | −7.43 | 164.9 |
| bifonazole | −10.02 | −10.02 | 97.0 | fosfluconazole | −10.82 | −7.18 | 157.4 |
| bitertanol | −9.57 | −8.75 | 119.9 | lanoconazole | −8.93 | −8.52 | 106.7 |
| bromuconazole | −8.99 | −8.28 | 168.9 | levoketoconazole | −12.88 | −12.56 | 179.3 |
| butoconazole | −10.14 | −9.51 | 148.10 | luliconazole | −8.97 | −8.88 | 117.1 |
| climbazol | −7.92 | −7.54 | 160.4 | mefentrifluconazole | −9.38 | −8.58 | 115.2 |
| clotrimazole | −9.87 | −9.77 | 157.9 | metconazole | −8.84 | −8.83 | 169.8 |
| croconazole | −9.28 | −8.77 | 170.1 | miconazole | −10.09 | −9.94 | 167.0 |
| eberconazole | −9.98 | −9.98 | 142.6 | myclobutanil | −8.7 | −8.70 | 134.3 (d) |
| econazole | −9.9 | −9.90 | 169.7 | neticonazole | −7.61 | −7.28 | 166.0 |
| efinaconazole | −8.15 | −7.66 | 160.3 | nuarimol | −8.58 | −8.53 | 147.2 |
| epoxiconazole | −9.4 | −8.88 | 164.1 | omoconazole | −10.18 | −10.18 | 169.7 |
| etaconazole | −8.51 | −7.78 | 163.4 | oxiconazole | −10.78 | −10.78 | 168.8 |
| fenarimol | −9.28 | −8.98 | 154.3 | oxpoconazole | −10.17 | −9.99 | 174.4 |
| fenbuconazole | −9.94 | −9.83 | 159.5 | pefurazoate | −7.81 | −7.16 | 110.8 |
| fenticonazole | −11.97 | −11.97 | 164.7 | penconazole | −7.76 | −7.26 | 166.8 |
| fluconazole | −6.76 | −6.76 | 170.4 | posaconazole | −13.81 | −13.04 | 140.9 |
| fluquinconazole | −8.98 | −8.54 | 143.3 | prochloraz | −8.14 | −7.77 | 122.6 |
| furconazole-cis | -8.15 | −8.15 | 117.1 | propiconazole | −8.64 | −8.40 | 148.0 |
| hexaconazole | −7.68 | −6.98 | 174.6 | prothioconazole | −8.12 | −7.98 | 150.2 (e) |
| imazalil | −7.73 | −7.73 | 178.7 | ravuconazole | −10.69 | −9.97 | 171.9 |
| imibenconazole | −9.66 | −9.46 | 154.5 | sertaconazole | −10.6 | −10.6 | 129.0 |
| ipconazole | −9.26 | −8.92 | 176.9 | sulconazole | −9.76 | −9.76 | 162.4 |
| ipfentrifluconazole | −9.53 | −9.53 | 124.0 | tebuconazole | −8.44 | −7.30 | 145.9 |
| isavuconazole | −11.14 | −10.22 | 162.3 | tetraconazole | −7.4 | −6.53 | 170.6 |
| isoconazole | −10.02 | −9.30 | 147.4 | tioconazole | −9.16 | −9.16 | 134.4 |
| itraconazole | −13.8 | −12.52 | 146.1 | triadimenol | −7.98 | −7.36 | 138.7 |
| ketoconazole | −12.59 | −12.59 | 174.7 | triticonazole | −9.64 | −8.37 | 153.1 |
| cyproconazole | −7.72 | −7.72 | 166.6 | voriconazole | −7.75 | −7.75 | 170.0 |
| difenoconazole | −11.09 | −11.09 | 158.8 | oteseconazole | −10.39 | −10.24 | 152.4 (f) |
| diniconazole−M | −8.7 | −7.41 | 109.7 | Average | −9.43 | −9.01 | 150.8 |
(a) Minimum energy conformation of all the 100 conformations considered. (b) Represents the energy of the conformation that has the lowest binding energy (ΔG), but only in subsets among those conformations that contain an Fe-N distance of < 3Å. Others containing Fe-O < 3Å or Fe-Cl < 3Å are not mentioned in this table. (c) Displayed Fe-N-(R)C angle (in degrees, see Figure 1) representing the conformation that has the highest Fe-N-(R)C angle of coordinated ligand among those conformations containing an Fe-N < 3Å. The energy of such a conformation ΔG(max α) is given in Supplementary Table S7. The following rings are considered: imidazole, triazole, tetrazole, pyridine, pyrimidine. (d) It is about the Fe-N-C angle, where C denotes nitrile (C≡N) carbon and N denotes nitrile nitrogen. (e) Triazolithione ring. (f) The highest angle for oteseconazole is 174.7° but involving the Fe-coordinating N(26) tetrazole atom which is tetrazole N3, not tetrazole N4, and is therefore different then the specific coordinating N(27) atom in literature. The maximum angle for N(27) is 152.4, as displayed, and it was used in QM/MM.
Analysis of step 5a using the 100 docked conformations of the hit compounds (ΔG in kcal/mol) for the 5tz1 hit compounds that passed prefiltering (so with ΔG(10) < −10.1 kcal/mol), taking into account both the energy (in kcal/mol) and geometry of the docked complex.
| Hit Comp. | Comment, Default Fe-N<3Å Con-Taining, Otherwise as Commented | For | Hit | Comment, Default Fe-N<3Å Con-Taining, Otherwise as Commented | For |
|---|---|---|---|---|---|
| DB12623 | No Fe-N, but with Fe-O, E −16.1 | Yes | DB04107 | No Fe-N, but with Fe-O, E −10.4 | No |
| DB08387 | max | No | DB04600 | 2 × max | Yes |
| DB00354 | No dist(Fe-X) < 3Å (X=N or O) | No | DB07227 | max | Yes × 2 |
| DB08746 | No Fe-N, but with Fe-O, | No | DB06021 | No dist(Fe-X) < 3Å (X=N or O) | No |
| DB12345 | max | Yes | DB12218 | max | Yes |
| DB08745 | No Fe-N, but with Fe-O, E −16.1 | Yes | DB02917 | No Fe-N, but with Fe-O, E −10.6 | No |
| DB07578 | max | Yes | DB12640 | No dist(Fe-X) < 3.5Å (X = any!) | No |
| DB12561 | max | No | DB00699 | No dist(Fe-X) < 3Å (X=N or O) | No |
| DB12017 | Fe-O, | Yes × 2 | DB13113 | max | Yes |
| DB11679 | No Fe-N, but with Fe-O, E −11.0 | No | DB00737 | No dist(Fe-X) < 3Å (X=N or O) | No |
| DB06834 | No dist(Fe-X) < 3Å (X=N or O) | No | DB07255 | No Fe-N, but with Fe-O, E −9.7 | No |
| DB08922 | No Fe-N, but with Fe-O, E −10.1 | No | DB12364 | No dist(Fe-X) < 3.5Å (X = any!) | No |
| DB04591 | max | Yes | DB01149 | No dist(Fe-X) < 3Å (X=N or O) | No |
| DB12644 | max | No | DB07008 | max | Yes |
| DB04960 | No dist(Fe-X) < 3Å (X=N or O) | No | DB02491 | No Fe-N, but with Fe-O, E −10.2 | No |
| DB13083 | max | Yes | DB09195 | max | No |
| DB12963 | No dist(Fe-X) < 3.5Å (X = any!) | No | DB07011 | max | Yes |
| DB04957 | No dist(Fe-X) < 3Å (X=N or O) | No | DB02706 | No Fe-N, but with Fe-O, E −15.0 | Yes |
| DB07878 | No Fe-N, but with Fe-O, E −11.0 | No | DB08560 | No Fe-N, but with Fe-O, E −10.5 | No |
| DB12682 | max | Yes |
* Two different binding groups of ligands oriented to Fe were selected, one with Adt4 regarding the COO group, and the other regarding the azole group but selected with Autodock vina, although maximum angle was only 139°. ** Two different binding groups of ligands oriented to Fe were selected, both with Adt4, one regarding the nitrile group with a max. Fe-N-C of 146.7, and the other regarding imidazole N3, with two max. Fe-N-C of 134° and 130°.
QM/MM score results of the hit compounds (with Drugbank numbers), six fungicides (clotrimazole-voriconazole), and water (H2O) used for a reference comparison. Presented is Gibbs free binding energy with all (including ZPE correction and entropy) applied corrections. The orientation bound to the iron center with the lowest energy is presented. More detailed tables can be found in Supplementary Tables S11 and S12.
| Compound | Δ | Compound | Δ |
|---|---|---|---|
| clotrimazole (fungicide) | −37.60 (78) (d) | DB07008 | −124.67 * |
| fluconazole (fungicide) | −46.37 (54) (d) | DB12623 | −71.59 |
| miconazole (fungicide) | −91.74 (79) (d) | DB07578 | −70.49 |
| ketoconazole (fungicide) | −86.58 (85) (d) | DB12017 (b) | −70.17 |
| oteseconazole (fungicide) | −106.36 (98) (d) | DB04591 | −82.30 * |
|
| DB07011 | −77.96 * | |
| water molecule (H2O) | −12.22 | DB08745 | −57.15 |
| DB13083 | −82.73 * | DB13113 | −31.15 |
| DB04600 | −225.38 (a) | DB12345 | −96.20 * |
| DB07227 (b) | −128.79 * | DB12218 | - (c) |
| DB02706 | −59.96 | DB12682 | −115.59 * |
* Hit compounds that passed the QM/MM step, due to stronger binding than voriconazole and without a negative note. (a) Negative note: the compound decomposes due to unstable C-O-S(OH)2NH2 in the active center, which implies that it could decompose in similar conditions but with different proteins and in many different sites. (b) Presented is the stronger energy binding orientation; for DB07227, it is nitrile mode and for DB12017, azole mode. (c) Negative note: an attempt with QM/MM gradient minimization has been made and, due to the repulsion steric effect from the neighboring N-H group, the calculations ultimately failed to converge. (d) Number in brackets “( )” for the corresponding fungicide denotes the percent of CYP51 inhibition determined in Reference [11].
List of interacting amino-acid residues with the selected docked fungicides and determined 5tz1 lead compounds. Each interaction is defined as the distance within 4.0 Å between any ligand atom and any protein amino-acid atom (so only heme atoms are excluded and therefore not taken into account).
| Docked Comp. | No. Inter. | List of Interacting Amino-Acid Residues |
|---|---|---|
| voriconazole (fung) | 283 | Tyr118, Leu121, Thr122, Phe126, Ile131, Tyr132, Phe228, Gly303, Ile304, Gly307, Gly308, Thr311, Leu376, Ser378, Ile379, Met508 |
| oteseconazole (fung) | 383 | Tyr64, Gly65, Tyr118, Leu121, Thr122, Phe126, Ile131, Tyr132, Phe228, Pro230, Phe233, Gly303, Ile304, Met306, Gly307, Thr311, Leu376, His377, Ser378, Phe380, Tyr505, Ser506, Ser507, Met508 |
| DB13083 (lead) | 406 | Tyr118, Leu121, Thr122, Phe126, Ile131, Tyr132, Phe228, Pro230, Phe233, Gly303, Ile304, Gly307, Gly308, Thr311, Leu376, His377, Ser378, Ile379, Phe380, Ser507, Met508 |
| DB07227 (lead) | 354 | Tyr64, Gly65, Leu87, Leu88, Tyr118, Leu121, Tyr132, Phe228, Pro230, Phe233, Gly307, His310, Thr311, Leu376, His377, Ser378, Ile379, Phe380, Tyr505, Ser506, Ser507, Met508, Val509 |
| DB07008 (lead) | 312 | Tyr118, Leu121, Tyr132, Phe228, Pro230, Phe233, Gly307, Gly308, Thr311, Leu376, His377, Ser378, Ile379, Phe380, Tyr505, Ser506, Ser507, Met508, Val509 |
| DB12345 (lead) | 477 | Tyr64, Gly65, Tyr118, Leu121, Thr122, Tyr132, Phe228, Pro230, Phe233, Gly307, Thr311, Leu376, His377, Ser378, Ile379, Phe380, Tyr505, Ser506, Ser507, Met508 |
| DB04591 (lead) | 442 | Tyr64, Tyr118, Leu121, Thr122, Phe126, Ile131, Tyr132, Phe228, Pro230, Phe233, Gly303, Ile304, Gly307, Gly308, Thr311, Pro375, Leu376, His377, Ser378, Phe380, Tyr505, Ser507, Met508, Val509, Val510 |
| DB07011 (lead) | 401 | Tyr64, Gly65, Pro68, Tyr118, Leu121, Tyr132, Phe228, Pro230, Phe233, Gly307, Gly308, His310, Thr311, Leu376, His377, Ser378, Ile379, Phe380, Tyr505, Ser506, Ser507, Met508, Val509 |
| DB12682(lead) | 377 | Ala61, Tyr64, Gly65, Leu87, Leu88, Tyr118, Leu121, Thr122, Phe126, Ile131, Tyr132, Phe228, Pro230, Phe233, Gly303, Ile304, Gly307, Thr311, Leu376, His377, Ile379, Phe380, Tyr505, Ser506, Met508, Val509 |
Figure 2All seven finally selected lead compounds bound to the active center in 5tz1. Only the quantum mechanics (QM) region atoms are shown: (a) DB13083, (b) DB07227, (c) DB07008, (d) DB12345, (e) DB04591, (f) DB07011, and (g) DB12682.
QM/MM docking results of the literature ligand compound “(l with reference)”, and the fungicide (dodemorph) and hit compounds (hit) analyses for both the uncharged species and for the ionized species at pH 7.4 (with charge). Presented is the Gibbs free binding energy with ZPE, BSSE, and dispersion correction applied (ΔG).
| Uncharged Species | Δ | Compounds at pH 7,4 (Charged) | Δ |
|---|---|---|---|
| Pentazocine (l) [ | +141.41 | dodemorph (fung) (+1) | −294.39 (r) |
| dodemorph (fung) | −54.39 (r) | tamoxifene (l) (+1) | −285.95 |
| opipramol (l) [ | −55.16 | Pentazocine (l) (+1) | −102.98 |
| PD144418 (l) [ | −86.53 | PD144418 (l) (+1) | −367.39 |
| tamoxifene (l) [ | −45.94 | DB06555 (hit) (+1) | −295.03 |
| DB07075 (hit) | −185.13* | DB08622 (hit) (+1) | −319.22 * |
| DB07075 (hit) (a) | −200.68 | DB08746 (hit) (+1) | −370.02 * |
| DB08622 (hit) | −65.06 * | DB02491 (hit) (+1) | −159.12 |
| DB08746 (hit) | −55.11 * | DB00637 (hit) (+1) | −163.40 |
| DB12345 (hit) | −25.24 | ||
| DB00637 (hit) | +16.96 | ||
| DB06555 (hit) | −50.70 | opipramol (l) (+2) | −498.73 (r) |
| DB02491 (hit) | −8.19 | DB07075 (+2) | −519.81 * |
* Hit compounds that passed the QM/MM step due to stronger binding both as uncharged species relative to dodemorph (r) and in charged state at a pH of 7.4 relative to dodemorph ((r) as +1 species) or opipramol ((r) as +2 species). (a) Same hit (DB07075) as in the row above but with a repeated run to verify the calculated QM/MM score again, with the ORCA input geometry used from a slightly different Autodock4 output conformation.
Figure 3(a) DB07075 free ligand optimized in water solution (PCM). (b) DB07075 ligand optimized in the bound state with the 5hk1 protein. Please note that when compared with free ligand (a), in this case (b) the C2 piperidine atom is negative, lacking one proton which is translocated to the -NH2 group forming -NH3+. (c) The DB08622 ligand is optimized in the bound state with the 5hk1 protein (nothing unusual). (d) The DB08746 ligand is optimized in the bound state with the 5hk1 protein (nothing unusual). This might explain why the QM/MM score is much more negative for DB07075 than for DB08622 or DB08746. Proton translocation, when binding from free ligand to the complex, took place only with DB07075.