| Literature DB >> 33803906 |
Emmanuel Broni1, Samuel K Kwofie1,2, Seth O Asiedu3, Whelton A Miller4,5,6, Michael D Wilson3,4.
Abstract
The huge burden of leishmaniasis caused by the trypanosomatid protozoan parasite Leishmania is well known. This illness was included in the list of neglected tropical diseases targeted for elimination by the World Health Organization. However, the increasing evidence of resistance to existing antimonial drugs has made the eradication of the disease difficult to achieve, thus warranting the search for new drug targets. We report here studies that used computational methods to identify inhibitors of receptors from natural products. The cell division cycle-2-related kinase 12 (CRK12) receptor is a plausible drug target against Leishmania donovani. This study modelled the 3D molecular structure of the L. donovani CRK12 (LdCRK12) and screened for small molecules with potential inhibitory activity from African flora. An integrated library of 7722 African natural product-derived compounds and known inhibitors were screened against the LdCRK12 using AutoDock Vina after performing energy minimization with GROMACS 2018. Four natural products, namely sesamin (NANPDB1649), methyl ellagic acid (NANPDB1406), stylopine (NANPDB2581), and sennecicannabine (NANPDB6446) were found to be potential LdCRK12 inhibitory molecules. The molecular docking studies revealed two compounds NANPDB1406 and NANPDB2581 with binding affinities of -9.5 and -9.2 kcal/mol, respectively, against LdCRK12 which were higher than those of the known inhibitors and drugs, including GSK3186899, amphotericin B, miltefosine, and paromomycin. All the four compounds were predicted to have inhibitory constant (Ki) values ranging from 0.108 to 0.587 μM. NANPDB2581, NANPDB1649 and NANPDB1406 were also predicted as antileishmanial with Pa and Pi values of 0.415 and 0.043, 0.391 and 0.052, and 0.351 and 0.071, respectively. Molecular dynamics simulations coupled with molecular mechanics Poisson-Boltzmann surface area (MM/PBSA) computations reinforced their good binding mechanisms. Most compounds were observed to bind in the ATP binding pocket of the kinase domain. Lys488 was predicted as a key residue critical for ligand binding in the ATP binding pocket of the LdCRK12. The molecules were pharmacologically profiled as druglike with inconsequential toxicity. The identified molecules have scaffolds that could form the backbone for fragment-based drug design of novel leishmanicides but warrant further studies to evaluate their therapeutic potential.Entities:
Keywords: CRK12; Leishmania donovani; Leishmaniasis; leishmanicide; molecular docking; molecular dynamics simulation; natural products
Year: 2021 PMID: 33803906 PMCID: PMC8003136 DOI: 10.3390/biom11030458
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1Methodology schema employed in this study for predicting potential antileishmanial compounds. Three modelling techniques comprising Modeller [29,30], I-TASSER [31,32,33,34] and Robetta [35,36,37] were used to predict potential LdCRK12 structures. Evaluation of the predicted protein structures revealed the reasonably best model. Natural compounds from the African Natural Product Database (AfroDB), as well as the North African Natural Product Database (NANPDB) and known antileishmanial compounds, were docked against LdCRK12 and the human CDK9 receptors. The potential lead compounds were subjected to absorption, distribution, metabolism, excretion, and toxicity (ADMET), biological activity predictions, and molecular dynamics (MDs) computations.
BLAST results showing identical proteins to the LdCRK12. The best template is selected based on the E-value, sequence identity, BLAST score, and availability of a 3D structure.
| ID | Protein Name | E-Value | BLAST Score | Identity (%) |
|---|---|---|---|---|
| O14098 | C-terminal heptapeptide repeat domain CTD kinase subunit alpha (Schizosaccharomyces pombe (strain 972/ATCC 24843) (Fission yeast)) | 4.2 × 10−34 | 356 | 36 |
| Q9TVL3-2 | Isoform a, of Probable cyclin-dependent kinase 9 (Caenorhabditis elegans) | 9.7 × 10−34 | 348 | 35 |
| Q641Z4 | Cyclin-dependent kinase 9 (Rattus norvegicus) | 7.4 × 10−34 | 345 | 31.3 |
| Q99J95 | Cyclin-dependent kinase 9 (Mus musculus) | 7.4 × 10−34 | 345 | 31.3 |
| P50750 | Cyclin-dependent kinase 9 (Homo sapiens) | 7.4 × 10−34 | 345 | 31.3 |
| Q5EAB2 | Cyclin-dependent kinase 9 (Bos taurus) | 1.4 × 10−33 | 343 | 31 |
Discrete optimized protein energy (DOPE) and GA341 scores of the 5 generated models using Modeller 9.2.
| Models | Dope Score | GA341 Score |
|---|---|---|
| MOD1 | −49,545.96484 | 0.36807 |
| MOD2 | −49,137.34766 | 0.20576 |
| MOD3 | −47,466.54688 | 0.10907 |
| MOD4 | −49,459.57422 | 0.28138 |
| MOD5 | −50,486.88281 | 0.21007 |
Predicted I-TASSER models and C-scores.
| Models | ITAS1 | ITAS2 | ITAS3 | ITAS4 | ITAS5 |
|---|---|---|---|---|---|
| C-Score | −3.68 | −3.85 | −2.87 | −2.77 | −2.66 |
Model evaluation of the Robetta predicted models using SAVES v5.0.
| Models | Model Score | |||
|---|---|---|---|---|
| Verify (%) | Errat | Prove (%) | Procheck | |
| ROB1 | 82.97 | 88.0579 | 0.0 (Pass) | 4E, 3W and 2P |
| ROB2 | 66.06 | 87.4259 | 0.0 (Pass) | 5E, 2W and 2P |
| ROB3 | 65.83 | 84.7073 | 6.1 (Error) | 5E, 1W and 3P |
| ROB4 | 67.54 | 83.6538 | 6.8 (Error) | 5E, 2W and 2P |
| ROB5 | 78.55 | 87.822 | 5.8 (Error) | 5E, 0W and 4P |
Model evaluation of the top 3 LdCRK12 structures modelled via Modeller, Robetta, and I-TASSER. Models ROB1, ITAS5, and MOD5 were generated using Robetta, I-TASSER, and Modeller, respectively. E: error; W: warning; and P: pass.
| Tool | Model Score | ||
|---|---|---|---|
| ROB1 | ITAS5 | MOD5 | |
| Verify (%) | 82.97 | 85.36 | 41.20 |
| Errat (Quality Factor) | 88.0579 | 80.2158 | 10.0536 |
| Prove (%) | 0.0 (Pass) | 9.5 (Error) | 16.1 (Error) |
| Procheck | 4E, 3W and 2P | 6E, 2W and 1P | 5E, 2W and 1P |
Figure 2Cartoon views of the top 3 predicted tertiary structures of the LdCRK12 from the 3 techniques used: (a) MOD5; (b) ITAS5; (c) ROB1; and (d) top 3 models aligned. MOD5, ITAS5, and ROB1 are colored in magenta, cyan and green, respectively.
Ramachandran plot statistics for the best models from the 3 modelling techniques. For all 3 models, the number of end residues (excluding Gly and Pro) = 2, Glycine residues = 65, Proline residues = 85, and the total number of residues = 881.
| Model | MOD5 | ITAS5 | ROB1 | |||
|---|---|---|---|---|---|---|
| No. of Residues | Percentage | No. of Residues | Percentage | No. of Residues | Percentage | |
| Most favored regions [A, B, L] | 581 | 79.7 | 445 | 61.0 | 598 | 82.0 |
| Additionally allowed regions [a, b, l, p] | 113 | 15.5 | 217 | 29.8 | 125 | 17.1 |
| Generously allowed regions [~a, ~b, ~l, ~p] | 22 | 3.0 | 43 | 5.9 | 3 | 0.4 |
| Disallowed regions | 13 | 1.8 | 24 | 3.3 | 3 | 0.4 |
| Non-glycine and non-proline residues | 729 | 100.0 | 729 | 100.0 | 729 | 100.0 |
Figure 3Ramachandran plot of the selected LdCRK12 structure (ROB1) obtained via PROCHECK. The percentages of residues in the most favored regions, additionally allowed regions, generously allowed regions and disallowed regions are 82.0%, 17.1%, 0.4%, and 0.4%, respectively.
Figure 4Model quality assessment using ProSA-web. (a) A z-score of the best LdCRK12 structure indicating the overall model quality, and (b) a local model quality of the selected LdCRK12 structure by plotting energies as a function of amino acid sequence position.
Predicted binding sites located around the kinase domain of the LdCRK12. Solvent accessible (SA) values are shown.
| Area (SA)/Å2 | Volume (SA)/Å3 | Residues Lining the Pocket | |
|---|---|---|---|
| 1 | 566.585 | 712.561 | Leu438, Pro439, Ala441, Pro442, Pro443, Pro444, Ser445, Glu463, Lys464, Leu465, Ser466, Glu467, Gly468, Thr469, Tyr470, Val473, Lys475, Ala486, Leu487, Lys488, Glu506, Leu510, Ser544, Phe563, Ala564, Tyr565, Ala566, Thr567, Ala568, Ser569, Ala571, Gly572, Arg575, Arg576, His606, Asp608, Lys610, Asp612, Asn613, Leu615, Thr625, Asp626, Phe627, Leu629, Cys630, Val650, Thr652 |
| 2 | 312.963 | 420.314 | Met492, Thr495, His496, Gly498, Phe499, Pro500, Gln501, Thr502, Arg505, Arg607, Gly628, Leu629, Cys630, Ser631, Arg639, Cys640, Val644, Thr647, Pro648, Ser649, Val650, Ile651, Arg656, Met660, Thr665, Tyr667, Ser708, Ala709, Glu712 |
| 3 | 443.095 | 377.107 | Ile536, Arg597, Lys598, His600, Glu601, Arg603, Pro635, Asp668, Glu669, Lys670, Thr823, Ala825, Glu826, Leu828, Arg829, Leu836, Asp837, Asp838, Ala839, Pro840, Leu841, Leu842, Tyr845, Gln846, Arg847, Val848, Leu849 |
| 4 | 207.834 | 203.064 | Arg692, His693, Ala695, Gln696, Gln699, Gln700, Arg703, Pro705, Thr711, Glu714, Gln715, Ser717, Thr720, Glu721, Gln749, Ala758, Ala759, Gln760, Ala762 |
| 5 | 110.211 | 105.944 | Thr720, Pro725, Leu726, Pro727, Pro728, Val731, Leu743, Leu746, Glu747, Gln749, Gly750, Arg751, Glu754, Pro761, Ala762, Asn763 |
| 6 | 141.403 | 95.351 | Ala571, Arg575, Lys610, Ser611, Asp612, Thr652, Ala654, Tyr655, Gln682, Leu686, Glu687, Pro688, Tyr691, Arg694, Phe780 |
| 7 | 50.346 | 89.822 | Pro635, Gly636, Ser637, Leu849, Pro850, Thr852 |
| 8 | 159.056 | 87.650 | Leu685, Glu687, Pro688, Pro689, Tyr691, Arg692, Arg694, Ala695, Gln698, Gln699, Arg718, Glu721, Ser774, Phe775, Leu776, Gln778, Gln779, Phe780 |
| 9 | 101.847 | 84.378 | Ala342, Val402, Ala403, Met404, Gly405, Leu412, Arg413, Leu415, Pro417, Tyr420, Arg429 |
| 10 | 60.970 | 41.895 | Phe580, Glu584, Leu587, Leu588, Lys591, Glu619, Gly620, Val622 |
| 11 | 150.732 | 38.172 | Cys574, Phe578, Ala579, Phe580, Thr581, Pro582, Met585, Gln682, Met683, Phe684, Leu686, Ile770, Phe771, Gly785, Trp786, Glu788, Glu790, His799, Arg801, Pro802 |
| 12 | 68.900 | 28.853 | Thr642, His643, Val644, Pro658, Glu659, Leu662, Gly663, Ser664, Leu726, Ser736, His739, Met740, Leu816, Pro818, Arg821 |
| 13 | 69.295 | 22.359 | Pro427, Arg429, Arg430, Val434, Gly435, Phe448, Gln452, Lys456 |
| 14 | 71.553 | 21.095 | Pro658, Leu662, Leu713, Leu716, Ser717, Ile719, Thr720, Gly724, Pro725, Leu726, Val742, Gln745, Leu746, Gln749, Leu816 |
Figure 5Superimposition of human CDK9-T6Q complex with the LdCRK12. LdCRK12, CDK9, and T6Q are colored in red, blue, and green, respectively.
The binding energies and intermolecular bonds between LdCRK12 and selected compounds.
| Compound | Binding Energy (kcal/mol) | Hydrogen Bonds (Bond Length (Å)) | Hydrophobic Bonds | |||
|---|---|---|---|---|---|---|
| CDK9 | CDK9 | CDK9 | ||||
| ZINC000095485940 | −10.1 | −7.7 | Gly468 (2.93), Ser569 (2.95), Asp626 (2.70) | Arg195 (3.06, 3.22), Glu234 (2.9), Arg343 (3.1) | Leu465, Ser466, Thr469, Val473, Ala486, Lys488, Ser544, Phe563, Asp612, Asn613, Leu615, Thr625 | Arg188, Leu192, Arg195, Thr233, Glu234, Tyr338, Ala340, Arg343 |
| NANPDB1406 | −9.5 | −7.3 | Lys488 (3.26), Ala566 (2.89, 2.97, 3.07), Ser569 (3.01) | Asn232 (2.8), Phe336 (3.08), Ala340 (2.69, 3.15), Arg343 (2.88, 3.1, 3.17, 3.25) | Leu465, Ser466, Gly468, Val473, Ala486, Tyr565, Thr567, Ala568, Asp612, Leu615, Asp626 | Asn232, Thr233, Met335, Phe336, Tyr338, Ala340, Arg343 |
| NANPDB2581 | −9.2 | −7.5 | Lys610 (3.08) | Arg195 (3.32), Arg343 (3.09, 3.35) | Leu465, Ser466, Thr469, Tyr470, Ala568, Ser569, Asp612, Asn613, Leu615, Asp626 | Leu192, Arg195, Thr233, Glu234, Pro341, Pro342, Arg343 |
| NANPDB6446 | −9.1 | −7.3 | Ser569 (2.77, 3.02), Arg575 (2.87, 3.15) | Asn179 (3.04), Tyr259 (2.89) | Leu465, Ser466, Ala568, Gly572, Asp612, Asp626 | Asn179, Pro182, Glu203, Asp205, Trp253, Asn258, Tyr259, Pro300 |
| Compound 8 | −9.1 | −9.0 | Leu723 (2.83) | Lys48 (2.91), Asp149 (2.99, 3.08) | Gly724, Pro725, Leu726, Pro727, Pro728, Val731, Leu743, Glu747, Asn763, Trp764, Gln815, Leu816 | Thr29, Phe30, Leu51, Pro60, Thr62, Asp149, Leu170, Arg188, Val190, Thr191, Leu192, Met335 |
| T6Q | −9.1 | −8.6 | Thr469 (3.06) | - | Leu465, Ser466, Gly468, Thr469, Val473, Lys488, Ala568, Ser569, Arg575, Lys610, Asp612, Asn613, Leu615, Asp626 | Ile25, Phe30, Val33, Ly48, Asp109, Gly112, Ala153, Leu156, Ala166, Asp167, His331, Leu332, Thr333 |
| DDD853651/GSK3186899/Compound 7 | −8.5 | −8.8 | Ser466 (2.96), Gly468 (3.19), Lys488 (3.03), Ser544 (3.27), Thr625 (3.12), Asp626 (3.31, 3.3), Tyr691 (2.98) | Glu107 (3.07, 2.98) | Gly468, Thr469, Tyr470, Val473, Ala486, Lys488, Phe563, Lys610, Asp612, Leu615, Asp626, Tyr691 | Ile25, Val33, Lys35, Lys48, Phe103, Glu107, His108, Asp109, Ala166, Asp167 |
| Compound 5 | −7.2 | −8.6 | Leu723 (2.98, 3.07) | Cys106 (3.2, 3.0) | Gly724, Pro725, Leu726, Pro727, Pro728, Val731, Tyr732, Leu743, Asn763, Trp764, Gln815, Leu816, Asp817, Gln820 | Ile25, Val33, Ala46, Lys48, Phe103, Phe105, Glu107, His108, Asp109, Gly112, Leu113, Ala153, Asn154, Leu156, Ala166, Asp167 |
Figure 6Cartoon representation of LdCRK12 in complex with: (a) ZINC000095485940; (b) NANPDB1406 (methyl ellagic acid); (c) NANPDB2581 (stylopine); and (d) GSK3186899 (Compound 7). The binding site is shown as surface representation with the ligands shown as sticks.
Ligand quality assessment metrics for selected compounds. The metrics include inhibitory constant (Ki), ligand efficiency (LE), LE scale (LE_scale), fit quality (FQ), LE-dependent lipophilicity (LELP), and calculated logP (cLogP).
| Compound | Binding Energy | NHA | cLogP | Ki (µM) | LE | LE_Scale | FQ | LELP |
|---|---|---|---|---|---|---|---|---|
| ZINC000095485940 | −10.1 | 29 | −0.1814 | 0.039 | 0.348 | 0.347 | 1.003 | 0.521 |
| NANPDB1406 | −9.5 | 23 | 1.5531 | 0.108 | 0.413 | 0.416 | 0.993 | 3.761 |
| NANPDB2581 | −9.2 | 24 | 3.3633 | 0.180 | 0.383 | 0.404 | 0.948 | 8.781 |
| NANPDB6446 | −9.1 | 26 | −0.8296 | 0.213 | 0.35 | 0.380 | 0.921 | 2.370 |
| NANPDB1649 | −8.5 | 26 | 3.2246 | 0.587 | 0.327 | 0.380 | 0.861 | 9.861 |
Selected compounds and known LdCRK12 inhibitors with their two-dimensional (2D) structures and common names or International Union of Pure and Applied Chemistry (IUPAC) names. The IUPAC names were generated using the Marvin suite (http://www.chemaxon.com/; accessed on 27 February 2020).
| Compound ID | Common/IUPAC Name | 2D Structure |
|---|---|---|
| ZINC000095485940 | (1R,2R,4R,7S,8R,10R,11R,12R,13R,16S)-7-(furan-3-yl)-10,13-dihydroxy-8,13-dimethyl-3,6,14-trioxapentacyclo[9.7.0.02,4.02,8.012,16]octadecane-5,18-dione |
|
| NANPDB1406 | methyl ellagic acid |
|
| NANPDB2581 | stylopine |
|
| NANPDB6446 | sennecicannabine |
|
| NANPDB1649 | sesamin |
|
| Compound 5 | 2-methyl-N-[(1r,4r)-4-{[3-(2-methoxyphenyl)-1H-pyrazolo[3,4-d]pyrimidin-6-yl]amino}cyclohexyl]propane-1-sulfonamide |
|
| GSK3186899 | DDD853651/Compound 7 |
|
| Compound 8 | 2-(2,4-difluorophenyl)-1-(4-{[3-(2-methoxyphenyl)-1H-pyrazolo[3,4-d]pyrimidin-6-yl]amino}piperidin-1-yl)ethan-1-one |
|
Figure 7The root mean square deviation (RMSD), radius of gyration (Rg), and root mean square fluctuation (RMSF) graphs of the LdCRK12–ligand complexes generated over a 10 ns molecular dynamics simulation. (a) RMSD versus time graph of LdCRK12–ligand complexes; (b) Radius of gyration (Rg) versus time graph of LdCRK12–ligand complexes; and (c) Analysis of RMSF trajectories of residues of LdCRK12–ligand complexes. For the 3 graphs, the unbound protein (LdCRK12), compound 5, GSK3186899 (compound 7), compound 8, NANPDB1406, NANPDB1649, NANPDB2581, NANPDB6446, and ZINC000095485940 are represented as black, red, green, blue, yellow, brown, grey, purple and cyan, respectively.
Contributing energy terms of the MM/PBSA computations for the LdCRK12–ligand complexes. Energy values are shown as average ± standard deviations in kJ/mol. SASA denote solvent accessible surface area.
| van der Waal Energy (kJ/mol) | Electrostatic Energy (kJ/mol) | Polar Solvation Energy (kJ/mol) | SASA Energy (kJ/mol) | Binding Energy (kJ/mol) | |
|---|---|---|---|---|---|
| Compound 5 | −98.909 ± 22.552 | −9.113 ± 13.340 | 67.471 ± 24.307 | −13.471 ± 2.359 | −54.023 ± 17.067 |
| DDD853651/GSK3186899/Compound 7 | −107.423 ± 24.517 | −43.202 ± 23.069 | 140.188 ± 36.187 | −16.945 ± 2.444 | −27.382 ± 20.792 |
| Compound 8 | −138.191 ± 15.201 | −17.732 ± 9.037 | 103.997 ± 24.258 | −16.683 ± 1.392 | −68.609 ± 13.327 |
| NANPDB1406 | −125.840 ± 9.460 | −40.995 ± 10.072 | 157.502 ± 22.889 | −15.186 ± 0.971 | −24.518 ± 14.412 |
| NANPDB1649 | −111.638 ± 18.534 | −4.178 ± 8.634 | 80.033 ± 25.137 | −14.651 ± 1.960 | −50.434 ± 13.538 |
| NANPDB2581 | −110.229 ± 10.366 | −7.999 ± 7.595 | 83.680 ± 20.147 | −14.826 ± 1.181 | −49.374 ± 14.169 |
| NANPDB6446 | −84.419 ± 19.455 | −64.626 ± 32.749 | 125.008 ± 32.850 | −13.141 ± 2.305 | −37.179 ± 17.980 |
| ZINC000095485940 | −91.882 ± 13.394 | −29.485 ± 12.960 | 136.331 ± 31.350 | −14.372 ± 1.704 | 0.593 ± 16.180 |
Figure 8Molecular mechanics Poisson–Boltzmann surface area (MM/PBSA) plot of binding free energy contribution per residue of the LdCRK12-NANPDB1406 complex.