| Literature DB >> 30120601 |
Yuezhou Zhang1,2, Henri Xhaard1,2, Leo Ghemtio3.
Abstract
Betulin derivatives have been proven effective in vitro against Leishmania donovani amastigotes, which cause visceral leishmaniasis. Identifying the molecular targets and molecular mechanisms underlying their action is a currently an unmet challenge. In the present study, we tackle this problem using computational methods to establish properties essential for activity as well as to screen betulin derivatives against potential targets. Recursive partitioning classification methods were explored to develop predictive models for 58 diverse betulin derivatives inhibitors of L. donovani amastigotes. The established models were validated on a testing set, showing excellent performance. Molecular fingerprints FCFP_6 and ALogP were extracted as the physicochemical properties most extensively involved in separating inhibitors from non-inhibitors. The potential targets of betulin derivatives inhibitors were predicted by in silico target fishing using structure-based pharmacophore searching and compound-pharmacophore-target-pathway network analysis, first on PDB and then among L. donovani homologs using a PSI-BLAST search. The essential identified proteins are all related to protein kinase family. Previous research already suggested members of the cyclin-dependent kinase family and MAP kinases as Leishmania potential drug targets. The PSI-BLAST search suggests two L. donovani proteins to be especially attractive as putative betulin target, heat shock protein 83 and membrane transporter D1.Entities:
Keywords: Betulin derivatives; Classification models; In silico target prediction; Leishmania donovani inhibitors; Network analysis; Predictive modeling; Recursive partitioning; Structure-based pharmacophore
Year: 2018 PMID: 30120601 PMCID: PMC6097978 DOI: 10.1186/s13321-018-0291-x
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Experimental Leishmanial growth inhibitory activities of betulin derivatives against L. donovani axenic amastigotes
Dataset used for recursive partitioning classification models
| Class 1 (inactive) | Class 2 (moderate active) | Class 3 (active) | Total | |
|---|---|---|---|---|
| Dataset 1 | Activity < 49 | Activity ≥ 49 | ||
| 31 | 0 | 27 | 58 | |
| Dataset 2 | Activity < 36 | 36 ≥ Activity ≤ 69 | Activity > 69 | |
| 22 | 19 | 17 | 58 | |
| Dataset 3 | Activity < 36 | 36 ≥ Activity ≤ 69 | Activity > 69 | |
| 22 | Excluded | 17 | 39 |
Bagged forest models
| Splitting method | Training | Test | ||||
|---|---|---|---|---|---|---|
| Dataset | ROC score | Dataset | ROC score | |||
| In-bag training | Out-of-bag training | |||||
| Dataset 1 | Diversity | 1 (20) | 0.97 | 0.72 | 1 (11) | 0.73 |
| R C | 1 (25) | 0.98 | 0.63 | 1 (6) | 0.73 | |
| Dataset 2 | Diversity | 1 (13) | 0.99 | 0.59 | 1 (9) | 0.87 |
| R C | 1 (20) | 0.97 | 0.71 | 1 (2) | 0.94 | |
| Dataset 3 | Diversity | 1 (13) | 0.96 | 0.58 | 1 (8) | 0.67 |
| R C | 1 (20) | 0.97 | 0.65 | 1 (2) | 0.59 | |
Simple tree models
| Training | Test | |||||
|---|---|---|---|---|---|---|
| Dataset | ROC score | Roc score (cross-validated) | Dataset | ROC score | ||
| Dataset 1 | Diverse | 1 (20) | 0.89 | 0.62 | 1 (11) | 0.71 |
| R C | 1 (25) | 0.91 | 0.62 | 1 (6) | 0.91 | |
| Dataset 2 | Diverse | 1 (13) | 0.91 | 0.63 | 1 (9) | 0.80 |
| R C | 1 (20) | 0.94 | 0.57 | 1 (2) | 0.59 | |
| Dattaset 3 | Diverse | 1 (13) | 0.90 | 0.65 | 1 (8) | 0.70 |
| R C | 1 (20) | 0.76 | 0.54 | 1 (2) | 0.72 | |
Fig. 1a Compounds similarity distance (Tanimoto score). b Distribution of compounds in % inhibition classes for the three datasets
Forest important features used to build the models
| Property attributes |
|---|
| FCFP_6 |
| ALogP |
| Molecular_Weight |
| Num_H_Donors |
| Num_H_Acceptors |
| Num_RotatableBonds |
| Num_Rings |
| Num_AromaticRings |
| Molecular_FractionalPolarSurfaceArea |
Targets selected by the target pharmacophore screening with fit value > 0.9
| ID | Uniprot ID | Gene name | Title | Family |
|---|---|---|---|---|
| 3ac3 | P06239 | LCK_HUMAN | Tyrosine-protein kinase Lck | Protein kinase |
| 3ad4 | P06239 | LCK_HUMAN | Tyrosine-protein kinase Lck | Protein kinase |
| 3mzs | P00189 | CP11A_BOVIN | Cholesterol side-chain cleavage enzyme, mitochondrial | Cytochrome P450 |
| 3my0 | P37023 | ACVL1_HUMAN | Serine/threonine-protein kinase receptor R3 | Protein kinase |
| 1lox | P12530 | LOX15_RABIT | Arachidonate 15-lipoxygenase | Lipoxygenase |
| 1qyx | P14061 | DHB1_HUMAN | Estradiol 17-beta-dehydrogenase 1 | Short chain dehydrogenases/reductases (SDR) |
| 2bik | P11309 | PIM1_HUMAN | Serine/threonine-protein kinase pim-1 | Protein kinase |
| 2br1 | O14757 | CHK1_HUMAN | Serine/threonine-protein kinase Chk1 | Protein kinase |
| 2chw | P48736 | PK3CG_HUMAN | Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit gamma isoform | PI3/PI4 kinase |
| 2chz | P48736 | PK3CG_HUMAN | Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit gamma isoform | PI3/PI4 kinase |
| 2p0m | P12530 | LOX15_RABIT | Arachidonate 15-lipoxygenase | Lipoxygenase |
| 2wxq | Q3UDT3 | Q3UDT3_MOUSE | Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit delta isoform | PI3/PI4 kinase |
| 3ddu | P48147 | PPCE_HUMAN | Prolyl endopeptidase | Peptidase S9A |
| 3h3c | Q14289 | FAK2_HUMAN | Protein-tyrosine kinase 2-beta | Protein kinase |
| 3hrb | Q16539 | MK14_HUMAN | Mitogen-activated protein kinase 14 | Protein kinase |
| 3le6 | P24941 | CDK2_HUMAN | Cyclin-dependent kinase 2 | Protein kinase |
| 3mqe | Q05769 | PGH2_MOUSE | Prostaglandin G/H synthase 2 | Prostaglandin G/H synthase |
| 3p2v | P15121 | ALDR_HUMAN | Aldose reductase | Aldo/keto reductase |
| 3r7r | P48736 | PK3CG_HUMAN | Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit gamma isoform | PI3/PI4 kinase |
| 3rgz | O22476 | BRI1_ARATH | Protein brassinosteroid insensitive 1 | Protein kinase |
| 3s3g | P15121 | ALDR_HUMAN | Aldose reductase | Aldo/keto reductase |
| 3tfq | P28845 | DHI1_HUMAN | Corticosteroid 11-beta-dehydrogenase isozyme 1 | Short chain dehydrogenases/reductases (SDR) |
| 3ugr | P42330 | AK1C3_HUMAN | Aldo-keto reductase family 1 member C3 | Aldo/keto reductase |
| 3zrl | P49841 | GSK3B_HUMAN | Glycogen synthase kinase-3 beta | Protein kinase |
| 4a79 | P27338 | AOFB_HUMAN | Amine oxidase [flavin-containing] B | Flavin monoamine oxidase |
| 2hpy | P02699 | OPSD_BOVIN | Rhodopsin | G-protein coupled receptor 1 |
| 3gql | P11362 | FGFR1_HUMAN | Fibroblast growth factor receptor 1 | Protein kinase |
| 2ab2 | P08235 | MCR_HUMAN | Mineralocorticoid receptor | Nuclear hormone receptor |
| 3lmp | P37231 | PPARG_HUMAN | Peroxisome proliferator-activated receptor gamma | Nuclear hormone receptor |
| 1pzo | P62593 | BLAT_ECOLX | Beta-lactamase TEM | Class-A beta-lactamase |
| 1zhx | P35844 | KES1_YEAST | Oxysterol-binding protein homolog 4 | Oxysterol-binding protein |
| 1zhz | P35844 | KES1_YEAST | Oxysterol-binding protein homolog 4 | Oxysterol-binding protein |
| 2oxd | P28523 | CSK2A_MAIZE | Casein kinase II subunit alpha | Protein kinase |
| 2x00 | Q8WSF8 | Q8WSF8_APLCA | Soluble acetylcholine receptor | Neurotransmitter-gated ion-channel |
| 3hgy | Q7B8P6 | Q7B8P6_CAMJU | cmeR | Transcriptional regulator |
| 3ov4 | P00947 | SDIS_COMTE | Steroid delta-isomerase | Steroid delta-5-4-isomerase |
| 3s92 | Q15059 | BRD3_HUMAN | Bromodomain-containing protein 3 | Bromodomain and extraterminal domain |
| 3smo | P31947 | 1433S_HUMAN | 14-3-3 protein sigma | 14-3-3 protein |
| 4a01 | O22124 | O22124_VIGRA | Pyrophosphate-energized vacuolar membrane proton pump | H(+)-translocating pyrophosphatase (TC 3.A.10) |
| 4a86 | P15494 | BEV1A_BETPN | Major pollen allergen Bet v 1-A | Bet v I type allergen |
| 4a8v | P43183 | BEV1J_BETPN | Major pollen allergen Bet v 1-J | Bet v I type allergen |
| 2iws | P02829 | HSP82_YEAST | ATP-dependent molecular chaperone HSP82 | Heat shock protein Hsp90 |
| 3b27 | P07900 | HS90A_HUMAN | Heat shock protein HSP 90-alpha | Heat shock protein Hsp90 |
| 2xa4 | O60674 | JAK2_HUMAN | Tyrosine-protein kinase JAK2 | Protein kinase |
| 4a9n | P25440 | BRD2_HUMAN | Bromodomain-containing protein 2 | Bromodomain and extraterminal domain |
| 4alh | P25440 | BRD2_HUMAN | Bromodomain-containing protein 2 | Bromodomain and extraterminal domain |
| 3svg | O60885 | BRD4_HUMAN | Bromodomain-containing protein 4 | Bromodomain and extraterminal domain |
Fig. 2Compounds that fit each pharmacophore with shape fit value > 0.9
Leishmania donovani homologous targets from PSIBLAST search wit evalue < 3
| Uniprot ID | Gene name | Title | %Identity | Evalue | Bit score | Query |
|---|---|---|---|---|---|---|
| P27890 | HSP83_LEIDO | Heat shock protein 83 | 61.76 | 0 | 568 | P07900|HS90A_HUMAN |
| Q01440 | GTR1_LEIDO | Membrane transporter D1 | 34.83 | 0.004 | 30.8 | P21616|AVP_VIGRR |
| A4ZZ93 | DHYSL_LEIDO | Inactive deoxyhypusine synthase | 19.70 | 0.042 | 26.2 | P42330|AK1C3_HUMAN |
| P39050 | TYTR_LEIDO | Trypanothione reductase | 64.71 | 0.051 | 27.7 | P11362|FGFR1_HUMAN |
| E9BDA8 | GMPR_LEIDB | GMP reductase | 25.64 | 0.073 | 25.4 | P24941|CDK2_HUMAN |
| P43151 | GPA_LEIDO | Putative guanine nucleotide-binding protein subunit alpha | 27.03 | 0.099 | 25.4 | P49841|GSK3B_HUMAN |
| P17804 | HSP70_LEIDO | Heat shock 70 kDa protein | 33.33 | 0.24 | 23.9 | P62593|BLAT_ECOLX |
| P36889 | SAHH_LEIDO | Adenosylhomocysteinase | 26.72 | 0.24 | 23.5 | P28845|DHI1_HUMAN |
| A7LBL2 | PURA_LEIDO | Adenylosuccinate synthetase | 27.66 | 0.27 | 25 | Q15059|BRD3_HUMAN |
| Q27675 | CYAA_LEIDO | Receptor-type adenylate cyclase A | 29.73 | 0.31 | 21.6 | P00947|SDIS_COMTE |
| P23223 | GP63_LEIDO | Leishmanolysin | 37.84 | 0.33 | 24.3 | P37231|PPARG_HUMAN |
| Q25263 | CYAB_LEIDO | Receptor-type adenylate cyclase B | 28.36 | 0.52 | 24.6 | Q3UDT3|Q3UDT3_MOUSE |
| P21620 | IMDH_LEIDO | Inosine-5’-monophosphate dehydrogenase | 47.37 | 0.57 | 22.7 | P15121|ALDR_HUMAN |
| Q9BIC6 | PFKA_LEIDO | ATP-dependent 6-phosphofructokinase | 28.95 | 0.59 | 22.7 | P14061|DHB1_HUMAN |
| Q05889 | LPG1_LEIDO | Galactofuranosyl glycosyltransferase | 28.07 | 0.69 | 24.6 | O60885|BRD4_HUMAN |
| D9IFD5 | DOHH_LEIDO | Deoxyhypusine hydroxylase | 38.46 | 0.71 | 23.1 | P37023|ACVL1_HUMAN |
| Q01441 | GTR2_LEIDO | Membrane transporter D2 | 38.46 | 0.79 | 23.1 | P27338|AOFB_HUMAN |
| P12522 | ATXB_LEIDO | Probable proton ATPase 1B | 25.69 | 1.1 | 22.3 | P35844|KES1_YEAST |
| P11718 | ATXA_LEIDO | Probable proton ATPase 1A | 25.23 | 1.2 | 22.3 | P35844|KES1_YEAST |
| O00874 | DPOLA_LEIDO | DNA polymerase alpha catalytic subunit | 22.22 | 1.2 | 23.1 | P48147|PPCE_HUMAN |
| P55905 | COQ5_LEIDO | 2-methoxy-6-polyprenyl-1,4-benzoquinol methylase, mitochondrial | 41.18 | 1.5 | 22.3 | P02829|HSP82_YEAST |
| Q05885 | ARD1_LEIDO | N-terminal acetyltransferase complex ARD1 subunit homolog | 32.56 | 1.6 | 21.9 | P02829|HSP82_YEAST |
| P12522 | ATXB_LEIDO | Probable proton ATPase 1B | 50 | 1.8 | 21.6 | P35844|KES1_YEAST |
| P27116 | DCOR_LEIDO | Ornithine decarboxylase | 29.55 | 2.2 | 22.7 | P48736|PK3CG_HUMAN |
| Q36736 | KM11_LEIDO | Kinetoplastid membrane protein 11 | 27.59 | 2.6 | 19.2 | P31947|1433S_HUMAN |
| Q25264 | DCAM_LEIDO | S-adenosylmethionine decarboxylase proenzyme | 42.11 | 2.7 | 21.6 | Q15059|BRD3_HUMAN |
| B5APK2 | DHYS_LEIDO | Deoxyhypusine synthase | 36.36 | 2.9 | 20.4 | P28845|DHI1_HUMAN |
Fig. 3Subset of the pharmacological network of L. donovani Betulin derivatives inhibitors limited to the hubs of the network for target prediction. Betulin derivatives inhibitors, pharmacophore, and targets with a red to gray gradient scale
Fig. 4Nodes degree distribution in the pharmacological Betulin derivatives inhibitors network. a Degree distribution of Betulin derivatives inhibitor nodes, b degree distribution of target nodes, c degree distribution of pathway nodes