Literature DB >> 30120601

Predictive classification models and targets identification for betulin derivatives as Leishmania donovani inhibitors.

Yuezhou Zhang1,2, Henri Xhaard1,2, Leo Ghemtio3.   

Abstract

Betulin derivatives have been proven effective in vitro against Leishmania donovani amastigotes, which cause visceral leishmaniasis. Identifying the molecular targets and molecular mechanisms underlying their action is a currently an unmet challenge. In the present study, we tackle this problem using computational methods to establish properties essential for activity as well as to screen betulin derivatives against potential targets. Recursive partitioning classification methods were explored to develop predictive models for 58 diverse betulin derivatives inhibitors of L. donovani amastigotes. The established models were validated on a testing set, showing excellent performance. Molecular fingerprints FCFP_6 and ALogP were extracted as the physicochemical properties most extensively involved in separating inhibitors from non-inhibitors. The potential targets of betulin derivatives inhibitors were predicted by in silico target fishing using structure-based pharmacophore searching and compound-pharmacophore-target-pathway network analysis, first on PDB and then among L. donovani homologs using a PSI-BLAST search. The essential identified proteins are all related to protein kinase family. Previous research already suggested members of the cyclin-dependent kinase family and MAP kinases as Leishmania potential drug targets. The PSI-BLAST search suggests two L. donovani proteins to be especially attractive as putative betulin target, heat shock protein 83 and membrane transporter D1.

Entities:  

Keywords:  Betulin derivatives; Classification models; In silico target prediction; Leishmania donovani inhibitors; Network analysis; Predictive modeling; Recursive partitioning; Structure-based pharmacophore

Year:  2018        PMID: 30120601      PMCID: PMC6097978          DOI: 10.1186/s13321-018-0291-x

Source DB:  PubMed          Journal:  J Cheminform        ISSN: 1758-2946            Impact factor:   5.514


Background

Leishmaniasis is a neglected tropical disease caused by Leishmania protozoan parasites that affect millions of people worldwide [1-3]. During the past decade, leishmaniasis has spread considerably, and an increasing number of new cases are being reported every year [3]. Several treatments exist for leishmaniasis [4], but they are not fully active, have adverse effects, loss of efficacy and are highly expensive [5]. Hence, there is an urgent need to develop new, safe and effective medications. Betulin derivatives have a significant in vitro inhibition growth of L. donovani amastigotes, which cause visceral leishmaniasis, the most severe form of the disease [6, 7]. Betulinic acid and other betulin derivatives have furthermore remarkable antiviral [8-11], anti-HIV [12], antiulcer [13], anti-inflammatory [14, 15], anti-malaria [16, 17] and anti-tumoral [18-20] activity that make this class of compounds promising for new drugs discovery [21-24]. Structure–activity relationships and pharmacological properties of betulin have been studied previously [25-29]. Recently, our collaborators have synthesized 58 betulin heterocyclic derivatives and evaluated their activity and selectivity against L. donovani amastigotes with similar or better inhibitory activity (> 80%) than some well-known antibiotics (Nystatin, Pentamycin, Amphotericin) [6, 30, 31]. Computational methods such as QSAR [32] and pharmacophore modeling [33] are important methods in modern drug discovery that have been successfully applied for modeling activities of betulin derivatives [34-42]. However, the congeneric series are still limited, and the mechanism of action of these compounds are still undefined. To date, very few computational studies and models have been done on Betulin derivatives to explore the full potential of this class of compounds, with one derivatives in clinical phase 3 (Oleogel-S10), and accelerate the understanding of their mode of action. In the present study, we report an application of classification method, recursive partitioning (RP) to build predictive models of the inhibitory activity of betulin derivatives and characterize their molecular properties. RP models can select essential molecular descriptors according to the decrease of the performance resulting from the random permutation of the variables. Also, we investigated the compound-target interaction network and potential pharmacological actions by reverse pharmacophore database screening. Although it can be to some extent debated [43], it is commonly accepted that structurally similar compounds have similar biological activity [44] and may also recognize homologous targets across organisms [45]. This concept spurs us to assume the proteins interacting with compounds that are similar to betulin derivatives in the structure are potential binding targets as well. We thus screened potent betulin inhibitors of Leishmania growth against PharmaDB [46], a database containing a collection of pharmacophores model built from protein-ligand complexes, to identify possible targets.

Materials and methods

Compounds and biological data

The molecular structures and biological data used in this study, 58 betulin derivatives synthesized by the Yli-Kauhaluoma group, were retrieved from references [6, 30, 31] (Table 1). The biological activities are reported as the percentage inhibition of L. donovani axenic amastigotes growth at 50 μM concentrations. Three datasets were generated, and the compounds were categorized in different classes depending on their % of inhibition (%I) in three different ways (Table 2). Dataset 1, the compounds were divided into two classes as active (%I ≥ 49) and inactive (%I < 49). Dataset 2, the compounds were divided into three classes as active (%I > 69), moderate active (%I ≥ 36 et ≤ 69) and inactive (%I < 36). Dataset 3, is similar to Dataset 2 but the group of moderately active compounds, considered as an uncertainty buffer, is not used.
Table 1

Experimental Leishmanial growth inhibitory activities of betulin derivatives against L. donovani axenic amastigotes

Table 2

Dataset used for recursive partitioning classification models

Class 1 (inactive)Class 2 (moderate active)Class 3 (active)Total
Dataset 1Activity < 49Activity ≥ 49
3102758
Dataset 2Activity < 3636 ≥ Activity ≤ 69Activity > 69
22191758
Dataset 3Activity < 3636 ≥ Activity ≤ 69Activity > 69
22Excluded1739
Experimental Leishmanial growth inhibitory activities of betulin derivatives against L. donovani axenic amastigotes Dataset used for recursive partitioning classification models

Generating the molecular structures and conformational analysis

The skeleton of betulin derivatives was drawn using ChemBioDraw Ultra 12.0, assigning hydrogen atoms with Maestro 9.6 (Schrödinger). After that, the dataset was prepared by Discovery Studio 4.5 (Accelrys Inc.) (DS 4.5). Partial charges of structures were calculated based on the CHARMm force field. Full minimization was run with the Smart minimizer algorithm until root mean square gradient was 0.01 and maximum 2000 steps. No implicit solvent model was included.

Recursive partitioning (RP) models

RP is a classification method for multivariable data analysis. It creates a decision tree to correctly classify and uncover relationships between members of the dataset based on a dichotomous splitting of a dependent property, in our case compounds properties and their %I. RP analysis was carried out using DS 4.5 to develop decision trees that categorize the compounds into two and three classes based on the % inhibition. RP single tree (ST) models and multi-tree bagged forest (BF) models made up of multiple trees were used. Both ST and BF models are particularly appropriate in case of imbalanced training data and are easily interpretable, while also providing a significant degree of predictive accuracy [47-50]. For both methods, a training set was used to build the decision trees, and a test set was utilized to evaluate the predictive power of the models. Using two splitting methods, we generated two training and test sets from each of the three datasets (see Tables 3 and 4). The first method (split by diversity) assigns a diverse subset of compounds to the training and test set. The second way (random per cluster) cluster the compounds by similarity and then randomly assigned compounds from each group between the training and test set. Both methods use 2D fingerprint molecular descriptors and a proportion of 70% data for the training versus 30% for the test set.
Table 3

Bagged forest models

Splitting methodTrainingTest
DatasetROC scoreDatasetROC score
In-bag trainingOut-of-bag training
Dataset 1(2 Class)Diversity1 (20)2 (21)0.970.721 (11)2 (6)0.73
R C1 (25)2 (17)0.980.631 (6)2 (10)0.73
Dataset 2(2 Class from 3)Diversity1 (13)2 (13)0.990.591 (9)2 (4)0.87
R C1 (20)2 (9)0.970.711 (2)2 (8)0.94
Dataset 3(3 Class)Diversity1 (13)2 (15)3 (13)0.960.581 (8)2 (4)3 (4)0.67
R C1 (20)2 (13)3 (9)0.970.651 (2)2 (6)3 (8)0.59
Table 4

Simple tree models

TrainingTest
DatasetROC scoreRoc score (cross-validated)DatasetROC score
Dataset 1(2 Class)Diverse1 (20)2 (21)0.890.621 (11)2 (6)0.71
R C1 (25)2 (17)0.910.621 (6)2 (10)0.91
Dataset 2(2 Class from 3)Diverse1 (13)2 (13)0.910.631 (9)2 (4)0.80
R C1 (20)3 (9)0.940.571 (2)3 (8)0.59
Dattaset 3(3 Class)Diverse1 (13)2 (15)3 (13)0.900.651 (8)2 (4)3 (4)0.70
R C1 (20)2 (13)3 (9)0.760.541 (2)2 (6)3 (8)0.72
Bagged forest models Simple tree models BF has a relatively small number of trees (10) generated using a separate bootstrap sample of the original data for each tree. All descriptors are considered as possible splitting criteria for each node and weighting method is set to “by class” by default, to compensate for imbalanced data. All others parameters were set to default. BF can measure how each descriptor contributes to the prediction accuracy in the course of training. We estimated the predictive ability of the ST models with five fold cross-validation and BF models using out-of bag statistics. For BF, in each bootstrap training set, around one-third of the instances are left out, constituting the out-of-bag sample. The test set was used to estimate the fitting ability of the ST and RF models on a new dataset that was not used in the model construction. The performance of the ST and BF models are based on three metrics: true positive rate (recall or sensitivity), specificity, and the area under the curve (AUC) of the receiver operating characteristics (ROC) plot. AUC or ROC score represents the probability that a classifier will be estimated correctly, with values 0.5 indicating better than random prediction and 1 signifying perfect prediction [51].

Target fishing

By screening a compound against a panel of pharmacophore models derived from multiple pharmacological targets, the potential targets of the compound can be outlined. Automated ligand profiling available in DS 4.5 so-called “Ligand Profiler” protocol was used [52]. DS 4.5 is equipped with a pharmacophore database PharmaDB that is the largest ever-reported collection of structure-based pharmacophores, 68,056 entries from 8166 protein-ligand X-ray structures [46, 53, 54]. These pharmacophores are derived from the sc-PDB dataset, a collection of 3D structures of binding sites found in the Protein Data Bank. For most actives betulin derivatives, all the pharmacophore models from PharmaDB were selected for the virtual screening with default settings. The rigid mode was used as the molecular mapping algorithm. No molecular features were allowed to be missed while mapping these compounds to the pharmacophore models to increase selectivity. The minimal inter-feature distance was set at 0.5 Å. For each target, the name and pathway information was collected from ChEMBL [55] and WikiPathways [56] databases using KNIME [57] version 3.1.2. Compound-Target-Pathway networks were generated by Cytoscape 3.0 (Cytoscape Consortium, USA) [58] where network nodes illustrate compounds, targets, and biological pathways. The edges linking the compound-target and target-pathway describe their relationships. Position-Specific Iterated BLAST (PSI-BLAST) search is done to identify the homologous protein in L. donovani from the selected target as the query sequence [59].

Results

Structural diversity analysis, RP (ST/BF) model development and interpretation

The robustness and efficiency of classification models are usually affected by the diversity of dataset used for modeling, with the effect that the more diverse are the compounds, the broader will be the applicability of the model. The dissimilarity between any two molecules was computed using a Tanimoto coefficient. In this study, the average fingerprint distance for the dataset of 58 betulin derivatives inhibitors is 0.7 with a minimum of 0.12 and maximum at 0.9. Figure 1a shows a broad range of diversity across compounds. Also, the data set has an average molecular property distance of 1.33, minimum at 0.067 and maximum at 2.75 which shows good structural and property diversity of the dataset. Two different methods, diversity and random per cluster were used to split the dataset into test and training sets. (see Tables 3 and 4) Different inhibitory classes with varying distributions of training-test where thus created (Fig. 1b).
Fig. 1

a Compounds similarity distance (Tanimoto score). b Distribution of compounds in % inhibition classes for the three datasets

a Compounds similarity distance (Tanimoto score). b Distribution of compounds in % inhibition classes for the three datasets Detailed result of ST and BF models are reported Tables 3 and 4 respectively. The ST and RF models performance are comparable. As shown, BF was able to find predictive models from dataset 2 with both splitting method. The ROC score for the in-bag training data for all trees in the forest model is 0.99 and 0.96, and the out-of-bag ROC score is 0.59 and 0.71 for the training set. The in-bag results are predictions for the data used to train the tree, while the out-of-bag results are predictions for the left-out data. The external test sets including 13 and 10 compounds respectively were used to evaluate the predictive ability of the two models. The ROC score on external test sets is good, 0.87 and 0.94 respectively. The confusion matrix, as well as sensitivity and specificity values, are presented in Additional file 1: Table S1, Additional file 2: Table S2. In the betulin derivatives inhibitors models, RF and ST method can correctly classify most of the molecules of the external test set. These outcomes indicate that the developed ST and RF models show favorable and robust prediction performance. The Y-randomization test was performed four times, and the AUC values for the model using the data set with experimental activity values were significantly higher than those obtained from the dataset with randomized values, indicating the robustness of our models. The most suitable sets of molecular descriptors for predicting Betulin derivatives inhibitors were extracted from the RF prediction models via feature selection. A summary of descriptors based on their frequency of occurrences in the models are given in Table 5. The FCFP_6 feature, number aromatic rings, number rings, molecular fractional polar surface area, molecular weight, number rotatable bonds are predominant in all models. In general, the frequency at which a descriptor was selected empirically appears to distinguish truly important descriptors from others best. In the RF models of betulin derivatives inhibitors, FCFP_6 feature, number aromatic rings are the most critical descriptors for classification.
Table 5

Forest important features used to build the models

Property attributes
FCFP_6
ALogP
Molecular_Weight
Num_H_Donors
Num_H_Acceptors
Num_RotatableBonds
Num_Rings
Num_AromaticRings
Molecular_FractionalPolarSurfaceArea
Forest important features used to build the models

Profiling results

The profiling results from 13 most actives compounds are presented in Table 6. The fit value was used to measure the fitness of the ligand and pharmacophore. A fit value equal or higher to 0.9 was used as a threshold to select targets from the activity profiler result (see Fig. 2). The 13 compounds mapped 47 pharmacophores models out of a total of 68,056 models with a rigid mapping and the presence of all molecular features required. These models belonged to 32 protein targets and were involved in 184 pathways. Protein sequences of all the predicted targets were collected, and a blast search was run on NCBI server to identify homolog in L. donovani (Table 7).
Table 6

Targets selected by the target pharmacophore screening with fit value > 0.9

IDUniprot IDGene nameTitleFamily
3ac3P06239LCK_HUMANTyrosine-protein kinase LckProtein kinase
3ad4P06239LCK_HUMANTyrosine-protein kinase LckProtein kinase
3mzsP00189CP11A_BOVINCholesterol side-chain cleavage enzyme, mitochondrialCytochrome P450
3my0P37023ACVL1_HUMANSerine/threonine-protein kinase receptor R3Protein kinase
1loxP12530LOX15_RABITArachidonate 15-lipoxygenaseLipoxygenase
1qyxP14061DHB1_HUMANEstradiol 17-beta-dehydrogenase 1Short chain dehydrogenases/reductases (SDR)
2bikP11309PIM1_HUMANSerine/threonine-protein kinase pim-1Protein kinase
2br1O14757CHK1_HUMANSerine/threonine-protein kinase Chk1Protein kinase
2chwP48736PK3CG_HUMANPhosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit gamma isoformPI3/PI4 kinase
2chzP48736PK3CG_HUMANPhosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit gamma isoformPI3/PI4 kinase
2p0mP12530LOX15_RABITArachidonate 15-lipoxygenaseLipoxygenase
2wxqQ3UDT3Q3UDT3_MOUSEPhosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit delta isoformPI3/PI4 kinase
3dduP48147PPCE_HUMANProlyl endopeptidasePeptidase S9A
3h3cQ14289FAK2_HUMANProtein-tyrosine kinase 2-betaProtein kinase
3hrbQ16539MK14_HUMANMitogen-activated protein kinase 14Protein kinase
3le6P24941CDK2_HUMANCyclin-dependent kinase 2Protein kinase
3mqeQ05769PGH2_MOUSEProstaglandin G/H synthase 2Prostaglandin G/H synthase
3p2vP15121ALDR_HUMANAldose reductaseAldo/keto reductase
3r7rP48736PK3CG_HUMANPhosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit gamma isoformPI3/PI4 kinase
3rgzO22476BRI1_ARATHProtein brassinosteroid insensitive 1Protein kinase
3s3gP15121ALDR_HUMANAldose reductaseAldo/keto reductase
3tfqP28845DHI1_HUMANCorticosteroid 11-beta-dehydrogenase isozyme 1Short chain dehydrogenases/reductases (SDR)
3ugrP42330AK1C3_HUMANAldo-keto reductase family 1 member C3Aldo/keto reductase
3zrlP49841GSK3B_HUMANGlycogen synthase kinase-3 betaProtein kinase
4a79P27338AOFB_HUMANAmine oxidase [flavin-containing] BFlavin monoamine oxidase
2hpyP02699OPSD_BOVINRhodopsinG-protein coupled receptor 1
3gqlP11362FGFR1_HUMANFibroblast growth factor receptor 1Protein kinase
2ab2P08235MCR_HUMANMineralocorticoid receptorNuclear hormone receptor
3lmpP37231PPARG_HUMANPeroxisome proliferator-activated receptor gammaNuclear hormone receptor
1pzoP62593BLAT_ECOLXBeta-lactamase TEMClass-A beta-lactamase
1zhxP35844KES1_YEASTOxysterol-binding protein homolog 4Oxysterol-binding protein
1zhzP35844KES1_YEASTOxysterol-binding protein homolog 4Oxysterol-binding protein
2oxdP28523CSK2A_MAIZECasein kinase II subunit alphaProtein kinase
2x00Q8WSF8Q8WSF8_APLCASoluble acetylcholine receptorNeurotransmitter-gated ion-channel
3hgyQ7B8P6Q7B8P6_CAMJUcmeRTranscriptional regulator
3ov4P00947SDIS_COMTESteroid delta-isomeraseSteroid delta-5-4-isomerase
3s92Q15059BRD3_HUMANBromodomain-containing protein 3Bromodomain and extraterminal domain
3smoP319471433S_HUMAN14-3-3 protein sigma14-3-3 protein
4a01O22124O22124_VIGRAPyrophosphate-energized vacuolar membrane proton pumpH(+)-translocating pyrophosphatase (TC 3.A.10)
4a86P15494BEV1A_BETPNMajor pollen allergen Bet v 1-ABet v I type allergen
4a8vP43183BEV1J_BETPNMajor pollen allergen Bet v 1-JBet v I type allergen
2iwsP02829HSP82_YEASTATP-dependent molecular chaperone HSP82Heat shock protein Hsp90
3b27P07900HS90A_HUMANHeat shock protein HSP 90-alphaHeat shock protein Hsp90
2xa4O60674JAK2_HUMANTyrosine-protein kinase JAK2Protein kinase
4a9nP25440BRD2_HUMANBromodomain-containing protein 2Bromodomain and extraterminal domain
4alhP25440BRD2_HUMANBromodomain-containing protein 2Bromodomain and extraterminal domain
3svgO60885BRD4_HUMANBromodomain-containing protein 4Bromodomain and extraterminal domain
Fig. 2

Compounds that fit each pharmacophore with shape fit value > 0.9

Table 7

Leishmania donovani homologous targets from PSIBLAST search wit evalue < 3

Uniprot IDGene nameTitle%IdentityEvalueBit scoreQuery
P27890HSP83_LEIDOHeat shock protein 8361.760568P07900|HS90A_HUMAN
Q01440GTR1_LEIDOMembrane transporter D134.830.00430.8P21616|AVP_VIGRR
A4ZZ93DHYSL_LEIDOInactive deoxyhypusine synthase19.700.04226.2P42330|AK1C3_HUMAN
P39050TYTR_LEIDOTrypanothione reductase64.710.05127.7P11362|FGFR1_HUMAN
E9BDA8GMPR_LEIDBGMP reductase25.640.07325.4P24941|CDK2_HUMAN
P43151GPA_LEIDOPutative guanine nucleotide-binding protein subunit alpha27.030.09925.4P49841|GSK3B_HUMAN
P17804HSP70_LEIDOHeat shock 70 kDa protein33.330.2423.9P62593|BLAT_ECOLX
P36889SAHH_LEIDOAdenosylhomocysteinase26.720.2423.5P28845|DHI1_HUMAN
A7LBL2PURA_LEIDOAdenylosuccinate synthetase27.660.2725Q15059|BRD3_HUMAN
Q27675CYAA_LEIDOReceptor-type adenylate cyclase A29.730.3121.6P00947|SDIS_COMTE
P23223GP63_LEIDOLeishmanolysin37.840.3324.3P37231|PPARG_HUMAN
Q25263CYAB_LEIDOReceptor-type adenylate cyclase B28.360.5224.6Q3UDT3|Q3UDT3_MOUSE
P21620IMDH_LEIDOInosine-5’-monophosphate dehydrogenase47.370.5722.7P15121|ALDR_HUMAN
Q9BIC6PFKA_LEIDOATP-dependent 6-phosphofructokinase28.950.5922.7P14061|DHB1_HUMAN
Q05889LPG1_LEIDOGalactofuranosyl glycosyltransferase28.070.6924.6O60885|BRD4_HUMAN
D9IFD5DOHH_LEIDODeoxyhypusine hydroxylase38.460.7123.1P37023|ACVL1_HUMAN
Q01441GTR2_LEIDOMembrane transporter D238.460.7923.1P27338|AOFB_HUMAN
P12522ATXB_LEIDOProbable proton ATPase 1B25.691.122.3P35844|KES1_YEAST
P11718ATXA_LEIDOProbable proton ATPase 1A25.231.222.3P35844|KES1_YEAST
O00874DPOLA_LEIDODNA polymerase alpha catalytic subunit22.221.223.1P48147|PPCE_HUMAN
P55905COQ5_LEIDO2-methoxy-6-polyprenyl-1,4-benzoquinol methylase, mitochondrial41.181.522.3P02829|HSP82_YEAST
Q05885ARD1_LEIDON-terminal acetyltransferase complex ARD1 subunit homolog32.561.621.9P02829|HSP82_YEAST
P12522ATXB_LEIDOProbable proton ATPase 1B501.821.6P35844|KES1_YEAST
P27116DCOR_LEIDOOrnithine decarboxylase29.552.222.7P48736|PK3CG_HUMAN
Q36736KM11_LEIDOKinetoplastid membrane protein 1127.592.619.2P31947|1433S_HUMAN
Q25264DCAM_LEIDOS-adenosylmethionine decarboxylase proenzyme42.112.721.6Q15059|BRD3_HUMAN
B5APK2DHYS_LEIDODeoxyhypusine synthase36.362.920.4P28845|DHI1_HUMAN
Targets selected by the target pharmacophore screening with fit value > 0.9 Compounds that fit each pharmacophore with shape fit value > 0.9 Leishmania donovani homologous targets from PSIBLAST search wit evalue < 3

Pharmacological network analysis

A topological analysis of the network pharmacology compound-pharmacophore-target-pathway offered insights into the biologically relevant connectivity patterns, and profoundly essential targets or pathways. A general overview of the global topological properties of the network was obtained from the statistical data by the Network Analyzer of Cytoscape. The full pharmacological network of L. donovani betulin derivatives inhibitors had three types of nodes, compounds, pharmacophores, and targets with related pathway information (Additional file 3: Fig. S1). The 13 compounds nodes formed the core of the network which fit 47 pharmacophores and was surrounded by the target nodes. Each target was linked to at least one pathway. A total of 209 pathway nodes constituted the outer layer of the network. Most pharmacophores were the center of a sub network-shaped connection. For seven targets, no pathway was identified. Three pharmacophores are involved in a little number of pathways, between 2 and 3 for each proposed target. Six pharmacophores formed a closed network of 2–4 pathways for each target. Pharmacophores, targets, and pathways were strongly interconnected in many-to-many relationships. Figure 3 presents a subset of the pharmacological network of L. donovani betulin derivatives inhibitors limited to its most connected compounds and targets nodes. The diameter of the network was 10, the centralization was 0.18, and the density was 0.011. To reduce the number of candidate targets and identify more potential targets based on targets identified from network pharmacology, the degrees distribution of all the alkaloids (Fig. 4a) and essential targets (Fig. 4b) were investigated. The compounds with higher degree values (≥ 9), such as 1, 3, 4, 5, 6, 7 and 8, that participate in more interactions than the other components are the hubs in the network. The target degree values ranged between 1 and 50. The targets with the highest degree (≥ 10) values are MAP kinase p38 alpha (50), Glycogen synthase kinase-3 beta (36), Cyclin-dependent kinase 2 (29), Tyrosine-protein kinase JAK2 (27), Heat shock protein HSP 90-alpha (23), PI3-kinase p110-gamma subunit (17), Tyrosine-protein kinase LCK (14), Tyrosine-protein kinase 2 beta (12), Serine/threonine-protein kinase Chk1(11) and 14-3-3 protein sigma (10). The highly connected nodes are referred to as the hubs of the network for target prediction. To find the relations between target proteins and the critical pathway further, we analyzed the target-pathway network. Logically, the weight of one pathway which contains many druggable target proteins is more significant than for many pathways including a single target protein that can be actioned by many drug molecules. The critical pathways (highest degree level) are summarized in Fig. 4c. These results suggested that B Cell Receptor Signaling, Brain-derived neurotrophic factor (BDNF) signaling, Integrated Pancreatic Cancer, Oncostatin M Signaling pathways may bind compounds with pharmacophoric similarities to betulin derivatives. Homologous targets were identified in L. donovani from the PSI-BLAST search as the potential target of Betulin derivatives. Table 7 shows a summary of L. donovani homologous targets with E-value < 3. A total of 27 proteins selected as similar to one or more targets identified by target fishing.
Fig. 3

Subset of the pharmacological network of L. donovani Betulin derivatives inhibitors limited to the hubs of the network for target prediction. Betulin derivatives inhibitors, pharmacophore, and targets with a red to gray gradient scale

Fig. 4

Nodes degree distribution in the pharmacological Betulin derivatives inhibitors network. a Degree distribution of Betulin derivatives inhibitor nodes, b degree distribution of target nodes, c degree distribution of pathway nodes

Subset of the pharmacological network of L. donovani Betulin derivatives inhibitors limited to the hubs of the network for target prediction. Betulin derivatives inhibitors, pharmacophore, and targets with a red to gray gradient scale Nodes degree distribution in the pharmacological Betulin derivatives inhibitors network. a Degree distribution of Betulin derivatives inhibitor nodes, b degree distribution of target nodes, c degree distribution of pathway nodes

Discussion

It is well known that the unknown targets and underlying mechanisms restrict the development of novel therapeutics against Leishmania. In silico predictive modeling offer new tools to overcome these shortages. However, many existing methodologies offers complex predictive models and relative applicability by the experimental chemist. To increase the utility, we proposed classification models and compounds-target-pathway interaction network to predict Leishmania activity of new compounds and discern the targets and potential pathways from a set of betulin derivatives active in vitro against L. Donovani. We successfully build two type of recursive partitioning classification models, single tree and bagged forest models. A forest model is less directly interpretable than a single-tree model in that there is not merely one tree to interpret, but depending on the type of forest, anywhere from tens to hundreds of trees. On the other hand, a forest model provides statistical measures of the relative importance of the various descriptors in distinguishing among the different classes, which is not available with single-tree models. When none of the descriptors is strongly correlated with the class membership, single-tree models can be brittle, in that a relatively small change in the training data results in a significant difference in the structure of the tree, and thus in the tree’s predictions. A forest model helps to address this problem. In principle, Network analysis has the potential to allow the target identification of L. donovani betulin derivatives inhibitors. The proteins in the hubs of the network (highly connected nodes) are highly associated with each other. The most critical proteins with high degree value are all related to protein kinase family. Among them, MAP kinase p38 alpha, Glycogen synthase kinase-3 beta, Cyclin-dependent kinase 2, Tyrosine-protein kinase JAK2, Heat shock protein HSP 90-alpha, PI3-kinase p110-gamma subunit, Tyrosine-protein kinase LCK, Protein tyrosine kinase 2 beta, Serine/threonine-protein kinase Chk and 14-3-3 protein sigma. They are involved in directing cellular responses to a diverse array of stimuli (such as mitogens, heat shock, and pro-inflammatory cytokines) and regulate proliferation, gene expression, mitosis, cell survival, apoptosis and many other cell functions [60]. The mode of action of these critical proteins may be done through the integrated biological network rather than by individual target. The four central pathways, B Cell Receptor, Brain-derived neurotrophic (BDNF), Integrated Pancreatic Cancer and Oncostatin M, have higher frequencies than the rest. Members of the cyclin-dependent kinase family and MAP kinases had been previously identified as essential for Leishmania and suggested as potential drug targets [61]. Homologous targets, Heat shock protein 83 and Membrane transporter D1 were identified as a possible target in L. donovani and proposed for experimental validation. Among the chaperones, heat shock protein 83 (Hsp83) is alternately referred to as Hsp90 or Hsp86 due to the variable molecular weight amongst different orthologues is a family of emerging targets for infectious diseases. Hsp83 is best known as cancer targets with some drug candidates in clinical development [62, 63]. Transporters are proteins that play a role in bringing small molecules across biological membranes. The function of transporters as therapeutic targets is a well-established new field of research [64]. Transporters are new therapeutic targets for treating rare diseases. But there is no, till today, a case of exploration of Hsp83 or Membrane transporter D1 as a drug target in L. donovani. The results offer the opportunity to characterize the chemical sensitivity of the parasitic chaperone and Membrane transporter D1 against our library of Betulin derivatives L. donovani inhibitors with biophysical and biochemical techniques.

Conclusion

In this study, Recursive partitioning (both ST and BF) methods were firstly used to develop classification models for the inhibitory activity of 58 betulin derivatives in vitro against L. donovani amastigotes. These models can be used to screen a large compound library for facilitating the discovery of the novel lead compounds. Most relevant molecular features of betulin derivative inhibition were identified. These features provide an excellent analytical perspective to explain the similarities and differences between betulin derivative inhibitors and non-inhibitors. The potential targets of these compounds were determined through in silico target fishing, which combines 3D structure-based pharmacophore searching and network pharmacology analysis. Using this strategy, we inferred links between most active compounds and Leishmaniasis disease through molecular targets and keys signaling pathways. Further studies need to validate identified targets and to test the effects of betulin derivatives on identified pathways and their interactions (Additional file 4: Fig. S2, Additional file 5). Additional file 1: Table S1. Bagged forest confusion matrix. Additional file 2: Table S2. Single tree confusion matrix. Additional file 3: Fig. S1. Full pharmacological network of Leishmania donovani Betulin derivatives inhibitors. Betulin derivatives inhibitors, pharmacophore, targets and biopathway with a red to gray gradient scale. Additional file 4: Fig. S2. Superimposition of potential protein target structures. Additional file 5. Zip file with all potential protein target structures protein data bank files.
  54 in total

1.  Do structurally similar molecules have similar biological activity?

Authors:  Yvonne C Martin; James L Kofron; Linda M Traphagen
Journal:  J Med Chem       Date:  2002-09-12       Impact factor: 7.446

2.  Protein-ligand-based pharmacophores: generation and utility assessment in computational ligand profiling.

Authors:  Jamel Meslamani; Jiabo Li; Jon Sutter; Adrian Stevens; Hugues-Olivier Bertrand; Didier Rognan
Journal:  J Chem Inf Model       Date:  2012-04-11       Impact factor: 4.956

3.  3D pharmacophores as tools for activity profiling.

Authors:  Daniela Schuster
Journal:  Drug Discov Today Technol       Date:  2010

4.  Betulin Derivatives Effectively Suppress Inflammation in Vitro and in Vivo.

Authors:  Mirka Laavola; Raisa Haavikko; Mari Hämäläinen; Tiina Leppänen; Riina Nieminen; Sami Alakurtti; Vânia M Moreira; Jari Yli-Kauhaluoma; Eeva Moilanen
Journal:  J Nat Prod       Date:  2016-01-14       Impact factor: 4.050

Review 5.  Structure–activity relationship of terpenes with anti-inflammatory profile – a systematic review.

Authors:  Marilia Trindade de Santana Souza; Jackson Roberto Guedes da Silva Almeida; Adriano Antunes de Souza Araujo; Marcelo Cavalcante Duarte; Daniel Pens Gelain; José Cláudio Fonseca Moreira; Marcio Roberto Viana dos Santos; Lucindo José Quintans-Júnior
Journal:  Basic Clin Pharmacol Toxicol       Date:  2014-09       Impact factor: 4.080

6.  Ionic derivatives of betulinic acid exhibit antiviral activity against herpes simplex virus type-2 (HSV-2), but not HIV-1 reverse transcriptase.

Authors:  Robert J Visalli; Hannah Ziobrowski; Kameswara R Badri; Johnny J He; Xiugen Zhang; Sri Ranjini Arumugam; Hua Zhao
Journal:  Bioorg Med Chem Lett       Date:  2015-06-06       Impact factor: 2.823

7.  Antimalarial activity of betulinic acid and derivatives in vitro against Plasmodium falciparum and in vivo in P. berghei-infected mice.

Authors:  Matheus Santos de Sá; José Fernando Oliveira Costa; Antoniana Ursine Krettli; Mariano Gustavo Zalis; Gabriela Lemos de Azevedo Maia; Ivana Maria Fechine Sette; Celso de Amorim Câmara; José Maria Barbosa Filho; Ana Maria Giulietti-Harley; Ricardo Ribeiro Dos Santos; Milena Botelho Pereira Soares
Journal:  Parasitol Res       Date:  2009-04-15       Impact factor: 2.289

8.  Retrospective molecular docking study of WY-25105 ligand to β-secretase and bias of the three-dimensional structure flexibility.

Authors:  Leo Ghemtio; Nicolas Muzet
Journal:  J Mol Model       Date:  2013-04-07       Impact factor: 1.810

9.  Mitochondria-Targeted Lupane Triterpenoid Derivatives and Their Selective Apoptosis-Inducing Anticancer Mechanisms.

Authors:  Yaqing Ye; Tao Zhang; Huiqing Yuan; Defeng Li; Hongxiang Lou; Peihong Fan
Journal:  J Med Chem       Date:  2017-07-18       Impact factor: 7.446

10.  Discovery of betulinic acid as a selective inhibitor of human melanoma that functions by induction of apoptosis.

Authors:  E Pisha; H Chai; I S Lee; T E Chagwedera; N R Farnsworth; G A Cordell; C W Beecher; H H Fong; A D Kinghorn; D M Brown
Journal:  Nat Med       Date:  1995-10       Impact factor: 53.440

View more
  2 in total

1.  Leishmanicidal Activity of Betulin Derivatives in Leishmania amazonensis; Effect on Plasma and Mitochondrial Membrane Potential, and Macrophage Nitric Oxide and Superoxide Production.

Authors:  Wilmer Alcazar; Sami Alakurtti; Maritza Padrón-Nieves; Maija Liisa Tuononen; Noris Rodríguez; Jari Yli-Kauhaluoma; Alicia Ponte-Sucre
Journal:  Microorganisms       Date:  2021-02-04

2.  Optimal Modeling of Anti-Breast Cancer Candidate Drugs Based on Graph Model Feature Selection.

Authors:  Rongyuan Chen; Zhixiong He; Shaonian Huang; Lizhi Shen; Xiancheng Zhou
Journal:  Comput Math Methods Med       Date:  2022-08-30       Impact factor: 2.809

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.