Literature DB >> 21773063

Models for the prediction of receptor tyrosine kinase inhibitory activity of substituted 3-aminoindazole analogues.

Monika Gupta1, Harish Dureja, Anil Kumar Madan.   

Abstract

The inhibition of tumor angiogenesis has become a compelling approach in the development of anticancer drugs. In the present study, topological models were developed through decision tree and moving average analysis using a data set comprising 42 analogues of 3-aminoindazoles. A total of 22 descriptors (distance based, adjacency based, pendenticity and distance-cum-adjacency based) were used. The values of all 22 topological indices for each analogue in the dataset were computed using an in-house computer program. A decision tree was constructed for the receptor tyrosine kinase KDR (kinase insert domain receptor) inhibitory activity to determine the importance of topological indices. The decision tree learned the information from the input data with an accuracy of 88%. Three independent topological models were also developed for prediction of receptor tyrosine kinase inhibitory (KDR) activity using moving average analysis. The models developed were also found to be sensitive towards the prediction of other receptor tyrosine kinases i.e. FLT3 (fms-like tyrosine kinase-3) and cKIT inhibitory activity. The accuracy of classification of single index based models using moving average analysis was found to be 88%. The performance of models was assessed by calculating precision, sensitivity, overall accuracy and Mathew's correlation coefficient (MCC). The significance of the models was also assessed by intercorrelation analysis.

Entities:  

Keywords:  3-Aminoindazoles; Decision tree; Moving average analysis; Receptor tyrosine kinase inhibitors; Topological indices

Year:  2011        PMID: 21773063      PMCID: PMC3134851          DOI: 10.3797/scipharm.1102-08

Source DB:  PubMed          Journal:  Sci Pharm        ISSN: 0036-8709


Introduction

Cancer is a leading cause of death worldwide and accounted for 7.9 million deaths (around 13% of all deaths) in 2007. Moreover, the deaths resulting from cancer are expected to rise continuously with an estimated 12 million deaths in 2030 [1]. Cancer is thought to reflect a multistep process, resulting from an accumulation of inherited, acquired or both defects in genes involved in the positive or negative regulation of cell proliferation and survival. Activation or inactivation of just four or five different genes may be required for the development of clinically recognizable human cancer [2]. Despite advances in diagnosis and treatment, overall survival of patients still remains poor. Surgery, chemotherapy, radiotherapy, and endocrine therapy have been the standard options available for treatment of cancer patients. This has improved survival in several types of solid tumors, but treatment-related toxicity and emergence of drug resistance have been the major causes of morbidity and mortality [3]. Consequently, there is an urgent need to develop newer more effective therapies to improve patient outcomes. Blood is essential for solid tumors to manage nutritional supplies and waste removal to manage the tumor growth and metastasis. Angiogenesis is a process in which new blood vessels are formed from pre-existing vasculatures [4]. It has been reported that the angiogenesis is a rate limiting step in tumor development. Tumors that lack adequate vasculature become necrotic or apoptotic and don’t grow beyond a limited size [5, 6]. Inhibition of tumor angiogenesis has become a compelling approach in development of anticancer agents [7, 8]. Vascular endothelial growth factor (VEGF) is the primary endothelial cell specific angiogenic factor [9]. VEGF activity is mediated by three higher affinity receptors belonging to the class-V subfamily of receptor tyrosine kinases (RTKs). These are widely known for regulating angiogenesis, vasculogenesis, and lymphangiogenesis. The VEGFR family includes VEGFR-I/FLT-1 (fms-like tyrosine kinase-I), VEGFR-2/FLK-I (fetal liver kinase-I)/KDR (kinase inert domain containing receptor) and VEGFR-3/FLT-4 (fms-like tyrosine kinase-4). VEGFR-I is required for endothelial organization during vascular development while VEGFR-2 is required for formation of blood islands and also hematopoiesis [3, 5]. VEGFR-3 plays a significant role in VEGF-C and VEGF-D-mediated lymphangiogenesis. Over activation of KDR by VEGFs has been linked to progression of variety of human cancers. VEGF mediated KDR signaling induces a series of endothelial responses such as proliferation, migration and survival which ultimately leads to new vessel formation and maturation [10-13]. The immature tumor vasculature lacking a close association with pericytes appears to be most susceptible to inhibition of VEGFR signaling [14] and resistance to continued inhibition of VEGFR signalling has been reported [15]. Evidence increasingly points to a key role for PDGFR (platelet derived growth factor receptor tyrosine kinsases), located on the pericytes in the tumour stroma, in angiogenesis and vessel maturation and highlights its potential as a therapeutic target [16]. PDGFR is class III subfamily of RTKs including five receptors i.e. PDGFR-α, PDGFR-β, CSF-IR (colony stimulating factor 1 receptor), cKIT and FLT3 involved in cellular growth, differentiation, cytokine vascular regulation, gliomas and leukemia [3, 17]. FLT3 is shown to be commonly overexpressed in most B lineage acute lymphocytic leukemia (ALL), acute myeloid leukemias (AMLs) and chronic myeloid leukemias (CML) [18]. Preclinical studies have already shown the benefits of combining VEGFR and PDGFR inhibition with respect to tumour response [19]. Due to vital role that RTKs play in tumor angiogenesis, inhibition of these may prove to be an effective therapeutic intervention and potential inhibitors can be explored [2, 3]. The drug research and development is comprehensive, expensive, time-consuming and full of risk [20]. The traditional approach of drug discovery involves target identification, validation, lead search and optimization followed by clinical development phases [21]. The experimental search for better activities in drug discovery is commonly carried out in the laboratory by optimizing the structure–activity relationship (SAR) of the functional groups present in a leading structure in terms of their biological endpoint. However, an interesting alternative to this trial-error based procedure that constitutes an active field in complex biochemical phenomena are the analysis through Quantitative Structure–Activity/Property/Toxicity Relationships (QSAR/QSPR/QSTR) [22]. Quantitative structure-activity relationship (QSAR) represents an attempt to correlate structure descriptors of compounds with their biological activity [23, 24]. An important aspect of this method is the use of good structural descriptors that represent the molecular features responsible for the relevant biological activity [25]. The chemical graph theory is largely applied to the quantitative characterization of molecular structures for predicting physicochemical, pharmacological and toxicological properties using graph theoretical invariants [26, 27].The graph theoretical invariants have been termed as topological indices [28, 29]. The computation of TI is very swift and the TIs have the advantage of being true structural Invariants, which means that their values are independent of molecular conformations [25]. In last few decades, the topological indices have emerged as powerful tools for predicting biological activity of molecules, and lead identification forming an integral part of new molecular research [30-33]. In the present study, relationship of topological descriptors with KDR inhibitory activities of 3-aminoindazoles has been investigated using decision tree and moving average analysis. The proposed models were also evaluated for the prediction of FLT3 and cKIT inhibitory activities.

Methodology

Dataset

A dataset comprising of 42 analogues of substituted 3-aminoindazoles [34] was selected for the present investigations. The basic structure for these analogues is depicted in Fig. 1. and various substituents are enlisted in Tab. 1.
Fig. 1.

Basic structures of 3-aminoindazole analogues [34]

Tab. 1.

Relationship between topological indices and KDR inhibitory activity

Cpd. No.Basic StructureRR1M2CWcAξc2KDR inhibitory activity
Predicted using MAA models
Reported
M2CWcAξc2
1A 86.79494.184.11
2B 123.291674.222.02
3B 164.962093.782.15
4B 129.461949.521.76
5C 135.452172.951.81+
6C 135.461750.11.98
7C 125.461729.091.98
8C 123.291674.222.02
9D2–F136.202160.531.82
10D3–F137.782180.531.81
11D4–F137.792200.531.58
12D2–Me134.452152.951.82
13D4–Me135.462192.951.58
14D3–Et139.462423.381.62++++
15D3–Cl143.292198.401.81++
16D3–Br158.122246.621.81
17D3–CF3152.702955.61.72++
18D3–OH136.792177.281.81
19D2–F–5–Me141.202389.251.87+++
20D3–Me–4–F144.372427.251.63++++
21D3–F–4–Me144.372427.251.63+++
22D2–F–5–CF3153.703191.651.78
23E3–Me–Me141.622383.541.73++++
24E3–Me 147.12635.781.67+
25E3–Me 144.632391.411.73++++
26E2–F–5–Me 170.902460.041.73+++
27F3–Me–Me165.413909.631.43
28F3–Me–OMe173.574669.821.39
29F3–Me–F179.734664.311.41
30F3–Me–Br190.455587.481.28
31F3–Me 192.515554.981.36
32F3–Me 161.783572.181.42
33F3–Me 175.234126.541.5
34F3–Me 173.854123.641.5
35F3–Me 141.962409.051.54++
36F3–Me 150.182924.311.42
37F3–Me 179.934595.261.28
38F3–Me 161.093501.971.54
39F3–Cl 169.623602.531.42
40F2–F–5–Me 167.533873.601.45
41F2–F–5–Me 179.325030.371.41
42F2–F–5–Me 192.265466.491.37

Active analogue;

Inactive analogue.

Kinase enzymatic assays of al the 42 analogues were preformed by Dai et al. [34] utilizing the homogeneous time-resolved fluorescence (HTRF) protocol. Peptide substrate at 4 μM, 1 mM ATP, enzyme and inhibitors (3.2 nM to 50 μM) were incubated for 1 hour at ambient temperature in 50mM NaOH (pH 7.5), 10mM MgCl2, 2mM MnCl2, 2.5 mM DTT, 0.1 mM orthovandate and 0.01% bovine serum albumin. The reactions were stopped with 0.5 M EDTA and then 75 μL buffer containing detecting agents (streptividine-allphycocyanin and PT66 antibody europium cryptate) was added. The plates were read from 1to 4 hour for time-resolved fluorescence. The inhibition was calculated using control and background reading. Each IC50 determination was preformed with seven concentrations and each assay point was reportedly determined in duplicate [34]. Subsequently, based on the results of KDR enzymatic assay, the potent inhibitors were reportedly characterized by cellular assay using 3T3 – murine fibroblasts cells [34]. The in vivo activity of compounds with potent cellular activity was also reportedly carried out using an estradiol-induced mouse uterine edema (UE) model. The said assay served as a valuable tool for a rapid and preliminary evaluation of KDR inhibitor’s oral activity. The dataset comprised of variable degree of activities. Compounds having reported IC50 values of ≤ 10 nM were considered to be active while those possessing IC50 values >10 nM were treated to be inactive for the purpose of present study.

Topostructural and topochemical indices

Twelve topostructural and ten topochemical indices [30, 35–55] used for the present study are presented in Tab. 2. The distance based topological descriptor (Wiener index, Balaban index), adjacency based descriptors (Zagreb group parameter, M1 and M2, molecular connectivity index) and distance cum adjacency based topological descriptors (eccentric adjacency index, augmented eccentric connectivity index, superadjacency index, eccentric connectivity index, connective eccentric index, superaugmented eccentric connectivity index-2) and pendenticity based descriptor (superpendentic index) were calculated using an in-house computer program. The topochemical versions of topostructural descriptors were calculated from distance and adjacency matrices weighed by molecular mass with respect to that of carbon atom.
Tab. 2.

Topostructural and topochemical indices

CodeIndexRefe.
A1Molecular connectivity topochemical index35,36
A2Eccentric adjacency topochemical index37
A3Augmented eccentric connectivity topochemical index38
A4Superadjacency topochemical index39
A5Eccentric connectivity topochemical index40
A6Connective eccentricity topochemical index41
A7Zagreb topochemical index, M1C42
A8Zagreb topochemical index, M2C42
A9Wiener’s topochemical index43
A10Superaugmented eccentric connectivity topochemical index-244
A11Molecular connectivity index30
A12Eccentric adjacency index45
A13Augmented eccentric connectivity index46
A14Superadjacency index39
A15Eccentric connectivity index47
A16Connective eccentricity index48
A17Zagreb index, M149,50
A18Zagreb index, M249,50
A19Wiener’s index51,52
A20Superaugmented eccentric connectivity index-253
A21Balaban mean square distance index54
A22Superpendentic index55

Decision tree

Decision tree provides a useful solution for many problems of classification where large datasets are used and the information contained is complex. A decision tree consisting of nodes and branches represents a collection of rules with each terminal node corresponding to a specific decision rule [56]. Decision trees are constructed beginning with the roots of tree and proceeding down to its leaves. In terms of ability, decision trees are a rapid and effective method of classifying data set entries and can provide good decision support capabilities [57, 58]. Applications of classification-based decision tree methods predominate in science and medicine [59]. It has been applied to some bioinformatics and cheminformatics problems, such as characterizations of tumor [60, 61], prediction of drug response [62], classification of antagonist of receptors [63] and identification of DNA sections coding exons [64]. In present study, decision tree was grown to identify the importance of topological indices. In a decision tree, the molecules at each parent node are classified, based upon the index value, into two child nodes. The prediction for molecule reaching a given terminal node is obtained by majority vote of molecules reaching the same terminal node in the training set. In this study, R program (version 2.1.0) along with the RPART library was used to grow decision tree. The active compounds were labeled as “A” (n = 9) and the inactive compounds were labeled “B” (n = 33). Each analogue was assigned a biological activity which was then compared with the reported KDR inhibitory activity [34].

Moving average analysis

Moving average analysis of correctly predicted compounds is the basis of development of single topological index based model [45, 65]. For the selection and evaluation of range specific features, exclusive activity ranges were discovered from the frequency distribution of response level and subsequently identify the active range by analyzing the resulting data by maximization of the moving average with respect to active compounds (<35% inactive, 35–65 % transitional, > 65 % active) The KDR inhibitory activity assigned to each compound was compared with reported biological activity. The various ranges obtained were also studied for the cKIT and FLT3 inhibitory activities. The average IC50 (nm) for each range and activity was also calculated.

Data analysis

The sensitivity and specificity values were calculated which represents the classification accuracies for the active and inactive compounds, respectively. The randomness of model was also predicted by calculating Mathew’s correlation coefficient (MCC). The MCC values ranging between −1 to +1 indicates the potential of model. MCC took both sensitivity and specificity into account and it is generally used as a balanced measure in dealing with data imbalance situation [66]. The intercorrelation between M2C and Wc, and was also investigated. The degree of correlation was appraised by correlation coefficient ‘r’. Pairs of indices with r ≥ 0.97 were considered highly inter-correlated, those with 0.90 ≤ r ≤ 0.97 were appreciably correlated, those with 0.50 ≤ r ≤ 0.89 were weakly correlated and finally the pairs of indices with r < 0.50 were not intercorrelated [67].

Results and Discussion

In the present study, decision tree was built from a set of 22 topological indices. The index at root node is most important and the importance of index decreases as the length of tree increases. The classification of 3-aminoindazoles analogues as inactive and active using a single tree, based on Zagreb topochemical index A8 is shown in Fig. 2.
Fig. 2.

A decision tree for distinguishing active analogue (A) from inactive analogue (B); A8- Zagreb topochemical index, M2C

The decision tree identified the Zagreb topochemical index A8 as the most important index. The decision tree classified the analogues with an accuracy of 88%. The precision and sensitivity of inactive analogues was of the order of 91.11% and 93.93%, whereas the precision and sensitivity of active analogues was of the order of 75% and 66.6% respectively (Tab. 3).
Tab. 3.

Confusion matrix for KDR inhibitory activity using models based on decision tree

RangesNumber of cpds. predicted using decision tree
Precision (%)Sensitivity (%)MCC
ActiveInactive
Active637566.60.63
Inactive23191.1193.93
Three independent moving average analysis (MAA) based topological models were developed (Tab. 4.). The topological index A8, Zagreb topochemical index identified as most important index by decision tree was used to construct model for the prediction of KDR inhibitory activity. Two more indices i.e. Wiener’s topochemical index, A9 and superaugmented eccentric connectivity index-2, A20 were also used to develop the models for predicting KDR inhibitory activity. The methodology used in the present study aims at development of suitable models for providing lead molecules through exploitation of the active ranges in the proposed models based on topological indices. Proposed models are unique and differ widely from the conventional QSAR models. Both systems of modelling have their own advantages and limitations. In the instant case, the modelling system adopted has distinct advantage of identification of narrow active range, which may be erroneously skipped during routine regression analysis in conventional QSAR modelling [44]. Since the ultimate goal of modelling is to provide lead structures, therefore, these active ranges can play vital role in lead identification.
Tab. 4.

MAA based models for the prediction of receptors tyrosine inhibitory activity

Model IndexNature of range in proposed modelIndex valueNumber of analogues falling in the range
Overall accuracy of predict. (%)KDR Aver. IC50 (nM)FLT3 Aver.* IC50 (nM)cKIT Aver.* IC50 (nM)
TotalCorrect
M2CLower inactive<139.4613121379.7784.541629.62
Active139.46–144.638688.099.6319.8619.71
Upper inactive>144.632119557.76189.31893.14

WcLower inactive<2383.5416141412.5680.532029.19
Active2383.54–2460.048688.099.8721.2923.43
Upper inactive>2460.041817391.5207.86108.73

Aξc2Lower inactive<1.621818389.5188.36101.4
Active1.62–1.738688.0912.52531
Upper inactive>1.7316131413.5972037.25

Average in a range is taken only for the reported IC50 values;

#Values in brackets are based upon correctly predicted analogues in the particular range.

Retrofit analysis of the data reveals the following information with regards to different models used in this study. The biological activity was assigned to 42 analogues in both active and inactive ranges, out of which activity of 37 analogues was correctly predicted resulting in 88.09% accuracy with regard to KDR inhibition using model based on Zagreb topochemical index (A8). 31 out of 34 compounds (91%) in both the inactive ranges were predicted correctly. The average IC50 value of all the correctly predicted analogues in both the inactive ranges was 1494.5 nM and 615.421 nM respectively (Fig. 3). The average IC50 of correctly predicted analogues in the active range was found to be only 5 nM with regard to KDR inhibitory activity. Such a low average IC50 value signifies high potency of the active range. The said active range also exhibited significant FLT3 activity with average IC50 value of 19.86 nM (Fig. 4) and cKIT activity with average IC50 value of 19.714 nM (Fig. 5). The precision and sensitivity of inactive analogues was found to be 93.93% and 91.11%, whereas the precision and sensitivity of active analogues was of the order of 66.6% and 75% respectively. The 3-aminoindazoles analogues were correctly classified as active or inactive using Wiener’s topochemical index with an accuracy of 88%. The biological activity was assigned to 42 analogues in both active and inactive ranges, out of which 37 analogues were correctly predicted. 31 out of 34 compounds (91.1%) in both the inactive ranges were predicted correctly. The average IC50 value of all the correctly predicted analogues in both the inactive ranges was 1613.57 nM and 413.94 nM, respectively whereas the average IC50 of correctly predicted analogues in the active range was found to be only 5.33 nM (Fig. 3). Such a low average IC50 value signifies high potency of the active range.
Fig. 3.

Average IC50 (nM) values of 3-aminoindazoles for KDR inhibitory activity in various ranges of topological models derived through moving average analysis

Fig. 4.

Average IC50 (nM) values of 3-aminoindazoles for FLT3 inhibitory activity in various ranges of topological models derived through moving average analysis

Fig. 5.

Average IC50 (nM) values of 3-aminoindazoles for cKIT inhibitory activity in various ranges of topological models derived through moving average analysis

The above active range also exhibited significant FLT3 activity with average IC50 value of 21.29 nM (Fig. 4) and cKIT activity with average IC50 value of 23.43 nM (Fig. 5). The precision and sensitivity of inactive analogues was found to be 93.93% and 91.11%, whereas the precision and sensitivity of active analogues was of the order of 66.6% and 75% respectively. Superaugmented eccentric connectivity-2, classified the 3-aminoindazoles analogues as active and inactive with an accuracy of 88%. The biological activity was assigned to 42 analogues in both active and inactive ranges, out of which 37 analogues were correctly predicted. 31 out of 34 compounds (91.1%) in both the inactive ranges were predicted correctly. The average IC50 value of all the correctly predicted analogues in both the inactive ranges was 389.5 nM and 841.75 nM, respectively whereas the average IC50 of correctly predicted analogues in the active range was found to be only 6.33 nM (Fig. 3). Such a low average IC50 value signifies high potency of the active range. The said active range also exhibited significant FLT3 activity with average IC50 value of 25 nM (Fig. 4) and cKIT activity with average IC50 value of 31 nM (Fig. 5). The overall accuracy of prediction was found to be 88%. The precision and sensitivity of inactive analogues was found to be 93.93% and 91.11%, whereas the precision and sensitivity of active analogues was of the order of 66.6% and 75% respectively. The MCC value was found to be 0.633 (Tab. 3.) suggesting the randomness in the distribution of data. The result of intercorrelation analysis (Tab. 5) reveals that the was not correlated with M2C and Wc while the pair M2C and Wc was found to be appreciably correlated
Tab. 5.

Intercorrelation matrix

M2CWcAξc2
M2C10.92−0.73
Wc1−0.68
Aξc21

Conclusion

Models based on all the three topological descriptors exhibited high degree of predictability with regard to KDR inhibitory activity using decision tree and moving average analysis. Moreover, the active ranges of the proposed models also exhibited significant FLT3 and cKIT inhibitory activities. A combination of VEGFR and PDGFR inhibitory activities will naturally be more beneficial for the treatment of tumors. High degree of predictability of the proposed models can provide valuable lead structures for the development of potent receptor tyrosine kinase inhibitors (RTKs).
  40 in total

Review 1.  Recent advances on the role of topological indices in drug discovery research.

Authors:  E Estrada; E Uriarte
Journal:  Curr Med Chem       Date:  2001-11       Impact factor: 4.530

2.  Predicting anti-HIV activity: computational approach using a novel topological descriptor.

Authors:  S Gupta; M Singh; A K Madan
Journal:  J Comput Aided Mol Des       Date:  2001-07       Impact factor: 3.686

3.  Topochemical models for prediction of cyclin-dependent kinase 2 inhibitory activity of indole-2-ones.

Authors:  Harish Dureja; Anil Kumar Madan
Journal:  J Mol Model       Date:  2005-06-02       Impact factor: 1.810

4.  Data mining with decision trees for diagnosis of breast tumor in medical ultrasonic images.

Authors:  W J Kuo; R F Chang; D R Chen; C C Lee
Journal:  Breast Cancer Res Treat       Date:  2001-03       Impact factor: 4.872

Review 5.  Vascular remodeling and clinical resistance to antiangiogenic cancer therapy.

Authors:  Julia Glade Bender; Erin M Cooney; Jessica J Kandel; Darrell J Yamashiro
Journal:  Drug Resist Updat       Date:  2004 Aug-Oct       Impact factor: 18.500

Review 6.  Protein tyrosine kinase structure and function.

Authors:  S R Hubbard; J H Till
Journal:  Annu Rev Biochem       Date:  2000       Impact factor: 23.643

Review 7.  Tyrosine kinase inhibitors in cancer therapy.

Authors:  Srinivasan Madhusudan; Trivadi S Ganesan
Journal:  Clin Biochem       Date:  2004-07       Impact factor: 3.281

Review 8.  Angiogenesis in cancer, vascular, rheumatoid and other disease.

Authors:  J Folkman
Journal:  Nat Med       Date:  1995-01       Impact factor: 53.440

Review 9.  Mechanisms of resistance to FLT3 inhibitors.

Authors:  S Haihua Chu; Donald Small
Journal:  Drug Resist Updat       Date:  2009-01-21       Impact factor: 18.500

10.  Selective ablation of immature blood vessels in established human tumors follows vascular endothelial growth factor withdrawal.

Authors:  L E Benjamin; D Golijanin; A Itin; D Pode; E Keshet
Journal:  J Clin Invest       Date:  1999-01       Impact factor: 14.808

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.