Literature DB >> 23901159

Molecular Descriptors in Modelling the Tumour Necrosis Factor-α Converting Enzyme Inhibition Activity of Novel Tartrate-Based Analogues.

P Singh1.   

Abstract

The tumour necrosis factor-α converting enzyme inhibition activity of a series comprising of novel tartrate-based analogues has been quantitatively analysed in terms of molecular descriptors. The statistically validated quantitative structure-activity relationship models provided rationales to explain the inhibition activity of these congeners. The descriptors identified through combinatorial protocol in multiple linear regression analysis have highlighted the role of Moran autocorrelation of lag 7, weighted by atomic van der Waals volume, presence of both prime and nonprime amide carbonyl oxygen in the tartrate moiety and occurrence of five membered ring bearing substituents at varying sites. A few potential novel tartrate-based analogues have been suggested for further investigation.

Entities:  

Keywords:  Combinatorial protocol in multiple linear regression analysis; inhibitors of tumour necrosis factor-α converting enzyme; molecular descriptors; novel tartrate-based compounds; quantitative structure-activity relationship

Year:  2013        PMID: 23901159      PMCID: PMC3719148          DOI: 10.4103/0250-474X.113539

Source DB:  PubMed          Journal:  Indian J Pharm Sci        ISSN: 0250-474X            Impact factor:   0.975


Tumour necrosis factor-α (TNF-α) is one of the cytokines, which is involved in immunomodulation and proinflammation events. The overproduction of TNF-α has been concerned in many autoimmune disorders namely rheumatoid arthritis, Crohn’s disease and psoriasis.[1-4] The reduction of TNF-α levels has been managed for successful treatment of inflammatory diseases.[5] Thus, the finding of a low cost, orally active small drug, which could moderate TNF-α levels is of prime importance at clinical level at present. One important strategy to reduce the levels of soluble TNF-α is to block the release of TNF-α from the cell surface by the inhibition of TNF-α converting enzyme (TACE).[6-8] This enzyme being a membrane-bound zinc-metalloprotease is able to convert the 26-kD transmembrane pro-form of TNF-α to the mature 17-kD soluble form.[910] It has been shown that the active site of TACE shares many common features with the matrix metalloproteinases (MMPs).[1112] However, a unique feature of TACE is a tunnel interconnecting the S1’ and S3’ pockets into a single large cavity. The selectivity of TACE may be accomplished by incorporating appropriate substitutions that bind in the narrow S1’ tunnel and large S3’ pocket.[13-17] As the broad-spectrum MMP inhibitors exhibit a dose-limiting toxicity leading to side effects known as musculoskeletal syndrome (MMS),[18-25] therefore, selective inhibitors of TACE are desirable at present. As most of the common TACE inhibitors are hydroxamate based,[2627] the exploration of selective nonhydroxamate drugs, devoid of MMS, may be more potential TACE inhibitors. In view of this, Rosner, et al.[28] have screened their proprietary mixture-based combinatorial library with the automated ligand identification system[29-31] and were able to identify four compounds (Compound. 1-4; Table 1, fig. 1) of moderate TACE affinities. The structures of these compounds were similar to bis-amides of l-tartaric acid (tartrates)[32] while the corresponding d-tartrates were reported as inactive. This was the first report in which a unique tridentate zinc binding mode was revealed with the tartrate scaffold and is defined by the two hydroxyl groups and the nonprime amide carbonyl interacting with the catalytic zinc atom (fig. 2). The zinc atom maintains its coordination with the three imidazole nitrogen’s of His405, His409 and His415 and attains pseudo-octahedral coordination geometry during the binding mode. The prime amide carbonyl oxygen makes hydrogen bonds with the backbone -NH of both Leu348 and Gly349. The OH near the nonprime side also forms hydrogen bonds with the carboxylate oxygen of Glu406. In this way, all the four oxygen atoms on the tartrate core collectively make both the nonprime and prime binding interactions with the TACE protein. A novel series of compounds, able to undergo such interactions, were further prepared and evaluated for their binding affinity (Ki) for TACE.[28] However this study, being sort of structure-activity relationship (SAR), was targeted at the alterations of substituents at different positions and provided no rationale to reduce the trial-and-error factors. Hence, the present communication is aimed at to perform a 2D quantitative structure-activity relationship (2D  QSAR) for the reported compounds so as to provide the rationale for drug-design and to grasp some molecular features of the compounds for the property study relating to their binding affinity for TACE. In the congeneric series, where a relative study is being carried out, the 2D descriptors may play important role in deriving the significant relationships with biological activities of the compounds. The novelty and importance of a 2D-QSAR study is due to its simplicity for the calculations of different descriptors and their interpretation in physical sense. Thus the study may not fully explain the mode of interaction at the receptor site(s) but it will certainly reflect upon important molecular features relevant for the interaction.
Table 1

MOLECULAR DESCRIPTORS, OBSERVED, CALCULATED AND PREDICTED TUMOR NECROSIS FACTOR-α CONVERTING ENZYME INHIBITION ACTIVITY OF NOVEL TARTRATE-BASED ANALOGUES

Fig. 1

The generalized structures of tartrate-based compounds (Table 1). (a) Compounds 1-32: X=Y=O, 33: X=O, Y=S, 34: X=S, Y=O; (b) compounds 35 and 36.

Fig. 2

Tridentate chelation of the zinc atom with the tartrate core.[28]

MOLECULAR DESCRIPTORS, OBSERVED, CALCULATED AND PREDICTED TUMOR NECROSIS FACTOR-α CONVERTING ENZYME INHIBITION ACTIVITY OF NOVEL TARTRATE-BASED ANALOGUES The generalized structures of tartrate-based compounds (Table 1). (a) Compounds 1-32: X=Y=O, 33: X=O, Y=S, 34: X=S, Y=O; (b) compounds 35 and 36. Tridentate chelation of the zinc atom with the tartrate core.[28]

MATERIALS AND METHODS

The active compounds along with their binding constant, Ki under present investigation (Table 1) have been taken from the literature.[28] The generalised structure of these compounds is shown in fig. 1a and b. The binding affinity has been expressed on the negative logarithm as pKi (-logKi) on the molar basis and stand as the dependent descriptor for present quantitative analysis. For modelling purpose, the data-set was divided into training- and test-sets to insure external validation of models derived through identified descriptors. Additionally, leave-one-out (LOO) and leave-five-out (L5O) procedures were employed for internal validation of generated models from the training-set. The selection of compounds for test set has been made through SYSTAT, Systat Software Inc., Chicago, USA[33] using the single linkage hierarchical cluster procedure involving the Euclidean distances of the activity values. Nearly 25% of the compounds, from total population, were selected for this purpose. Based on the pKi values of the data set, a cluster tree was generated and compounds were selected in such a way to keep them at a maximum possible distance from each other. In this way, the test set includes the highest to lowest active congeners of the data set. In SYSTAT, by default, the normalised Euclidean distances are computed to join the objects of cluster. The normalised distances are root mean-squared distances. The single linkage uses distance between two closest members in clustering. It generates long clusters and provides scope to choose objects at different intervals. Due to this reason, a single linkage clustering procedure was applied.

Molecular descriptors:

The structures of the compounds under study have been drawn in ChemDraw, Cambridge Soft Corporation, Cambridge, USA[34] using the standard procedure. These structures were converted into 3D objects using the default conversion procedure implemented in the CS Chem3D Ultra, Cambridge Soft Corporation, Cambridge, USA. The generated 3D-structures of the compounds were subjected to energy minimization in the MOPAC module, using the AM1 procedure for closed shell systems, implemented in the CS Chem3D Ultra. This will ensure a well-defined conformer relationship across the compounds of the study. All these energy minimised structures of respective compounds have been ported to DRAGON software (Virtual Computational Chemistry Laboratory, Munich, Germany)[35] for computing the descriptors corresponding to 0D, 1D, and 2D classes. Table 2 provides the definition and scope of these descriptor classes in addressing the structural features which were employed in present QSAR work. The combinatorial protocol in multiple linear regression (CP-MLR) computational procedure[36] has been used for present work in developing QSAR models. Prior to application of the CP-MLR procedure, all those descriptors which are intercorrelated beyond 0.90 and showing a correlation of less than 0.1 with the biological endpoints (descriptor vs. activity, r<0.1) were excluded. The remaining descriptors, able to address the biological activity of these compounds, will serve as the database (pool) at the end of this initial stage.
Table 2

DESCRIPTOR CLASSES USED FOR THE ANALYSIS OF TUMOR NECROSIS FACTOR-α CONVERTING ENZYME ACTIVITY OF TARTRAE-BASED ANALOGUES AND IDENFIFIED CATEGORIES IN MODELING THE ACTIVITY

DESCRIPTOR CLASSES USED FOR THE ANALYSIS OF TUMOR NECROSIS FACTOR-α CONVERTING ENZYME ACTIVITY OF TARTRAE-BASED ANALOGUES AND IDENFIFIED CATEGORIES IN MODELING THE ACTIVITY

Model development:

The CP-MLR is a ‘filter’- based variable selection procedure for model development in QSAR studies.[36] Its procedural aspects and implementation are discussed in some of our recent publications.[37-42] The thrust of this procedure is in its embedded ‘filters’. They are briefly as follows: Filter-1 seeds the variables by way of limiting interparameter correlations to predefined level (upper limit ≤0.79); filter-2 controls the variables entry to a regression equation through t-values of coefficients (threshold value ≥2.0); filter-3 provides comparability of equations with different number of variables in terms of square root of adjusted multiple correlation coefficient of regression equation, r-bar; filter-4 estimates the consistency of the equation in terms of cross-validated r2 or q2 with LOO cross-validation as default option (threshold value 0.3≤q2 ≤1.0). All these filters make the variable selection process efficient and lead to a unique solution. In order to collect the descriptors with higher information content and explanatory power, the threshold of filter-3 was successively incremented with increasing number of descriptors (per equation) by considering the r-bar value of the preceding optimum model as the new threshold for next generation. Furthermore, in order to discover any chance correlations associated with the models recognized in CP-MLR, each cross-validated model has been put to a randomisation test[4344] by repeated randomisation of the activity to discover the chance correlations, if any, associated with them. For this, every model has been subjected to 100 simulation runs with scrambled activity. The scrambled activity models with regression statistics better than or equal to that of the original activity model have been counted, to express the per cent chance correlation of the model under scrutiny.

Applicability domain:

The utility of a QSAR model is based on its accurate prediction ability for new compounds. A model is valid only within its training domain and new compounds must be assessed as belonging to the domain before the model is applied. The applicability domain is assessed by the leverage values for each compound.[4546] The Williams plot (the plot of standardised residuals versus leverage values, h) can then be used for an immediate and simple graphical detection of both the response outliers (Y-outliers) and structurally influential chemicals (X-outliers) in the model. In this plot, the applicability domain is established inside a squared area within ±x (standard deviation) and a leverage threshold h*. The threshold h* is generally fixed at 3(k+1)/n (n is the number of training-set compounds and k is the number of model parameters) whereas x=2 or 3. Prediction must be considered unreliable for compounds with a high leverage value (h>h*). On the other hand, when the leverage value of a compound is lower than the threshold value, the probability of accordance between predicted and observed values is as high as that for the training set compounds.

RESULTS AND DISCUSSION

From the listed compounds in Table 1, two analogues (compound 2 and 24) have been removed from the study. The X-ray structure of compound 2 has revealed that the 2-chlorophenyl-piperazine group binds to the S1 subsite, defined by Val314, Lys315, Thr347 and Leu350. This subsite, being a flat hydrophobic patch, was solvent exposed. The chlorophenyl group appeared disordered and lacks 2fofc electron density.[28] The compound was unable to bind properly to the subsite and behave indifferently from other analogues of the series. Likewise, shortening of ethylene linker to the methylene linker resulted into significantly less active compound 24 compared to compound 21. Because of larger size of phenyl ring and shorter linker, the benzyl amide group of compound 24 may not able to bind properly in the narrow S1’ tunnel. This compound, therefore, also remained the ‘outlier’ of present study. The remaining 34 compounds have been further divided into training and test sets. As mentioned above, the selection of test-set compounds was made through SYSTAT using the single linkage hierarchical cluster procedure involving the Euclidean distances of the pKi values. Nine compounds (compound 13, 14, 16, 19, 22, 23, 28, 29 and 35; Table 1) were selected from the generated cluster tree in such a way to keep them at a maximum possible distance from each other. The test set was employed for external validation of models derived from remaining 25 analogues of the series. The internal consistency, for each of the generated models from training set, was achieved through LOO and L5O procedures. A total number of 481 descriptors, belonging to 0D, 1D and 2D classes, were computed for these compounds utilizing DRAGON software. The descriptors which were poorly correlated with dependent variable, pKi and were intercorrelated among themselves were eliminated initially. The leftover 99 descriptors were collated in a pool and were subjected to CP-MLR. A large number of models were obtained in one, two and three descriptors. In doing so the threshold of filter-3 was successively incremented with increasing number of descriptors (per equation) by considering the r-bar value of the preceding optimum model as the new threshold for next generation. However, the statistical significance was achieved only for 21 models that were obtained in three descriptors. The identified descriptors for them along with their average regression coefficients and the total incidence are given in Table 3. The name of each descriptor is given in the footnote under this Table. Three such models, in the increasing level of significance, are given through Eqs. 1-3
Table 3

DESCRIPTORS IDENTIFIED FOR MODELING THE TUMOR NECROSIS FACTOR-α CONVERTING ENZYME INHIBITION ACTIVITY OF TARTRATE-BASED ANALOGUES ALONG WITH THEIR AVERAGE REGRESSION COEFFICIENTS AND THE TOTAL INCIDENCE

DESCRIPTORS IDENTIFIED FOR MODELING THE TUMOR NECROSIS FACTOR-α CONVERTING ENZYME INHIBITION ACTIVITY OF TARTRATE-BASED ANALOGUES ALONG WITH THEIR AVERAGE REGRESSION COEFFICIENTS AND THE TOTAL INCIDENCE pKi=42.594(±15.951) PW3-8.231(±1.581) MATS7v+1.463(±0.278) O-058-10.552 n=25, r=0.845, s=0.431, F(3, 21)=17.460, AIC=0.257, FIT=1.541, LOF=0.270, q2LOO=0.554, q2L5O=0.563, r2Test=0.663 (1) pKi=-0.209(±0.068) RBN-9.081(±1.516) MATS7v+1.433(±0.264) O-058+6.467 n=25, r=0.857, s=0.415, F(3, 21)=19.379, AIC=0.238, FIT=1.710, LOF=0.250, q2LOO=0.624, q2L5O=0.625, r2Test=0.517 (2) pKi=0.631(±0.129) nR05-7.946(±1.255) MATS7v+1.498(±0.218) O-058+2.900 n=25, r=0.906, s=0.342, F(3, 21) =31.910, AIC=0.161, FIT=2.816, LOF=0.170, q2LOO=0.732, q2L5O=0.690, r2Test=0.660 (3) where n and F represent, respectively, the number of data points and the F-ratio between the variances of calculated and observed activities. The ±data within the parentheses are the standard errors associated with regression coefficients. FIT is the Kubinyi function,[4748] AIC is the Akaike’s information criterion[4950] and LOF is the Friedman’s lack of fit factor.[51] The FIT function is closely related to the F-statistic but proved to be a useful parameter for the assessment of the quality of the models. The disadvantage of the F-value is its sensitivity to changes in the number of independent variables, k in the equation that describes the model. The F-value is more sensitive if k is small, whereas it is less sensitive if k is large. The FIT function, on the other hand, is less sensitive to a lower number k but is more sensitive to a larger number k The best model would yield the highest value for this function. The AIC takes into account the statistical goodness of fit and the number of parameters that have to be estimated to achieve that degree of fit. The model that produces the lower AIC value should be considered potentially the most useful. The LOF factor takes into account the number of terms used in the equation and is not biased, as are the indicator variables, toward large number of parameters. A statistical sound model will generate the lower value of LOF. In a comparative study, where QSAR models are generated from the descriptors belonging to different categories, the FIT function, the AIC criterion and the LOF factor are very important parameters in explaining the best model.[52-54] In all above equations, the F-values remained significant at 99% level [F3,21 (0.01)=4.874] and indices q2LOOand q2L5O(>0.5) have accounted for their internal robustness. The r2Test value, greater than 0.5, specified that the identified test-set is able to validate these models externally. The descriptors RBN, nR05, O-058 and MATS7v involved in these models represent, respectively, number of rotatable bonds, number of five-membered rings, number of doubly bonded oxygen atoms and Moran autocorrelation of lag 7/weighted by atomic van der Waals volume. Though all the above models are reliable enough in statistical sense, but the highest variance (r2), in observed activities, is explained only through Eq. 3. The other statistical parameters, s, F, AIC, FIT, LOF and q2 also favoured Eq. 3 as a statistically reliable model and thus retained for further discussion. The computed values of descriptors employed in the derivation of Eq. 3 are given in Table 1 for the sake of convenience. That these descriptors have no mutual correlation is shown in Table 4. The descriptor, MATS7v divulged the implication of lag 7, weighted by atomic van der Waals volume. The highest value of descriptor O-058 could be two (Table 1) for the compound under study, advocating the need of both oxygen (O=) atoms. Alternatively, it is essential to have a tartrate core for key binding interaction involving these oxygen atoms through hydrogen bonding with receptor sites. Replacement of any of these oxygen atoms with sulphur (Compounds 33 and 34) and substitution of the nonprime amide with amide isosteres, such as a thiazole or imidazole (Compound 35 and 36), afforded compounds with weaker TACE inhibition and were not advantageous. The loss of activity in these compounds is thought to be caused by the poorer hydrogen-bond acceptor character of sulphur and lack of interactions with the S1 subsite or the face-edge interactions with His415. From Eq. 3, it appeared that the higher values of descriptors nR05 and O-058 and the lower (or more negative) value of MATS7v are conducive in improving the activity of a compound. The calculated and predicted pKis using, respectively, Eq. 3 and LOO procedure, remained in parity with the observed ones (Table 1). The plot, showing the variation of observed versus calculated and predicted pKis is given in fig. 3. Except two ‘outlier’ congeners (Compound 2 and 24), all other compounds have exhibited systematic variation between observed and calculated pKi values, reflecting upon the goodness of fit. Based on Eq. 3, a few potential inhibitors of TACE have been suggested for further exploration. These are given in Table 5 along with descriptors and calculated pKi values. The predicted activities of some of these congeners were much superior to that of highest potent compounds reported in the original series.
Table 4

CORRELATION MATRIXa AMONGST THE DESCRIPTORS OF EQ. 3

Fig. 3

Plot of observed versus calculated and predicted pKi values. pKi values Calculated form Eq.(3) Ο, pKi values Predicted by LOO Δ

Table 5

PREDICTED TUMOUR NECROSIS FACTOR-α CONVERTING ENZYME INHIBITION ACTIVITY OF SOME NEW TARTRATE-BASED ANALOGUES

CORRELATION MATRIXa AMONGST THE DESCRIPTORS OF EQ. 3 Plot of observed versus calculated and predicted pKi values. pKi values Calculated form Eq.(3) Ο, pKi values Predicted by LOO Δ PREDICTED TUMOUR NECROSIS FACTOR-α CONVERTING ENZYME INHIBITION ACTIVITY OF SOME NEW TARTRATE-BASED ANALOGUES The applicability domain was visualised through the Williams plot (fig. 4) for the highest significant model, obtained for complete data-set as
Fig. 4

Williams plot

Williams plot for the training set and external prediction set for tumour necrosis factor-α converting enzyme inhibition activity of tartrate-based compounds, listed in Table 1 (h*=0.33 and residual limits=±0.92). Training-set Ο, Test-set Δ.

Williams plot Williams plot for the training set and external prediction set for tumour necrosis factor-α converting enzyme inhibition activity of tartrate-based compounds, listed in Table 1 (h*=0.33 and residual limits=±0.92). Training-set Ο, Test-set Δ. pKi=0.638(±0.150) nR05-6.269(±1.383) MATS7v+1.540(±0.251) O-058+2.848 n=36, r=0.821, s=0.461, F(3,32)=22.011, q2LOO=0.586, q2L5O=0.524 AIC=0.266, FIT=1.467, LOF=0.272 (4) The limits of normal values for the Y-outliers (response outliers) was set equal to ±2 times the standard deviation and a leverage threshold at h*. For present work, the residual limits and leverage threshold were ±0.92 and 0.33, respectively. From fig. 4, it appeared that compounds 2 and 24 were obvious ‘outliers’ while compound 36 (X-outlier) is a prominent congener to influence the statistics of present series. The residual and leverage of this influential compound were 0.127 and 0.349, respectively. All remaining compounds (training set and test set), present within the square, indicated that the applicability domain is fully justified and the identified model has been evaluated correctly. Furthermore, the derived model matches the high quality parameters with good fitting power and capability of assessing external data. The present study has, therefore, provided guidelines to develop new tartrate-based analogues which may be potent inhibitors of TNF-α converting enzyme (TACE). The emerged descriptors enlightened the role of Moran autocorrelations pertaining to lag 7, weighted by atomic van der Waals volume, presence of both prime and nonprime amide carbonyl oxygen in the tartrate moiety and occurrence of 5 membered ring bearing substituents at prime and nonprime sites. A few potential novel tartrate-based analogues, as the inhibitors of TNF-α converting enzyme (TACE), have been suggested for further investigation.
  41 in total

1.  Design and therapeutic application of matrix metalloproteinase inhibitors.

Authors:  M Whittaker; C D Floyd; P Brown; A J Gearing
Journal:  Chem Rev       Date:  1999-09-08       Impact factor: 60.622

2.  2D-autocorrelation descriptors for predicting cytotoxicity of naphthoquinone ester derivatives against oral human epidermoid carcinoma.

Authors:  Liane Saíz-Urra; Maykel Pérez González; Marta Teijeira
Journal:  Bioorg Med Chem       Date:  2007-02-22       Impact factor: 3.641

3.  QSAR study about ATP-sensitive potassium channel activation of cromakalim analogues using CP-MLR approach.

Authors:  Susheela Sharma; Yenamandra S Prabhakar; Prithvi Singh; Brij Kishore Sharma
Journal:  Eur J Med Chem       Date:  2008-01-31       Impact factor: 6.514

4.  Hydroxamates: relationships between structure and plasma stability.

Authors:  Marion Flipo; Julie Charton; Akila Hocine; Sandrine Dassonneville; Benoit Deprez; Rebecca Deprez-Poulain
Journal:  J Med Chem       Date:  2009-11-12       Impact factor: 7.446

5.  Cloning of a disintegrin metalloproteinase that processes precursor tumour-necrosis factor-alpha.

Authors:  M L Moss; S L Jin; M E Milla; D M Bickett; W Burkhart; H L Carter; W J Chen; W C Clay; J R Didsbury; D Hassler; C R Hoffman; T A Kost; M H Lambert; M A Leesnitzer; P McCauley; G McGeehan; J Mitchell; M Moyer; G Pahel; W Rocque; L K Overton; F Schoenen; T Seaton; J L Su; J D Becherer
Journal:  Nature       Date:  1997-02-20       Impact factor: 49.962

6.  A quantitative structure-activity relationship study on serotonin 5-HT6) receptor ligands: indolyl and piperidinyl sulphonamides.

Authors:  B K Sharma; P Singh; K Sarbhai; Y S Prabhakar
Journal:  SAR QSAR Environ Res       Date:  2010-04       Impact factor: 3.000

7.  Acetylenic TACE inhibitors. Part 3: Thiomorpholine sulfonamide hydroxamates.

Authors:  J I Levin; J M Chen; L M Laakso; M Du; J Schmid; W Xu; T Cummons; J Xu; G Jin; D Barone; J S Skotnicki
Journal:  Bioorg Med Chem Lett       Date:  2006-01-19       Impact factor: 2.823

Review 8.  Preclinical and clinical studies of MMP inhibitors in cancer.

Authors:  A H Drummond; P Beckett; P D Brown; E A Bone; A H Davidson; W A Galloway; A J Gearing; P Huxley; D Laber; M McCourt; M Whittaker; L M Wood; A Wright
Journal:  Ann N Y Acad Sci       Date:  1999-06-30       Impact factor: 5.691

9.  Modelling of serotonin reuptake inhibitory and histamine H₃antagonistic activity of piperazine and diazepane amides: QSAR rationales for co-optimization of the activity profiles.

Authors:  B K Sharma; P Singh; M Shekhawat; K Sarbhai; Y S Prabhakar
Journal:  SAR QSAR Environ Res       Date:  2011-06       Impact factor: 3.000

Review 10.  Current perspective of TACE inhibitors: a review.

Authors:  Shirshendu DasGupta; Prashant R Murumkar; Rajani Giridhar; Mange Ram Yadav
Journal:  Bioorg Med Chem       Date:  2008-12-03       Impact factor: 3.641

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.