| Literature DB >> 19758437 |
Abstract
BACKGROUND: Small molecules that bind reversibly to DNA are among the antitumor drugs currently used in chemotherapy. In the pursuit of a more rational approach to cancer chemotherapy based upon these molecules, it is necessary to exploit the interdependency between DNA-binding affinity, sequence selectivity and cytotoxicity. For drugs binding noncovalently to DNA, it is worth exploring whether molecular descriptors, such as their molecular weight or the number of potential hydrogen acceptors/donors, can account for their DNA-binding affinity and cytotoxicity.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19758437 PMCID: PMC2758867 DOI: 10.1186/1471-2210-9-11
Source DB: PubMed Journal: BMC Pharmacol ISSN: 1471-2210
Molecular descriptors for noncovalent DNA-binding drugsa.
| Actinomycin D | 3053 | Intercalation | 1255 | 1.6 | 5 | 18 | 356 | 3030 | 5.38 | 2 | 8.7 |
| Bleomycin | 125066 | Intercalation | 1416 | -1.9 | 20 | 30 | 627 | 2580 | 5.57 | 1 | 5.9 |
| Chartreusin | 5159 | Intercalation | 641 | 2.6 | 5 | 14 | 200 | 1150 | 5.45 | 2 | 5.7 |
| Chromomycin | 58514 | minor-groove | 1185 | -0.6 | 8 | 26 | 359 | 2480 | 4.41 | 1 | 8.3 |
| Daunorubicin | 82151 | intercalation | 528 | 0.1 | 5 | 11 | 186 | 960 | 6.65 | 2 | 7.1 |
| Distamycin A | 82150 | minor-groove | 482 | 0.2 | 6 | 9 | 181 | 825 | 6.89 | 3 | 4.1 |
| Doxorubicin | 123127 | intercalation | 544 | -0.5 | 6 | 12 | 206 | 977 | 6.30 | 1 | 7.2 |
| Echinomycin | 526417 | bis-intercalation | 1101 | 2.5 | 4 | 16 | 302 | 2200 | 5.52 | 2 | 8.1 |
| Elsamicin A | 369327 | intercalation | 654 | 2.9 | 5 | 14 | 206 | 1210 | 6.54 | 2 | 7.5 |
| Epirubicin | 256942 | intercalation | 544 | -0.5 | 6 | 12 | 206 | 977 | 6.57 | 2 | 6.7 |
| Ethidium | 268986 | intercalation | 394 | 4.3 | 2 | 3 | 56 | 419 | 4.90 | 4 | 5.5 |
| m-AMSA | 249992 | intercalation | 394 | 3.8 | 2 | 6 | 80 | 601 | 4.30 | 4 | 6.2 |
| Mitoxantrone | 301739 | intercalation | 445 | -3.1 | 8 | 10 | 163 | 571 | 6.78 | 3 | 7.2 |
| Mithramycin A | 24559 | minor-groove | 1085 | -0.4 | 11 | 24 | 358 | 1940 | 5.08 | 1 | 7.9 |
| Netropsin | 3067 | minor-groove | 431 | -1.7 | 7 | 10 | 211 | 723 | 6.40 | 2 | 4.0 |
a Mw: Molecular weight, XlogP: partition coefficient that measures the differential solubility of a compound in two solvents, HbD: number of hydrogen bond donors in the structure, HbA: number of hydrogen bond acceptors, PSA: polar surface area (in Å2). Complexity: a rough estimate of how complicated a structure is. Both the elements contained and the displayed structural features including symmetry are considered. log Keq: logarithmic-transformed equilibrium binding constant for a drug-DNA complex (Keq in M-1). Lipinski: Lipinski's score, the rule-of-five value used to measure bioavailability. GI50 is used in place of -log(GI50), the negative logarithm of the average drug concentration that inhibits cell growth in the NCI-60 cell lines (August 2008 data) as a measure of cytotoxicity or cytostasis.
Figure 1Molecular formulae of six of the noncovalent DNA-binding drugs used in the present study. The drugs displayed are characterized by their high activity in the NCI's tumor screening panel.
Results of the Shapiro-Wilk normality test(a)
| MW | 0.827 | 15 | 0.008 |
| XlogP | 0.951 | 15 | 0.542 |
| HbD | 0.767 | 15 | 0.001 |
| HbA | 0.933 | 15 | 0.298 |
| PSA | 0.862 | 15 | 0.026 |
| Complexity | 0.873 | 15 | 0.038 |
| logKeq | 0.915 | 15 | 0.163 |
| Lipinski | 0.847 | 15 | 0.016 |
| GI50 | 0.960 | 15 | 0.697 |
(a) Molecular descriptors were considered to pass the normality test if p > 0.01
Calculated Pearson and Spearman's rgcorrelation coefficients between each molecular descriptor and log Keq.
| Mw | -0.396 | 7.22 × 10-2 | -0.177 | 2.64 × 10-1 | -0.096 | 3.96 × 10-1 | 0.073 | 4.20 × 10-1 | -0.909 | 8.72 × 10-4 | -0.857 | 3.27 × 10-3 |
| XlogP | -0.461 | 4.20 × 10-2 | -0.388 | 7.66 × 10-2 | -0.662 | 1.85 × 10-2 | -0.699 | 1.20 × 10-2 | -0.118 | 3.91 × 10-1 | 0.000 | 5.00 × 10-1 |
| HbD | 0.047 | 4.34 × 10-1 | 0.252 | 1.83 × 10-1 | 0.189 | 3.01 × 10-1 | 0.636 | 2.40 × 10-2 | -0.368 | 1.85 × 10-1 | -0.258 | 2.69 × 10-1 |
| HbA | -0.272 | 1.63 × 10-1 | -0.244 | 1.91 × 10-1 | 0.143 | 3.50 × 10-1 | 0.049 | 4.47 × 10-1 | -0.955 | 1.11 × 10-4 | -0.976 | 1.66 × 10-5 |
| PSA | -0.151 | 2.96 × 10-1 | -0.211 | 2.25 × 10-1 | 0.079 | 4.10 × 10-1 | 0.141 | 3.50 × 10-1 | -0.950 | 1.47 × 10-4 | -0.994 | 1.00 × 10-6 |
| Complexity | -0.373 | 8.56 × 10-2 | -0.252 | 1.82 × 10-1 | -0.088 | 4.00 × 10-1 | -0.067 | 4.30 × 10-1 | -0.844 | 4.18 × 10-3 | -0.857 | 3.27 × 10-3 |
a Significance level (actual p values)
Equations used to predict logKeq values for DNA-binding drugsa.
| All drugs | 0.603 | 6.6 × 10-2 | 0.258 | |
| All drugs | 0.461 | 8.4 × 10-2 | 0.152 | |
| Intercalators | 0.662 | 3.7 × 10-2 | 0.368 | |
| 'M-region' | 0.984 | 2.0 × 10-3 | 0.944 | |
| 'M-region' | 0.955 | 2.2 × 10-4 | 0.897 |
The predictive equations are presented for the three sets of drugs analyzed (All DNA-binding drugs, Intercalators and 'M-region' compounds) described in the main textb.
aObtained by multiple regression analysis, in which molecular descriptors showing multicollinearity were discarded (see the main text for details). The predictive equations displayed are those statistically "more significant" for each set of predictors (actual p values, ANOVA test, are shown in the Table), r is the correlation coefficient of the linear fit, AdR2 is the fraction of the variance in logKeq that is explained (predicted) by the model, corrected for the number the variables in the model, as described in Methods.
bThe cases (drugs) used in the calculations for each set were 15, 10 and 8 respectively.
Figure 2Common molecular descriptors and noncovalent binding to DNA. (A) Predicted logKeq values obtained by multiple regression analysis using molecular descriptors (Keq (cal)) are plotted together with experimentally calculated values (Keq (cal)) for the complete set of drugs, intercalators and 'M region' compounds respectively. Equations used to calculate logKeq are shown in Table 4. ACT (actinomycin D), BLEO (bleomycin), CHAR (chartreusin), CRO (chromomycin), DAU (daunorubicin), DIST (distamycin), DOXO (doxorubicin), ECH (echinomycin), ELSA (elsamicin A), EPI (epirubicin), ETH (ethidium), AMSA (m-AMSA), MTA (mithramycin A), NETR (netropsin). (B) Dendogram showing average linkage hierarchical clustering of six molecular descriptors for noncovalent DNA-binding drugs, based on the Pearson correlation coefficients. Descriptors with higher similarity are clustered together. (C) Hierarchical clustering applied to the 15 drugs binding reversibly to DNA (Table 1) on the basis of their proximities. Connecting lines further to the right indicate more distance between clusters of either molecular descriptors (B) or drugs (C).
Figure 3Principal component analysis of molecular descriptors for noncovalent DNA-binding drugs. Two-dimensional representation of loading values and factor scores on principal components 1 and 2 are shown in a rotated space. Component 1 may be labeled "molecular size" while component 2 would be "hydrophilicity-hydrophobicity", with XlogP clearly loading in the hydrophobic part of the axis. Loadings are displayed side-by-side with a representation of the factor scores for the different drugs on the two principal components. Graphical representations correspond to the analysis of all fifteen noncovalent DNA-binding drugs (A) (B), intercalators (C) (D), and the 'M-region' compounds (E) (F), respectively.
Equations used to predict GI50 values for DNA-binding drugsa.
| All drugsc | 0.713 | 1.4 × 10-2 | 0.426 | |
| Intercalators | 0.894 | 1.6 × 10-2 | 0.700 | |
| 'M-region' | 0.984 | 1.8 × 10-4 | 0.955 |
The predictive equations are presented for the three sets of drugs analyzed (All 15 drugs, Intercalators and 'M-region' compounds) described in the main textb.
a Obtained by multiple regression analysis. The predictive equations displayed are those statistically more significant for each set of predictors (actual p values, ANOVA test, are shown in the Table). Other details as in legend to Table 4.
bThe cases (drugs) used in the calculations for each set were 15, 10 and 8 respectively.
clogKeq was not a significant predictor for GI50 for the set that contains the 15 (all) drugs.
Figure 4Molecular descriptors and cell growing inhibition (GI. (A) Plot comparing the GI50, retrieved from the NCI-60 cell lines (Table 1) and the values calculated by multiple regression analysis (GI50 (obs)). Equations used to calculate GI50are shown in Table 5. The plot corresponds to the more active 'M-region' compounds. (B) Dendrograms showing a hierarchical clustering of all DNA-binding drugs, which takes into account all the descriptors, including logKeq and Lipinski's scores. (C) Principal component analysis of molecular descriptors plus logKeq and Lipinski's scores for all the DNA-binding drugs, shown in a rotated space; two-dimensional representation of loading values are shown (C), and the drugs represented according to their factor scores in principal component analysis (D).