| Literature DB >> 25505989 |
Ying-Hsin Chang1, Jun-Yan Chen2, Chiou-Yi Hor3, Yu-Chung Chuang4, Chang-Biau Yang3, Chia-Ning Yang5.
Abstract
Human estrogen receptor (ER) isoforms, ERα and ERβ, have long been an important focus in the field of biology. To better understand the structural features associated with the binding of ERα ligands to ERα and modulate their function, several QSAR models, including CoMFA, CoMSIA, SVR, and LR methods, have been employed to predict the inhibitory activity of 68 raloxifene derivatives. In the SVR and LR modeling, 11 descriptors were selected through feature ranking and sequential feature addition/deletion to generate equations to predict the inhibitory activity toward ERα. Among four descriptors that constantly appear in various generated equations, two agree with CoMFA and CoMSIA steric fields and another two can be correlated to a calculated electrostatic potential of ERα.Entities:
Year: 2013 PMID: 25505989 PMCID: PMC4245501 DOI: 10.1155/2013/743139
Source DB: PubMed Journal: Int J Med Chem ISSN: 2090-2077
Structures, experimental activities presented in IC50 and pIC50 values, and predicted pIC50 values by different modeling approaches.
|
|
| |||||||
| 1–54 | 55–68 | |||||||
|
| ||||||||
| No. | Substituents | IC50 (nM) | pIC50 | Predicted | ||||
| R | R′ | CoMFA | CoMSIA | LR | SVM | |||
|
| ||||||||
| 1 | H | H | 300 | 6.52 | 7.17 | 6.72 | 6.88 | 6.39 |
| 2 | H | 4′-OH | 35 | 7.46 | 7.33 | 6.98 | 7.71 | 7.46 |
| 3∗ | H | 4′-OMe | 100 | 7.00 |
|
| 7.00 | 6.70 |
| 4 | 6-C | 4′-OH | 20 | 7.70 | 7.64 | 7.03 | 7.23 | 7.15 |
| 5∗ | 6-CO2Me | 4′-OH | 30 | 7.52 |
|
| 6.17 | 6.58 |
| 6 | 6-COMe | 4′-OH | 60 | 7.22 | 7.28 | 7.49 | 6.81 | 6.72 |
| 7 | 6-OMe | 4′-OH | 250 | 6.60 | 6.83 | 7.02 | 7.38 | 7.38 |
| 8 | 6-Me | 4′-OH | 300 | 6.52 | 7.05 | 6.81 | 7.25 | 7.33 |
| 9∗∗ | 6-Cl | 4′-OH | 1000 | 6.00 | 7.22 | 7.39 | ||
| 10 | 6-CONH2 | 4′-OH | 1000 | 6.00 | 6.10 | 5.67 | 7.46 | 7.62 |
| 11 | 5-F, 6-OH | 4′-OH | 3 | 8.52 | 8.27 | 8.32 | 7.85 | 7.90 |
| 12 | 5-OH | 4′-OH | 100 | 7.00 | 7.03 | 6.45 | 7.15 | 7.22 |
| 13∗ | 4,7-di(Me),6-OH | 4′-OH | 100 | 7.00 |
|
| 6.72 | 6.74 |
| 14 | 4-OH | 4′-OH | 190 | 6.72 | 6.68 | 6.16 | 7.14 | 6.89 |
| 15 | 7-OH | 4′-OH | 300 | 6.52 | 6.99 | 7.07 | 7.63 | 7.47 |
| 16 | 4,6-di(OH) | 4′-OH | 350 | 6.46 | 6.71 | 6.72 | 7.98 | 8.03 |
| 17 | 5,6-di(OH) | 4′-OH | 400 | 6.40 | 6.33 | 6.74 | 6.73 | 6.75 |
| 18 | 5,7-di(Me),6-OH | 4′-OH | 500 | 6.30 | 6.38 | 6.46 | 7.35 | 7.62 |
| 19 | 4,5-Benzo,6-OH | 4′-OH | 500 | 6.30 | 5.97 | 5.98 | 6.78 | 7.55 |
| 20 | 6-OMe | 4′-OMe | 300 | 6.52 | 6.56 | 6.82 | 6.61 | 6.30 |
| 21 | 5,6,7-tri(OMe) | 4′-OMe | 350 | 6.46 | 6.67 | 6.25 | 5.74 | 5.97 |
| 22 | 6-OMe | 3′,4′-OCH2O | 500 | 6.30 | 6.72 | 7.14 | 6.29 | 6.15 |
| 23 | 6-OMe | 4′-CH2OH | 600 | 6.22 | 5.87 | 5.86 | 6.68 | 6.40 |
| 24 | 6-OH | H | 2.5 | 8.60 | 7.92 | 8.19 | 7.94 | 8.24 |
| 25∗ | 6-OH | 4′-OH | 0.2 | 9.70 |
|
| 7.90 | 8.21 |
| 26 | 6-OH | 4′-C | 0.8 | 9.10 | 7.83 | 8.11 | 7.81 | 8.20 |
| 27 | 6-OH | 4′-Cl | 1 | 9.00 | 8.10 | 8.25 | 7.41 | 7.70 |
| 28 | 6-OH | 4′-F | 2.3 | 8.64 | 7.71 | 8.26 | 8.08 | 8.19 |
| 29 | 6-OH | 4′-Et | 5 | 8.30 | 7.81 | 7.84 | 7.86 | 7.94 |
| 30 | 6-OH | 4′-CH=CH2 | 7 | 8.15 | 7.43 | 7.97 | 7.77 | 7.84 |
| 31 | 6-OH | 4′-n-Bu | 10 | 8.00 | 8.22 | 7.79 | 7.84 | 7.68 |
| 32∗ | 6-OH | 4′-i-Pr | 30 | 7.52 |
|
| 7.65 | 7.62 |
| 33 | 6-OH | 4′-Me | 50 | 7.30 | 7.93 | 7.98 | 7.44 | 7.41 |
| 34 | 6-OH | 4′-Ph | 100 | 7.00 | 7.25 | 7.72 | 6.80 | 7.04 |
| 35 | 6-OH | 4′-CH2SEt | 100 | 7.00 | 7.25 | 7.30 | 7.49 | 7.06 |
| 36 | 6-OH | 4′-NO2 | 500 | 6.30 | 6.65 | 6.36 | 6.80 | 6.55 |
| 37∗∗ | 6-OH | 4′-OMe | 1000 | 6.00 | 7.83 | 7.97 | ||
| 38∗ | 6-OH | 4′-CONMe2 | 20 | 7.70 |
|
| 6.67 | 7.43 |
| 39 | 6-OH | 4′-COMe | 32 | 7.49 | 7.19 | 7.82 | 6.95 | 7.02 |
| 40 | 6-OH | 4′-CON(H)Me | 40 | 7.40 | 6.87 | 7.29 | 7.12 | 7.89 |
| 41∗ | 6-OH | 4′-CO2Me | 50 | 7.30 |
|
| 7.03 | 6.79 |
| 42 | 6-OH | 4′-CO2Et | 50 | 7.30 | 7.16 | 7.30 | 7.77 | 7.67 |
| 43 | 6-OH | 4′-CONH2 | 200 | 6.70 | 6.95 | 7.18 | 7.42 | 7.78 |
| 44 | 6-OH | 4′-CO2H | 325 | 6.49 | 7.08 | 6.78 | 6.92 | 6.79 |
| 45 | 6-OH | 3′-F, 4′-OH | 0.3 | 9.52 | 9.10 | 8.78 | 8.47 | 8.30 |
| 46 | 6-OH | 2′-Me | 0.7 | 9.15 | 8.85 | 8.37 | 8.20 | 8.25 |
| 47 | 6-OH | 3′-Me, 4′-OH | 1 | 9.00 | 9.31 | 9.03 | 8.41 | 8.39 |
| 48 | 6-OH | 2′-Me, 4′-OH | 2 | 8.70 | 9.29 | 9.05 | 8.46 | 8.40 |
| 49 | 6-OH | 2′-OMe, 4′-OH | 2 | 8.70 | 8.66 | 8.81 | 9.65 | 9.73 |
| 50 | 6-OH | 3′-Cl, 4′-OH | 2.3 | 8.64 | 8.98 | 9.17 | 8.55 | 9.01 |
| 51 | 6-OH | 3′-F | 2.5 | 8.60 | 8.36 | 8.52 | 8.51 | 8.57 |
| 52 | 6-OH | 3′-OH | 3.2 | 8.49 | 8.39 | 8.39 | 8.32 | 8.29 |
| 53 | 6-OH | 2′-OH | 10 | 8.00 | 8.55 | 8.39 | 8.41 | 8.39 |
| 54 | 6-OH | 3′,5′-Di(Me), 4′-OH | 100 | 7.00 | 6.98 | 7.27 | 6.58 | 6.01 |
| 55 | 6-OH | 1′-Naphthyl | 0.8 | 9.10 | 9.39 | 8.70 | 8.74 | 8.17 |
| 56 | 6-OH | 4′-OH-1′-Naphthyl | 2 | 8.70 | 8.74 | 8.70 | 8.60 | 8.35 |
| 57 | 6-OH |
| 2 | 8.70 | 8.76 | 8.46 | 8.28 | 8.71 |
| 58 | 6-OH | Cyclohexyl | 2.5 | 8.60 | 8.66 | 8.47 | 9.00 | 9.34 |
| 59 | 6-OH | Isopropyl | 3 | 8.52 | 8.32 | 8.48 | 7.85 | 7.26 |
| 60 | 6-OH | Cyclopentyl | 5 | 8.30 | 8.47 | 8.48 | 8.10 | 7.80 |
| 61∗ | 6-OH | 4′-Hydroxybenzyl | 5 | 8.30 |
|
| 8.41 | 8.15 |
| 62∗ | 6-OH | 3′-Thienyl | 10 | 8.00 |
|
| 7.47 | 7.35 |
| 63 | 6-OH | 2′-Thienyl | 20 | 7.70 | 7.64 | 7.96 | 7.19 | 7.24 |
| 64∗ | 6-OH | Ethyl | 20 | 7.70 |
|
| 8.00 | 8.36 |
| 65 | 6-OH | Methyl | 35 | 7.46 | 7.30 | 8.26 | 7.16 | 6.68 |
| 66 | 6-OH | 2′-Naphthyl | 80 | 7.10 | 7.09 | 6.95 | 8.81 | 8.70 |
| 67 | 6-OH | 4′-Pyridyl | 100 | 7.00 | 7.80 | 7.39 | 7.73 | 7.96 |
| 68 | 6-OH | 4′-Pyridyl N-oxide | 100 | 7.00 | 7.01 | 7.01 | 7.20 | 7.02 |
*Compounds included in test set of CoMFA and CoMSIA modeling.
∗∗Compounds not included in the training or test set of CoMFA and CoMSIA.
Figure 1(a) The alignment core used in this study. (b) The result of alignment using align database in Sybyl.
Statistical data of CoMFA and CoMSIA models on MCF-7 cell inhibitiona.
| CoMFA | CoMSIA | |
|---|---|---|
|
| 0.519 | 0.511 |
| ONC | 4 | 4 |
| SEE | 0.434 | 0.443 |
|
| 0.816 | 0.819 |
|
| 60.320 | 57.534 |
|
| ||
| Contribution fraction | ||
|
| ||
|
| 0.515 | 0.106 |
|
| 0.485 | 0.239 |
| HB donor | 0.442 | |
| HB acceptor | 0.212 | |
aAbbreviations used are as follows.
q 2: Leave-one-out cross-validated (LOOCV) correlation coefficient.
ONC: Optimum number of principal components.
r 2: Non-cross-validated correlation coefficient.
SEE: Standard error of estimate.
F: F-test value.
S: Steric field contribution fraction.
E: Electrostatic field contribution fraction.
HB donor: Hydrogen bond donor field contribution fraction.
HB acceptor: Hydrogen bond acceptor field contribution fraction.
Figure 2Comparison of actual versus predicted raloxifene inhibitory activity based on (a) CoMFA, (b) CoMSIA, (c) LR, and (d) SVM models. The diagonal in the four plots is the y = x line, whereas the dashed lines indicate the ±1 log point margins of error for analyses. The solid dots represent the modeling results on training set, whereas the open triangle points in CoMFA and CoMSIA plots represent the test sets.
The intercorrelations between the 11 selected features (descriptors) and the activity presented in pIC50 for the studied compoundsa.
|
|
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|---|---|
| pIC50 | −0.32 | 0.2 | −0.22 | −0.25 | −0.33 | 0.11 | −0.02 | 0.19 | 0.24 | −0.22 | 0.39 |
|
| 1 | ||||||||||
|
| −0.08 | 1 | |||||||||
|
| 0.33 | −0.01 | 1 | ||||||||
|
| 0.11 | −0.11 | −0.1 | 1 | |||||||
|
| 0.56 | −0.12 | 0.34 | 0.43 | 1 | ||||||
|
| 0.11 | 0.18 | 0.29 | −0.01 | 0.5 | 1 | |||||
|
| −0.28 | 0.26 | 0 | −0.2 | −0.13 | 0.43 | 1 | ||||
|
| −0.12 | −0.21 | 0.05 | 0.04 | 0.26 | 0.63 | 0.05 | 1 | |||
|
| −0.11 | −0.03 | −0.02 | 0.37 | 0.07 | −0.02 | 0.06 | −0.01 | 1 | ||
|
| 0.49 | −0.04 | 0.52 | 0.29 | 0.93 | 0.68 | −0.07 | 0.39 | 0.06 | 1 | |
|
| −0.43 | 0.2 | −0.12 | −0.35 | −0.52 | −0.25 | 0.09 | −0.07 | −0.06 | −0.47 | 1 |
aDescriptors used are
x 1: Complementary information content (CIC) (fast descriptors).
x 2: Estate keys (sums): S_ssCH2 (fast descriptors).
x 3: Estate keys (sums): S_aasC (fast descriptors).
x 4: Estate keys (sums): S_dO (fast descriptors).
x 5: Principal moment of inertia X (spatial descriptors).
x 6: Shadow area: ZX plane (spatial descriptors).
x 7: Shadow area fraction: YZ plane (spatial descriptors).
x 8: Shadow ratio (spatial descriptors).
x 9: Dipole moment Z (spatial descriptors).
x 10: SASA (jurs descriptors).
x 11: RPCS (jurs descriptors).
Figure 3Results obtained by modeling MCF-7 cell inhibitory activity based on 3D QSAR methods. (a) Superimposed steric fields determined by CoMFA (mesh) and CoMSIA (solid) methodologies, in which green contours indicate regions where a relatively bulky substituent would increase inhibitory activity, whereas the yellow contours indicate areas where a bulkier substituent would decrease inhibitory activity. Compound 55 is displayed in the background for reference. The Cartesian coordinate frame is given. (b) Superimposed electrostatic fields determined by CoMFA (mesh) and CoMSIA (solid) methodologies, in which blue contours indicate regions where a positively charged substituent would increase inhibitory activity, whereas the red contours indicate regions where a negatively charged substituent would increase inhibitory activity. Compound 25 is displayed in the background for reference. (c) Hydrogen bond donor field, in which a cyan region favors hydrogen bond donors while a purple region disfavors hydrogen bond donors. Compound 25 is displayed in the background for reference. (d) Hydrogen bond acceptor field, in which a pink region favors hydrogen bond acceptors, while a red region disfavors hydrogen bond acceptors. Compound 25 is displayed in the background for reference.
Figure 4Overlay of CoMSIA fields onto ERα binding cavity. (a) Steric, (b) electrostatic, and (c) hydrogen bond donor and (d) hydrogen bond acceptor fields. Color codes are the same as specified in Figure 3. Raloxifene is displayed in the background for reference.
The model equations generated by support vector regression and linear regression.
|
|
|
|
|
|
|
|
|
|
|
| Constant |
| ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LR models | ||||||||||||||
|
| ||||||||||||||
| Equation (1) | pIC50 = | +0.08 | −0.26 | −0.24 | −0.41 | +0.66 | −0.48 | +0.41 | +0.28 | +7.54 |
| |||
| Equation (2) | pIC50 = | +0.08 | −0.26 | −0.24 | −0.41 | +0.66 | −0.48 | +0.41 | −0.01 | +0.28 | +7.54 |
| ||
| Equation (3) | pIC50 = | −0.26 | −0.23 | −0.42 | +0.68 | −0.47 | +0.41 | +0.30 | +7.54 |
| ||||
| Equation (4) | pIC50 = | −0.26 | −0.23 | −0.42 | +0.68 | −0.47 | +0.41 | −0.01 | +0.30 | +7.54 |
| |||
| Equation (5) | pIC50 = | −0.28 | −0.22 | −0.47 | +0.77 | −0.51 | −0.11 | +0.41 | +0.03 | +0.31 | +7.54 |
| ||
| Equation (6) | pIC50 = | −0.01 | +0.08 | −0.25 | −0.24 | −0.39 | +0.67 | −0.48 | +0.41 | −0.03 | +0.28 | +7.54 |
| |
| Equation (7) | pIC50 = | −0.21 | +0.91 | −0.56 | +0.43 | −0.70 | +0.30 | +7.54 |
| |||||
| Equation (8) | pIC50 = | −0.26 | −0.23 | −0.41 | +0.69 | −0.47 | +0.41 | −0.03 | +0.30 | +7.54 |
| |||
| Equation (9) | pIC50 = | −0.23 | −0.56 | +0.84 | −0.51 | −0.12 | +0.34 | +0.34 | +7.54 |
| ||||
| Equation (10) | pIC50 = | −0.21 | −0.54 | +0.73 | −0.46 | +0.34 | +0.33 | +7.54 |
| |||||
|
| ||||||||||||||
| SVM models | ||||||||||||||
|
| ||||||||||||||
| Equation (1) | pIC50 = | −0.28 | −0.32 | −0.34 | +0.62 | −0.37 | +0.42 | +0.32 | +7.50 |
| ||||
| Equation (2) | pIC50 = | −0.30 | −0.51 | +0.84 | −0.48 | −0.19 | +0.33 | +0.38 | +7.57 |
| ||||
| Equation (3) | pIC50 = | +0.09 | −0.26 | −0.32 | +0.74 | −0.50 | −0.10 | +0.54 | −0.34 | +0.26 | +7.55 |
| ||
| Equation (4) | pIC50 = | −0.23 | −0.35 | +0.71 | −0.49 | +0.57 | −0.38 | +0.39 | +7.60 |
| ||||
| Equation (5) | pIC50 = | −0.08 | +0.06 | −0.21 | −0.31 | +0.67 | −0.45 | −0.14 | +0.43 | −0.23 | +0.25 | +7.56 |
| |
| Equation (6) | pIC50 = | +0.10 | −0.28 | −0.24 | −0.38 | +0.61 | −0.47 | −0.11 | +0.46 | +0.24 | +7.62 |
| ||
| Equation (7) | pIC50 = | −0.23 | −0.19 | +0.59 | −0.45 | −0.05 | +0.47 | −0.29 | +0.31 | +7.57 |
| |||
| Equation (8) | pIC50 = | +0.13 | −0.20 | −0.29 | +0.65 | −0.45 | +0.53 | −0.33 | +0.33 | +7.59 |
| |||
| Equation (9) | pIC50 = | +0.03 | −0.38 | −0.31 | −0.29 | +0.53 | −0.37 | +0.53 | +0.07 | +0.34 | +7.62 |
| ||
| Equation (10) | pIC50 = | +0.82 | −0.53 | +0.38 | −0.71 | +0.37 | +7.58 |
| ||||||
| Equation (11) | pIC50 = | −0.05 | −0.16 | −0.47 | +0.69 | −0.40 | +0.31 | +0.33 | +7.53 |
| ||||
| Equation (12) | pIC50 = | −0.28 | −0.54 | +0.67 | −0.40 | +0.42 | +0.02 | +0.34 | +7.62 |
| ||||
| Equation (13) | pIC50 = | −0.25 | +0.79 | −0.57 | +0.48 | −0.59 | +0.40 | +7.57 |
| |||||
| Equation (14) | pIC50 = | −0.24 | −0.48 | +0.73 | −0.43 | +0.40 | +0.30 | +7.55 |
| |||||
| Equation (15) | pIC50 = | −0.11 | +0.03 | −0.25 | −0.30 | −0.29 | +0.50 | −0.45 | +0.48 | +0.03 | +0.30 | +7.67 |
| |
Figure 5Electrostatic potential on the ERα protein surface around the active site of raloxifene (PDB code: 1ERR). Electronegative and electropositive charges are colored in red and blue, respectively.