| Literature DB >> 20957105 |
Zhizhong Wang1, Yan Li, Chunzhi Ai, Yonghua Wang.
Abstract
Over the years development of selective estrogen receptor (ER) ligands has been of great concern to researchers involved in the chemistry and pharmacology of anticancer drugs, resulting in numerous synthesized selective ER subtype inhibitors. In this work, a data set of 82 ER ligands with ERα and ERβ inhibitory activities was built, and quantitative structure-activity relationship (QSAR) methods based on the two linear (multiple linear regression, MLR, partial least squares regression, PLSR) and a nonlinear statistical method (Bayesian regularized neural network, BRNN) were applied to investigate the potential relationship of molecular structural features related to the activity and selectivity of these ligands. For ERα and ERβ, the performances of the MLR and PLSR models are superior to the BRNN model, giving more reasonable statistical properties (ERα: for MLR, R(tr) (2) = 0.72, Q(te) (2) = 0.63; for PLSR, R(tr) (2) = 0.92, Q(te) (2) = 0.84. ERβ: for MLR, R(tr) (2) = 0.75, Q(te) (2) = 0.75; for PLSR, R(tr) (2) = 0.98, Q(te) (2) = 0.80). The MLR method is also more powerful than other two methods for generating the subtype selectivity models, resulting in R(tr) (2) = 0.74 and Q(te) (2) = 0.80. In addition, the molecular docking method was also used to explore the possible binding modes of the ligands and a relationship between the 3D-binding modes and the 2D-molecular structural features of ligands was further explored. The results show that the binding affinity strength for both ERα and ERβ is more correlated with the atom fragment type, polarity, electronegativites and hydrophobicity. The substitutent in position 8 of the naphthalene or the quinoline plane and the space orientation of these two planes contribute the most to the subtype selectivity on the basis of similar hydrogen bond interactions between binding ligands and both ER subtypes. The QSAR models built together with the docking procedure should be of great advantage for screening and designing ER ligands with improved affinity and subtype selectivity property.Entities:
Keywords: QSAR; docking; receptor; selectivity
Mesh:
Substances:
Year: 2010 PMID: 20957105 PMCID: PMC2956105 DOI: 10.3390/ijms11093434
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
The SMILEs and pIC50 information of the compounds studied herein.
| NO. | SMILES | pIC50(α) | pIC50(β) | S |
|---|---|---|---|---|
| compound1 | OC1=CC=C(C2=CC(F)=C(C(Cl)=C(O)C=C3)C3=C2)C=C1 | 6.40 | 7.96 | 1.55 |
| compound2 | OC1=C(F)C=C(C2=CC=C(C=C(O)C=C3C#C)C3=C2)C=C1 | 6.14 | 7.92 | 1.78 |
| compound3 | OC1=C(F)C=C(C2=CC=C(C=C(O)C=C3F)C3=C2)C=C1 | 6.68 | 7.82 | 1.11 |
| compound4 | OC1=CC=C(C2=CC=C(C=C(O)C=C3C#N)C3=C2)C=C1 | 6.08 | 7.70 | 1.61 |
| compound5 | OC1=CC(F)=C(C2=CC=C(C=C(O)C=C3C#N)C3=C2)C(F)=C1 | 6.35 | 7.66 | 1.29 |
| compound6 | OC1=CC=C(C2=CC(C#N)=C(C=C(O)C=C3)C3=C2)C=C1 | 5.98 | 7.64 | 1.65 |
| compound7 | OC1=CC=C(C2=CC(CC)=C(C=C(O)C=C3)C3=C2)C=C1F | 5.95 | 7.60 | 1.65 |
| compound8 | OC1=CC=C(C2=CC(C#N)=C(C=C(O)C=C3)C3=C2)C=C1F | 5.68 | 7.57 | 1.89 |
| compound9 | OC1=CC=C(C2=CC=C3C(Cl)=C(O)C=CC3=C2)C(Cl)=C1 | 6.44 | 7.48 | 1.00 |
| compound10 | BrC2=CC(C3=CC=C(O)C(F)=C3)=NC1=CC=C(O)C=C12 | 5.55 | 7.47 | 1.92 |
| compound11 | BrC2=CC(C3=CC=C(O)C=C3)=NC1=CC=C(O)C=C12 | 5.67 | 7.37 | 1.68 |
| compound12 | ClC2=CC(C3=CC=C(O)C=C3)=NC1=CC=C(O)C=C12 | 5.67 | 7.34 | 1.66 |
| compound13 | ClC2=CC(C3=CC=C(O)C(F)=C3)=NC1=CC=C(O)C=C12 | 5.61 | 7.28 | 1.66 |
| compound14 | OC3=CC=C(C=C3F)C2=CC=C(C1=C2)C(C)=C(C=C1C#N)O | 5.39 | 7.22 | 1.82 |
| compound15 | OC3=CC=C(C=C3)C2=CC=C1C(F)=C(C=CC1=C2)O | 6.11 | 7.15 | 1.00 |
| compound16 | OC3=C(F)C=C(C=C3F)C2=CC=C1C=C(C=CC1=C2)O | 6.04 | 7.08 | 1.00 |
| compound17 | OC1=C(C=C(C3=CC=C2C=C(O)C=C(C2=C3)C=O)C=C1)F | 6.14 | 7.96 | 1.82 |
| compound18 | OC1=CC=C(C2=CC=C3C=C(O)C=CC3=C2)C(Cl)=C1 | 7.00 | 7.85 | 0.79 |
| compound19 | OC3=CC=C(C=C3)C2=CC(F)=C1C=C(C=CC1=C2)O | 6.66 | 7.80 | 1.11 |
| compound20 | OC3=C(F)C=C(C=C3)C2=CC=C1C=C(C=C(C#N)C1=C2)O | 6.02 | 7.68 | 1.65 |
| compound21 | OC3=CC=C(C=C3)C2=CC(Cl)=C1C=C(C=CC1=C2)O | 6.52 | 7.64 | 1.08 |
| compound22 | OC3=CC=C(C=C3F)C2=CC(CC)=C1C=C(C=CC1=C2)O | 5.63 | 7.62 | 1.99 |
| compound23 | OC3=CC=C(C=C3)C2=CC=C1C(Cl)=C(C=CC1=C2)O | 6.04 | 7.60 | 1.55 |
| compound24 | OC3=C(F)C=C(C(F)=C3)C2=CC=C1C=C(C=CC1=C2)O | 6.57 | 7.55 | 0.94 |
| compound25 | OC3=CC(F)=C(C(F)=C3)C2=CC=C1C(Cl)=C(C=CC1=C2)O | 6.46 | 7.47 | 0.97 |
| compound26 | OC3=CC=C(C=C3F)C2=CC=C1C(Cl)=C(C=CC1=C2)O | 5.84 | 7.40 | 1.54 |
| compound27 | OC3=C(F)C=C(C=C3)C2=CC=C1C(Br)=C(C=C(C#N)C1=C2)O | 5.94 | 7.35 | 1.39 |
| compound28 | OC3=CC=C(C=C3F)C2=CC=C1C=C(C=CC1=C2)O | 6.04 | 7.30 | 1.24 |
| compound29 | OC3=C(F)C=C(C=C3)C2=CC=C1C=C(C=C(C#CC)C1=C2)O | 5.74 | 7.26 | 1.50 |
| compound30 | OC3=C(F)C=C(C(F)=C3)C2=CC(C#N)=C1C=C(C=CC1=C2)O | 5.73 | 7.16 | 1.41 |
| compound31 | OC3=C(F)C=C(C=C3)C2=CC=C1C=C(C=C(C=C)C1=C2)O | 5.28 | 7.14 | 1.85 |
| compound32 | OC3=C(F)C=C(C(F)=C3)C2=CC=C1C(Cl)=C(C=CC1=C2)O | 5.93 | 7.07 | 1.11 |
| compound33 | OC3=CC=C(C(C)=C3)C2=CC=C1C=C(C=CC1=C2)O | 6.40 | 7.00 | 0.48 |
| compound34 | OC3=C(F)C=C(C=C3F)C2=CC=C1C(Cl)=C(C=CC1=C2)O | 5.28 | 6.97 | 1.68 |
| compound35 | OC3=CC=C(C=C3)C2=CC(C#N)=C1C(Br)=C(C=CC1=C2)O | 5.88 | 6.92 | 1.00 |
| compound36 | OC3=CC=C(C=C3)C2=CC=C1C=C(C=CC1=C2)O | 5.68 | 6.79 | 1.08 |
| compound37 | OC1=CC=C2C(C(C#N)=CC(C3=CC=C(O)C(F)=C3)=N2)=C1 | 4.98 | 6.64 | 1.65 |
| compound38 | OC3=CC=C(C=C3Cl)C2=CC=C1C(Cl)=C(C=CC1=C2)O | 5.45 | 6.49 | 1.01 |
| compound39 | OC1=CC=C2C(C(C=C)=CC(C3=CC=C(O)C(F)=C3)=N2)=C1 | 5.41 | 6.36 | 0.89 |
| compound40 | OC3=C(F)C=C(C=C3F)C2=CC(C#N)=C1C=C(C=CC1=C2)O | 5.26 | 6.24 | 0.93 |
| compound41 | OC1=CC=C2C(C(C#C)=CC(C3=CC=C(O)C=C3)=N2)=C1 | 4.82 | 6.12 | 1.28 |
| compound42 | OC1=CC=C2C(C=CC(C3=CC=C(O)C=C3)=N2)=C1Br | 4.94 | 6.06 | 1.08 |
| compound43 | OC1=CC=C2C(C=CC(C3=CC=C(O)C=C3)=N2)=C1 | 4.75 | 5.77 | 0.97 |
| compound44 | OC1=CC=CC2=CC(C3=CC=CC(O)=C3)=CC=C12 | 4.84 | 5.69 | 0.78 |
| compound45 | OC1=CC=C2C(C(C(C)=O)=CC(C3=CC=C(O)C=C3)=N2)=C1 | 4.50 | 5.66 | 1.12 |
| compound46 | OC(C=CC2=C3)=CC2=CC=C3C1=CC=CC=C1 | 4.87 | 5.43 | 0.41 |
| compound47 | OC(C=CC2=C3)=CC2=C(C#CC)C=C3C1=CC=C(O)C(F)=C1 | 5.46 | 7.00 | 1.52 |
| compound48 | OC(C=CC2=C3)=C(Cl)C2=C(C#N)C=C3C1=CC=C(O)C(F)=C1 | 5.52 | 6.96 | 1.42 |
| compound49 | OC(C=CC2=C3)=C(Br)C2=CC=C3C1=CC=C(O)C=C1 | 5.58 | 6.89 | 1.29 |
| compound50 | OC(C=CC2=C3)=C(C)C2=CC=C3C1=CC=C(O)C=C1 | 5.55 | 6.77 | 1.19 |
| compound51 | OC1=CC=C2C(C=CC(C3=CC=C(O)C=C3)=N2)=C1 | 5.20 | 6.52 | 1.30 |
| compound52 | OC1=CC=C2C(C(Br)=CC(C3=CC(F)=C(O)C(F)=C3)=N2)=C1 | 5.11 | 6.44 | 1.32 |
| compound53 | OC1=CC=C2C(C(CC)=CC(C3=CC=C(O)C=C3)=N2)=C1 | 5.20 | 6.28 | 1.05 |
| compound54 | OC1=CC=C2C(C(C=C)=CC(C3=CC=C(O)C=C3)=N2)=C1 | 5.30 | 6.22 | 0.87 |
| compound55 | OC1=CC=C2C(C(CC)=CC(C3=CC=C(O)C(F)=C3)=N2)=C1 | 4.76 | 6.10 | 1.33 |
| compound56 | OC(C=CC2=C3)=C(OC)C2=CC=C3C1=CC=C(O)C=C1 | 5.05 | 5.94 | 0.83 |
| compound57 | OC(C=CC2=C3)=C( [N+]( [O−])=O)C2=CC=C3C1=CC=C(O)C=C1 | 5.15 | 5.70 | 0.41 |
| compound58 | OC1=CC=C2C(C(C4=CC=CC=C4)=CC(C3=CC=C(O)C(F)=C3)=N2)=C1 | 4.74 | 5.68 | 0.88 |
| compound59 | OC3=CC=C(C=C3)C2=CC=C1C=CC=CC1=C2 | 5.20 | 5.61 | 0.21 |
| compound60 | OC1=CC(C3=CC=C2C=CC(O)=CC2=C3)=CC=C1 | 4.58 | 5.25 | 0.56 |
| compound61 | OC3=CC=C(C=C3)C2=CC=C1C(C4=CC=CC=C4)=C(O)C=CC1=C2 | 4.91 | 5.13 | −0.19 |
| compound62 | OC1=CC=C2C(C(OC)=CC(C3=CC=C(O)C=C3)=N2)=C1 | 4.18 | 4.92 | 0.66 |
| compound63 | OC1=CC=C2C(C(C(O)C)=CC(C3=CC=C(O)C(O)=C3)=N2)=C1 | 4.30 | 4.30 | - |
| --mpound64 | OC3=CC=C(C(F)=C3)C2=CC=C1C(Cl)=C(O)C=CC1=C2 | 6.24 | 7.92 | 1.68 |
| compound65 | OC3=CC(F)=C(C(F)=C3)C2=CC=C1C=C(O)C=CC1=C2 | 6.99 | 7.64 | 0.54 |
| compound66 | OC3=CC=C(C=C3)C2=CC=C1C(Cl)=C(O)C=C(C#N)C1=C2 | 6.01 | 7.52 | 1.50 |
| compound67 | OC3=CC=C(C=C3F)C2=CC(C=C)=C1C=C(O)C=CC1=C2 | 5.60 | 7.36 | 1.75 |
| compound68 | OC3=CC=C(C=C3)C2=CC(C#N)=C1C(Cl)=C(O)C=CC1=C2 | 5.96 | 7.22 | 1.23 |
| compound69 | OC3=CC=C(C=C3Cl)C2=CC=C1C=C(O)C=CC1=C2 | 5.97 | 6.96 | 0.94 |
| compound70 | OC3=CC=C(C(OC)=C3)C2=CC=C1C=C(O)C=CC1=C2 | 5.76 | 6.57 | 0.74 |
| compound71 | OC1=CC=C2C(C(C(C)=O)=CC(C3=CC=C(O)C(F)=C3)=N2)=C1 | 4.47 | 6.03 | 1.55 |
| compound72 | OC1=CC=C2C(C(C#C)=CC(C3=CC(F)=C(O)C(F)=C3)=N2)=C1 | 4.32 | 5.12 | 0.73 |
| compound73 | OC(C=CC2=C3)=CC2=CC=C3C1=CC=CC=C1O | 4.30 | 4.70 | 0.18 |
| compound74 | OC(C=CC2=C3)=CC2=CC=C3C1=CC=C(O)C=C1F | 6.62 | 7.70 | 1.04 |
| compound75 | OC(C=C(CC)C2=C3)=CC2=CC=C3C1=CC(F)=C(O)C=C1 | 5.95 | 7.60 | 1.65 |
| compound76 | OC(C=CC2=C3)=CC2=C(C=O)C=C3C1=CC=C(O)C(F)=C1 | 5.64 | 7.47 | 1.83 |
| compound77 | OC(C=CC2=C3)=C(F)C2=C(C#N)C=C3C1=CC=C(O)C(F)=C1 | 5.51 | 7.25 | 1.74 |
| compound78 | OC(C=CC2=C3)=CC2=C(C#C)C=C3C1=CC=C(O)C(F)=C1 | 5.61 | 7.20 | 1.58 |
| compound79 | OC(C=CC2=C3)=C(Cl)C2=CC=C3C1=CC=C(O)C=C1C | 6.40 | 6.89 | 0.32 |
| compound80 | OC1=CC=C2C(C(C#N)=CC(C3=CC=C(O)C=C3)=N2)=C1 | 5.34 | 6.55 | 1.18 |
| compound81 | OC1=CC2=CC(C3=CC=CC=C3)=CC=C2C=C1 | 4.47 | 5.28 | 0.74 |
| compound82 | OC1=CC(O)=CC2=C1C(C(C3=CC=C(O)C=C3)=CO2)=O | 5.40 | 7.01 | 1.60 |
The crystals used in the docking process and 2D structures of their co-crystallized ligand.
| Crystal | ligand | Crystal | ligand |
|---|---|---|---|
Figure 1The distribution the 82 compounds in the 5 × 5 top-map of the Kohonen neural network: (A) is for the ERα set and (B) is for the ERβ set. (C) is the distribution of 81 compounds in the 4 × 4 top-map Kohonen neural network for the Selectivity set. Those numbers with grey circle background are compounds of the test set, while the others are the ones of the training set. Numbers in blue rectangles are compounds further split for the validation of the BRNN models.
Figure 2Experimental vs predicted pIC50 values of ligands for the ER alpha (left), ER beta (middle) and experimental vs predicted S values of ligands for Selectivity (right) by the MLR models (A) for the training and test sets, by the PLSR models (the pIC50 and descriptor values were normalized) (B). (C) Experimental and predicted values by Baysian-regularized neural network for the training, validation and independent test sets for ER alpha (left), ER beta (middle), and the Selectivity (right). The empty circles represent the outliers present.
Figure 3Trend of the statistical results of the PLSR models with vary latent factors based on the data sets for ER alpha (A), ER beta (B) and Selectivity (C).
Figure 4(A) Superpositon of Docking conformations of Compound 18 in 1X7R (green) and 1X7E (cyans).The interacting modes of compound18 with 1X7R (B) and 1X7E (C). Compound18 and the important residues for binding interaction are represented by stick and line models, respectively. The magentas dash lines denote the hydrogen bonds.
Figure 5Compound17 and the potent interacting residues docking into 1QKM, 1X78, 1YY4 and 1YYE in sequence (A–D). The magenta dashed lines denote the hydrogen bonds.
The statistical results of the BRNN models.
| Data set | A | B | Rtraining | Rvalidation | Rtest | SSEtraining | SSEvalidation | SSEtest |
|---|---|---|---|---|---|---|---|---|
| 5 | 5 | 0.87 | 0.76 | 0.73 | 0.19 | 0.09 | 0.10 | |
| 5 | 11 | 0.91 | 0.70 | 0.74 | 0.29 | 0.14 | 0.15 | |
| 5 | 14 | 0.81 | 0.65 | 0.77 | 0.009 | 0.005 | 0.005 |
A: represents the number of hidden neurons. B: represents the number of input neurons for PCs. SSE is abbreviation of Sum Squared error.
Figure 6(A) Overlay of binding conformations of compound10 docking into 1QKM(ERβ) vs. 1X7R(ERα) (left) and 1X78(ERβ) vs. 1X7E(ERα) (right). (B) Overlay of binding conformations of compound22 docking into 1QKM(ERβ) vs. 1X7R(ERα) (left) and 1X78(ERβ) vs. 1X7E(ERα) (right). 1QKM and 1X78 were colored green, while 1X7R and1X7E colored cyan. The magenta dashed lines denote the hydrogen bonds.
Summary of the RMSD information when the cocrystalized ligand redocked into the corresponding crystals.
| Crystal | AVG_RMSD | SD_RMSD | MAX_RMSD/NO. of pose | MIN_RMSD/NO. of pose |
|---|---|---|---|---|
| 0.66 | 0.18 | 0.94/7th | 0.46/10th | |
| 0.32 | 0.03 | 0.36/9th, 10th | 0.27/5th | |
| 0.39 | 0.06 | 0.47/1th, 3th | 0.32/8th | |
| 0.53 | 0.32 | 1.04/5th,7th | 0.14/1th | |
| 0.63 | 0.30 | 1.01/7th | 0.14/6th | |
| 0.77 | 0.54 | 1.81/7th | 0.34/1th |
The numbers here match along with the ten plausible poses ranking with the docking score descending order.