| Literature DB >> 32042954 |
Muhammad Tukur Ibrahim1, Adamu Uzairu1, Gideon Adamu Shallangwa1, Sani Uba1.
Abstract
In-silico activity prediction was performed to predict new inhibitory activities of 2, 9-disubstituted 8-phenylthio/phenylsulfinyl-9h-purine derivatives as anti-proliferative agents using QSAR technique. The anti-proliferative agents were optimized using Density Functional Theory (DFT) method utilizing the B3LYP/6-31G* level of theory. Genetic Function Algorithm (GFA) was used to build the QSAR models. Out of the models built, the best one was selected and reported because of its fitness statistically with the following assessment parameters: R2 trng = 0.919035, R2 adj = 0.893733, Q2 cv = 0.866475, R2 test = 0.636217, and LOF = 0.215884. The selected model was further subjected to other assessment such as VIF, Y-scrambling test, applicability domain and found to be statistically significant. The binding mode of some selected 2, 9-disubstituted 8-phenylthio/phenylsulfinyl-9H-purine (ligands) in the active site of EGFR-tyrosine kinase (EGFR-TK) (receptor) was studied via Molecular docking. Molecule 22 was identified to have the highest binding energy (-10.4 kcal/mol) among the other selected ligands which it might be as a result of hydrogen interactions formed with MET793 (2.48599 Å, 2.04522 Å) & THR854 (3.76616 Å) amino acid residues and hydrophobic/other interactions with amino acid residues (LEU718, LEU844, MET766, VAL726, ALA743, LYS745 and MET790) in the active site of EGFR-tyrosine kinase (EGFR-TK). The drug-likeness of these selected anti-proliferative agents were predicted via the pharmacokinetics profile of the molecules utilizing SWISS ADME. The anti-proliferative agents were found to be orally safe by not having more than 1 violation of the Lipinski's rule of five. This research proposed a way for designing potent anti-proliferative agents against their target enzyme.Entities:
Keywords: ADME; B3LYP/6-31G*; DFT; EGFR; NSCLC; Physical chemistry; Theoretical chemistry
Year: 2020 PMID: 32042954 PMCID: PMC7002806 DOI: 10.1016/j.heliyon.2020.e03158
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
The Molecular formula, pIC50, Predicted pIC50, the residual values and binding energy for the studied molecules.
| S/No | Molecular formula | pIC50 | Predicted pIC50 | Residuals | Binding energy (kcal/mol) |
|---|---|---|---|---|---|
| 1 | C25H29N7S | 6.537752 | 6.675639 | -0.13789 | -9.3 |
| 2 | C25H27N7S | 6.516413 | 6.369865 | 0.146548 | -9.5 |
| 3 | C26H29N7S | 6.180522 | 6.347182 | -0.16666 | -9.4 |
| 4 | C27H31N7S | 7.29073 | 7.213299 | 0.077431 | -9.9 |
| 5 | C27H31N7OS | 7.37059 | 7.429651 | -0.05906 | -9.1 |
| 6 | C28H33N7OS | 7.319664 | 7.329735 | -0.01007 | -9.1 |
| 7 | C26H30N6O2S | 7.353596 | 7.636959 | -0.28336 | -9.6 |
| 8 | C27H33N7OS | 6.302074 | 6.453033 | -0.15096 | -9.7 |
| 9 | C29H35N7OS | 7.531653 | 7.149655 | 0.381998 | -9.6 |
| 10 | C28H32N6O2S | 6.854493 | 6.95115 | -0.09666 | -10.1 |
| 11y | C26H29FN6O2S | 7.289037 | 7.289037 | 0.065794 | -9.3 |
| 12 | C26H29FN6O2S | 7.320572 | 6.295373 | 0.079107 | -9.4 |
| 13 | C25H29N7OS | 6.668573 | 5.973654 | 0.3732 | -9.2 |
| 14 | C25H27N7OS | 5.788399 | 6.109828 | -0.18526 | -9.8 |
| 15 | C26H29N7OS | 6.076601 | 7.001534 | -0.03323 | -9.5 |
| 16 | C27H31N7OS | 7.189096 | 7.365658 | 0.187562 | -10.2 |
| 17y | C27H31N7O2S | 6.386581 | 6.386581 | 0.386269 | -10.3 |
| 18y | C28H33N7O2S | 6.571379 | 6.571379 | 0.3376 | -9.2 |
| 19 | C26H30N6O3S | 7.298432 | 7.298432 | -0.06723 | -9.4 |
| 20y | C27H33N7O2S | 5.876377 | 5.876377 | -0.2037 | -9.5 |
| 21 | C29H35N7O2S | 5.890219 | 5.890219 | -0.2963 | -9 |
| 22 | C28H32N6O3S | 6.069204 | 6.069204 | 0.003431 | -10.4 |
| 23 | C26H29FN6O3S | 6.445269 | 6.445269 | 0.057485 | -9.7 |
| 24y | C26H29FN6O3S | 6.257196 | 6.257196 | 0.066387 | -9.7 |
| 25 | C25H28N8S | 5.053911 | 5.053911 | 0.092906 | -9.1 |
| 26 | C27H32N8S | 6.286258 | 6.286258 | -0.14532 | -9.5 |
| 27y | C27H32N8OS | 6.301378 | 6.301378 | 0.300808 | -9.7 |
| 28 | C28H33N7OS | 6.97265 | 6.97265 | 0.232312 | -9.0 |
| 29y | C27H32N8OS | 5.628323 | 5.628323 | 0.590811 | -9.9 |
| 30y | C28H33N7O2S | 6.363412 | 6.363412 | -0.39023 | -9.0 |
| Gefitinib | C22H24ClFN4O3 | 33.3 | 5.505041 | -1.97251 | -8.0 |
y = Test set.
General minimum required value for the assessment of QSAR model.
| Symbol | Name | Recommended Value | Reported Model |
|---|---|---|---|
| R2 | Co-efficient of determination | 0.919035 | |
| Q2 | Cross-Validation Co-efficient | 0.866475 | |
| R2- Q2 | Difference between R2 and Q2 | 0.05256 | |
| N(ext, test set) | Minimum number of external test set | 8 | |
| R2ext. | Co-efficient of determination of external and test set | 0.636217 |
The symbols, descriptions and classes of descriptors for the selected model.
| S/no | Symbol | Description | Class |
|---|---|---|---|
| 1 | Average Broto-Moreau autocorrelation - lag 7/weighted by Sanderson electronegativities | 2D | |
| 2 | Average Broto-Moreau autocorrelation - lag 8/weighted by Sanderson electronegativities | 2D | |
| 3 | Centered Broto-Moreau autocorrelation - lag 3/weighted by Sanderson electronegativities | 2D | |
| 4 | Moran autocorrelation - lag 7/weighted by mass | 2D | |
| 5 | Logarithmic Randic-like eigenvector-based index from Barysz matrix/weighted by atomic number | 3D |
Figure 1(A) XY (Scatter) Plot of the actual pIC50 against predicted pIC50 of training set (B) XY (Scatter) Plot of the actual pIC50 against predicted pIC50 of test set of the selected model.
Figure 2XY (Scatter) Plot of actual pIC50 against the residuals of both the test and training sets of the selected model.
MF, VIF and correlation between descriptors of the selected model.
| AATS7e | 1 | 7.726502 | 2.33884 | ||||
| AATS8e | 0.585238 | 1 | 8.27239 | -1.09383 | |||
| ATSC3e | -0.07618 | -0.45661 | 1 | 1.457493 | -0.0027 | ||
| MATS7m | -0.39681 | 0.419105 | -0.28992 | 1 | 5.410414 | -0.05726 | |
| VR3_D | 0.621244 | 0.675433 | -0.30276 | 0.085265 | 1 | 2.190966 | -0.18505 |
Y-scrambling test.
| Model | R | R2 | Q2 |
|---|---|---|---|
| Original | 0.842447 | 0.709717 | 0.504914 |
| Random 1 | 0.63572 | 0.40414 | 0.013407 |
| Random 2 | 0.577014 | 0.332946 | -0.05171 |
| Random 3 | 0.440502 | 0.194042 | -0.35335 |
| Random 4 | 0.252932 | 0.063975 | -0.53688 |
| Random 5 | 0.631791 | 0.39916 | -0.04994 |
| Random 6 | 0.25435 | 0.064694 | -0.46379 |
| Random 7 | 0.279886 | 0.078336 | -0.59263 |
| Random 8 | 0.307425 | 0.09451 | -0.59972 |
| Random 9 | 0.424458 | 0.180165 | -0.3578 |
| Random 10 | 0.627664 | 0.393962 | -0.14752 |
| Average r: | 0.443174 | ||
| Average r2: | 0.220593 | ||
| Average Q2: | -0.31399 | ||
| cRp2: | 0.603579 |
Figure 3Williams Plot of the selected model.
The binding energy, Amino acid residues, Hydrogen bond (bond length Å) of some selected ligands.
| S/N | Binding energy (Kcal/mol) | Amino Acid Residues | Hydrogen Bond (bond length Å) |
|---|---|---|---|
| 22 | -10.4 | LEU718, LEU844, MET766, VAL726 | MET793(2.49, 2.05 & THR854(3.77) |
| 17 | -10.3 | LEU718, LEU844, MET766, VAL726, LA743 LYS745 & MET790 | MET793(2.61, 2.16) & LYS745(2.88) |
| 16 | -10.2 | LEU718, LEU844, MET766, VAL726, ALA743, LYS745 & MET790 | MET793 (2.54, 2.13) LYS745 (2.82) & ASP800 (3.78) |
| 10 | -10.1 | ALA743, LYS745, MET790, ARG841, VAL726 & LEU844 | ARG841(2.75, 3.04) LYS745(2.48), LEU788(3.58) & PHE723(3.50) |
Figure 42D structures of (A) Complex 22, (B) Complex 17, (C) Complex 16 and (D) Complex 10 with bond distances using Discovery studio visualizer.
Figure 53D structures of (A) Complex 22, (B) Complex 17, (C) Complex 16 and (D) Complex 10 using PyMOL.
ADME properties.
| S/N | MW | HB donor | HB acceptor | WLOGP | TPSA | Lipinski violations |
|---|---|---|---|---|---|---|
| 22 | 532.66 | 1 | 7 | 5.05 | 113.61 | 1 |
| 17 | 517.65 | 1 | 6 | 3.94 | 107.62 | 1 |
| 16 | 501.65 | 1 | 5 | 4.71 | 98.39 | 1 |
| 10 | 516.66 | 1 | 6 | 5.16 | 102.63 | 1 |
Drug-likeness properties.
| S/N | GI absorption | BBB permeant | Pgp substrate | Bioavailability Score |
|---|---|---|---|---|
| 22 | High | No | Yes | 0.55 |
| 17 | High | No | Yes | 0.55 |
| 16 | High | No | Yes | 0.55 |
| 10 | High | No | Yes | 0.55 |
Figure 6The Bioavailability Radar of (A) Molecule 22 (B) Molecule 17 (C) Molecule 16 (D) Molecule 10 with the highest docking score.
Figure 7The Boiled-egg plot of the selected molecules.