| Literature DB >> 24992720 |
Jagat Singh Chauhan1, Sandeep Kumar Dhanda1, Deepak Singla1, Subhash M Agarwal2, Gajendra P S Raghava1.
Abstract
Overexpression of EGFR is responsible for causing a number of cancers, including lung cancer as it activates various downstream signaling pathways. Thus, it is important to control EGFR function in order to treat the cancer patients. It is well established that inhibiting ATP binding within the EGFR kinase domain regulates its function. The existing quinazoline derivative based drugs used for treating lung cancer that inhibits the wild type of EGFR. In this study, we have made a systematic attempt to develop QSAR models for designing quinazoline derivatives that could inhibit wild EGFR and imidazothiazoles/pyrazolopyrimidines derivatives against mutant EGFR. In this study, three types of prediction methods have been developed to design inhibitors against EGFR (wild, mutant and both). First, we developed models for predicting inhibitors against wild type EGFR by training and testing on dataset containing 128 quinazoline based inhibitors. This dataset was divided into two subsets called wild_train and wild_valid containing 103 and 25 inhibitors respectively. The models were trained and tested on wild_train dataset while performance was evaluated on the wild_valid called validation dataset. We achieved a maximum correlation between predicted and experimentally determined inhibition (IC50) of 0.90 on validation dataset. Secondly, we developed models for predicting inhibitors against mutant EGFR (L858R) on mutant_train, and mutant_valid dataset and achieved a maximum correlation between 0.834 to 0.850 on these datasets. Finally, an integrated hybrid model has been developed on a dataset containing wild and mutant inhibitors and got maximum correlation between 0.761 to 0.850 on different datasets. In order to promote open source drug discovery, we developed a webserver for designing inhibitors against wild and mutant EGFR along with providing standalone (http://osddlinux.osdd.net/) and Galaxy (http://osddlinux.osdd.net:8001) version of software. We hope our webserver (http://crdd.osdd.net/oscadd/ntegfr/) will play a vital role in designing new anticancer drugs.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24992720 PMCID: PMC4081576 DOI: 10.1371/journal.pone.0101079
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Flow chart showing training and validating datasets used in developing prediction models.
Figure 2A diagram demonstrating list of softwares used for computing chemical descriptors and types of different descriptors.
Figure 3The superimposed structure of docked substrate over crystal structure conformation.
Our docked structure is overlaid on the top of Erlotinib bound EGFR cristal stucture (PDB ID: 1M17).
SMOreg based Performance of QSAR models based on selected descriptors of 128 wild EGFR inhibitors.
| Descriptors | R | R2 | MAE | RMSE |
| Vlife | 0.801 | 0.773 | 0.513 | 0.692 |
| Vlife+Dock energy | 0.813 | 0.770 | 0.507 | 0.673 |
| Dragon | 0.835 | 0.791 | 0.456 | 0.640 |
| Dragon+Dock energy | 0.841 | 0796 | 0.444 | 0.624 |
| WebCDK | 0.773 | 0.734 | 0.570 | 0.730 |
| WebCDK+Dock energy | 0.777 | 0.734 | 0.557 | 0.723 |
| PaDEL | 0.891 | 0.835 | 0.438 | 0.567 |
| PaDEL+Dock energy | 0.892 | 0.836 | 0.425 | 0.546 |
| PowerMV | 0.811 | 0.786 | 0.531 | 0.677 |
| PowerMV+Dock energy | 0.815 | 0.787 | 0.529 | 0.667 |
| Hybrid | 0.911 | 0.843 | 0.371 | 0.497 |
| Hybrid+Dock energy | 0.921 | 0.847 | 0.349 | 0.450 |
Figure 4List of descriptors showing positive and negative correlation with the wild EGFR inhibitory activity.
Figure 5Scatter plots between experimental versus predicted pIC50 values of wild types EGFR inhibitors.
Performance of SMOreg based models developing for predicting inhibitors against wild, mutant and hybrid EGFR on the training and validation data set on PaDEL descriptors.
| Inhibitors | Descriptors | R | R2 | MAE | RMSE |
| Wild EGFR | wild_whole (128 inhibitors) | 0.891 | 0.835 | 0.438 | 0.567 |
| wild_train (103 inhibitors) | 0.892 | 0.841 | 0.392 | 0.514 | |
| wild_valid (25 inhibitors) | 0.901 | 0.839 | 0.347 | 0.486 | |
| Mutant EGFR (L858R) | mutant_whole (56 inhibitors) | 0.834 | 0.710 | 0.413 | 0.524 |
| mutant_train (42 inhibitors) | 0.846 | 0.721 | 0.386 | 0.501 | |
| mutant_valid(14 inhibitors) | 0.850 | 0.745 | 0.368 | 0.467 | |
| Hybrid | hybrid_whole(184 inhibitors) | 0.833 | 0.731 | 0.491 | 0.636 |
| hybrid_train(147 inhibitors) | 0.850 | 0.723 | 0.464 | 0.628 | |
| hybrid_valid (37 inhibitors) | 0.761 | 0.623 | 0.617 | 0.724 |
The SMOreg based performance of QSAR models developed using selected descriptors calculated from mutant_whole datasets.
| Descriptors | No. of descriptors | R | R2 | MAE | RMSE |
| Docking Energy | 6 | 0.445 | 0.315 | 0.676 | 0.856 |
| PaDEL descriptors | 14 | 0.834 | 0.710 | 0.413 | 0.524 |
| PaDELdescriptors+Docking Energy | 20 | 0.841 | 0.728 | 0.398 | 0.517 |
Figure 6Positive and negative correlation of selected highly significant descriptors of mutant EGFR inhibitors.
Comparative performance of existing method with our method developing for predicting inhibitors against wild type EGFR inhibitors.
| Methods | Datasets | R | R2 | MAE | RMSE |
| Vema A | Wild_whole | 0.877 | 0.768 | 0.434 | 0.551 |
| Wild_train | 0.922 | 0.849 | 0.354 | 0.435 | |
| Wild_valid | 0.730 | 0.499 | 0.719 | 0.846 | |
| Hongying Du | Wild_whole | 0.918 | 0.843 | 0.369 | 0.455 |
| Wild_train | 0.921 | 0.849 | 0.354 | 0.442 | |
| Wild_valid | 0.901 | 0.807 | 0.432 | 0.504 | |
| ntEGFR | Wild_whole | 0.921 | 0.847 | 0.349 | 0.450 |
| Wild_train | 0.917 | 0.837 | 0.333 | 0.462 | |
| Wild_valid | 0.902 | 0.810 | 0.342 | 0.501 |