| Literature DB >> 31640190 |
Onat Kadioglu1, Thomas Efferth2.
Abstract
P-glycoprotein (P-gp) is an important determinant of multidrug resistance (MDR) because its overexpression is associated with increased efflux of various established chemotherapy drugs in many clinically resistant and refractory tumors. This leads to insufficient therapeutic targeting of tumor populations, representing a major drawback of cancer chemotherapy. Therefore, P-gp is a target for pharmacological inhibitors to overcome MDR. In the present study, we utilized machine learning strategies to establish a model for P-gp modulators to predict whether a given compound would behave as substrate or inhibitor of P-gp. Random forest feature selection algorithm-based leave-one-out random sampling was used. Testing the model with an external validation set revealed high performance scores. A P-gp modulator list of compounds from the ChEMBL database was used to test the performance, and predictions from both substrate and inhibitor classes were selected for the last step of validation with molecular docking. Predicted substrates revealed similar docking poses than that of doxorubicin, and predicted inhibitors revealed similar docking poses than that of the known P-gp inhibitor elacridar, implying the validity of the predictions. We conclude that the machine-learning approach introduced in this investigation may serve as a tool for the rapid detection of P-gp substrates and inhibitors in large chemical libraries.Entities:
Keywords: P-glycoprotein; artificial intelligence; drug discovery; machine learning; molecular docking; multidrug resistance
Mesh:
Substances:
Year: 2019 PMID: 31640190 PMCID: PMC6829872 DOI: 10.3390/cells8101286
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Compounds selected for learning and external validation for the P-glycoprotein (P-gp) modulator/non-modulator prediction model.
| Learning Set | External Validation Set | ||||
|---|---|---|---|---|---|
| Compound | Category | Compound | Category | Compound | Category |
| Escitalopram | Modulator | Hydroxyzine | Non-modulator | Terfenadine | Modulator |
| Simvastatin acid | Modulator | Oxybutynin | Non-modulator | Prazosin | Modulator |
| Neostigmine | Modulator | Ethosuximide | Non-modulator | Prednisone | Modulator |
| Zolmitriptan | Modulator | Warfarin | Non-modulator | Chloroquine | Modulator |
| Atomoxetine | Modulator | Mexilitene | Non-modulator | Lopinavir | Modulator |
| Methysergide | Modulator | Sulpiride | Non-modulator | Prednisolone | Modulator |
| Famciclovir | Modulator | Thiopental | Non-modulator | Vincristine | Modulator |
| Lovastatin acid | Modulator | Lamotrigine | Non-modulator | Sertraline | Modulator |
| Darifenacin | Modulator | Diphenhydramine | Non-modulator | Loperamide | Modulator |
| Paliperidone | Modulator | Enoxacin | Non-modulator | Etoposide | Modulator |
| Trospium | Modulator | Methylphenidate | Non-modulator | Indinavir | Modulator |
| Aprepitant | Modulator | Itraconazole | Non-modulator | Dipyridamole | Modulator |
| Apomorphine | Modulator | Nortriptyline | Non-modulator | Mitoxantrone | Modulator |
| Cetirizine | Modulator | Galantamine | Non-modulator | Cimetidine | Modulator |
| Cyclosporin A | Modulator | Ramelteon | Non-modulator | Bromocriptine | Modulator |
| Labetalol | Modulator | Rivastigmine | Non-modulator | Reserpine | Modulator |
| Amisulpride | Modulator | Ropivacaine | Non-modulator | Oxprenolol | Non-modulator |
| 5-Hydroxymethyl tolterodine | Modulator | Zonisamide | Non-modulator | Alprazolam | Non-modulator |
| Cabergoline | Modulator | Zolpidem | Non-modulator | Oxcarbazepine | Non-modulator |
| Ximelagatran | Modulator | Sulfasalazine | Non-modulator | Tolterodine | Non-modulator |
| Hoechst 33342 | Modulator | Metoclopramide | Non-modulator | Zaleplon | Non-modulator |
| Rhodamine 123 | Modulator | Nalmefene | Non-modulator | Cyclobenzaprine | Non-modulator |
| Actinomycin D | Modulator | Oxycodone | Non-modulator | Nimodipine | Non-modulator |
| Olanzapine | Modulator | Topiramate | Non-modulator | Riluzole | Non-modulator |
| Ranitidine | Modulator | Hydrocodone | Non-modulator | Tiagabine | Non-modulator |
| Astemizole | Modulator | Rosuvastatin | Non-modulator | Nalbuphine | Non-modulator |
| Verapamil | Modulator | Tropisetron | Non-modulator | Duloxetine | Non-modulator |
| Ziprasidone | Modulator | Varenicline | Non-modulator | Pravastatin acid | Non-modulator |
| Chlorpromazine | Modulator | Clemastine | Non-modulator | Promazine | Non-modulator |
| Clozapine | Modulator | Clonazepam | Non-modulator | Bromazepam | Non-modulator |
| Trimethoprim | Modulator | Ropinirole | Non-modulator | Lorazepam | Non-modulator |
| Paroxetine | Modulator | Solifenacin | Non-modulator | Mirtazapine | Non-modulator |
Compounds selected for learning and external validation for the P-gp inhibitor/substrate prediction model.
| Learning Set | External Validation Set | ||||||
|---|---|---|---|---|---|---|---|
| Compound | Category | Compound | Category | Compound | Category | Compound | Category |
| Ginsenoside | Inhibitor | Epirubicin | Substrate | Agosterol | Inhibitor | Colchicin | Substrate |
| Laniquidar | Inhibitor | Etoposide | Substrate | Amiodarone | Inhibitor | Dexamethazone | Substrate |
| Loratidine | Inhibitor | Fexofenadine | Substrate | Amorinin | Inhibitor | Digoxin | Substrate |
| Mibefradil | Inhibitor | Hoechst 33342 | Substrate | Apigenin | Inhibitor | Docetaxel | Substrate |
| Naringenin | Inhibitor | Idarubicin | Substrate | Atorvastatin | Inhibitor | Doxorubicin | Substrate |
| Pgp-4008 | Inhibitor | Irinotecan | Substrate | Atovaquone | Inhibitor | Daunorubicin | Substrate |
| Phloretin | Inhibitor | Kaempferol | Substrate | Biochanin | Inhibitor | ||
| Quercetin | Inhibitor | Loperamide | Substrate | Biricodar | Inhibitor | ||
| Quinine | Inhibitor | Mitomycin | Substrate | Catechin | Inhibitor | ||
| Rotenone | Inhibitor | Mitoxantrone | Substrate | Cefoperazone | Inhibitor | ||
| Sakuranetin | Inhibitor | Ondansetron | Substrate | Chrysine | Inhibitor | ||
| Sertraline | Inhibitor | Paclitaxel | Substrate | Cyclosporine | Inhibitor | ||
| Sinensetin | Inhibitor | Procyanidin B2 | Substrate | Diltiazem | Inhibitor | ||
| Stigmasterol | Inhibitor | Rhodamine 123 | Substrate | Elacridar | Inhibitor | ||
| Syringaresinol | Inhibitor | Tenoposide | Substrate | ||||
| Tamoxifen | Inhibitor | Topotecan | Substrate | ||||
| Tariquidar | Inhibitor | Vinblastine | Substrate | ||||
| Valspodar | Inhibitor | Vincristine | Substrate | ||||
| Verapamil | Inhibitor | Vindesine | Substrate | ||||
| Zosuquidar | Inhibitor | Vinorelbine | Substrate | ||||
Figure 1Receiver operating characteristic (ROC) curves of k Nearest Neighboring (kNN), Neural Network, Random Forest (RF), and Support Vector Machine (SVM) classification algorithms based on random leave-one-out sampling for the P-gp modulator/non-modulator prediction model for the learning step.
Performance of the P-gp modulator/non-modulator prediction model based on the RF classifier algorithm.
| Steps | Sensitivity | Specificity | Overall Predictive Accuracy | Precision |
|---|---|---|---|---|
|
| 0.938 | 0.969 | 0.953 | 0.968 |
|
| 0.938 | 0.938 | 0.938 | 0.938 |
Figure 2ROC curves of kNN, Neural Network, RF, and SVM classification algorithms based on random leave-one-out sampling for the P-gp inhibitor/substrate prediction model for the learning step.
Performance of the P-gp inhibitor/substrate prediction model based on the RF classifier algorithm.
| Steps | Sensitivity | Specificity | Overall Predictive Accuracy | Precision |
|---|---|---|---|---|
|
| 0.750 | 0.700 | 0.725 | 0.714 |
|
| 0.786 | 0.833 | 0.800 | 0.917 |
Grid parameters for molecular docking analyses on human P-gp.
| x | y | z | |
|---|---|---|---|
|
| 126 | 98 | 116 |
|
| 168.614 | 166.372 | 162.000 |
|
| 0.375 |
Prediction of the top 20 P-gp inhibitors identified by the RF classification algorithm using the ChEMBL P-gp modulator list of 493 compounds. The results were validated by determining the binding affinities using Autodock VINA.
| Name | ChEMBL ID | Inhibitor Probability | Class | VINA LBE (kcal/mol) |
|---|---|---|---|---|
| Karavoate P | CHEMBL1641677 | 0.849 | Synthetic | −12.200 ± 1.212 |
| Tribenzoylbalsaminol F | CHEMBL1928854 | 0.549 | Synthetic | −12.033 ± 0.896 |
| Zosuquidar | CHEMBL444172 | 0.513 | Synthetic | −11.967 ± 0.058 |
| Latilagascenes D | CHEMBL435917 | 0.566 | Synthetic | −11.700 ± 0.001 |
| Dihydrocytochalasin B | CHEMBL2074735 | 0.513 | Synthetic | −11.367 ± 0.231 |
| Jolkinoate I | CHEMBL2315618 | 0.593 | Synthetic | −11.300 ± <0.001 |
| Karavoate K | CHEMBL1641672 | 0.849 | Synthetic | −11.267 ± 0.493 |
| Fanchinin | CHEMBL176045 | 0.586 | Synthetic | −11.233 ± 0.208 |
| Latilagascene I | CHEMBL511018 | 0.586 | Synthetic | −11.167 ± 0.058 |
| Karavoate L | CHEMBL1641673 | 0.766 | Synthetic | −11.133 ± 0.808 |
| 3-Methylcholanthrene | CHEMBL40583 | 0.788 | Synthetic | −11.100 ± <0.001 |
| Lonafarnib | CHEMBL298734 | 0.567 | Synthetic | −11.000 ± <0.001 |
| Karavoate N | CHEMBL1641675 | 0.666 | Synthetic | −10.933 ± 0.058 |
| Tariquidar | CHEMBL348475 | 0.619 | Synthetic | −10.933 ± 0.404 |
| Pimozide | CHEMBL1423 | 0.517 | Synthetic | −10.900 ± 0.100 |
| Karavoate I | CHEMBL1641670 | 0.766 | Synthetic | −10.767 ± 0.058 |
| Cryptotanshinone | CHEMBL187460 | 0.663 | Natural | −10.700 ± <0.001 |
| Jolkinol B | CHEMBL489265 | 0.577 | Synthetic | −10.700 ± <0.001 |
| Astemizole | CHEMBL296419 | 0.617 | Synthetic | −10.667 ± 0.115 |
| Metergoline | CHEMBL19215 | 0.732 | Natural | −10.600 ± <0.001 |
Prediction of P-gp substrates identified by the RF classification algorithm using the ChEMBL P-gp modulator list of 150 compounds. The results were validated by determining the binding affinities using Autodock VINA.
| Name | ChEMBL ID | Substrate probability | Class | VINA LBE (kcal/mol) |
|---|---|---|---|---|
| Vindoline | CHEMBL526546 | 0.771 | Synthetic | −15.000 ± <0.001 |
| Cepharanthin | CHEMBL2074948 | 0.614 | Natural | −12.600 ± <0.001 |
| Latilagascene G | CHEMBL448193 | 0.514 | Synthetic | −12.300 ± <0.001 |
| Mk3207 | CHEMBL1910936 | 0.733 | Synthetic | −12.167 ± 0.058 |
| Ergocristine | CHEMBL446315 | 0.767 | Natural | −12.067 ± 0.058 |
| Cytochalasin E | CHEMBL494856 | 0.6 | Natural | −11.800 ± <0.001 |
| Jolkinoate L | CHEMBL2315621 | 0.567 | Synthetic | −11.533 ± 0.058 |
| Irinotecan | CHEMBL481 | 0.967 | Natural | −11.400 ± 0.819 |
| Latilagascenes E | CHEMBL373511 | 0.614 | Synthetic | −11.367 ± 0.116 |
| Dofequidar | CHEMBL65067 | 0.583 | Synthetic | −11.300 ± 0.001 |
| Acetyldigoxin | CHEMBL2074725 | 0.708 | Natural | −11.233 ± 0.808 |
| Dihydroergocristine | CHEMBL601773 | 0.767 | Natural | −11.133 ± 0.666 |
| Telcagepant | CHEMBL236593 | 0.517 | Synthetic | −11.067 ± 0.058 |
| Ergotamine | CHEMBL442 | 0.8 | Natural | −10.933 ± 0.058 |
| Candesartan Cilexetil | CHEMBL1014 | 0.567 | Synthetic | −10.900 ± 0.200 |
| Digoxin | CHEMBL1751 | 0.708 | Natural | −10.833 ± 1.097 |
| Bromocriptine | CHEMBL493 | 0.767 | Natural | −10.800 ± 0.100 |
| Itrazole | CHEMBL64391 | 0.564 | Synthetic | −10.700 ± 0.436 |
| Digitoxin | CHEMBL254219 | 0.725 | Natural | −10.667 ± 0.462 |
| Paclitaxel | CHEMBL428647 | 0.808 | Natural | −10.633 ± 0.462 |
Lowest binding energies (LBE) and predicted inhibition constants obtained by molecular docking of the top 20 P-gp inhibitors.
| P-gp Inhibitor | AutoDock LBE (kcal/mol) | Predicted Inhibition Constant (µM) |
|---|---|---|
| 3-Methylcholanthrene | −8.900 ± 0.001 | 0.300 ± <0.001 |
| Astemizole | −9.693 ± 0.047 | 0.079 ± 0.007 |
| Cryptotanshinone | −9.010 ± 0.001 | 0.251 ± <0.001 |
| Dihydrocytochalasin B | −10.460 ± 0.020 | 0.0212 ± 0.001 |
| Fanchinin | −9.937 ± 0.067 | 0.0522 ± 0.006 |
| Jolkinoate I | −10.440 ± 0.200 | 0.0232 ± 0.008 |
| Jolkinol B | −10.250 ± 0.044 | 0.0307 ± 0.002 |
| Karavoate I | −12.310 ± 0.235 | 0.001 ± <0.001 |
| Karavoate K | −12.330 ± 0.213 | 0.001 ± <0.001 |
| Karavoate L | −12.807 ± 0.200 | 0.0004 ± <0.001 |
| Karavoate N | −12.160 ± 0.560 | 0.002 ± 0.001 |
| Karavoate P | −13.537 ± 0.605 | 0.0002 ± <0.001 |
| Latilagascene I | −11.147 ± 0.561 | 0.009 ± 0.009 |
| Latilagascenes D | −12.220 ± 0.370 | 0.001 ± 0.001 |
| Lonafarnib | −11.433 ± 0.087 | 0.004 ± 0.001 |
| Metergoline | −9.737 ± 0.029 | 0.073 ± 0.004 |
| Pimozide | −10.220 ± 0.324 | 0.031 ± 0.025 |
| Tariquidar | −11.273 ± 0.274 | 0.006 ± 0.002 |
| Tribenzoylbalsaminol F | −12.403 ± 0.118 | 0.001 ± <0.001 |
| Zosuquidar | −11.257 ± 0.361 | 0.006 ± 0.004 |
| Elacridar (positive control) | −11.093 ± 0.361 | 0.008 ± 0.004 |
Lowest binding energies (LBE) and predicted inhibition constants obtained by molecular docking of the top 20 P-gp substrates.
| P-gp substrate | AutoDock LBE (kcal/mol) | Predicted Inhibition Constant (µM) |
|---|---|---|
| Acetyldigoxin | −11.767 ± 0.480 | 0.003 ± 0.002 |
| Bromocriptine | −12.360 ± 1.02 | 0.002 ± 0.001 |
| Candesartan Cilexetil | −11.153 ± 0.370 | 0.007 ± 0.004 |
| Cepharanthin | −10.753 ± 0.006 | 0.013 ± <0.001 |
| Cytochalasin E | −10.957 ± 0.006 | 0.093 ± 0.001 |
| Digitoxin | −11.390 ± 0.517 | 0.006 ± 0.004 |
| Digoxin | −11.500 ± 0.151 | 0.004 ± 0.001 |
| Dihydroergocristine | −11.670 ± 0.056 | 0.003 ± <0.001 |
| Dofequidar | −10.970 ± 0.351 | 0.010 ± 0.006 |
| Ergocristine | −12.407 ± 0.012 | 0.001 ± <0.001 |
| Ergotamine | −11.227 ± 0.150 | 0.006 ± 0.001 |
| Irinotecan | −11.380 ± 0.020 | 0.005 ± <0.001 |
| Itrazole | −10.843 ± 0.186 | 0.012 ± 0.003 |
| Jolkinoate L | −10.643 ± 0.681 | 0.022 ± 0.016 |
| Latilagascenes E | −11.770 ± 0.185 | 0.002 ± 0.001 |
| Latilagescene G | −12.500 ± 0.316 | 0.001 ± <0.001 |
| Mk-3207 | −11.650 ± 0.020 | 0.003 ± <0.001 |
| Paclitaxel | −9.607 ± 0.359 | 0.103 ± 0.065 |
| Telcagepant | −9.333 ± 0.021 | 0.144 ± 0.005 |
| Vindoline | −7.337 ± 0.211 | 4.363 ± 1.389 |
| Doxorubicin (positive control) | −11.070 ± 0.135 | 0.008 ± 0.002 |
Lowest binding energies (LBE) and predicted inhibition constants obtained by molecular docking of the non-modulators.
| P-gp Inhibitor | AutoDock LBE (kcal/mol) | Predicted Inhibition Constant (µM) |
|---|---|---|
| Oxprenolol | −5.743 ± 0.398 | 70.273 ± 40.057 |
| Promazine | −6.933 ± 0.021 | 8.273 ± 0.262 |
| Riluzole | −5.380 ± 0.010 | 114.080 ± 2.326 |
Figure 3Molecular docking results for selected non-modulators (pink).
Figure 4Molecular docking results for selected inhibitors (red) and substrates (green) yielded from the P-gp inhibitor/substrate prediction model. Elacridar (blue) and doxorubicin (yellow) were selected as control drugs.
Figure 5Boxplot analysis of the descriptors used for the model and comparison of the predicted inhibitors and substrates.
Average values of descriptors for inhibitors and substrates.
| Descriptor | Inhibitor | Substrate |
|---|---|---|
| cLogP | 3.498 ± 2.464 | 3.134 ± 2.962 |
| Total surface area | 311.199 ± 188.142 | 461.870 ± 286.187 |
| Shape index | 0.529 ± 0.125 | 0.429 ± 0.081 |
| Molecular flexibility | 0.395 ± 0.141 | 0.332 ± 0.114 |
| Rotatable bonds | 6.799 ± 12.158 | 9.818 ± 11.778 |
| Aromatic rings | 1.450 ± 1.168 | 1.918 ± 1.330 |
| Aromatic atoms | 8.237 ± 6.470 | 10.759 ± 7.098 |
| Symmetric atoms | 2.649 ± 3.637 | 3.582 ± 4.477 |
| Aromatic nitrogens | 0.301 ± 0.772 | 0.559 ± 1.141 |
| Basic nitrogens | 0.441 ± 0.625 | 0.659 ± 0.762 |
| Acidic oxygens | 0.117 ± 0.361 | 0.171 ± 0.462 |