| Literature DB >> 30333748 |
Daniel J Mason1,2, Richard T Eastman3, Richard P I Lewis1, Ian P Stott4, Rajarshi Guha3, Andreas Bender1.
Abstract
The parasite Plasmodium falciparum is the most lethal species of Plasmodium to cause serious malaria infection in humans, and with resistance developing rapidly novel treatment modalities are currently being sought, one of which being combinations of existing compounds. The discovery of combinations of antimalarial drugs that act synergistically with one another is hence of great importance; however an exhaustive experimental screen of large drug space in a pairwise manner is not an option. In this study we apply our machine learning approach, Combination Synergy Estimation (CoSynE), which can predict novel synergistic drug interactions using only prior experimental combination screening data and knowledge of compound molecular structures, to a dataset of 1,540 antimalarial drug combinations in which 22.2% were synergistic. Cross validation of our model showed that synergistic CoSynE predictions are enriched 2.74 × compared to random selection when both compounds in a predicted combination are known from other combinations among the training data, 2.36 × when only one compound is known from the training data, and 1.5 × for entirely novel combinations. We prospectively validated our model by making predictions for 185 combinations of 23 entirely novel compounds. CoSynE predicted 20 combinations to be synergistic, which was experimentally validated for nine of them (45%), corresponding to an enrichment of 1.70 × compared to random selection from this prospective data set. Such enrichment corresponds to a 41% reduction in experimental effort. Interestingly, we found that pairwise screening of the compounds CoSynE individually predicted to be synergistic would result in an enrichment of 1.36 × compared to random selection, indicating that synergy among compound combinations is not a random event. The nine novel and correctly predicted synergistic compound combinations mainly (where sufficient bioactivity information is available) consist of efflux or transporter inhibitors (such as hydroxyzine), combined with compounds exhibiting antimalarial activity alone (such as sorafenib, apicidin, or dihydroergotamine). However, not all compound synergies could be rationalized easily in this way. Overall, this study highlights the potential for predictive modeling to expedite the discovery of novel drug combinations in fight against antimalarial resistance, while the underlying approach is also generally applicable.Entities:
Keywords: artificial intelligence; combinations; malaria; modeling; plasmodium falciparum; synergy
Year: 2018 PMID: 30333748 PMCID: PMC6176478 DOI: 10.3389/fphar.2018.01096
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
Figure 1Artemether and Lumefantrine, Artesunate and Amodiaquine, and Dihydroartemisinin and Piperaquine are antimalarial combinations recommended by the WHO as the current standard of care to help protect against drug resistance in P. falciparum.
Figure 2Three different rounds of cross-validation (CV) were employed to test model performance prior to making final predictions. Numbers on axes represent compound IDs in a compound combination training dataset. K-fold randomly selects a 1/K fraction of combinations to remove from the training data and predict in each round; Leave One Compound Out (LOCO) chooses pairs to remove based upon one compound in each round, and Leave One Pair Out chooses pairs to remove based upon a choice of two compounds in each round. Green; training combinations; blue; test combinations, red; held-out combinations, black; self-self crosses (not included in training data).
Figure 3Multi-Dimensional Scaling (MDS) plot of chemical space for all compounds used in this study, based upon pairwise similarity of radius 2, 2,048-bit Morgan fingerprints. Compounds that comprise the top five synergistic combinations in the training (red dots) and prospective validation (green dots) datasets are highlighted, together with their synergistic connection. The lack of a clear clustering suggests that pairs of synergistic compounds do not always arise from those in distinct or well-defined chemical space. Out of these predictions in green, none were predicted by CoSynE, but paroxetine + guanethidine would be discovered following the indirect route described in the Results section, and is the second-most synergistic combination in the validation dataset. Structures for validation and training compounds are included in Supplementary Tables 5, 7, respectively.
Dataset statistics.
| 3d7 | 264 (22.1%) | 762 (63.8%) | 168 (14.1%) | 1,194 |
| Dd2 | 277 (22.2%) | 817 (65.6%) | 151 (12.1%) | 1,245 |
| Hb3 | 242 (20.9%) | 767 (66.2%) | 150 (12.9%) | 1,159 |
| 3d7 | 18 (15.1%) | 100 (84%) | 1 (0.8%) | 119 |
| Dd2 | 49 (26.5%) | 134 (72.4%) | 2 (1.1%) | 185 |
| Hb3 | 29 (35.8%) | 52 (64.2%) | 0 | 81 |
Counts for the number of synergistic, additive, and antagonistic compounds in each of the datasets available for the current study, after filtering for high quality (HQ) data. The Dd2 training dataset had the highest number of HQ datapoints, which was reflected during cross validation (CV). The Dd2 dataset also contained the highest number of HQ datapoints in the prospectively validated dataset.
Dd2 training performance.
| 5-Fold | SFP | 0.45 | 0.56 | 0.84 | 0.61 | 0.53 | 0.82 | 2.74 | 2 |
| TFP | 0.44 | 0.55 | 0.83 | 0.60 | 0.51 | 0.81 | 2.69 | 3 | |
| STFP | 0.47 | 0.57 | 0.84 | 0.64 | 0.52 | 0.83 | 2.89 | 1 | |
| LOCO | SFP | 0.27 | 0.31 | 0.81 | 0.52 | 0.33 | 0.77 | 2.36 | 1 |
| TFP | 0.03 | 0.08 | 0.58 | 0.07 | 0.11 | 0.76 | 0.31 | 3 | |
| STFP | 0.03 | 0.32 | 0.55 | 0.23 | 0.89 | 0.31 | 1.04 | 2 | |
| LOPO | SFP | 0.02 | 0.01 | 0.44 | 0.33 | 0.01 | 0.78 | 1.50 | 1 |
| TFP | −0.02 | 0.10 | 0.49 | 0.20 | 0.07 | 0.73 | 0.89 | 3 | |
| STFP | 0.02 | 0.36 | 0.47 | 0.23 | 0.82 | 0.34 | 1.02 | 2 | |
The results from three increasingly difficult rounds of cross validation (CV); shuffled and stratified 5-fold CV, leave one compound out (LOCO), and leave one pair out (LOPO), for each model type (SFP, structural fingerprint; TFP, target fingerprint; and STFP, combined structure-target fingerprint). Since the current study concerns the prediction of novel compound combinations, our chosen model followed the expected performance of the SFP model during LOPO CV, since this is the most challenging test of the model. AUC, area under receiver operating curve; Pr, precision; Re, recall; Ac, accuracy; Ef, enrichment factor. The “cross descriptor average” is the average score for each metric across each cross validation routine.
Dd2 SFP predictions.
| NCGC00167488 | Sorafenib (216239) | Hydroxyzine (3658) | 0.4 | 0.917 |
| NCGC00263624 | Apicidin (6918328) | Dihydroergotamine (10531) | 0.42 | 0.924 |
| NCGC00016272 | Guanethidine (3518) | Trifluoperazine (5566) | 0.36 | 0.926 |
| NCGC00021152 | Hydroxyzine (3658) | Dihydroergotamine (10531) | 0.4 | 0.932 |
| NCGC00167488 | Sorafenib (216239) | Trifluoperazine (5566) | 0.43 | 0.937 |
| NCGC00181117 | Virginiamycin s1 (46937022) | Dihydroergotamine (10531) | 0.49 | 0.941 |
| NCGC00263624 | Apicidin (6918328) | Hydroxyzine (3658) | 0.47 | 0.952 |
| NCGC00263624 | Apicidin (6918328) | Virginiamycin s1 (46937022) | 0.62 | 0.957 |
| NCGC00017400 | Dihydroergotamine (10531) | Trifluoperazine (5566) | 0.43 | 0.958 |
The 9 combinations out of 20 predicted by CoSynE, which were prospectively validated to be synergistic, which cover a total of 7 unique compounds. The probability of being synergistic that was assigned by CoSynE is shown, which does not correlate with the experimentally quantified degree of synergy.
Synergistic drugs correctly predicted by CoSynE.
| Apicidin | Known to target histone deacetylase and has previously shown activity against P. falciparum via inhibition of apicomplexan histone deacetylase (HDA) (Darkin-Rattray et al., | |
| Dihydroergotamine | An inhibitor of P. falciparum (Weisman et al., | |
| Guanethidine | Annotated in PubChem as having an inconclusive potency against P. falciparum of 5.72 uM (AID:504834). Also annotated as active against MDR-1 (AID:377); the P. falciparum analog of which (pfmdr1) is involved in resistance and guanethidine may therefore play a role in preventing drug efflux (Hyde, | |
| Hydroxyzine | Shown to act as an efflux pump inhibitor in bacteria (Aybey et al., | |
| Sorafenib tosylate | Tyrosine kinase inhibitor that exhibits antimalarial properties, and has been shown to inhibit the function of calcium-dependent protein kinase 3 in P. falciparum (PfCDPK1), which affects parasite egress from the host cell (Gaji et al., | |
| Trifluoperazine | Calmodulin inhibitor, and a potent antiplasmodial inhibitor of calcium-dependent protein kinase 4 (PfCDPK4) (Cavagnino et al., | |
| Virginiamycin s1 | An antibiotic that is annotated as targeting 60S Ribosomal Protein L37 in PubChem. Similar in structure to azithromycin (which is known to target apicoplast 50S ribosomal subunit and inhibit P. falciparum). |
Depiction and description of the seven compounds that were part of combinations predicted to be synergistic by CoSynE.
Dd2 SFP Performance.
| SFP | 20 | 9 | 0.15 | 0.26 | 0.63 | 0.45 | 0.18 | 0.72 | 1.70 |
Overall performance of the Dd2 SFP model, after the full pairwise screen of prospective compounds was carried out. Overall, the precision and recall for the prediction of novel synergistic combinations, however this still provides greater enrichment of synergistic combinations than expected by random selection (1.70-fold) from the prospectively validated dataset. AUC, area under receiver operating curve; Pr, precision; Re, recall; Ac, accuracy; Ef, enrichment factor.