| Literature DB >> 23840562 |
Twan van Laarhoven1, Elena Marchiori.
Abstract
In silico discovery of interactions between drug compounds and target proteins is of core importance for improving the efficiency of the laborious and costly experimental determination of drug-target interaction. Drug-target interaction data are available for many classes of pharmaceutically useful target proteins including enzymes, ion channels, GPCRs and nuclear receptors. However, current drug-target interaction databases contain a small number of drug-target pairs which are experimentally validated interactions. In particular, for some drug compounds (or targets) there is no available interaction. This motivates the need for developing methods that predict interacting pairs with high accuracy also for these 'new' drug compounds (or targets). We show that a simple weighted nearest neighbor procedure is highly effective for this task. We integrate this procedure into a recent machine learning method for drug-target interaction we developed in previous work. Results of experiments indicate that the resulting method predicts true interactions with high accuracy also for new drug compounds and achieves results comparable or better than those of recent state-of-the-art algorithms. Software is publicly available at http://cs.ru.nl/~tvanlaarhoven/drugtarget2013/.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23840562 PMCID: PMC3694117 DOI: 10.1371/journal.pone.0066952
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The number of drug compounds and target proteins, their ratio, and the number of interactions in the drug-target datasets from[12].
| Dataset | Drugs | Targets |
| Interactions |
| Enzyme | 445 | 664 | 0.67 | 2926 |
| Ion Channel | 210 | 204 | 1.03 | 1476 |
| GPCR | 223 | 95 | 2.35 | 635 |
| Nuclear Receptor | 54 | 26 | 2.08 | 90 |
Results of 5 fold cross validation: average AUC and AUPR over 5 runs.
| Method | AUC (std) | AUPR (std) |
|
| Enzyme | |||
| GIP | 0.685 (0.006) | 0.150 (0.008) | |
| WNN | 0.819 (0.004) |
| 0.809 (0.068) |
| WNN-GIP |
|
| 0.908 (0.019) |
| KBMF2K | 0.812 (0.004) |
| |
| Ion Channel | |||
| GIP | 0.637 (0.008) |
| |
| WNN | 0.757 (0.006) |
| 0.535 (0.200) |
| WNN-GIP | 0.775 (0.006) |
| 0.730 (0.171) |
| KBMF2K |
|
| |
| GPCR | |||
| GIP | 0.679 (0.014) | 0.260 (0.023) | |
| WNN | 0.848 (0.008) | 0.308 (0.032) | 0.713 (0.084) |
| WNN-GIP |
|
| 0.702 (0.081) |
| KBMF2K | 0.840 (0.009) |
| |
| Nuclear Receptor | |||
| GIP | 0.758 (0.026) | 0.357 (0.060) | |
| WNN | 0.788 (0.027) |
| 0.305 (0.205) |
| WNN-GIP |
|
| 0.527 (0.103) |
| KBMF2K |
| 0.354 (0.063) | |
Standard deviation is reported between parentheses. The best AUC and AUPR results are indicated in bold, results that are not significantly different from the best (at ) are indicated in italic.
Highest ranked predicted new interactions for each of the datasets.
| Rank | Drug compound | Target protein | |||
| Enzyme | |||||
| M |
|
|
|
|
|
| C,M,D |
|
|
|
|
|
| M,D |
|
|
|
|
|
| M |
|
|
|
|
|
| C,M,D |
|
|
|
|
|
| Ion Channel | |||||
| D,K |
|
|
|
|
|
| 2 | D00726 | Metoclopramide | hsa1138 | cholinergic receptor, nicotinic, alpha 5 (neuronal) | |
| C,D |
|
|
|
|
|
| 4 | D02098 | Proparacaine hydrochloride | hsa8645 | KCNK5: potassium channel, subfamily K, member 5 | |
| K |
|
|
|
|
|
| GPCR | |||||
| C,M,D |
|
|
|
|
|
| C,D |
|
|
|
|
|
| 3 | D00604 | Clonidine hydrochloride | hsa147 | adrenoceptor alpha 1B | |
| C |
|
|
|
|
|
| C |
|
|
|
|
|
| Nuclear Receptor | |||||
| 1 | D00316 | Etretinate | hsa6096 | RAR-related orphan receptor B | |
| C |
|
|
|
|
|
| K |
|
|
|
|
|
| 4 | D01132 | Tazarotene | hsa6097 | RAR-related orphan receptor C | |
| K |
|
|
|
|
|
Interactions found in ChEMBL, Matador, DrugBank and KEGG are indicated in italic and marked as C, M, D and K respectively.
Results of LOOCV on pairs.
| Method | AUC | AUPR |
| Enzyme | ||
| GIP | 0.978 | 0.915 |
| WNN | 0.558 | 0.141 |
| WNN-GIP | 0.983 | 0.944 |
| Const | 0.577 | 0.179 |
| Const-GIP |
|
|
| BLM-NII |
| 0.929 |
| Ion Channel | ||
| GIP | 0.984 | 0.943 |
| WNN | 0.528 | 0.125 |
| WNN-GIP | 0.986 | 0.953 |
| Const | 0.535 | 0.138 |
| Const-GIP |
|
|
| BLM-NII |
| 0.950 |
| GPCR | ||
| GIP | 0.954 | 0.790 |
| WNN | 0.580 | 0.219 |
| WNN-GIP | 0.972 | 0.863 |
| Const | 0.604 | 0.266 |
| Const-GIP |
|
|
| BLM-NII |
| 0.865 |
| Nuclear Receptor | ||
| GIP | 0.922 | 0.684 |
| WNN | 0.694 | 0.478 |
| WNN-GIP | 0.958 | 0.857 |
| Const | 0.744 | 0.568 |
| Const-GIP |
|
|
| BLM-NII |
| 0.866 |
Results of BLM-NNII are from [17]. The best AUC and AUPR results are indicated in bold, results that are not significantly different from the best (at ) are indicated in italic, see the main text for details.