| Literature DB >> 33553921 |
Ying Zheng1, Zheng Wu1.
Abstract
Drug repositioning is the identification of interactions between drugs and target proteins in pharmaceutical sciences. Traditional large-scale validation through chemical experiments is time-consuming and expensive, while drug repositioning can drastically decrease the cost and duration taken by traditional drug development. With the rapid advancement of high-throughput technologies and the explosion of various biological and medical data, computational drug repositioning methods have been used to systematically identify potential drug-target interactions. Some of them are based on a particular class of machine learning algorithms called kernel methods. In this paper, we propose a new machine learning prediction method combining multiple kernels into a tripartite heterogeneous drug-target-disease interaction spaces in order to integrate multiple sources of biological information simultaneously. This novel network algorithm extends the traditional drug-target interaction bipartite graph to the third disease layer. Meanwhile, Gaussian kernel functions on heterogeneous networks and the regularized least square method of the Kronecker product are used to predict new drug-target interactions. The values of AUPR (area under the precision-recall curve) and AUC (the area under the receiver operating characteristic curve) of the proposed algorithm are significantly improved. Especially, the AUC values are improved to 0.99, 0.99, 0.97, and 0.96 on four benchmark data sets. These experimental results substantiate that the network topology can be used for predicting drug-target interactions.Entities:
Year: 2021 PMID: 33553921 PMCID: PMC7860102 DOI: 10.1021/acsomega.0c05377
Source DB: PubMed Journal: ACS Omega ISSN: 2470-1343
Sources and Verification of Data Sets
| resource | description | URL | drug-related entities |
|---|---|---|---|
| DrugBank | free access database with comprehensive drug data | drug and drug–target data | |
| Kegg | open access database for molecular-level information | system information, health information, genomic information, and chemical information | |
| UniProt | free accessible protein sequence and annotation database | UniProt knowledgebase, UniProt reference cluster, and UniProt archive | |
| OMIM | free access compendium for Mendelian disorder | phenotypic and genotypic information for human disease | |
| DisGeNET | free access human disease database | genotype and phenotype relationships for diseases–diseases and diseases–target | |
| ChEMBL | free access drug and target database | bioactivity and genomic data to aid the translation of genomic information into effective new drugs |
Benchmark Data Sets
| data set | drugs | targets | interactions | |
|---|---|---|---|---|
| enzyme | 445 | 664 | 0.67 | 2926 |
| ion channel | 210 | 204 | 1.03 | 1476 |
| GPCR | 223 | 95 | 2.35 | 635 |
| nuclear receptor | 54 | 26 | 2.08 | 90 |
Figure 1Example of a bipartite graph model for drug–target interactions.
Figure 2Tripartite heterogeneous network model.
Result of FLapRLS, RLS-Kron, and THN_KRLSa
| data sets | method | AUC | sensitivity | specificity | AUPR |
|---|---|---|---|---|---|
| enzyme | FLapRLS | 0.985 | 0.913 | 0.999* | 0.92 |
| RLS-Kron | 0.978 | 0.905 | 0.997 | 0.915 | |
| THN_KRLS | 0.99* | 0.979* | 0.998 | 0.99* | |
| ion channel | FLapRLS | 0.991* | 0.688 | 0.986 | 0.89 |
| RLS-Kron | 0.984 | 0.721 | 0.98 | 0.943 | |
| THN_KRLS | 0.99 | 0.977* | 0.998* | 0.99* | |
| GPCR | FLapRLS | 0.944 | 0.737 | 0.986 | 0.83 |
| RLS-Kron | 0.954 | 0.753 | 0.975 | 0.79 | |
| THN_KRLS | 0.97* | 0.934* | 0.990* | 0.97* | |
| nuclear receptor | FLapRLS | 0.746 | 0.52 | 0.915 | 0.608 |
| RLS-Kron | 0.92 | 0.713 | 0.937 | 0.684 | |
| THN_KRLS | 0.96* | 0.930* | 0.993* | 0.94* |
For each data set, * indicates the highest value.
Figure 3Precision–recall curves of the three methods. There is a significant improvement in the GPCR and nuclear receptor data sets
Figure 4ROC curve of THN_KRLS, FLapRLS, and RLS-Kron.
Figure 5Red dot in the figure is the best cutoff point (0.014, 0.688), the radius is about 0.3123, and the area of the quarter circle is 0.0766, so the maximum remaining AUC area is less than 0.9234.
Top 10 New Drug–Target Interactions on the Enzyme Data Set
| enzyme rank | pair & name |
|---|---|
| DrugBank | |
| D00225: alprazolam (DrugBank ID DB00404), Hsa:1557: cytochrome P450 2C19 (UniProtKB P33261) | |
| DrugBank | |
| D00394: trimipramine (DrugBank ID DB00726), Hsa:28: histo-blood group ABO system transferase (UniProtKB P16442) | |
| D00225: alprazolam (DrugBank ID DB00404), Hsa:28: histo-blood group ABO system transferase (UniProtKB P16442) | |
| DrugBank | |
| D00380: tolbutamide (DrugBank ID DB01124), Hsa:28: histo-blood group ABO system transferase (UniProtKB P16442) | |
| DrugBank | |
| D01071: hexobarbital (DrugBank ID DB01355), Hsa:28: histo-blood group ABO system transferase (UniProtKB P16442) | |
| D00139: methoxsalen (DrugBank ID DB00553), Hsa:1557: cytochrome P450 2C19 (UniProtKB P33261) |
Top 10 New Drug–Target Interactions on the GPCR Data Set
| GPCR rank | pair & name |
|---|---|
| DrugBank | |
| D00465: oxybutynin (DrugBank ID DB01062), Hsa:57105: cysteinyl leukotriene receptor 2 (UniProtKB Q9NS75) | |
| D00465: oxybutynin (DrugBank ID DB01062), Hsa:10800: cysteinyl leukotriene receptor 1 (UniProtKB Q9Y271) | |
| D00645: bretylium (DrugBank ID DB01158), Hsa:1128: muscarinic acetylcholine receptor M1 (UniProtKB P11229) | |
| Kegg | |
| DrugBank | |
| D00465: oxybutynin (DrugBank ID DB01062), Hsa:134: adenosine receptor A1 (UniProtKB P30542) | |
| D00645: bretylium (DrugBank ID DB01158), Hsa:1129: muscarinic acetylcholine receptor M2 (UniProtKB P08172) | |
| D00465: oxybutynin (DrugBank ID DB01062), Hsa:135: adenosine receptor A2a (UniProtKB P29274) | |
| DrugBank |
One-to-One ID Information of the Drugs and Targets involved, Including Kegg ID, DrugBank ID, or UniProt ID and the Drug Name or Target Name
| Kegg ID | DrugBank ID or UniProt ID | drug name or target name |
|---|---|---|
| D00394 | DB00726 | trimipramine |
| D00225 | DB00404 | alprazolam |
| D01071 | DB01355 | hexobarbital |
| D00380 | DB01124 | tolbutamide |
| D00574 | DB00357 | aminoglutethimide |
| D00139 | DB00553 | methoxsalen |
| D01699 | DB00496 | darifenacin |
| D00465 | DB01062 | oxybutynin |
| D00645 | DB01158 | bretylium |
| D00765 | DB00728 | rocuronium |
| Hsa:28 | P16442 | histo-blood group ABO system transferase |
| Hsa:1557 | P33261 | cytochrome P450 2C19 |
| Hsa:57105 | Q9NS75 | cysteinyl leukotriene receptor 2 |
| Hsa:10800 | Q9Y271 | cysteinyl leukotriene receptor 1 |
| Hsa:1128 | P11229 | muscarinic acetylcholine receptor M1 |
| Hsa:1129 | P08172 | muscarinic acetylcholine receptor M2 |
| Hsa:134 | P30542 | adenosine receptor A1 |
| Hsa:135 | P29274 | adenosine receptor A2a |