| Literature DB >> 30627536 |
Minhui Wang1, Chang Tang2, Jiajia Chen3.
Abstract
Drug-target interactions play an important role for biomedical drug discovery and development. However, it is expensive and time-consuming to accomplish this task by experimental determination. Therefore, developing computational techniques for drug-target interaction prediction is urgent and has practical significance. In this work, we propose an effective computational model of dual Laplacian graph regularized matrix completion, referred to as DLGRMC briefly, to infer the unknown drug-target interactions. Specifically, DLGRMC transforms the task of drug-target interaction prediction into a matrix completion problem, in which the potential interactions between drugs and targets can be obtained based on the prediction scores after the matrix completion procedure. In DLGRMC, the drug pairwise chemical structure similarities and the target pairwise genomic sequence similarities are fully exploited to serve the matrix completion by using a dual Laplacian graph regularization term; i.e., drugs with similar chemical structure are more likely to have interactions with similar targets and targets with similar genomic sequence similarity are more likely to have interactions with similar drugs. In addition, during the matrix completion process, an indicator matrix with binary values which indicates the indices of the observed drug-target interactions is deployed to preserve the experimental confirmed interactions. Furthermore, we develop an alternative iterative strategy to solve the constrained matrix completion problem based on Augmented Lagrange Multiplier algorithm. We evaluate DLGRMC on five benchmark datasets and the results show that DLGRMC outperforms several state-of-the-art approaches in terms of 10-fold cross validation based AUPR values and PR curves. In addition, case studies also demonstrate that DLGRMC can successfully predict most of the experimental validated drug-target interactions.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30627536 PMCID: PMC6304580 DOI: 10.1155/2018/1425608
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Overview of the proposed DTIs prediction method. The chemical structure similarity between drugs and the genomic sequence similarity between targets are used to serve the matrix completion. Meanwhile, the experimental validated DTIs are preserved by a binary indicator matrix.
The statistics of drugs, targets, and interactions in each dataset.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Drugs | 54 | 223 | 210 | 445 | 1936 |
| Targets | 26 | 95 | 204 | 664 | 1609 |
| Interactions | 90 | 635 | 1476 | 2926 | 7019 |
| Average No. of drugs per target | 3.46 | 6.68 | 7.24 | 4.41 | 4.36 |
| Average No. of targets per drug | 1.67 | 2.85 | 7.03 | 6.58 | 3.63 |
| Sparsity of the interaction matrix (%) | 93.59 | 97.00 | 96.55 | 99.01 | 99.77 |
| Percentage of drugs with only one interaction target (%) | 72.22 | 47.53 | 38.57 | 39.78 | 75.20 |
| Percentage of targets with only one interaction drug (%) | 30.77 | 35.79 | 11.27 | 43.37 | 30.53 |
Algorithm 1Iterative algorithm for solving DLGRMC.
Figure 2An intuitive showing of the imbalance ratio between interacting and noninteracting drug-target pairs of different datasets.
Average AUPR values of different methods on different datasets under CV1 (the values following the symbol “±" are the standard deviations of 5 repetition results).
| Methods | NRs | GPCRs | Ics | Es | DB |
|---|---|---|---|---|---|
| BLM-NII | 0.641±0.038 | 0.483±0.019 | 0.645±0.010 | 0.624±0.013 | 0.667±0.024 |
| WNN | 0.567±0.024 | 0.559±0.020 | 0.583±0.018 | 0.591±0.016 | 0.652±0.027 |
| CMF | 0.577±0.038 | 0.674±0.011 | 0.858±0.008 | 0.806±0.005 | 0.883±0.019 |
| GRMF | 0.592±0.025 | 0.679±0.012 | 0.367±0.015 | 0.324±0.014 | 0.704±0.029 |
| NRLMF | 0.675±0.034 | 0.687±0.017 | 0.889±0.010 | 0.847±0.007 | 0.902±0.030 |
| LGRMC | 0.696±0.022 | 0.701±0.014 | 0.899±0.013 | 0.874±0.009 | 0.921±0.016 |
Average AUPR values of different methods on different datasets under CV2 (the values following the symbol “±" are the standard deviations of 5 repetition results).
| Methods | NRs | GPCRs | Ics | Es | DB |
|---|---|---|---|---|---|
| BLM-NII | 0.427±0.045 | 0.308±0.020 | 0.289±0.029 | 0.246±0.021 | 0.443±0.031 |
| WNN | 0.501±0.051 | 0.286±0.018 | 0.237±0.034 | 0.251±0.037 | 0.543±0.034 |
| CMF | 0.465±0.052 | 0.358±0.016 | 0.268±0.031 | 0.203±0.022 | 0.482±0.026 |
| GRMF | 0.481±0.056 | 0.357±0.017 | 0.284±0.027 | 0.252±0.018 | 0.507±0.024 |
| NRLMF | 0.540±0.052 | 0.361±0.019 | 0.348±0.031 | 0.345±0.033 | 0.575±0.027 |
| LGRMC | 0.572±0.054 | 0.377±0.018 | 0.364±0.028 | 0.373±0.020 | 0.601±0.030 |
Average AUPR values of different methods on different datasets under CV3 (the values following the symbol “±" are the standard deviations of 5 repetition results).
| Methods | NRs | GPCRs | Ics | Es | DB |
|---|---|---|---|---|---|
| BLM-NII | 0.412±0.042 | 0.332±0.014 | 0.205±0.011 | 0.167±0.010 | 0.446±0.023 |
| WNN | 0.517±0.024 | 0.364±0.009 | 0.319±0.012 | 0.385±0.013 | 0.527±0.016 |
| CMF | 0.484±0.035 | 0.407±0.007 | 0.352±0.009 | 0.376±0.006 | 0.535±0.012 |
| GRMF | 0.517±0.026 | 0.367±0.010 | 0.343±0.017 | 0.346±0.010 | 0.539±0.018 |
| NRLMF | 0.491±0.048 | 0.409±0.042 | 0.358±0.016 | 0.395±0.014 | 0.550±0.029 |
| LGRMC | 0.527±0.023 | 0.415±0.012 | 0.362±0.015 | 0.410±0.012 | 0.574±0.019 |
Figure 3The PR curves of different methods on four datasets.
The top 10 interacting targets of drug “D00094" in dataset NRs predicted by different methods (“√" denotes experimental validated targets and “×" denotes nonvalidated targets).
| Rank | Targets predicted by different methods | |||||
|---|---|---|---|---|---|---|
| BLM-NII | WNN | CMF | GRMF | NRLMF | DLGRMC | |
| 1 | hsa5914 (√) | hsa190 (√) | hsa6096 (√) | hsa6257 (√) | hsa5915 (√) | hsa5914 (√) |
| 2 | hsa5915 (√) | hsa6257 (√) | hsa6257 (√) | hsa5915 (√) | hsa190 (√) | hsa5915 (√) |
| 3 | hsa6257 (√) | hsa5915 (√) | hsa5915 (√) | hsa6256 (√) | hsa6096 (√) | hsa190 (√) |
| 4 | hsa190 (√) | hsa6256 (√) | hsa190 (√) | hsa190 (√) | hsa5914 (√) | hsa6096 (√) |
| 5 | hsa6258 (√) | hsa190 (√) | hsa6256 (√) | hsa6258 (√) | hsa6097 (√) | hsa6257 (√) |
| 6 | hsa6097 (√) | hsa6097 (√) | hsa5916 (√) | hsa5916 (√) | hsa6258 (√) | hsa6256 (√) |
| 7 | hsa2099 (×) | hsa5916 (√) | hsa2104 (×) | hsa5915 (√) | hsa5916 (√) | hsa6258 (√) |
| 8 | hsa4306 (×) | hsa2908 (×) | hsa2421 (×) | hsa2101 (×) | hsa6257 (√) | hsa5916 (√) |
| 9 | hsa5465 (×) | hsa2104 (×) | hsa4306 (×) | hsa2104 (×) | hsa367 (×) | hsa2099 (×) |
| 10 | hsa2104 (×) | hsa2421 (×) | hsa9970 (×) | hsa5465 (×) | hsa4306 (×) | hsa2908 (×) |
The top 10 interacting targets of drug “D00255” in dataset GPCRs predicted by different methods (“√” denotes experimental validated targets and “×” denotes nonvalidated targets).
| Rank | Targets predicted by different methods | |||||
|---|---|---|---|---|---|---|
| BLM-NII | WNN | CMF | GRMF | NRLMF | DLGRMC | |
| 1 | hsa147 (√) | hsa150 (√) | hsa151 (√) | hsa155 (√) | hsa155 (√) | hsa147 (√) |
| 2 | hsa148 (√) | hsa146 (√) | hsa146 (√) | hsa150 (√) | hsa147 (√) | hsa155 (√) |
| 3 | hsa146 (√) | hsa155 (√) | hsa147 (√) | hsa151 (√) | hsa146 (√) | hsa151 (√) |
| 4 | hsa150 (√) | hsa153 (√) | hsa148 (√) | hsa147 (√) | hsa150 (√) | hsa150 (√) |
| 5 | hsa1812 (×) | hsa154 (√) | hsa155 (√) | hsa154 (√) | hsa148 (√) | hsa146 (√) |
| 6 | hsa2550 (×) | hsa1234 (×) | hsa154 (√) | hsa1268 (×) | hsa2550 (×) | hsa154 (√) |
| 7 | hsa2913 (×) | hsa1241 (×) | hsa2911 (×) | hsa135 (×) | hsa3361 (×) | hsa1128 (×) |
| 8 | hsa5739 (×) | hsa3354 (×) | hsa1241 (×) | hsa2911 (×) | hsa5729 (×) | hsa2911 (×) |
| 9 | hsa7201 (×) | hsa7201 (×) | hsa3354 (×) | hsa57105 (×) | hsa9052 (×) | hsa3269 (×) |
| 10 | hsa552 (×) | hsa6751 (×) | hsa6751 (×) | hsa886 (×) | hsa2911 (×) | hsa3352 (×) |
The top 10 interacting targets of drug “D00110" in dataset ICs predicted by different methods (“√" denotes experimental validated targets and “×" denotes nonvalidated targets).
| Rank | Targets predicted by different methods | |||||
|---|---|---|---|---|---|---|
| BLM-NII | WNN | CMF | GRMF | NRLMF | DLGRMC | |
| 1 | hsa6336 (√) | hsa11280 (√) | hsa6530 (√) | hsa6532 (√) | hsa6529 (√) | hsa6331 (√) |
| 2 | hsa6532 (√) | hsa6530 (√) | hsa6532 (√) | hsa11280 (√) | hsa6532 (√) | hsa6336 (√) |
| 3 | hsa6530 (√) | hsa6529 (√) | hsa11280 (√) | hsa6336 (√) | hsa6336 (√) | hsa6530 (√) |
| 4 | hsa11280 (√) | hsa6331 (√) | hsa6529 (√) | hsa6336 (√) | hsa6331 (√) | hsa6532 (√) |
| 5 | hsa6529 (√) | hsa6532 (√) | hsa6331 (√) | hsa6530 (√) | hsa11280 (√) | hsa11280 (√) |
| 6 | hsa2554 (×) | hsa2554 (×) | hsa6336 (√) | hsa6529 (√) | hsa9312 (×) | hsa6529 (√) |
| 7 | hsa2901 (×) | hsa9177 (×) | hsa2901 (×) | hsa1137 (×) | hsa93589 (×) | hsa1141 (×) |
| 8 | hsa3748 (×) | hsa773 (×) | hsa27012 (×) | hsa9312 (×) | hsa23704 (×) | hsa1137 (×) |
| 9 | hsa1134 (×) | hsa8514 (×) | hsa8973 (×) | hsa3762 (×) | hsa2892 (×) | hsa9312 (×) |
| 10 | hsa9177 (×) | hsa9311 (×) | hsa2560 (×) | hsa1139 (×) | hsa3756 (×) | hsa93589 (×) |
The top 10 interacting targets of drug “D00002" in dataset Es predicted by different methods (“√" denotes experimental validated targets and “×" denotes nonvalidated targets).
| Rank | Targets predicted by different methods | |||||
|---|---|---|---|---|---|---|
| BLM-NII | WNN | CMF | GRMF | NRLMF | DLGRMC | |
| 1 | hsa216 (√) | hsa108 (√) | hsa1725 (×) | hsa196883 (√) | hsa191 (√) | hsa191 (√) |
| 2 | hsa108 (√) | hsa1725 (×) | hsa108 (√) | hsa191 (√) | hsa196883 (√) | hsa1725 (×) |
| 3 | hsa1725 (×) | hsa191 (√) | hsa2936 (√) | hsa7498 (√) | hsa108 (√) | hsa196883 (√) |
| 4 | hsa2746 (√) | hsa3939 (√) | hsa2639 (√) | hsa3033 (√) | hsa3292 (√) | hsa108 (√) |
| 5 | hsa196883 (√) | hsa3292 (√) | hsa115 (√) | hsa108 (√) | hsa3615 (√) | hsa2936 (√) |
| 6 | hsa7015 (×) | hsa349565 (√) | hsa2597 (√) | hsa7299 (×) | hsa3939 (√) | hsa3033 (√) |
| 7 | hsa4594 (×) | hsa34 (×) | hsa3156 (×) | hsa84152 (×) | hsa3818 (×) | hsa349565 (√) |
| 8 | hsa3035 (×) | hsa8435 (×) | hsa51095 (×) | hsa590 (×) | hsa5536 (×) | hsa339221 (×) |
| 9 | hsa306 (×) | hsa51095 (×) | hsa90 (×) | hsa3156 (×) | hsa34 (×) | hsa3156 (×) |
| 10 | hsa8435 (×) | hsa306 (×) | hsa761 (×) | hsa34 (×) | hsa90 (×) | hsa3991 (×) |
The top 10 interacting targets of drug “DB00171" in dataset DB predicted by different methods (“√" denotes experimental validated targets and “×" denotes nonvalidated targets).
| Rank | Targets predicted by different methods | |||||
|---|---|---|---|---|---|---|
| BLM-NII | WNN | CMF | GRMF | NRLMF | DLGRMC | |
| 1 | P10398 (√) | P00519 (√) | P36896 (√) | P35626 (√) | O95477 (√) | Q09428 (√) |
| 2 | P36896 (√) | P35626 (√) | O43681 (√) | Q08828 (√) | P00519 (√) | P49902 (√) |
| 3 | P42684 (√) | Q9UM73 (√) | Q07912 (√) | Q9UM73 (√) | P35626 (√) | O95477 (√) |
| 4 | Q9UM73 (√) | O43681 (√) | P35626 (√) | P10398 (√) | P10398 (√) | P00519 (√) |
| 5 | O43681 (√) | P36896 (√) | P49902 (√) | P36896 (√) | P42684 (√) | P35626 (√) |
| 6 | Q07912 (√) | Q16671 (√) | Q08828 (√) | O43681 (√) | Q9UM73 (√) | Q08828 (√) |
| 7 | O95477 (√) | O95342 (√) | Q9UM73 (√) | Q07912 (√) | P36896 (√) | O14727 (√) |
| 8 | P31749 (×) | Q13131 (×) | P31749 (×) | Q15822 (×) | O43681 (√) | Q9UM73 (√) |
| 9 | P20839 (×) | P20839 (×) | P20839 (×) | P53985 (×) | Q13131 (×) | P10398 (√) |
| 10 | Q8NFJ5 (×) | P31749 (×) | P16219 (×) | P31749 (×) | O15270 (×) | P31749 (×) |
Figure 4The AUPR values versus the parameter (a) α with β = λ = 1, (b) β with α = λ = 1, and (c) λ with α = β = 1 on different datasets.