| Literature DB >> 28079135 |
Ming Hao1, Stephen H Bryant1, Yanli Wang1.
Abstract
In this work, we propose a dual-network integrated logistic matrix factorization (DNILMF) algorithm to predict potential drug-target interactions (DTI). The prediction procedure consists of four steps: (1) inferring new drug/target profiles and constructing profile kernel matrix; (2) diffusing drug profile kernel matrix with drug structure kernel matrix; (3) diffusing target profile kernel matrix with target sequence kernel matrix; and (4) building DNILMF model and smoothing new drug/target predictions based on their neighbors. We compare our algorithm with the state-of-the-art method based on the benchmark dataset. Results indicate that the DNILMF algorithm outperforms the previously reported approaches in terms of AUPR (area under precision-recall curve) and AUC (area under curve of receiver operating characteristic) based on the 5 trials of 10-fold cross-validation. We conclude that the performance improvement depends on not only the proposed objective function, but also the used nonlinear diffusion technique which is important but under studied in the DTI prediction field. In addition, we also compile a new DTI dataset for increasing the diversity of currently available benchmark datasets. The top prediction results for the new dataset are confirmed by experimental studies or supported by other computational research.Entities:
Mesh:
Year: 2017 PMID: 28079135 PMCID: PMC5227688 DOI: 10.1038/srep40376
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of our compiled dataset, and the other two benchmark datasets.
| Number of drugs | Number of targets | Number of interactions | Sparsity | ||
|---|---|---|---|---|---|
| Yamanishi | Enzymes | 445 | 664 | 2,926 | 0.010 |
| IC | 210 | 204 | 1,476 | 0.034 | |
| GPCR | 223 | 95 | 635 | 0.030 | |
| NR | 54 | 26 | 90 | 0.064 | |
| Kuang | — | 786 | 809 | 3,681 | 0.006 |
| Hao | — | 829 | 733 | 3,688 | 0.006 |
Figure 1Four scenarios of DTI predictions.
For the D1-T1 pair surrounded by circle, (A) known drug - known target; (B) known drug - new target; (C) new drug - known target; and (D) new drug - new target.
Figure 2Flowchart of the whole procedure in the proposed DNILMF algorithm.
The comparison of DNILMF with NRLMF using 5 trials of 10-fold cross-validation based on the setting CVP.
| Data | Method | AUPR | AUC |
|---|---|---|---|
| Enzymes | NRLMF | 0.892 ± 0.006 | 0.987 ± 0.001 |
| DNILMF | 0.922 ± 0.008 | 0.989 ± 0.001 | |
| IC | NRLMF | 0.906 ± 0.008 | 0.989 ± 0.001 |
| DNILMF | 0.938 ± 0.008 | 0.990 ± 0.001 | |
| GPCR | NRLMF | 0.749 ± 0.015 | 0.969 ± 0.004 |
| DNILMF | 0.812 ± 0.009 | 0.975 ± 0.003 | |
| NR | NRLMF | 0.728 ± 0.041 | 0.950 ± 0.011 |
| DNILMF | 0.751 ± 0.031 | 0.955 ± 0.004 |
The comparison of DNILMF with NRLMF using 5 trials of 10-fold cross-validation based on the setting CVR.
| Data | Method | AUPR | AUC |
|---|---|---|---|
| Enzymes | NRLMF | 0.358 ± 0.040 | 0.871 ± 0.017 |
| DNILMF | 0.796 ± 0.029 | 0.964 ± 0.009 | |
| IC | NRLMF | 0.344 ± 0.033 | 0.813 ± 0.027 |
| DNILMF | 0.822 ± 0.047 | 0.961 ± 0.010 | |
| GPCR | NRLMF | 0.364 ± 0.023 | 0.895 ± 0.011 |
| DNILMF | 0.781 ± 0.050 | 0.967 ± 0.006 | |
| NR | NRLMF | 0.545 ± 0.054 | 0.900 ± 0.021 |
| DNILMF | 0.776 ± 0.026 | 0.956 ± 0.010 |
The comparison of DNILMF with NRLMF using 5 trials of 10-fold cross-validation based on the setting CVC.
| Data | Method | AUPR | AUC |
|---|---|---|---|
| Enzymes | NRLMF | 0.812 ± 0.018 | 0.966 ± 0.005 |
| DNILMF | 0.889 ± 0.023 | 0.978 ± 005 | |
| IC | NRLMF | 0.785 ± 0.028 | 0.964 ± 0.007 |
| DNILMF | 0.887 ± 0.010 | 0.970 ± 0.004 | |
| GPCR | NRLMF | 0.556 ± 0.038 | 0.930 ± 0.012 |
| DNILMF | 0.684 ± 0.036 | 0.933 ± 0.009 | |
| NR | NRLMF | 0.449 ± 0.079 | 0.851 ± 0.027 |
| DNILMF | 0.483 ± 0.050 | 0.856 ± 0.042 |
The comparison of DNILMF with EigenTrans using 5 trials of 10-fold cross-validation based on the setting CVP.
| Method | AUPR | AUC |
|---|---|---|
| EigenTrans | 0.649 ± 0.034 | 0.941 ± 0.005 |
| DNILMF | 0.748 ± 0.009 | 0.965 ± 0.001 |
Prediction results of DNILMF for the new compiled dataset.
| Combination | AUPR | AUC |
|---|---|---|
| fp2-clusto | 0.772 ± 0.011 | 0.970 ± 0.001 |
| fp2-kmer3 | 0.772 ± 0.010 | 0.970 ± 0.001 |
| pcfp-clusto | 0.774 ± 0.009 | 0.971 ± 0.001 |
| pcfp-kmer3 | 0.775 ± 0.011 | 0.972 ± 0.001 |
Top 5 predicted interactions for the new compiled dataset using DNILMF based on the pcfp-kmer3 combination.
| Rank | CID | DrugBank ID | Drug name | UniProt ID | Target name | Score |
|---|---|---|---|---|---|---|
| 1 | 4205 | DB00370 | Mirtazapine | P08908 | 5HR1A | 0.921 |
| 2 | 3380 | DB01544 | Flunitrazepam | P14867 | GARSA1 | 0.906 |
| 3 | 2818 | DB00363 | Clozapine | P21918 | DD5R | 0.903 |
| 4 | 6540428 | DB00247 | Methysergide | P28221 | 5HR1D | 0.898 |
| 5 | 3964 | DB00408 | Loxapine | P41595 | 5HR2B | 0.892 |