| Literature DB >> 36014371 |
Jiaxin Li1, Xixin Yang1,2, Yuanlin Guan3,4, Zhenkuan Pan1.
Abstract
Nowadays, drug-target interactions (DTIs) prediction is a fundamental part of drug repositioning. However, on the one hand, drug-target interactions prediction models usually consider drugs or targets information, which ignore prior knowledge between drugs and targets. On the other hand, models incorporating priori knowledge cannot make interactions prediction for under-studied drugs and targets. Hence, this article proposes a novel dual-network integrated logistic matrix factorization DTIs prediction scheme (Ro-DNILMF) via a knowledge graph embedding approach. This model adds prior knowledge as input data into the prediction model and inherits the advantages of the DNILMF model, which can predict under-studied drug-target interactions. Firstly, a knowledge graph embedding model based on relational rotation (RotatE) is trained to construct the interaction adjacency matrix and integrate prior knowledge. Secondly, a dual-network integrated logistic matrix factorization prediction model (DNILMF) is used to predict new drugs and targets. Finally, several experiments conducted on the public datasets are used to demonstrate that the proposed method outperforms the single base-line model and some mainstream methods on efficiency.Entities:
Keywords: drug–target interactions prediction; dual-network integrated logistic matrix factorization; knowledge graph embedding
Mesh:
Year: 2022 PMID: 36014371 PMCID: PMC9412517 DOI: 10.3390/molecules27165131
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.927
Figure 1Basic architecture of DNIMLF.
Figure 2Graphical illustration of the proposed DTI prediction scheme.
Dataset information.
| Dataset | Group | Drugs | Proteins | DTIs |
|---|---|---|---|---|
| Yamanishi_08 | EN | 445 | 664 | 2926 |
| IC | 210 | 204 | 1476 | |
| GPCR | 223 | 95 | 635 | |
| NR | 54 | 26 | 90 | |
| ALL | 932 | 989 | 5127 | |
| KEGG | - | 10,979 | 13,959 | 12,112 |
| DrugBank | - | 1482 | 1408 | 9881 |
The parameter settings.
| Parameter | Value | |||||
|---|---|---|---|---|---|---|
|
| 125 | 250 | 500 | 750 | 1000 | - |
|
| 128 | 256 | 512 | 1024 | 2048 | - |
|
| 3 | 6 | 9 | 12 | 24 | 30 |
|
| 0.5 | 0.6 | 0.7 | 0.8 | 0.9 | 1 |
|
| 0.25 | 0.2 | 0.15 | 0.1 | 0.05 | 0 |
|
| 0.25 | 0.2 | 0.15 | 0.1 | 0.05 | 0 |
|
| 5 | 6 | 7 | 8 | 9 | 10 |
The results of the optimization function.
| Sigmoid | Tanh | |||||||
|---|---|---|---|---|---|---|---|---|
| MRR | H@1 | H@3 | H@10 | MRR | H@1 | H@3 | H@10 | |
| EN | 0.683 | 0.490 | 0.524 | 0.741 | 0.692 | 0.537 | 0.547 | 0.732 |
| IC | 0.359 | 0.512 | 0.537 |
| 0.467 | 0.564 | 0.582 | 0.861 |
| GPCR | 0.723 | 0.509 | 0.518 | 0.803 | 0.743 | 0.521 | 0.539 |
|
| NR | 0.436 | 0.427 | 0.469 | 0.654 | 0.584 | 0.433 | 0.486 | 0.747 |
The blue part 0.817 is the best performance of the sigmoid function and the blue part 0.817 is the best performance of the tanh function.
Figure 3Hit@N of the score function. (a) Description of Hit@1 with three knowledge graph embedding models. (b) Description of Hit@3 with three knowledge graph embedding models. (c) Description of Hit@10 with three knowledge graph embedding models.
Figure 4The AUC and AUPR with the different number of samples.
Figure 5Comparative results of the presented method and other mainstream methods.
Comparative results of the presented method and other combination models.
| Metrics | Embedding Model | PredictingModel | EN | IC | GPCR | NR |
|---|---|---|---|---|---|---|
| AUPR | - | NRLMF | 0.812 | 0.785 | 0.556 | 0.449 |
| DNILMF | 0.869 | 0.887 | 0.684 | 0.483 | ||
| Ro-DNILMF |
|
|
|
| ||
| TransE | NRLMF | 0.816 | 0.789 | 0.560 | 0.478 | |
| DNILMF | 0.873 | 0.889 | 0.684 | 0.535 | ||
| DisMult | NRLMF | 0.815 | 0.786 | 0.574 | 0.497 | |
| DNILMF | 0.872 | 0.893 | 0.687 | 0.530 | ||
| HolE | NRLMF | 0.813 | 0.795 | 0.587 | 0.513 | |
| DNILMF | 0.889 | 0.903 | 0.695 | 0.573 | ||
| ComplEx | NRLMF | 0.824 | 0.793 | 0.593 | 0.510 | |
| DNILMF | 0.886 | 0.904 | 0.703 | 0.542 | ||
| ConvE | NRLMF | 0.818 | 0.814 | 0.609 | 0.526 | |
| DNILMF | 0.873 | 0.903 | 0.721 | 0.568 | ||
| pRotatE | NRLMF | 0.820 | 0.817 | 0.614 | 0.523 | |
| DNILMF | 0.886 | 0.905 | 0.715 | 0.567 | ||
| RotatE | NRLMF | 0.823 | 0.826 | 0.627 | 0.527 | |
| DNILMF |
|
|
|
| ||
| AUC | - | NRLMF | 0.966 | 0.943 | 0.930 | 0.851 |
| DNILMF | 0.971 | 0.962 | 0.933 | 0.856 | ||
| Ro-DNILMF |
|
|
|
| ||
| TransE | NRLMF | 0.966 | 0.944 | 0.930 | 0.859 | |
| DNILMF | 0.968 | 0.964 | 0.933 | 0.854 | ||
| DisMult | NRLMF | 0.965 | 0.947 | 0.932 | 0.864 | |
| DNILMF | 0.968 | 0.963 | 0.934 | 0.867 | ||
| HolE | NRLMF | 0.966 | 0.946 | 0.931 | 0.873 | |
| DNILMF | 0.971 | 0.975 | 0.936 | 0.875 | ||
| ComplEx | NRLMF | 0.967 | 0.969 | 0.940 | 0.884 | |
| DNILMF | 0.972 | 0.982 | 0.938 | 0.891 | ||
| ConvE | NRLMF | 0.965 | 0.974 | 0.939 | 0.873 | |
| DNILMF | 0.970 | 0.979 | 0.937 | 0.886 | ||
| pRotatE | NRLMF | 0.964 | 0.981 | 0.938 | 0.896 | |
| DNILMF | 0.971 | 0.982 | 0.939 | 0.893 | ||
| RotatE | NRLMF | 0.970 | 0.982 | 0.939 | 0.901 | |
| DNILMF |
|
|
|
|
The green part is the performance of the Ro-DNILMF model and the blue part is the best performance of the other combination models.