| Literature DB >> 32894154 |
Bo-Ya Ji1,2, Zhu-Hong You3,4, Han-Jing Jiang1,2, Zhen-Hao Guo1,2, Kai Zheng5.
Abstract
BACKGROUND: The prediction of potential drug-target interactions (DTIs) not only provides a better comprehension of biological processes but also is critical for identifying new drugs. However, due to the disadvantages of expensive and high time-consuming traditional experiments, only a small section of interactions between drugs and targets in the database were verified experimentally. Therefore, it is meaningful and important to develop new computational methods with good performance for DTIs prediction. At present, many existing computational methods only utilize the single type of interactions between drugs and proteins without paying attention to the associations and influences with other types of molecules.Entities:
Keywords: Drug-target interactions; Heterogeneous information network; LINE; Random forest
Mesh:
Substances:
Year: 2020 PMID: 32894154 PMCID: PMC7487884 DOI: 10.1186/s12967-020-02490-x
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Fig. 1The heterogeneous association information network
Fig. 2Computation framework of our model
The association information in the network
| Association | Database | Amount |
|---|---|---|
| miRNA-lncRNA | lncRNASNP2 [ | 8374 |
| miRNA-disease | HMDD v3.0 [ | 16,427 |
| miRNA-protein | miRTarBase:update 2018 [ | 4944 |
| LncRNA-disease | LncRNADisease [ | |
| lncRNASNP2 [ | 1264 | |
| Drug-disease | CTD: update 2019 [ | 18,416 |
| LncRNA-protein | LncRNA2Target v2.0 [ | 690 |
| Protein–protein | STRING: in 2017 [ | 19,237 |
| Protein-disease | DisGeNET [ | 25,087 |
| Total | N/A | 94,439 |
The node information in the network
| Node | Amount |
|---|---|
| Drug | 134 |
| MiRNA | 1023 |
| Disease | 2062 |
| Protein | 613 |
| LncRNA | 769 |
| Total | 4601 |
Evaluation of our model under five-fold cross-validation
| Fold | ACC. (%) | Spec. (%) | Prec. (%) | MCC (%) | Sen. (%) | AUC (%) |
|---|---|---|---|---|---|---|
| 0 | 86.45 | 91.49 | 90.54 | 73.28 | 81.41 | 92.90 |
| 1 | 85.87 | 90.86 | 89.85 | 72.10 | 80.87 | 92.31 |
| 2 | 85.08 | 90.82 | 89.63 | 70.63 | 79.34 | 92.05 |
| 3 | 85.13 | 91.27 | 90.05 | 70.79 | 78.98 | 91.71 |
| 4 | 86.64 | 91.53 | 90.61 | 73.63 | 81.75 | 92.66 |
| Average | 85.83±0.72 | 91.19±0.34 | 90.14±0.43 | 72.09±1.38 | 80.47±1.24 | 92.33±0.47 |
Fig. 3The ROC curves of our model under fivefold cross-validation
Fig. 4The PR curves of our model under five-fold cross-validation
Comparison of different feature combinations
| Feature | Acc. (%) | Spec. (%) | Prec. (%) | MCC (%) | Sen. (%) | AUC (%) |
|---|---|---|---|---|---|---|
| Attribute | 80.73 ± 0.79 | 84.36 ± 1.05 | 83.14 ± 1.04 | 61.63 ± 1.61 | 77.11 ± 0.60 | 87.77 ± 0.83 |
| Behavior | 85.75 ± 0.59 | 91.12 ± 0.90 | 90.06 ± 0.92 | 71.92 ± 1.21 | 80.37 ± 0.68 | 92.18 ± 0.51 |
| Both | 85.83 ± 0.72 | 91.19 ± 0.34 | 90.14 ± 0.43 | 72.09 ± 1.38 | 80.47 ± 1.24 | 92.33 ± 0.47 |
Fig. 5Comparison of different feature combinations under fivefold cross validation
Comparison of different machine learning classifiers
| Classifier | ACC. (%) | Spec. (%) | Prec. (%) | MCC (%) | Sen. (%) | AUC (%) |
|---|---|---|---|---|---|---|
| Logistic | 77.63 ± 1.03 | 81.19 ± 0.75 | 79.74 ± 0.91 | 55.40 ± 2.04 | 74.06 ± 1.41 | 84.27 ± 1.30 |
| KNN | 82.04 ± 1.19 | 79.83 ± 2.26 | 80.72 ± 1.74 | 64.15 ± 2.32 | 84.24 ± 0.78 | 88.99 ± 0.81 |
| Naive Bayes | 72.57 ± 1.16 | 73.74 ± 1.09 | 73.11 ± 1.04 | 45.15 ± 2.31 | 71.39 ± 1.91 | 77.30 ± 1.57 |
| DecisionTree | 79.81 ± 0.66 | 79.73 ± 1.29 | 79.78 ± 1.01 | 59.63 ± 1.32 | 79.89 ± 0.60 | 79.81 ± 0.66 |
| RandomForest | 85.83 ± 0.72 | 91.19 ± 0.34 | 90.14 ± 0.43 | 72.09 ± 1.38 | 80.47 ± 1.24 | 92.33 ± 0.47 |
Fig. 6Comparison of different machine learning classifiers under fivefold cross-validation
Prediction of the top 10 targets associated with Caffeine
| UniProt ID | Target | Evidence |
|---|---|---|
| 9606.ensp00000342007 | Cytochrome P450 1A2 | SuperTarget |
| 9606.ensp00000360372 | Cytochrome P450 2C19 | Unconfirmed |
| 9606.ensp00000337915 | Cytochrome P450 3A4 | SuperTarget |
| 9606.ensp00000478255 | ATP-dependent translocase ABCB1 | DrugBank |
| 9606.ensp00000360317 | Cytochrome P450 2C8 | SuperTarget |
| 9606.ensp00000260682 | Cytochrome P450 2C9 | SuperTarget |
| 9606.ensp00000324648 | Cytochrome P450 2B6 | Unconfirmed |
| 9606.ensp00000440689 | Cytochrome P450 2E1 | SuperTarget |
| 9606.ensp00000353820 | Cytochrome P450 2D6 | SuperTarget |
| 9606.ensp00000222982 | Cytochrome P450 3A5 | SuperTarget |
Prediction of the top 10 targets associated with Clozapine
| UniProt ID | Target | Evidence |
|---|---|---|
| 9606.ensp00000478255 | ATP-dependent translocase ABCB1 | DrugBank |
| 9606.ensp00000342007 | Cytochrome P450 1A2 | SuperTarget |
| 9606.ensp00000360372 | Cytochrome P450 2C19 | SuperTarget |
| 9606.ensp00000260682 | Cytochrome P450 2C9 | SuperTarget |
| 9606.ensp00000337915 | Cytochrome P450 3A4 | SuperTarget |
| 9606.ensp00000324648 | Cytochrome P450 2B6 | Unconfirmed |
| 9606.ensp00000353820 | Cytochrome P450 2D6 | SuperTarget |
| 9606.ensp00000222982 | Cytochrome P450 3A5 | SuperTarget |
| 9606.ensp00000295897 | Serum albumin | Unconfirmed |
| 9606.ensp00000480571 | Cytochrome P450 3A7 | Unconfirmed |
Prediction of the top 10 targets associated with Pioglitazone
| UniProt ID | Target | Evidence |
|---|---|---|
| 9606.ensp00000337915 | Cytochrome P450 3A4 | SuperTarget |
| 9606.ensp00000478255 | ATP-dependent translocase ABCB1 | Unconfirmed |
| 9606.ensp00000353820 | Cytochrome P450 2D6 | SuperTarget |
| 9606.ensp00000367102 | Solute carrier family 22 member 6 | Unconfirmed |
| 9606.ensp00000222982 | Cytochrome P450 3A5 | Unconfirmed |
| 9606.ensp00000260682 | Cytochrome P450 2C9 | SuperTarget |
| 9606.ensp00000360372 | Cytochrome P450 2C19 | DrugBank |
| 9606.ensp00000369050 | Cytochrome P450 1A1 | Unconfirmed |
| 9606.ensp00000360317 | Cytochrome P450 2C8 | SuperTarget |
| 9606.ensp00000256958 | Solute carrier organic anion transporter family member 1B1 | DrugBank |