| Literature DB >> 34054920 |
Bo-Wei Zhao1,2,3, Zhu-Hong You1,2,3, Leon Wong1,2,3, Ping Zhang4, Hao-Yuan Li5, Lei Wang1,2,3.
Abstract
Drug repositioning is an application-based solution based on mining existing drugs to find new targets, quickly discovering new drug-disease associations, and reducing the risk of drug discovery in traditional medicine and biology. Therefore, it is of great significance to design a computational model with high efficiency and accuracy. In this paper, we propose a novel computational method MGRL to predict drug-disease associations based on multi-graph representation learning. More specifically, MGRL first uses the graph convolution network to learn the graph representation of drugs and diseases from their self-attributes. Then, the graph embedding algorithm is used to represent the relationships between drugs and diseases. Finally, the two kinds of graph representation learning features were put into the random forest classifier for training. To the best of our knowledge, this is the first work to construct a multi-graph to extract the characteristics of drugs and diseases to predict drug-disease associations. The experiments show that the MGRL can achieve a higher AUC of 0.8506 based on five-fold cross-validation, which is significantly better than other existing methods. Case study results show the reliability of the proposed method, which is of great significance for practical applications.Entities:
Keywords: disease; drug; drug repositioning; graph embedding; multi-graph representation learning
Year: 2021 PMID: 34054920 PMCID: PMC8153989 DOI: 10.3389/fgene.2021.657182
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Flowchart of the proposed method.
Five-fold cross-validation results performed by MGRL.
| 0 | 76.93 | 73.59 | 80.27 | 78.85 | 53.97 | 84.93 |
| 1 | 76.47 | 73.45 | 79.48 | 78.16 | 53.03 | 84.90 |
| 2 | 77.67 | 75.11 | 80.24 | 79.17 | 55.42 | 85.93 |
| 3 | 76.53 | 74.10 | 78.96 | 77.89 | 53.13 | 84.79 |
| 4 | 76.70 | 73.05 | 80.35 | 78.80 | 53.54 | 84.79 |
| Average |
Figure 2The ROCs, AUCs, PRs, and AUPRs of MGRL under five-fold cross-validation on the benchmark dataset.
Comparison of different feature using Random Forest Classifier under five-fold cross-validation.
| Attribute | 75.53 ± 0.37 | 76.38 ± 0.82 | 74.68 ± 0.47 | 75.10 ± 0.31 | 51.07 ± 0.73 | 83.40 ± 0.45 |
| Embedding | 76.31 ± 0.52 | 72.05 ± 0.64 | 80.58 ± 0.76 | 78.77 ± 0.68 | 52.82 ± 1.06 | 84.50 ± 0.54 |
| GCN+Embedding |
Figure 3The comparison of different feature using Gradient Boosting Decision Tree classifier.
Comparison of different machine learning classifier under five-fold cross-validation.
| SVM | 70.62 ± 0.85 | 71.71 ± 1.17 | 69.53 ± 1.61 | 70.20 ± 1.05 | 41.26 ± 1.69 | 77.58 ± 0.77 |
| Logistic | 71.48 ± 0.60 | 71.34 ± 0.78 | 71.61 ± 0.65 | 71.54 ± 0.59 | 42.95 ± 1.20 | 78.66 ± 0.56 |
| KNN | 69.13 ± 0.48 | 86.33 ± 0.34 | 51.92 ± 0.94 | 64.23 ± 0.45 | 40.74 ± 0.91 | 78.87 ± 0.60 |
| GBDT | 74.40 ± 0.43 | 60.90 ± 0.80 | 87.90 ± 0.78 | 83.44 ± 0.83 | 50.69 ± 0.91 | 84.67 ± 0.66 |
| Random Forest |
Figure 4The performance comparison between Random Forest and GDBT, KNN, Logistic Regression, and SVM.
Figure 5Under the CTD Dataset (contains 18,416 drug-disease associations between 269 drugs, and 598 diseases.), TL-HGBI, DeepDR, Resource allocation and DRRS were compared between the AUCs obtained under five-fold cross-validation.
The top 10 drug candidates of the five popular drugs supported by MGRL.
| Doxorubicin | 1 | Seizures | CTD | 6 | Hemolysis | CTD |
| 2 | Headache | CTD | 7 | Drug eruptions | CTD | |
| 3 | Glioma | CTD | 8 | Cerebral hemorrhage | CTD | |
| 4 | Muscular diseases | CTD | 9 | Pancytopenia | CTD | |
| 5 | Drug hypersensitivity | CTD | 10 | Hyperbilirubinemia | CTD | |
| Etoposide | 1 | Headache | CTD | 6 | Anemia, hemolytic | CTD |
| 2 | Edema | CTD | 7 | Hypertension | CTD | |
| 3 | Thrombosis | CTD | 8 | Ovarian neoplasms | CTD | |
| 4 | Cholestasis | CTD | 9 | Ventricular dysfunction, left | CTD | |
| 5 | Exanthema | CTD | 10 | Carcinoma, hepatocellular | CTD | |
| Levodopa | 1 | Depressive disorder | CTD | 6 | Ataxia | CTD |
| 2 | Chemical and drug induced liver injury | CTD | 7 | Fever | CTD | |
| 3 | Inappropriate adh syndrome | CTD | 8 | Schizophrenia | CTD | |
| 4 | Tachycardia | CTD | 9 | Paresthesia | Unconfirmed | |
| 5 | Edema | CTD | 10 | Mood disorders | Unconfirmed | |
| Clonidine | 1 | Headache | Unconfirmed | 6 | Long qt syndrome | CTD |
| 2 | Memory disorders | CTD | 7 | Dystonia | CTD | |
| 3 | Chemical and drug induced liver injury | CTD | 8 | Nervous system diseases | CTD | |
| 4 | Bipolar disorder | CTD | 9 | Necrosis | CTD | |
| 5 | Cognition disorders | CTD | 10 | Psychotic disorders | CTD | |
| Ciprofloxacin | 1 | Muscle weakness | CTD | 6 | Substance withdrawal syndrome | CTD |
| 2 | Arrhythmias, cardiac | Unconfirmed | 7 | Hyperalgesia | CTD | |
| 3 | Necrosis | CTD | 8 | Tachycardia | CTD | |
| 4 | Liver diseases | CTD | 9 | Gastrointestinal diseases | CTD | |
| 5 | Sleep initiation and maintenance disorders | Unconfirmed | 10 | Anaphylaxis | Unconfirmed |