| Literature DB >> 35409235 |
Ping Xuan1, Zixuan Lu1, Tiangang Zhang2, Yong Liu1, Toshiya Nakaguchi3.
Abstract
Identifying new disease indications for existing drugs can help facilitate drug development and reduce development cost. The previous drug-disease association prediction methods focused on data about drugs and diseases from multiple sources. However, they did not deeply integrate the neighbor topological information of drug and disease nodes from various meta-path perspectives. We propose a prediction method called NAPred to encode and integrate meta-path-level neighbor topologies, multiple kinds of drug attributes, and drug-related and disease-related similarities and associations. The multiple kinds of similarities between drugs reflect the degrees of similarity between two drugs from different perspectives. Therefore, we constructed three drug-disease heterogeneous networks according to these drug similarities, respectively. A learning framework based on fully connected neural networks and a convolutional neural network with an attention mechanism is proposed to learn information of the neighbor nodes of a pair of drug and disease nodes. The multiple neighbor sets composed of different kinds of nodes were formed respectively based on meta-paths with different semantics and different scales. We established the attention mechanisms at the neighbor-scale level and at the neighbor topology level to learn enhanced neighbor feature representations and enhanced neighbor topological representations. A convolutional-autoencoder-based module is proposed to encode the attributes of the drug-disease pair in three heterogeneous networks. Extensive experimental results indicated that NAPred outperformed several state-of-the-art methods for drug-disease association prediction, and the improved recall rates demonstrated that NAPred was able to retrieve more actual drug-disease associations from the top-ranked candidates. Case studies on five drugs further demonstrated the ability of NAPred to identify potential drug-related disease candidates.Entities:
Keywords: drug–disease association prediction; fully connected neural networks and autoencoder based on CNN; multiple drug–disease heterogeneous networks; neighbor topology learning based on meta-paths; pairwise node attribute encoding
Mesh:
Year: 2022 PMID: 35409235 PMCID: PMC8999005 DOI: 10.3390/ijms23073870
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1ROC and PR curves of all the methods of drug–disease association.
The statistical results of the paired Wilcoxon test on the AUCs over all the 763 drugs by comparing NAPred and all other five methods.
| GFPred | CBPred | SCMFDD | LRSSL | MBiRW | HGBI | |
|---|---|---|---|---|---|---|
| 5.27051 × 10 | 1.83480 × 10 | 5.49787 × 10 | 5.31080 × 10 | 2.89205 × 10 | 1.74747 × 10 | |
| 3.42304 × 10 | 4.72506 × 10 | 1.81013 × 10 | 8.63715 × 10 | 4.68094 × 10 | 4.85712 × 10 |
Figure 2The average recalls of all the drugs under different top-k.
The top-10 candidate diseases of 5 drugs.
| Drug Name | Rank | Disease Name | Description | Rank | Disease Name | Description |
|---|---|---|---|---|---|---|
| 1 | Staphylococcal Infections | CTD, PubChem | 6 | Staphylococcal Skin | PubChem | |
| Infections | ||||||
| 2 | Pneumonia, Bacterial | ClinicalTrials | 7 | Streptococcal Infections | CTD, ClinicalTrials | |
| Ampicillin | 3 | Urinary Tract Infections | CTD, DrugBank, | 8 | Osteomyelitis | PubChem, |
| PubChem | ClinicalTrials | |||||
| 4 | Wound Infection | PubChem, ClinicalTrials | 9 | Postoperative Complications | PubChem | |
| 5 | Proteus Infections | Inferred Candidate | 10 | Bacterial Infections | CTD, DrugBank, | |
| by 2 Literature Works | ClinicalTrials | |||||
| 1 | Escherichia coli Infections | CTD, PubChem, ClinicalTrials | 6 | Salmonella Infections | DrugBank, PubChem, ClinicalTrials | |
| 2 | Urinary Tract Infections | DrugBank, PubChem, | 7 | Enterobacteriaceae Infections | PubChem, ClinicalTrials | |
| ClinicalTrials | ||||||
| Ceftriaxone | 3 | Haemophilus Infections | PubChem | 8 | Septicemia | DrugBank, PubChem, |
| ClinicalTrials | ||||||
| 4 | Gonorrhea | DrugBank, PubChem, | 9 | Endocarditis, Bacterial | DrugBank, ClinicalTrials | |
| ClinicalTrials | ||||||
| 5 | Gram-Negative Bacterial | Inferred Candidate | 10 | Pseudomonas Infections | PubChem | |
| Infections | by 1 Literature Work | |||||
| 1 | Urinary Tract Infections | CTD, PubChem | 6 | Leukemia, Lymphoid | CTD, DrugBank, | |
| ClinicalTrials | ||||||
| 2 | Leukemia, Myeloid, | CTD, DrugBank, | 7 | Bronchitis | CTD | |
| Acute | ClinicalTrials | |||||
| Doxorubicin | 3 | Escherichia coli Infections | CTD | 8 | Sarcoma | CTD, DrugBank, |
| ClinicalTrials | ||||||
| 4 | Neoplasms | ClinicalTrials, PubChem | 9 | Gonorrhea | Unconfirmed | |
| 5 | Staphylococcal Infections | CTD, PubChem | 10 | Precursor Cell Lymphoblastic | CTD | |
| Leukemia-Lymphoma | ||||||
| 1 | Gonorrhea | DrugBank, PubChem | 6 | Gram-Positive Bacterial Infections | PubChem | |
| 2 | Gram-Negative Bacterial | PubChem | 7 | Staphylococcal Infections | CTD, DrugBank, | |
| Erythromycin | Infections | PubChem | ||||
| 3 | Chancroid | DrugBank, PubChem | 8 | Pneumonia, Mycoplasma | Unconfirmed | |
| 4 | Bacterial Infections | DrugBank, PubChem | 9 | Neurosyphilis | PubChem | |
| 5 | Neisseriaceae Infections | DrugBank | 10 | Chlamydiaceae Infections | DrugBank, ClinicalTrials | |
| 1 | Candidiasis, Cutaneous | DrugBank, PubChem, | 6 | Tinea Capitis | DrugBank, PubChem | |
| ClinicalTrials | ||||||
| 2 | Tinea Versicolor | DrugBank, PubChem, | 7 | Fungemia | DrugBank, PubChem, | |
| ClinicalTrials | ClinicalTrials | |||||
| Itraconazole | 3 | Tinea Pedis | DrugBank, PubChem | 8 | Skin Diseases, Infectious | PubChem, ClinicalTrials |
| 4 | Leishmaniasis | CTD, PubChem, | 9 | AIDS-Related Opportunistic | ClinicalTrials | |
| ClinicalTrials | Infections | |||||
| 5 | Chromoblastomycosis | DrugBank, PubChem | 10 | Candidiasis | CTD, DrugBank, PubChem |
Figure 3Framework of the proposed NAPred model. (a) Construct multi-scale meta-path sets and the sets composed of the same-type neighbor nodes. (b) Encode the attribute vectors of neighbor nodes of a drug. (c) Encode the attribute vectors of neighbor nodes of a disease. (d) Learn the neighbor topology of a drug–disease node pair. (e) Learn the attributes of the node pair. (f) Integrate multiple representations.
Figure 4Construction of three heterogeneous networks based on multiple kinds of drug similarities, drug–disease associations, and disease similarities.
Figure 5Illustration of constructing an attribute embedding matrix for a pair of drug and disease nodes.