| Literature DB >> 32351464 |
Abstract
More and more clinical observations have implied that microbes have great effects on human diseases. Understanding the relations between microbes and diseases are of profound significance for disease prevention and therapy. In this paper, we propose a predictive model based on the known microbe-disease associations to discover potential microbe-disease associations through integrating Learning Graph Representations and a modified Scoring mechanism on the Heterogeneous network (called LGRSH). Firstly, the similarity networks for microbe and disease are obtained based on the similarity of Gaussian interaction profile kernel. Then, we construct a heterogeneous network including these two similarity networks and microbe-disease associations' network. After that, the embedding algorithm Node2vec is implemented to learn representations of nodes in the heterogeneous network. Finally, according to these low-dimensional vector representations, we calculate the relevance between each microbe and disease by utilizing a modified rule-based inference method. By comparison with three other methods including LRLSHMDA, KATZHMDA and BiRWHMDA, LGRSH performs better than others. Moreover, in case studies of asthma, Chronic Obstructive Pulmonary Disease and Inflammatory Bowel Disease, there are 8, 8, and 10 out of the top-10 discovered disease-related microbes were validated respectively, demonstrating that LGRSH performs well in predicting potential microbe-disease associations.Entities:
Keywords: Node2vec; heterogeneous network; microbe-disease association; network embedding algorithm; skip-gram
Year: 2020 PMID: 32351464 PMCID: PMC7174569 DOI: 10.3389/fmicb.2020.00579
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1The flowchart of LGRSH.
FIGURE 2Description of walking strategy in Node2vec when the traversal has just gone from t to u.
FIGURE 3Description of algorithm Node2vec.
Effect of parameters p and q in fivefold cross validation.
| 0.9251 | 0.9165 | 0.9178 | 0.9246 | 0.9229 | 0.9236 | 0.9244 | |
| 0.9253 | 0.9236 | 0.9251 | 0.9246 | 0.9235 | 0.9229 | ||
| 0.9240 | 0.9250 | 0.9190 | 0.9213 | 0.9234 | 0.9234 | 0.9242 | |
| 0.9214 | 0.9204 | 0.9239 | 0.9230 | 0.9251 | 0.9181 | 0.9208 | |
| 0.9215 | 0.9222 | 0.9206 | 0.9229 | 0.9241 | 0.9239 | 0.9235 |
FIGURE 4Effect of parameters p and q in fivefold cross validation.
FIGURE 5Prediction comparison between LGRSH and other three methods in LOOCV and fivefold cross validation while p = 0.5, q = 4.
FIGURE 6Precision-recall curves for LGRSH and other three methods in fivefold cross validation.
FIGURE 7The number of correctly predicted by LGRSH and other three methods on HMDAD.
Validation results for Top-10 predicted microbes related with asthma.
| Rank | Microbe | Evidence |
| 1 | Clostridium difficile | PMID:21872915 |
| 2 | Firmicutes | PMID:27078029 |
| 3 | Clostridium coccoides | PMID:21477358 |
| 4 | Actinobacteria | PMID:30286807 |
| 5 | Enterobacteriaceae | PMID:28947029 |
| 6 | Lactobacillus | PMID:30400588 |
| 7 | Bacteroides | PMID:18822123 |
| PMID:29161087 | ||
| 8 | Burkholderia | |
| 9 | Lachnospiraceae | PMID:28912020 |
| 10 | Enterococcus |
Validation results for Top-10 predicted microbes related with COPD.
| Rank | Microbe | Evidence |
| 1 | Proteobacteria | PMID:29579057 |
| 2 | Prevotella | PMID:28542929 |
| 3 | Helicobacter pylori | PMID:28558695 |
| 4 | Actinobacteria | PMID:29709671 |
| 5 | Bacteroidetes | PMID:29579057 |
| 6 | Clostridium difficile | PMID:30430993 |
| 7 | Clostridium coccoides | |
| 8 | Lactobacillus | PMID:26630356 |
| 9 | Lachnospiraceae | |
| 10 | Staphylococcus aureus | PMID:30804927 |
Validation results for Top-10 predicted microbes related with IBD.
| Rank | Microbe | Evidence |
| 1 | Prevotella | PMID:24013298 |
| 2 | Bacteroidetes | PMID:29492876 |
| 3 | Clostridium difficile | PMID:24838421 |
| 4 | Helicobacter pylori | PMID:22221289 |
| PMID:28124160 | ||
| 5 | Firmicutes | PMID:25307765 |
| PMID:29492876 | ||
| 6 | Clostridium coccoides | PMID:19235886 |
| 7 | Lactobacillus | PMID:26340825 |
| 8 | Enterobacteriaceae | PMID:30319571 |
| 9 | Veillonella | PMID:30573380 |
| 10 | Haemophilus | PMID:24013298 |