| Literature DB >> 35369503 |
Da Xu1, Hanxiao Xu1, Yusen Zhang1, Rui Gao2.
Abstract
Extensive clinical and biomedical studies have shown that microbiome plays a prominent role in human health. Identifying potential microbe-disease associations (MDAs) can help reveal the pathological mechanism of human diseases and be useful for the prevention, diagnosis, and treatment of human diseases. Therefore, it is necessary to develop effective computational models and reduce the cost and time of biological experiments. Here, we developed a novel machine learning-based joint framework called CWNMF-GLapRLS for human MDA prediction using the proposed collaborative weighted non-negative matrix factorization (CWNMF) technique and graph Laplacian regularized least squares. Especially, to fuse more similarity information, we calculated the functional similarity of microbes. To deal with missing values and effectively overcome the data sparsity problem, we proposed a collaborative weighted NMF technique to reconstruct the original association matrix. In addition, we developed a graph Laplacian regularized least-squares method for prediction. The experimental results of fivefold and leave-one-out cross-validation demonstrated that our method achieved the best performance by comparing it with 5 state-of-the-art methods on the benchmark dataset. Case studies further showed that the proposed method is an effective tool to predict potential MDAs and can provide more help for biomedical researchers.Entities:
Keywords: association prediction; collaborative weighted non-negative matrix factorization; disease; graph Laplacian regularized least squares; microbe
Year: 2022 PMID: 35369503 PMCID: PMC8965656 DOI: 10.3389/fmicb.2022.834982
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Summary of microbe–disease association dataset.
| Types | Statistical information |
| Microbes | 292 |
| Diseases | 39 |
| Associations | 450 |
| Sparsity (%) | 96.05 |
FIGURE 1The flowchart of CWNMF-GLapRLS framework for prediction.
FIGURE 2(A) The illustration of determining the optimal values of parameter pair (λ1, λ2) under grid search. (B) The effects between parameter η and AUC value.
FIGURE 3Convergence behavior of CWNMF objection function.
FIGURE 4The performance comparison of different methods.
FIGURE 5The ROC curves and AUC values of six methods under LOOCV framework.
FIGURE 6The ROC curves and average AUC values of six methods under fivefold CV framework.
Prediction results of the top 15 asthma-associated microbes.
| Rank | Microbe | Evidence |
| 1 |
| PMID:23265859 |
| 2 |
| PMID:21477358 |
| 3 |
| PMID:26220531 |
| 4 |
| Unconfirmed |
| 5 |
| PMID:10202341 |
| 6 |
| PMID:21872915 |
| 7 |
| PMID:30400588 |
| 8 |
| PMID:24735374 |
| 9 |
| PMID:31958431 |
| 10 |
| PMID:26424567 |
| 11 |
| PMID:25865368 |
| 12 |
| PMID:25533526 |
| 13 |
| Unconfirmed |
| 14 |
| PMID:30208875 |
| 15 |
| PMID:27838347 |
Prediction results of the top 15 IBD-associated microbes.
| Rank | Microbe | Evidence |
| 1 |
| PMID:25307765 |
| 2 |
| PMID:25307765 |
| 3 |
| PMID:25307765 |
| 4 |
| PMID:19235886 |
| 5 |
| PMID:22221289 |
| 6 |
| PMID:25307765 |
| 7 |
| PMID:31142855 |
| 8 |
| PMID:24013298 |
| 9 |
| PMID:24838421 |
| 10 |
| PMID:24478468 |
| 11 |
| PMID:24478468 |
| 12 |
| PMID:24013298 |
| 13 |
| PMID:19809406 |
| 14 |
| Unconfirmed |
| 15 |
| PMID:32815163 |