| Literature DB >> 34178973 |
Wei Peng1,2, Jielin Du1, Wei Dai1,2, Wei Lan3.
Abstract
MicroRNAs (miRNAs) are a category of small non-coding RNAs that profoundly impact various biological processes related to human disease. Inferring the potential miRNA-disease associations benefits the study of human diseases, such as disease prevention, disease diagnosis, and drug development. In this work, we propose a novel heterogeneous network embedding-based method called MDN-NMTF (Module-based Dynamic Neighborhood Non-negative Matrix Tri-Factorization) for predicting miRNA-disease associations. MDN-NMTF constructs a heterogeneous network of disease similarity network, miRNA similarity network and a known miRNA-disease association network. After that, it learns the latent vector representation for miRNAs and diseases in the heterogeneous network. Finally, the association probability is computed by the product of the latent miRNA and disease vectors. MDN-NMTF not only successfully integrates diverse biological information of miRNAs and diseases to predict miRNA-disease associations, but also considers the module properties of miRNAs and diseases in the course of learning vector representation, which can maximally preserve the heterogeneous network structural information and the network properties. At the same time, we also extend MDN-NMTF to a new version (called MDN-NMTF2) by using modular information to improve the miRNA-disease association prediction ability. Our methods and the other four existing methods are applied to predict miRNA-disease associations in four databases. The prediction results show that our methods can improve the miRNA-disease association prediction to a high level compared with the four existing methods.Entities:
Keywords: disease; heterogeneous network embedding; matrix factorization; miRNA; miRNA-disease association prediction
Year: 2021 PMID: 34178973 PMCID: PMC8223753 DOI: 10.3389/fcell.2021.603758
Source DB: PubMed Journal: Front Cell Dev Biol ISSN: 2296-634X
The number of MiRNAs, diseases and miRNA-disease associations in four datasets.
| HMDD2.0-You | 495 | 380 | 5424 |
| HMDD2.0-Lan | 550 | 329 | 6084 |
| HMDD2.0-Yan | 576 | 356 | 6391 |
| HMDD3.0 | 1207 | 894 | 18732 |
FIGURE 1The flowchart of MDN-NMTF and MDN-NMTF2 to predict miRNA-disease association. The MDN-NMTF model takes four steps to predict miRNA-disease associations: building the similarity networks, learning feature representation for miRNA and diseases; reconstructing the miRNA-disease association network, predicting miRNA-disease associations. MDN-NMTF2 is an extended version of MDN-NMTF. It divides the miRNAs and diseases into several modules on the basis of the representation vectors learned by MDN-NMTF. MDN-NMTF2 calculates the similarity of two miRNAs or two diseases based on the module they belong to and infers the novel miRNA-disease associations from similar miRNAs or diseases in the same modules.
The AUC values for the models on different databases by randomly zeroing cross validation.
| MDN-NMTF | 0.9335 ± 0.0037 | 0.9409 ± 0.0030 | 0.9391 ± 0.0033 | 0.9435 ± 0.0021 |
| MDN-NMTF2 | ||||
| DNRLMF-MDA | 0.9301 ± 0.0036 | 0.9384 ± 0.0031 | 0.9369 ± 0.0030 | 0.9390 ± 0.0015 |
| IMCMDA | 0.8285 ± 0.0068 | 0.8045 ± 0.0062 | 0.7216 ± 0.0072 | 0.6572 ± 0.0052 |
| UBiRW | 0.9196 ± 0.0036 | 0.9191 ± 0.0030 | 0.9198 ± 0.0032 | 0.9280 ± 0.0016 |
| GRNMF | 0.9031 ± 0.0049 | 0.9153 ± 0.0045 | 0.9157 ± 0.0044 | 0.9247 ± 0.0023 |
FIGURE 2The ROC curves of MDN-NMTF and other four methods for 14 diseases on HMDD2.0-Yan Dataset. The figure shows that the ROC curves of MDN-NMTF in 14 diseases are all higher than that of the other four methods.
AUC values of MDN-NMTF and other four compared methods for the 14 diseases on HMDD2.0-Yan Dataset.
| Breast neoplasms | 0.8274 | 0.8340 | 0.8194 | 0.8183 | |
| Non-small-cell lung carcinoma | 0.8748 | 0.8658 | 0.8614 | 0.8582 | |
| Renal cell carcinoma | 0.8089 | 0.7936 | 0.7592 | 0.7846 | |
| Glioblastoma | 0.8280 | 0.8414 | 0.8248 | 0.8336 | |
| Heart failure | 0.7797 | 0.7864 | 0.7962 | 0.8120 | |
| Hepatocellular carcinoma | 0.7585 | 0.7588 | 0.7916 | 0.7846 | |
| Lung neoplasms | 0.9077 | 0.8963 | 0.8922 | 0.8885 | |
| Melanoma | 0.8375 | 0.8211 | 0.8216 | 0.8251 | |
| Neoplasms | 0.9253 | 0.9227 | 0.9223 | 0.9264 | |
| Ovarian neoplasms | 0.8941 | 0.8863 | 0.8835 | 0.8885 | |
| Pancreatic neoplasms | 0.9035 | 0.8943 | 0.8886 | 0.9057 | |
| Prostatic neoplasms | 0.8623 | 0.8395 | 0.8184 | 0.8261 | |
| Stomach neoplasms | 0.8054 | 0.8164 | 0.8055 | 0.8071 | |
| Colorectal neoplasms | 0.8292 | 0.8350 | 0.8463 | 0.8425 |
The AUC values of each method on four different datasets by single-column zeroing cross validation.
| MDN-NMTF | 0.8445 ± 0.1339 | |||
| MDN-NMTF2 | 0.8561 ± 0.1240 | 0.8473 ± 0.1292 | 0.8896 ± 0.1142 | |
| DNRLMF-MDA | 0.8482 ± 0.1355 | 0.8451 ± 0.1431 | 0.8385 ± 0.1487 | 0.8813 ± 0.1181 |
| IMCMDA | 0.8329 ± 0.1297 | 0.8214 ± 0.1290 | 0.8158 ± 0.1357 | 0.8781 ± 0.1308 |
| UBiRW | 0.8512 ± 0.1343 | 0.8403 ± 0.1356 | 0.8326 ± 0.1499 | 0.8794 ± 0.1341 |
| GRNMF | 0.7833 ± 0.1505 | 0.7504 ± 0.1618 | 0.7895 ± 0.1465 | 0.8245 ± 0.1502 |
Top 50 Related miRNAs of Stomach Neoplasms predicted by MDN-NMTF on HMDD2.0-Yan Dataset.
| hsa-mir-21 | dbDEMC, miRCancer | hsa-mir-199a-1 | PMID:22956063 |
| hsa-mir-214 | dbDEMC, miRCancer | hsa-mir-22 | dbDEMC, miRCancer |
| hsa-mir-200b | dbDEMC, miRCancer | hsa-mir-375 | dbDEMC, miRCancer |
| hsa-mir-200c | miRCancer | hsa-mir-486 | PMID:21415212 |
| hsa-mir-182 | dbDEMC, miRCancer | hsa-mir-106a | dbDEMC, miRCancer |
| hsa-mir-221 | dbDEMC, miRCancer | hsa-mir-16-1 | miRCancer |
| hsa-mir-181b-1 | PMID:20162574 | hsa-mir-222 | dbDEMC, miRCancer |
| hsa-mir-148a | dbDEMC, miRCancer | hsa-mir-101-1 | PMID:22450781 |
| hsa-mir-34c | miRCancer | hsa-mir-10b | dbDEMC, miRCancer |
| hsa-mir-146b | miRCancer | hsa-mir-195 | dbDEMC, miRCancer |
| hsa-mir-34a | dbDEMC, miRCancer | hsa-mir-141 | dbDEMC, miRCancer |
| hsa-mir-125b-1 | PMID:23128435 | hsa-mir-101-2 | PMID:26458815 |
| hsa-mir-200a | dbDEMC, miRCancer | hsa-mir-146a | miRCancer |
| hsa-mir-31 | dbDEMC, miRCancer | hsa-mir-199a-2 | PMID:22956063 |
| hsa-mir-145 | dbDEMC, miRCancer | hsa-mir-106b | dbDEMC, miRCancer |
| hsa-mir-126 | dbDEMC, miRCancer | hsa-mir-143 | dbDEMC, miRCancer |
| hsa-mir-34b | miRCancer | hsa-mir-124-1 | PMID:21365509 |
| hsa-mir-16-2 | PMID:18449891 | hsa-mir-124-2 | PMID:21365509 |
| hsa-mir-125b-2 | PMID:26458815 | hsa-mir-103a-2 | PMID:20726036 |
| hsa-mir-107 | dbDEMC, miRCancer | hsa-mir-130a | dbDEMC, miRCancer |
| hsa-mir-223 | dbDEMC, miRCancer | hsa-mir-27b | dbDEMC, miRCancer |
| hsa-mir-183 | dbDEMC, miRCancer | hsa-mir-155 | dbDEMC, miRCancer |
| hsa-mir-27a | dbDEMC, miRCancer | hsa-mir-335 | miRCancer |
| hsa-mir-25 | dbDEMC, miRCancer | hsa-mir-151a | PMID:22956063 |
| hsa-mir-181b-2 | PMID:22539488 | hsa-mir-7-1 | PMID:22139078 |
Top 50 Related miRNAs of Lymphoma predicted by MDN-NMTF on HMDD2.0-Yan Dataset.
| hsa-mir-17 | dbDEMC, miRCancer | hsa-mir-363 | dbDEMC |
| hsa-mir-20a | dbDEMC, miRCancer | hsa-mir-150 | dbDEMC, miRCancer |
| hsa-mir-155 | dbDEMC, miRCancer | hsa-mir-126 | dbDEMC |
| hsa-mir-18a | dbDEMC, miRCancer | hsa-mir-200b | dbDEMC |
| hsa-mir-19a | dbDEMC, miRCancer | hsa-mir-184 | dbDEMC |
| hsa-mir-19b-1 | miRCancer | hsa-mir-200a | dbDEMC |
| hsa-mir-92a-1 | PMID:21383985 | hsa-mir-499a | PMID:19690137 |
| hsa-mir-15a | dbDEMC, miRCancer | hsa-mir-34a | dbDEMC |
| hsa-mir-146a | dbDEMC | hsa-mir-210 | dbDEMC |
| hsa-mir-19b-2 | miRCancer | hsa-mir-200c | dbDEMC |
| hsa-mir-16-1 | miRCancer | hsa-mir-205 | dbDEMC |
| hsa-mir-16-2 | miRCancer | hsa-mir-145 | dbDEMC |
| hsa-mir-21 | dbDEMC, miRCancer | hsa-mir-24-1 | PMID:19177201 |
| hsa-mir-92a-2 | PMID:21383985 | hsa-mir-125b-1 | dbDEMC |
| hsa-mir-181a-1 | dbDEMC | hsa-mir-20b | dbDEMC |
| hsa-mir-181a-2 | PMID:21910161 | hsa-mir-125a | dbDEMC |
| hsa-mir-26a-2 | dbDEMC | hsa-mir-124-1 | PMID:22395483 |
| hsa-mir-26a-1 | PMID:19197161 | hsa-mir-141 | dbDEMC |
| hsa-mir-122 | dbDEMC | hsa-mir-125b-2 | PMID:23527180 |
| hsa-mir-101-1 | PMID:21960592 | hsa-mir-18b | dbDEMC |
| hsa-mir-101-2 | PMID:21960592 | hsa-mir-138-2 | dbDEMC |
| hsa-mir-342 | dbDEMC | hsa-mir-29c | dbDEMC |
| hsa-mir-486 | dbDEMC | hsa-mir-138-1 | PMID:21960592 |
| hsa-mir-203 | dbDEMC | hsa-mir-708 | dbDEMC |
| hsa-mir-223 | dbDEMC, miRCancer | hsa-mir-143 | dbDEMC |
miRNA modules and disease modules detected by MDN-NMTF2 on HMDD2.0-Yan Dataset.
| miRNA | 127 | 40 | 0.4409 | 82.20% |
| disease | 142 | 22 | 0.0939 | 61.28% |
FIGURE 3An example of miRNA module detected by MDN-NMTF2 on HMDD2.0-Yan Dataset. The figure shows that all 36 miRNAs in the module are related to Leukemia Myeloid Acute.
FIGURE 4An example of disease module detected by MDN-NMTF2 on HMDD2.0-Yan Dataset. The figure shows that 12 of 13 diseases in the module are related to a miRNA has-mir-124-1.
| Algorithm MDN-NMTF |
| Input: miRNA similarity |
| Output: |
| 1: Initialize matrices |
| 2: Calculate the dynamic neighbor matrices |
| 3: While objective function value in Eq. (21) not converge do |
| (1) Fix |
| (2) Fix |
| (3) Fix |
| (4) Fix |
| (5) Fix |
| end while |
| 4: Rebuild miRNA-disease association matrix |
| Algorithm MDN-NMTF2 |
| Input: miRNA similarity matrix |
| Output: |
| 1: Get |
| 2: Determine miRNA |
| 3. Determine disease |
| 4. Calculate similarity of two miRNAs If they belong to the same miRNA module by Eq. (29) |
| 5. Calculate |
| 6. Calculate similarity of two diseases If they belong to the same disease module by Eq. (31) |
| 7. Calculate |
| 8: Calculate the final predicted miRNA-disease associations |