| Literature DB >> 31077936 |
Xiangxiang Zeng1, Wen Wang2, Gaoshan Deng3, Jiaxin Bing2, Quan Zou4.
Abstract
Identifying disease-related microRNAs (miRNAs) is an essential but challenging task in bioinformatics research. Much effort has been devoted to discovering the underlying associations between miRNAs and diseases. However, most studies mainly focus on designing advanced methods to improve prediction accuracy while neglecting to investigate the link predictability of the relationships between miRNAs and diseases. In this work, we construct a heterogeneous network by integrating neighborhood information in the neural network to predict potential associations between miRNAs and diseases, which also consider the imbalance of datasets. We also employ a new computational method called a neural network model for miRNA-disease association prediction (NNMDA). This model predicts miRNA-disease associations by integrating multiple biological data resources. Comparison of our work with other algorithms reveals the reliable performance of NNMDA. Its average AUC score was 0.937 over 15 diseases in a 5-fold cross-validation and AUC of 0.8439 based on leave-one-out cross-validation. The results indicate that NNMDA could be used in evaluating the accuracy of miRNA-disease associations. Moreover, NNMDA was applied to two common human diseases in two types of case studies. In the first type, 26 out of the top 30 predicted miRNAs of lung neoplasms were confirmed by the experiments. In the second type of case study for new diseases without any known miRNAs related to it, we selected breast neoplasms as the test example by hiding the association information between the miRNAs and this disease. The results verified 50 out of the top 50 predicted breast-neoplasm-related miRNAs.Entities:
Keywords: disease; disease similarity; miRNA-disease association; miRNAs; neural network
Year: 2019 PMID: 31077936 PMCID: PMC6510966 DOI: 10.1016/j.omtn.2019.04.010
Source DB: PubMed Journal: Mol Ther Nucleic Acids ISSN: 2162-2531 Impact factor: 8.886
Comparison of Various Computational Approaches’ AUC Values through 5-Fold Cross-Validation
| Method | RWRMDA | HDMP | IMCMDA | RLSMDA | MIDP | SPM | NNMDA |
|---|---|---|---|---|---|---|---|
| Breast neoplasm | 0.785 | 0.801 | 0.812 | 0.832 | 0.838 | 0.932 | 0.968 |
| Hepatocellular carcinoma | 0.749 | 0.759 | 0.744 | 0.794 | 0.807 | 0.918 | 0.966 |
| Renal cell carcinoma | 0.815 | 0.833 | 0.793 | 0.839 | 0.862 | 0.901 | 0.912 |
| Squamous cell carcinoma | 0.819 | 0.820 | 0.837 | 0.849 | 0.870 | 0.899 | 0.924 |
| Colorectal neoplasm | 0.793 | 0.802 | 0.766 | 0.831 | 0.845 | 0.885 | 0.927 |
| Glioblastoma | 0.680 | 0.700 | 0.781 | 0.714 | 0.786 | 0.840 | 0.911 |
| Heart failure | 0.722 | 0.770 | 0.924 | 0.738 | 0.821 | 0.950 | 0.945 |
| Acute myeloid leukemia | 0.839 | 0.858 | 0.861 | 0.853 | 0.915 | 0.957 | 0.916 |
| Lung neoplasm | 0.827 | 0.835 | 0.841 | 0.855 | 0.876 | 0.892 | 0.943 |
| Melanoma | 0.784 | 0.790 | 0.761 | 0.807 | 0.837 | 0.951 | 0.949 |
| Ovarian neoplasm | 0.882 | 0.884 | 0.875 | 0.909 | 0.923 | 0.949 | 0.928 |
| Pancreatic neoplasm | 0.871 | 0.895 | 0.894 | 0.887 | 0.945 | 0.954 | 0.954 |
| Prostatic neoplasm | 0.823 | 0.854 | 0.775 | 0.841 | 0.882 | 0.928 | 0.936 |
| Stomach neoplasm | 0.779 | 0.787 | 0.783 | 0.797 | 0.821 | 0.859 | 0.955 |
| Urinary bladder neoplasm | 0.821 | 0.850 | 0.813 | 0.845 | 0.897 | 0.898 | 0.920 |
| Average AUC | 0.799 | 0.816 | 0.817 | 0.826 | 0.862 | 0.914 | 0.937 |
Figure 1Performances on 5-Fold Cross-Validation Precision
(A) Precision on disease heart failure. (B) Recall on disease heart failure. (C) Average recalls for the 15 tested diseases on four methods (NNMDA, IMCMDA, MIDP, and SPM), which contain the diseases breast neoplasm, hepatocellular carcinoma, renal cell carcinoma, squamous cell carcinoma, colorectal neoplasm, glioblastoma, heart failure, acute myeloid leukemia, lung neoplasm, melanoma, ovarian neoplasm, pancreatic neoplasm, prostatic neoplasm, stomach neoplasm, and urinary bladder neoplasm.
Figure 2Comparison of Performance among NNMDA and Baseline Methods (NNMDA, IMCMDA, RWRMDA, HDMP, and RLSMDA)
Prediction Results of the Top 30 Predicted Lung Neoplasm-Related miRNAs Based on Known Associations in HMDD V1.0
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-let-7g | dbDEMC mir2disease | hsa-mir-18b | dbDEMC |
| hsa-mir-135b | dbDEMC | hsa-mir-17 | dbDEMC |
| hsa-mir-133b | dbDEMC | hsa-mir-21 | dbDEMC mir2disease |
| hsa-mir-200b | dbDEMC mir2disease | hsa-mir-148a | dbDEMC mir2disease |
| hsa-let-7d | dbDEMC mir2disease | hsa-mir-18a | dbDEMC mir2disease |
| hsa-mir-181b-1 | unverified | hsa-mir-30e | dbDEMC |
| hsa-mir-29c | dbDEMC mir2disease | hsa-mir-101-1 | mir2disease |
| hsa-mir-98 | dbDEMC mir2disease | hsa-mir-30c-2 | unverified |
| hsa-mir-221 | dbDEMC mir2disease | hsa-mir-125a | dbDEMC mir2disease |
| hsa-mir-186 | dbDEMC | hsa-mir-200c | dbDEMC mir2disease |
| hsa-mir-142 | unverified | hsa-mir-126 | dbDEMC mir2disease |
| hsa-mir-146a | dbDEMC | hsa-mir-31 | dbDEMC mir2disease |
| hsa-mir-146b | dbDEMC mir2disease | hsa-mir-30c-1 | unverified |
| hsa-mir-101-1 | mir2disease | hsa-mir-30a | dbDEMC |
| hsa-let-7b | dbDEMC | hsa-mir-192 | mir2disease dbDEMC |
The first column contains the top 1–15 related miRNAs, whereas the third column shows the top 16–30 related miRNAs.
Prediction Results of the Top 50 Predicted Breast Neoplasm-Related miRNAs When the Known Associations of Breast Neoplasms Were Considered as Unknown Ones
| miRNA | Evidence | miRNA | Evidence |
|---|---|---|---|
| hsa-mir-155 | dbDEMC HMDD | hsa-mir-19b-1 | HMDD |
| hsa-mir-21 | dbDEMC HMDD | hsa-mir-1-1 | HMDD |
| hsa-mir-146a | dbDEMC HMDD | hsa-mir-145 | dbDEMC HMDD |
| hsa-mir-29b-1 | HMDD | hsa-mir-29c | dbDEMC HMDD |
| hsa-mir-125b-1 | HMDD | hsa-mir-199a-2 | HMDD |
| hsa-mir-29b-2 | HMDD | hsa-mir-223 | dbDEMC HMDD |
| hsa-mir-34a | dbDEMC HMDD | hsa-mir-126 | dbDEMC HMDD |
| hsa-mir-15a | dbDEMC HMDD | hsa-mir-133a-2 | HMDD |
| hsa-mir-125b-2 | HMDD | hsa-mir-19a | dbDEMC HMDD |
| hsa-mir-20a | dbDEMC HMDD | hsa-mir-199a-1 | HMDD |
| hsa-mir-16-1 | HMDD | hsa-let-7b | dbDEMC HMDD |
| hsa-mir-16-2 | HMDD | hsa-mir-26a-1 | HMDD |
| hsa-mir-221 | dbDEMC HMDD | hsa-let-7c | dbDEMC HMDD |
| hsa-mir-29a | dbDEMC HMDD | hsa-mir-142 | HMDD |
| hsa-let-7a-2 | HMDD | hsa-mir-146b | HMDD |
| hsa-mir-26a-2 | HMDD | hsa-mir-150 | dbDEMC HMDD |
| hsa-mir-1-2 | HMDD | hsa-mir-210 | dbDEMC HMDD |
| hsa-let-7a-1 | HMDD | hsa-mir-196a-2 | HMDD |
| hsa-let-7a-3 | HMDD | hsa-let-7i | dbDEMC HMDD |
| hsa-mir-17 | dbDEMC HMDD | hsa-let-7d | dbDEMC HMDD |
| hsa-mir-31 | dbDEMC HMDD | hsa-mir-195 | dbDEMC HMDD |
| hsa-mir-92a-1 | HMDD | hsa-mir-222 | dbDEMC HMDD |
| hsa-mir-18a | dbDEMC HMDD | hsa-mir-92a-2 | HMDD |
| hsa-mir-122 | dbDEMC HMDD | hsa-mir-24-1 | HMDD |
| hsa-mir-133a-1 | HMDD | hsa-mir-133b | dbDEMC HMDD |
The first column contains the top 1–25 related miRNAs, whereas the third column shows the top 26–50 related miRNAs.
Figure 3Flowchart of NNMDA
(A) NNMDA uses several individual miRNA-related or disease-related networks to construct a heterogeneous network (details of the used datasets are introduced in Materials and Methods). In a heterogeneous network, different types of nodes are connected by distinct types of edges. Two nodes can be connected by more than one edge (e.g., the solid link between diseases representing disease functional similarity and the chain line between them representing disease Gaussian similarity). (B) Each node adopts a neighborhood information aggregation operation to extract information from the neighborhood. Each arrow represents a specific aggregation function with respect to a specific edge type. Each node then updates its feature representation by integrating its current representation with the aggregated information. (C) NNMDA learns the topology-preserving node features that are useful for miRNA-disease interaction prediction by enforcing the node features to reconstruct the original individual networks. (D) Reconstruction of all individual matrices.