| Literature DB >> 31349729 |
Ping Xuan1, Hao Sun1, Xiao Wang2, Tiangang Zhang3, Shuxiang Pan1.
Abstract
Identification of disease-associated miRNAs (disease miRNAs) are critical for understanding etiology and pathogenesis. Most previous methods focus on integrating similarities and associating information contained in heterogeneous miRNA-disease networks. However, these methods establish only shallow prediction models that fail to capture complex relationships among miRNA similarities, disease similarities, and miRNA-disease associations. We propose a prediction method on the basis of network representation learning and convolutional neural networks to predict disease miRNAs, called CNNMDA. CNNMDA deeply integrates the similarity information of miRNAs and diseases, miRNA-disease associations, and representations of miRNAs and diseases in low-dimensional feature space. The new framework based on deep learning was built to learn the original and global representation of a miRNA-disease pair. First, diverse biological premises about miRNAs and diseases were combined to construct the embedding layer in the left part of the framework, from a biological perspective. Second, the various connection edges in the miRNA-disease network, such as similarity and association connections, were dependent on each other. Therefore, it was necessary to learn the low-dimensional representations of the miRNA and disease nodes based on the entire network. The right part of the framework learnt the low-dimensional representation of each miRNA and disease node based on non-negative matrix factorization, and these representations were used to establish the corresponding embedding layer. Finally, the left and right embedding layers went through convolutional modules to deeply learn the complex and non-linear relationships among the similarities and associations between miRNAs and diseases. Experimental results based on cross validation indicated that CNNMDA yields superior performance compared to several state-of-the-art methods. Furthermore, case studies on lung, breast, and pancreatic neoplasms demonstrated the powerful ability of CNNMDA to discover potential disease miRNAs.Entities:
Keywords: convolutional neural network; deep learning; disease-associated miRNAs; network representation learning; non-negative matrix factorization
Year: 2019 PMID: 31349729 PMCID: PMC6696449 DOI: 10.3390/ijms20153648
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1ROC curves and precision-recall (PR) curves of CNNMDA and other methods for 15 diseases.
Prediction results of CNNMDA and the other four methods for 15 diseases in terms of the area under the receiver operating characteristic curve (AUC).
| Diseases Name | AUC CNNMDA | GSTRW | DMPred | BNPMDA | Liu’s Method |
|---|---|---|---|---|---|
| Breast neoplasms |
| 0.822 | 0.939 | 0.906 | 0.896 |
| Hepatocellular carcinoma |
| 0.770 | 0.899 | 0.784 | 0.846 |
| Renal cell carcinoma |
| 0.801 | 0.897 | 0.830 | 0.785 |
| Squamous cell carcinoma |
| 0.821 | 0.894 | 0.793 | 0.897 |
| Colorectal neoplasms |
| 0.742 | 0.882 | 0.724 | 0.864 |
| Glioblastoma |
| 0.821 | 0.906 | 0.781 | 0.828 |
| Heart failure |
| 0.823 | 0.984 | 0.929 | 0.816 |
| Acute myeloid leukemia |
| 0.817 | 0.894 | 0.784 | 0.924 |
| Lung neoplasms |
| 0.795 | 0.941 | 0.903 | 0.931 |
| Melanoma |
| 0.788 | 0.909 | 0.909 | 0.859 |
| Ovarian neoplasms |
| 0.831 | 0.934 | 0.924 | 0.855 |
| Pancreatic neoplasms |
| 0.853 | 0.913 | 0.725 | 0.892 |
| Prostatic neoplasms |
| 0.828 | 0.947 | 0.896 | 0.895 |
| Stomach neoplasms |
| 0.781 | 0.922 | 0.740 | 0.838 |
| Urinary bladder neoplasms |
| 0.821 | 0.921 | 0.879 | 0.870 |
The bold values indicate the higher AUCs.
Prediction results of CNNMDA and other four methods for 15 diseases in terms of the area under the precision–recall curve (AUPR).
| Diseases Name | AUPR CNNMDA | GSTRW | DMPred | BNPMDA | Liu’s Method |
|---|---|---|---|---|---|
| Breast neoplasms |
| 0.261 | 0.681 | 0.245 | 0.378 |
| Hepatocellular carcinoma |
| 0.234 | 0.539 | 0.574 | 0.335 |
| Renal cell carcinoma |
| 0.127 | 0.325 | 0.328 | 0.152 |
| Squamous cell carcinoma |
| 0.104 | 0.191 | 0.272 | 0.170 |
| Colorectal neoplasms |
| 0.136 | 0.279 | 0.177 | 0.273 |
| Glioblastoma | 0.277 | 0.142 | 0.270 |
| 0.166 |
| Heart failure |
| 0.160 | 0.669 | 0.451 | 0.157 |
| Acute myeloid leukemia | 0.262 | 0.118 | 0.236 |
| 0.207 |
| Lung neoplasms |
| 0.140 | 0.481 | 0.480 | 0.343 |
| Melanoma |
| 0.157 | 0.410 | 0.477 | 0.309 |
| Ovarian neoplasms |
| 0.152 | 0.453 | 0.386 | 0.239 |
| Pancreatic neoplasms |
| 0.133 | 0.308 | 0.136 | 0.283 |
| Prostatic neoplasms |
| 0.150 | 0.414 | 0.175 | 0.231 |
| Stomach neoplasms |
| 0.207 | 0.503 | 0.306 | 0.303 |
| Urinary bladder neoplasms |
| 0.134 | 0.331 | 0.292 | 0.229 |
The bold values indicate the higher AUPRs.
Comparison of different methods based on AUCs with a paired t-test.
| DMPred | GSTRW | BNPMDA | Liu’s Method | |
|---|---|---|---|---|
| 3.3219 × 10−5 | 8.5916 × 10−23 | 5.4483 × 10−10 | 2.0247 × 10−10 | |
| 1.4386 × 10−8 | 2.7951 × 10−13 | 1.181 × 10−2 | 2.9012 × 10−8 |
Figure 2Recall values of top k candidates of CNNMDA and the other four methods.
The top 50 lung neoplasms-related candidates.
| Rank | miRNA Name | Evidence |
|---|---|---|
| 1 | hsa-mir-106b | dbDEMC, PhenomiR |
| 2 | hsa-mir-15a | Literature [ |
| 3 | hsa-mir-16 | dbDEMC, PhenomiR, miRCancer |
| 4 | hsa-mir-130a | dbDEMC, PhenomiR |
| 5 | hsa-mir-193b | dbDEMC, PhenomiR, TCGA |
| 6 | hsa-mir-520d | dbDEMC |
| 7 | hsa-mir-429 | dbDEMC, miRCancer |
| 8 | hsa-mir-122 | dbDEMC, PhenomiR, miRCancer |
| 9 | hsa-mir-149 | dbDEMC, PhenomiR |
| 10 | hsa-mir-424 | dbDEMC, PhenomiR |
| 11 | hsa-mir-451a | dbDEMC |
| 12 | hsa-mir-378a | Literature [ |
| 13 | hsa-mir-708 | dbDEMC |
| 14 | hsa-mir-20b | dbDEMC, PhenomiR, TCGA |
| 15 | hsa-mir-15b | dbDEMC, PhenomiR, miRCancer |
| 16 | hsa-mir-520a | dbDEMC, TCGA |
| 17 | hsa-mir-10a | dbDEMC |
| 18 | hsa-mir-520b | dbDEMC |
| 19 | hsa-mir-625 | dbDEMC |
| 20 | hsa-mir-141 | dbDEMC, PhenomiR, miRCancer |
| 21 | hsa-mir-449a | dbDEMC, PhenomiR, miRCancer |
| 22 | hsa-mir-99a | dbDEMC, PhenomiR, TCGA |
| 23 | hsa-mir-195 | dbDEMC, PhenomiR, miRCancer |
| 24 | hsa-mir-151a | Literature [ |
| 25 | hsa-mir-296 | Literature [ |
| 26 | hsa-mir-449b | dbDEMC, PhenomiR, miRCancer |
| 27 | hsa-mir-28 | dbDEMC, PhenomiR |
| 28 | hsa-mir-342 | dbDEMC, PhenomiR |
| 29 | hsa-mir-372 | dbDEMC, PhenomiR, TCGA |
| 30 | hsa-mir-345 | dbDEMC, PhenomiR |
| 31 | hsa-mir-92b | dbDEMC, PhenomiR |
| 32 | hsa-mir-328 | dbDEMC, PhenomiR |
| 33 | hsa-mir-367 | dbDEMC, PhenomiR |
| 34 | hsa-mir-373 | dbDEMC, PhenomiR |
| 35 | hsa-mir-302b | dbDEMC, PhenomiR, miRCancer |
| 36 | hsa-mir-194 | dbDEMC, PhenomiR |
| 37 | hsa-mir-1258 | dbDEMC |
| 38 | hsa-mir-320a | dbDEMC, PhenomiR |
| 39 | hsa-mir-152 | dbDEMC, PhenomiR |
| 40 | hsa-mir-302c | dbDEMC, PhenomiR |
| 41 | hsa-mir-151b | dbDEMC |
| 42 | hsa-mir-204 | dbDEMC, PhenomiR |
| 43 | hsa-mir-23b | dbDEMC, PhenomiR |
| 44 | hsa-mir-129 | dbDEMC, PhenomiR, TCGA |
| 45 | hsa-mir-451b | Literature [ |
| 46 | hsa-mir-374a | Literature [ |
| 47 | hsa-mir-211 | dbDEMC, PhenomiR |
| 48 | hsa-mir-208a | Literature [ |
| 49 | hsa-mir-1254 | dbDEMC, miRCancer |
| 50 | hsa-mir-337 | dbDEMC, PhenomiR, TCGA |
Figure 3Functional enrichment analysis of lung cancer-related miRNAs. The horizontal ordinates represent 35 significant enriched functions of the top 50 candidate miRNAs associated with lung neoplasms. The vertical coordinates represent the number of miRNAs associated with each enriched function.
Figure 4Construction of a deep learning framework based on dual convolutional neural networks to learn original representation and global network representation.
Figure 5Establishment of the left embedding layer of miRNA m1 and disease d5 by combining their similarities and associations.
Figure 6Establishment of the right embedding layer miRNA m1 and disease d5 by integrating their projection vectors in low-dimensional space.