| Literature DB >> 35053212 |
Chen Jin1, Zhuangwei Shi2, Ken Lin2, Han Zhang2.
Abstract
Many studies have clarified that microRNAs (miRNAs) are associated with many human diseases. Therefore, it is essential to predict potential miRNA-disease associations for disease pathogenesis and treatment. Numerous machine learning and deep learning approaches have been adopted to this problem. In this paper, we propose a Neural Inductive Matrix completion-based method with Graph Autoencoders (GAE) and Self-Attention mechanism for miRNA-disease associations prediction (NIMGSA). Some of the previous works based on matrix completion ignore the importance of label propagation procedure for inferring miRNA-disease associations, while others cannot integrate matrix completion and label propagation effectively. Varying from previous studies, NIMGSA unifies inductive matrix completion and label propagation via neural network architecture, through the collaborative training of two graph autoencoders. This neural inductive matrix completion-based method is also an implementation of self-attention mechanism for miRNA-disease associations prediction. This end-to-end framework can strengthen the robustness and preciseness of both matrix completion and label propagation. Cross validations indicate that NIMGSA outperforms current miRNA-disease prediction methods. Case studies demonstrate that NIMGSA is competent in detecting potential miRNA-disease associations.Entities:
Keywords: graph autoencoder; inductive matrix completion; miRNA-disease association; self-attention mechanism
Mesh:
Substances:
Year: 2022 PMID: 35053212 PMCID: PMC8774034 DOI: 10.3390/biom12010064
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1Flowchart of similarity computation.
Figure 2Illustration of NIMGSA. GAEm and GAEd represent graph autoencoders on the miRNA graph and disease graph respectively.
Figure 3Computation procedure of NIMGSA. NIMC denotes Neural Inductive Matrix Completion.
Figure 4ROC and PR curves of different methods.
Mean values and standard deviations of AUROC and AUPR, compared with different methods.
| METHOD | AUROC | AUPR |
|---|---|---|
| IMCMDA | 0.8329 ± 0.0011 | 0.2785 ± 0.0029 |
| SPM | 0.8960 ± 0.0070 | 0.2464 ± 0.0054 |
| NIMCGCN | 0.9279 ± 0.0006 | 0.3943 ± 0.0054 |
| MCLPMDA | 0.9292 ± 0.0069 | 0.4387 ± 0.0106 |
| GAEMDA | 0.9332 ± 0.0005 | 0.4142 ± 0.0034 |
| NIMGSA | 0.9354 ± 0.0047 | 0.4567 ± 0.0147 |
Binary classification metrics of different methods on Dataset2. Sp denotes specificity. Sn denotes sensitivity. Acc denotes accuracy. Pre denotes precision. F1 denotes F1-score. Mcc denotes Matthews correlation coefficient.
| SPEC | METHOD | SEN | ACC | PRE | F1-Score | MCC |
|---|---|---|---|---|---|---|
| 0.99 | IMCMDA | 0.2628 | 0.9692 | 0.4365 | 0.3281 | 0.3239 |
| SPM | 0.1551 | 0.9661 | 0.3137 | 0.2075 | 0.2048 | |
| NIMCGCN | 0.3039 | 0.9703 | 0.4725 | 0.3699 | 0.3645 | |
| MCLPMDA | 0.3567 | 0.9719 | 0.5127 | 0.4207 | 0.4138 | |
| GAEMDA | 0.3650 | 0.9721 | 0.5186 | 0.4284 | 0.4213 | |
| NIMGSA | 0.3718 | 0.9723 | 0.5229 | 0.4346 | 0.4273 |
AUROC and AUPR at different .
|
| 0.1 | 0.3 | 0.5 | 0.7 | 0.9 |
|---|---|---|---|---|---|
|
| 0.9119 | 0.9289 | 0.9354 | 0.9338 | 0.9312 |
|
| 0.3648 | 0.4255 | 0.4567 | 0.4556 | 0.4509 |
AUROC and AUPR at a different learning rate.
| lr | 0.001 | 0.01 | 0.05 | 0.1 |
|---|---|---|---|---|
|
| 0.9193 | 0.9354 | 0.7693 | 0.5557 |
|
| 0.4077 | 0.4567 | 0.2791 | 0.0709 |
AUROC and AUPR at a different dimension of hidden vectors.
| DIMENSION | 16 | 32 | 64 | 128 |
|---|---|---|---|---|
|
| 0.9012 | 0.9228 | 0.9354 | 0.9357 |
|
| 0.3642 | 0.4127 | 0.4567 | 0.4589 |
Ablation studies.
| Models | AUROC | AUPR |
|---|---|---|
| Self-attention | 0.9046 | 0.3768 |
| Without self-attention | 0.8916 | 0.3392 |
| NIMGSA | 0.9354 | 0.4567 |
Top 10 predicted miRNAs associated with esophageal neoplasms.
| MiRNA NAME | EVIDENCE |
|---|---|
| hsa-mir-125b | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-17 | dbDEMC v2.0 |
| hsa-mir-16 | dbDEMC v2.0 |
| hsa-mir-18a | dbDEMC v2.0 |
| hsa-mir-19b | dbDEMC v2.0 |
| hsa-mir-29a | dbDEMC v2.0 |
| hsa-mir-222 | dbDEMC v2.0 |
| hsa-mir-1 | dbDEMC v2.0 |
| hsa-mir-29b | dbDEMC v2.0 |
| hsa-mir-200b | dbDEMC v2.0 |
Top 10 predicted miRNAs associated with breast neoplasms.
| MiRNA NAME | EVIDENCE |
|---|---|
| hsa-mir-142 | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-15b | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-192 | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-106a | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-150 | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-130a | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-30e | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-92b | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-192b | dbDEMC v2.0; miR2Disease; HMDD v3.0 |
| hsa-mir-372 | dbDEMC v2.0; HMDD v3.0 |
Top 10 predicted miRNAs associated with lung neoplasms.
| MiRNA NAME | EVIDENCE |
|---|---|
| hsa-mir-16 | dbDEMC v2.0; miR2Disease; HMDD v3.0 |
| hsa-mir-15a | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-106b | dbDEMC v2.0; miR2Disease; HMDD v3.0 |
| hsa-mir-141 | dbDEMC v2.0; miR2Disease; HMDD v3.0 |
| hsa-mir-15b | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-122 | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-429 | dbDEMC v2.0; miR2Disease; HMDD v3.0 |
| hsa-mir-20b | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-23b | dbDEMC v2.0; HMDD v3.0 |
| hsa-mir-130a | dbDEMC v2.0; miR2Disease; HMDD v3.0 |