| Literature DB >> 33334307 |
Chengshuai Zhao1, Yang Qiu1, Shuang Zhou2, Shichao Liu1, Wen Zhang3, Yanqing Niu4.
Abstract
BACKGROUND: Researchers discover LncRNA-miRNA regulatory paradigms modulate gene expression patterns and drive major cellular processes. Identification of lncRNA-miRNA interactions (LMIs) is critical to reveal the mechanism of biological processes and complicated diseases. Because conventional wet experiments are time-consuming, labor-intensive and costly, a few computational methods have been proposed to expedite the identification of lncRNA-miRNA interactions. However, little attention has been paid to fully exploit the structural and topological information of the lncRNA-miRNA interaction network.Entities:
Keywords: Attention mechanism; Ensemble learning; Graph embedding; lncRNA-miRNA interactions
Mesh:
Substances:
Year: 2020 PMID: 33334307 PMCID: PMC7745483 DOI: 10.1186/s12864-020-07238-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig 1The influence of hyperparameters on performances of GEEL-FI model. a shows the box plot of AUC scores of GEEL-FI with different embedded representation integration. b shows the scatter plot of AUC and AUPR scores of GEEL-FI with different dimensions of lncRNA-miRNA pair embedded representations. c shows the line plot of AUPR scores of GEEL-FI with the different numbers of Random Forest estimators
Parameter settings for proposed methods
| Methods | Components | Parameters |
|---|---|---|
| Graph embedding methods | Representation vector | dimension |
| GraRep | ||
| DeepWalk | walk length | |
| GAE | variational Autoencoder, hidden size | |
| GEEL-PI | Random Forest | default parameters |
| Logistic Regression | L2 regulation with default parameters | |
| GEEL-FI | DANN | hidden layers |
| Representation vector | embeddings | |
| Pair feature | dimension | |
| Random Forest | estimators |
Performances of different methods
| Methods | AUPR | AUC | F1 | ACC | REC | SPEC | PRE |
|---|---|---|---|---|---|---|---|
| EPLMI | 0.0706 | 0.8494 | 0.1055 | 0.9939 | 0.1373 | 0.9962 | 0.0883 |
| INLMI | 0.0723 | 0.8477 | 0.1086 | 0.9935 | 0.1531 | 0.9956 | 0.0867 |
| SLNPM | 0.6207 | 0.9165 | 0.6652 | 0.9972 | 0.9988 | 0.7016 | |
| GEEL-PI | 0.7004 | 0.9537 | 0.5945 | 0.9995 | 0.8342 | ||
| GEEL-FI | 0.6915 | 0.5790 |
Fig 2The top recall and top precision performances for different methods. a shows recall of different methods in top-ranked predictions. b shows precision of different methods in top-ranked predictions
Performances of based predictors and the ensemble models
| Embedding | AUPR | AUC | F1 | ACC | REC | SPEC | PRE |
|---|---|---|---|---|---|---|---|
| LE | 0.6654 ± 0.0033 | 0.9430 ± 0.0017 | 0.6592 ± 0.0040 | 0.9976 ± 0.0001 | 0.5429 ± 0.0079 | 0.9995 ± 0.0001 | 0.8420 ± 0.0144 |
| GraRep | 0.6805 ± 0.0037 | 0.9417 ± 0.0019 | 0.6818 ± 0.0036 | 0.9977 ± 0.0001 | 0.5703 ± 0.0066 | 0.9996 ± 0.0001 | 0.8498 ± 0.0137 |
| HOPE | 0.6573 ± 0.0036 | 0.9281 ± 0.0022 | 0.6796 ± 0.0035 | 0.9976 ± 0.0001 | 0.5813 ± 0.0087 | 0.9994 ± 0.0001 | 0.8198 ± 0.0134 |
| DeepWalk | 0.6511 ± 0.0037 | 0.9383 ± 0.0018 | 0.6463 ± 0.0051 | 0.9974 ± 0.0001 | 0.5452 ± 0.0133 | 0.9994 ± 0.0001 | 0.7993 ± 0.0248 |
| GAE | 0.6664 ± 0.0031 | 0.9292 ± 0.0023 | 0.6754 ± 0.0033 | 0.9976 ± 0.0001 | 0.5666 ± 0.0086 | 0.9995 ± 0.0001 | 0.8395 ± 0.0185 |
| GEEL-PI | 0.7004 ± 0.0035 | 0.9537 ± 0.0022 | 0.9995 ± 0.0001 | 0.8342 ± 0.0128 | |||
| GEEL-FI | 0.6915 ± 0.0029 | 0.5790 ± 0.0063 |
Performances on the network of different sparsity
| Removal ratio | LE | GraRep | HOPE | DeepWalk | GAE | GEEL-PI | GEEL-FI |
|---|---|---|---|---|---|---|---|
| 10% | 0.6496 | 0.6666 | 0.6448 | 0.6341 | 0.6537 | 0.6838 | |
| 20% | 0.6323 | 0.6524 | 0.6311 | 0.6192 | 0.6254 | 0.6719 | |
| 30% | 0.6124 | 0.6355 | 0.6171 | 0.5982 | 0.6206 | 0.6561 | |
| 40% | 0.5884 | 0.6156 | 0.5959 | 0.5761 | 0.6009 | 0.6347 |
Fig 3The AUPR scores of GEEL-F and GEEL-FI when different embeddings involved in feature fusion. GEEL-FI adopts attention mechanism to integration embeddings, GEEL-F does not
Fig 4Attention weights in lncRNA and miRNA representations integration. a shows attention weights of lncRNA representations in GEEL-FI. b shows attention weights of miRNA representations in GEEL-FI
Top 10 prediction of GEEL-PI and GEEL-FI
| GEEL-PI | GEEL-FI | |||||
|---|---|---|---|---|---|---|
| Rank | LncRNAs | MiRNAs | Evidence | LncRNAs | MiRNAs | Evidence |
| 1 | lnc-COL6A3–5:1 | hsa-miR-4500 | × | lnc-COL6A3–5:1 | hsa-miR-4500 | |
| 2 | lnc-ACER2–1:1 | hsa-miR-17-5p | √ | lnc-ALYREF-1:1 | hsa-miR-372-3p | |
| 3 | lnc-FAS-1:1 | hsa-miR-302b-3p | √ | MIR17HG:2 | hsa-miR-520a-3p | |
| 4 | lnc-PDK3–1:1 | hsa-miR-93–5p | √ | lnc-PDK3–1:1 | hsa-miR-302d-3p | |
| 5 | lnc-ACER2–1:1 | hsa-miR-106a-5p | √ | USP2-AS1:10 | hsa-miR-302b-3p | |
| 6 | lnc-ALYREF-1:1 | hsa-miR-372-3p | √ | lnc-PDK3–1:1 | hsa-miR-93–5p | |
| 7 | MIR17HG:2 | hsa-miR-520a-3p | √ | lnc-NMRK1–1:1 | hsa-miR-520d-3p | |
| 8 | lnc-NMRK1:1 | hsa-miR-520d-3p | √ | lnc-ACER2–1:1 | hsa-miR-17-5p | |
| 9 | lnc-RPE-1:1 | hsa-miR-130a-3p | × | lnc-ACER2–1:1 | hsa-miR-106a-5p | |
| 10 | lnc-PDK3–1:1 | hsa-miR-302d-3p | √ | lnc-RPE-1:1 | hsa-miR-130a-3p | × |
Fig 5Flowchart of the proposed GEEL-PI and GEEL-FI. a by integrating the two similarity networks with the known lncRNA-miRNA interaction network, we construct a lncRNA-miRNA heterogeneous network. Different graph embedding methods are applied to the lncRNA-miRNA heterogeneous network to learn low-dimensional representations of lncRNAs and miRNAs. b for GEEL-PI, base predictors are trained based on the learned representations from different embedding methods. Then, their output predictions are integrated for further improving the performance and generalizability. c for GEEL-FI, by constructing a deep attention neural network, we integrate abundant embedded representation of lncRNA and miRNA to obtain distinctive lncRNA-miRNA pair features