| Literature DB >> 31555320 |
Yu-An Huang1, Zhi-An Huang2, Zhu-Hong You1, Zexuan Zhu3, Wen-Zhun Huang1, Jian-Xin Guo1, Chang-Qing Yu1.
Abstract
The interaction of miRNA and lncRNA is known to be important for gene regulations. However, the number of known lncRNA-miRNA interactions is still very limited and there are limited computational tools available for predicting new ones. Considering that lncRNAs and miRNAs share internal patterns in the partnership between each other, the underlying lncRNA-miRNA interactions could be predicted by utilizing the known ones, which could be considered as a semi-supervised learning problem. It is shown that the attributes of lncRNA and miRNA have a close relationship with the interaction between each other. Effective use of side information could be helpful for improving the performance especially when the training samples are limited. In view of this, we proposed an end-to-end prediction model called GCLMI (Graph Convolution for novel lncRNA-miRNA Interactions) by combining the techniques of graph convolution and auto-encoder. Without any preprocessing process on the feature information, our method can incorporate raw data of node attributes with the topology of the interaction network. Based on a real dataset collected from a public database, the results of experiments conducted on k-fold cross validations illustrate the robustness and effectiveness of the prediction performance of the proposed prediction model. We prove the graph convolution layer as designed in the proposed model able to effectively integrate the input data by filtering the graph with node features. The proposed model is anticipated to yield highly potential lncRNA-miRNA interactions in the scenario that different types of numerical features describing lncRNA or miRNA are provided by users, serving as a useful computational tool.Entities:
Keywords: LncRNA–miRNA interactions; computational prediction model; graph convolution network; regulation network; system biology model
Year: 2019 PMID: 31555320 PMCID: PMC6727066 DOI: 10.3389/fgene.2019.00758
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1The diagram of spectral graph convolution.
Figure 2The flowchart of GCLMI.
Figure 3The ROC curves yielded by GCLMI on 2-fold, 5-fold and 10-fold cross validation.
Figure 4Evaluation of graph convolution layer w.r.t ROC curves on 5-fold cross validation.
Figure 5Training process of GCLMI in different training epochs with different negative sample sets. (A) and (B) illustrate the training loss and training error in training process, respectively.
Figure 6Comparison of prediction performance of GCLMI with different negative sample sets.
Prediction performance w.r.t. AUC in 2-fold, 5-fold and 10-fold cross validation.
| Cross validation | 2-fold CV | 5-fold CV | 10-fold CV |
|---|---|---|---|
| Average AUC | 0.8492 + /-0.0013 | 0.8567 + /-0.0009 | 0.8590+/-0.0005 |
Performance comparison among different methods by using RNA expression profile-based similarity in the framework of 5-fold cross validation.
| Method | 5-fold cross validation |
|---|---|
| lncRNA-based CF | 0.6359 + /−0.0024 |
| miRNA-based CF | 0.8235 + /−0.0015 |
| SVD-based CF | 0.4967 + /−0.0340 |
| Katz-based method | 0.7439 + /−0.0017 |
| Basic latent factor model | 0.8253 + /−0.0024 |
| EPLMI ( | 0.8447 + /−0.0017 |
| The proposed model | 0.8567 + /−0.0009 |