| Literature DB >> 27703979 |
Jiawei Luo1, Cong Huang2, Pingjian Ding2.
Abstract
MicroRNAs (miRNAs) are short noncoding RNAs that play important roles in regulating gene expressing, and the perturbed miRNAs are often associated with development and tumorigenesis as they have effects on their target mRNA. Predicting potential miRNA-target associations from multiple types of genomic data is a considerable problem in the bioinformatics research. However, most of the existing methods did not fully use the experimentally validated miRNA-mRNA interactions. Here, we developed RMLM and RMLMSe to predict the relationship between miRNAs and their targets. RMLM and RMLMSe are global approaches as they can reconstruct the missing associations for all the miRNA-target simultaneously and RMLMSe demonstrates that the integration of sequence information can improve the performance of RMLM. In RMLM, we use RM measure to evaluate different relatedness between miRNA and its target based on different meta-paths; logistic regression and MLE method are employed to estimate the weight of different meta-paths. In RMLMSe, sequence information is utilized to improve the performance of RMLM. Here, we carry on fivefold cross validation and pathway enrichment analysis to prove the performance of our methods. The fivefold experiments show that our methods have higher AUC scores compared with other methods and the integration of sequence information can improve the performance of miRNA-target association prediction.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27703979 PMCID: PMC5040835 DOI: 10.1155/2016/7460740
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Network schema of the miRNA-target network. The network contains two types of objects, miRNA and its targets. Each box represents one type of nodes, and each dashed line represents one type of links. The numbers in the figure represent the numbers of nodes/links of different types.
Figure 2The ROC curve of the global network.
Figure 3The ROC curve of the OV network.
Figure 4The ROC curve of the Lung network.
Figure 5The ROC curve of the Breast network.
The number of links predicted by our methods based on different thresholds.
| Database | Methods | Validated | Th ≥ 0.9 | Th ≥ 0.8 | Th ≥ 0.7 | Th ≥ 0.6 | Th ≥ 0.5 |
|---|---|---|---|---|---|---|---|
| Global | RMLM | 11,1770 | 17,2912 | 20,4894 | 23,4327 | 26,5883 | 79,8049 |
| RMLMSe | 11,1770 | 17,6625 | 21,0909 | 24,2946 | 28,1782 | 80,7688 | |
|
| |||||||
| OV | RMLM | 4,2730 | 5,3683 | 5,9580 | 6,4676 | 6,9759 | 23,3784 |
| RMLMSe | 4,2730 | 5,3891 | 5,9954 | 6,5526 | 7,1565 | 23,4562 | |
|
| |||||||
| Lung | RMLM | 4,7764 | 5,8511 | 6,4339 | 6,9397 | 7,4816 | 24,5323 |
| RMLMSe | 4,7764 | 5,8870 | 6,4881 | 7,0437 | 7,9293 | 24,6261 | |
|
| |||||||
| Breast | RMLM | 6,4403 | 8,6555 | 9,8883 | 10,9659 | 12,0730 | 36,4375 |
| RMLMSe | 6,4403 | 8,6690 | 9,9540 | 11,1719 | 12,6556 | 36,6573 | |
The “validated” column is the number of links validated in database miRTarbase v6.1 and “Th” represents the threshold.
In RMLMSe, the enrichment KEGG pathways of global dataset.
| Enrichment KEGG pathways |
| |
|---|---|---|
| 1 | p53 signaling pathway | 4.27 |
| 2 | Chronic myeloid leukemia | 8.80 |
| 3 | Bladder cancer | 3.24 |
| 4 | Glioma | 6.03 |
| 5 | Melanoma | 1.35 |
| 6 | Pathways in cancer | 2.34 |
| 7 | Prostate cancer | 1.01 |
| 8 | Cell cycle | 1.61 |
| 9 | Small cell lung cancer | 9.71 |
| 10 | Pancreatic cancer | 3.26 |
The p values have been obtained through hypergeometric test.
In RMLMSe, the enrichment KEGG pathways of lung dataset.
| Enrichment KEGG pathways |
| |
|---|---|---|
| 1 | p53 signaling pathway | 5.15 |
| 2 | Pathways in cancer | 3.11 |
| 3 | Small cell lung cancer | 1.12 |
| 4 | Non-small cell lung cancer | 1.04 |
| 5 | Focal adhesion | 1.53 |
| 6 | Neurotrophin signaling pathway | 1.81 |
| 7 | Adherens junction | 6.05 |
| 8 | ErbB signaling pathway | 1.34 |
| 9 | Pathogenic | 1.89 |
| 10 | MAPK signaling pathway | 1.31 |
The p values have been obtained through hypergeometric test.