| Literature DB >> 25452690 |
Zixing Wang1, Wenlong Xu1, Haifeng Zhu2, Yin Liu3.
Abstract
MicroRNAs (miRNAs) are small regulatory RNAs that play key gene-regulatory roles in diverse biological processes, particularly in cancer development. Therefore, inferring miRNA targets is an essential step to fully understanding the functional properties of miRNA actions in regulating tumorigenesis. Bayesian linear regression modeling has been proposed for identifying the interactions between miRNAs and mRNAs on the basis of the integrated sequence information and matched miRNA and mRNA expression data; however, this approach does not use the full spectrum of available features of putative miRNA targets. In this study, we integrated four important sequence and structural features of miRNA targeting with paired miRNA and mRNA expression data to improve miRNA-target prediction in a Bayesian framework. We have applied this approach to a gene-expression study of liver cancer patients and examined the posterior probability of each miRNA-mRNA interaction being functional in the development of liver cancer. Our method achieved better performance, in terms of the number of true targets identified, than did other methods.Entities:
Keywords: gene expression; gene regulation; microRNA target prediction; prior information; sequence feature
Year: 2014 PMID: 25452690 PMCID: PMC4238384 DOI: 10.4137/CIN.S16348
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Figure 1Trace plots for (A) the number of selected miRNA–mRNA interactions and (B) the log-posterior probability along the number of iterations.
Enrichment values of experimentally validated targets obtained from our feature-dependent model (feature-MCMC), the model without additional features (non-feature), and the GenMiR++ method. In the top 5000 or 500 predicted interactions, the numbers of experimentally validated targets (true positives), false positives, false negatives, and enrichment significance (P-value) are given. P-values were calculated on the basis of the hypergeometric distribution.
| MODEL | TOP | TRUE POSITIVE | FALSE POSITIVE | FALSE NEGATIVES | |
|---|---|---|---|---|---|
| Feature-MCMC | 5000 | 79 | 4921 | 530 | 2.23E-06 |
| 500 | 15 | 485 | 594 | 7.90E-05 | |
| Non-feature | 5000 | 68 | 4932 | 521 | 1.51E-04 |
| 500 | 15 | 485 | 594 | 7.90E-05 | |
| GenMir++ | 5000 | 56 | 4944 | 553 | 2.31E-02 |
| 500 | 10 | 490 | 599 | 5.41E-03 |
Predicted experimentally validated targets obtained from our feature-dependent model (feature-MCMC), the model without additional features (non-feature), and the GenMiR++ method.
| miRNA | GENE | FEATURE-MCMC | NON-FEATURE | GenMiR++ |
|---|---|---|---|---|
| hsa-mir-103a-3p | Smarce1 | × | × | |
| hsa-mir-103a-3p | FKBP1A | × | × | |
| hsa-mir-103a-3p | BCKDK | × | × | |
| hsa-mir-103a-3p | CCNE1 | × | × | |
| hsa-mir-103a-3p | aadat | × | × | |
| hsa-mir-103a-3p | SCAF1 | × | × | |
| hsa-mir-10a-5p | NDUFB6 | × | × | |
| hsa-mir-145–5p | aph1a | × | × | |
| hsa-mir-145–5p | MUC1 | × | × | |
| hsa-mir-16–5p | CCNE1 | × | × | × |
| hsa-mir-16–5p | Tppp3 | × | × | |
| hsa-mir-185–5p | CCNE1 | × | × | |
| hsa-mir-186–5p | TMEM183A | × | × | |
| hsa-mir-191–5p | Mpst | × | × | |
| hsa-mir-19b-3p | WBP2 | × | × | |
| hsa-mir-21–5p | TPM1 | × | × | × |
| hsa-mir-22–3p | BTF3L1 | × | × | |
| hsa-mir-24–3p | vps25 | × | × | |
| hsa-mir-24–3p | MARCKSL1 | × | × | |
| hsa-mir-29a-3p | DNMT3A | × | × | |
| hsa-mir-32–5p | Hivep1 | × | × | |
| hsa-mir-32–5p | BCAT2 | × | × | |
| hsa-mir-34a-5p | MAGEA12 | × | × | × |
| hsa-mir-34a-5p | Magea6 | × | × | × |
| hsa-mir-7–5p | TCOF1 | × | × | |
| hsa-mir-7–5p | Pole4 | × | × | |
| hsa-mir-7–5p | c18orf10 | × | × | |
| hsa-mir-7–5p | dtymk | × | × | × |
| hsa-mir-93–5p | Gramd1a | × | × |
Top 10 enriched KEGG pathways from our feature-dependent model (feature-MCMC), the model without additional features (non-feature), and the GenMiR++ method.
| FEATURE-MCMC MODEL | NUMBER OF GENES | |
|---|---|---|
| Pathways in cancer | 14 | 5.32E-07 |
| Focal adhesion | 10 | 8.61E-06 |
| Regulation of actin cytoskeleton | 9 | 5.32E-05 |
| Leukocyte transendothelial migration | 6 | 0.0003 |
| Pathways in cancer, focal adhesion | 6 | 0.0001 |
| Leukocyte transendothelial migration, adhere junction | 4 | 4.04E-05 |
| Adhere junction, bacterial invasion of epithelial cells | 4 | 3.94E-05 |
| Leukocyte transendothelial migration, adhere junction, tight junction | 3 | 0.0002 |
| Long-term depression, progesterone-mediated oocyte maturation | 3 | 0.0002 |
| Regulation of actin cytoskeleton, focal adhesion, leukocyte transendothelial migration, adhere junction | 3 | 0.0001 |
| Pathways in cancer | 8 | 0.0001 |
| Focal adhesion | 8 | 2.27E-05 |
| Regulation of actin cytoskeleton | 7 | 0.00022 |
| Huntington’s disease | 6 | 0.0002 |
| Pathways in cancer, focal adhesion | 5 | 0.0001 |
| Regulation of actin cytoskeleton, focal adhesion | 5 | 0.0001 |
| Pathway in cancer, focal adhesion, small cell lung cancer | 4 | 0.0001 |
| Pathway in cancer, focal adhesion, ECM-receptor interaction | 3 | 0.0003 |
| Regulation of actin cytoskeleton, focal adhesion, leukocyte transendothelial migration, bacterial invasion of epithelial cells | 3 | 0.0001 |
| Adhere junction, bacterial invasion of epithelial cells | 0.0001 | |
| Cell cycle | 10 | 6.41E-08 |
| Focal adhesion | 7 | 0.0003 |
| Pyrimidine metabolism | 6 | 8.34E-05 |
| Focal adhesion, amoebiasis | 6 | 1.83E-06 |
| Pathways in cancer, small cell lung cancer | 5 | 0.0003 |
| Focal adhesion, ECM-receptor interaction | 5 | 3.58E-05 |
| DNA replication | 4 | 0.0002 |
| Pathways in cancer, focal adhesion, amoebiasis | 4 | 9.02E-05 |
| Focal adhesion, ECM-receptor interaction, amoebiasis | 4 | 9.02E-05 |
| DNA replication, cell cycle | 3 | 5.44E-05 |
Figure 2Down-regulatory effect of hsa-miR-154 on MUC1. The liver cancer patients were grouped according to their hsa-miR-145 expression levels (higher or lower than average). The cumulative distribution of the MUC1 expression levels in these two groups of patients was plotted, respectively (high hsa-miR-154, red dashed line; low hsa-miR-154, blue solid line). The x-axis represents the MUC1 expression levels represented by the Reads Per Kilobase of transcript per Million mapped reads (RPKM) values from the RNA-seq data.