| Literature DB >> 24267917 |
Cory B Giles, Reshmi Girija-Devi, Mikhail G Dozmorov, Jonathan D Wren.
Abstract
BACKGROUND: Experimentally validated co-expression correlations between miRNAs and genes are a valuable resource to corroborate observations about miRNA/mRNA changes after experimental perturbations, as well as compare miRNA target predictions with empirical observations. For example, when a given miRNA is transcribed, true targets of that miRNA should tend to have lower expression levels relative to when the miRNA is not expressed.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24267917 PMCID: PMC3850996 DOI: 10.1186/1471-2105-14-S14-S17
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Distribution of miRNAs and gene expression in RNA-seq samples. For each sequencing run, the read depth is calculated using the "bigWigAverageOverBed" utility. The per-transcript read depth averaged across all sequencing runs for coding and non-coding transcripts (a) and microRNAs (b) shows a typical log-normal distribution.
miR-target correlations in experimentally validated and computationally predicted subsets
| Dataset | Experimentally validated? | # miR-gene pairs | ΔCorrelation |
|---|---|---|---|
| miRecord | yes | 268 | -0.051 |
| miRanda (conserved) | no | 194414 | -0.009 |
| miRanda(nonconserved) | no | 965520 | -0.016 |
| TargetScan | no | 51487 | -0.015 |
For each miR-target database, the mean miR-gene expression correlation was calculated for miR-target pairs within the database and compared to the mean correlation for all possible miR-gene pairs. The difference between the mean correlation for pairs within a database and the mean correlation for all pairs is given as ∆correlation.
miRanda and TargetScan score parameters can predict co-expression
| Data Source | Parameter | Pearson Correlation | -log10 (P-Value) |
|---|---|---|---|
| miRanda conserved | Alignment Score | 0.041 | 69.83 |
| Conservation Score | -0.023 | 22.16 | |
| Free Energy | 0.024 | 23.33 | |
| mirSVR Score | -0.02 | 16.6 | |
| miRanda nonconserved | Alignment Score | 0.026 | 139.55 |
| Conservation Score | -0.025 | 130.38 | |
| Free Energy | -0.08 | > 307 | |
| mirSVR Score | 0.024 | 120.49 | |
| TargetScan | Aggregate PCT | 0.023 | 5.51 |
| Context Score | -0.112 | 141.13 | |
| # Conserved Sites | 0.0299 | 9.61 | |
| # Nonconserved Sites | -0.064 | 45.92 | |
| # Conserved 7mer-1a sites | 0.019 | 3.27 | |
| # Conserved 7mer-m8 sites | 0.014 | 1.39 | |
| # Conserved 8mer sites | 0.02 | 4.2 | |
| # Nonconserved 7mer-1a sites | -0.094 | 99.64 | |
| # Nonconserved 7mer-m8 sites | 0.031 | 10.64 | |
| # Nonconserved 8mer sites | -0.008 | 0 |
For each predicted miRNA-mRNA pair from the miRanda conserved, miRanda nonconserved, and TargetScan datasets, the Pearson correlation coefficient for co-expression across multiple RNA-seq experiments was obtained. The correlation coefficient was then compared to various predictor variables provided by miRanda/TargetScan, across all pairs, again using Pearson correlation. (P-values after Bonferroni correction; * indicates significance at p < 0.05; ** indicates significance at p < 0.001).
Figure 2Gene-miR pairs with negative expression correlations are overrepresented in predicted target databases. Pearson correlations were calculated for the subset of gene-miR pairs found in the miRanda and TargetScan databases, and binned by correlation value. For each bin, the ratio of observed to expected pairs (odds ratio) was calculated. Bins which have a positive log odds ratio are over-represented in predicted targeting pairs, and bins with negative log odds ratio are under-represented. miR-gene pairs with negative expression correlation are clearly enriched in miRanda predicted interacting pairs, whereas TargetScan shows a bimodal enrichment pattern. A) miRanda conserved, B) miRanda nonconserved, C) TargetScan.
Figure 3mirCoX web server user interface. (A) Screen capture of mirCoX application showing the online search screen. (B) A list of co-relational microRNA datasets based on the user's query by gene symbol and a menu bar with operations which can be performed on the data.