| Literature DB >> 20576699 |
Igor Ulitsky1, Louise C Laurent, Ron Shamir.
Abstract
While it has been established that microRNAs (miRNAs) play key roles throughout development and are dysregulated in many human pathologies, the specific processes and pathways regulated by individual miRNAs are mostly unknown. Here, we use computational target predictions in order to automatically infer the processes affected by human miRNAs. Our approach improves upon standard statistical tools by addressing specific characteristics of miRNA regulation. Our analysis is based on a novel compendium of experimentally verified miRNA-pathway and miRNA-process associations that we constructed, which can be a useful resource by itself. Our method also predicts novel miRNA-regulated pathways, refines the annotation of miRNAs for which only crude functions are known, and assigns differential functions to miRNAs with closely related sequences. Applying our approach to groups of co-expressed genes allows us to identify miRNAs and genomic miRNA clusters with functional importance in specific stages of early human development. A full list of the predicted mRNA functions is available at http://acgt.cs.tau.ac.il/fame/.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20576699 PMCID: PMC2926627 DOI: 10.1093/nar/gkq570
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.(A) FAME outline. A bipartite graph is constructed with edges corresponding to miRNA–target predicted pairs and edge weights determined by TargetScan context scores. Degree-preserving graph randomization is used to evaluate the significance of the total weight of the edges connecting a designated set of miRNAs to a designated set of targets, by computing an empirical P-value. (B) UTR lengths and the average number of miRNA regulators for genes highly expressed in various stem cell-related cell lines. See methods for the description of the mRNA dataset. The numbers are averages over the 200 genes with the highest average expression levels in each group of cell lines.
Figure 2.Comparison of methods for detection of enrichment of miRNA targets. (A) For each miRNA family, all the KEGG pathways were tested for enrichment of miRNA targets and ranked in increasing order of P-value. In case of ties, annotations were ranked in decreasing order of z-score. The chart shows the relative position of the compendium function in each list. (B) Average location of the known KEGG pathway in the ranked lists obtained by using FAME and the HG and LLR tests. Error bars represent one standard error. (C) Average location of the known GO ‘biological process’ annotation in the ranked lists of the three methods. (D) Same as C, but taking into account only annotations that were placed in the top 10% by at least one of the methods.
KEGG pathways predicted by FAME to be regulated by miRNAs
| miRNA | KEGG pathway | Number of targets | Weight enrichment factor | |
|---|---|---|---|---|
| let-7/98 | Aminoacyl-tRNA biosynthesis | 3 | 1.3 × 10−3 | 9.73 |
| mir-1/206 | SNARE interactions in vesicular transport | 6 | 6.5 × 10−4 | 3.64 |
| mir-103/107 | Hedgehog-signaling pathway | 8 | 2.0 × 10−4 | 3.81 |
| mir-122 | Glycolysis / gluconeogenesis | 3 | 5.0 × 10−4 | 17.45 |
| mir-124/506 | Metabolic pathways | 70 | 1.0 × 10−4 | 1.54 |
| mir-125/351 | Tyrosine metabolism | 3 | 8.0 × 10−4 | 8.13 |
| mir-125a-3p | Cytokine–cytokine receptor interaction | 4 | 6.3 × 10−3 | 3.44 |
| mir-129/129-5p | Cardiac muscle contraction | 6 | 1.0 × 10−4 | 8.89 |
| mir-132/212 | TGF-beta-signaling pathway | 10 | 5.5 × 10−4 | 3.10 |
| mir-138 | Axon guidance | 13 | 8.5 × 10−4 | 2.34 |
| mir-139-5p | Purine metabolism | 4 | 3.9 × 10−3 | 3.60 |
| mir-140/140-5p/876-3p | Notch-signaling pathway | 4 | 4.3 × 10−3 | 4.49 |
| mir-142-3p | Regulation of actin cytoskeleton | 12 | 2.5 × 10−4 | 2.61 |
| mir-143 | Natural killer cell mediated cytotoxicity | 4 | 4.1 × 10−3 | 3.38 |
| mir-145 | Axon guidance | 17 | 1.0 × 10−4 | 2.55 |
| mir-146 | Toll-like receptor-signaling pathway | 3 | 1.0 × 10−4 | 10.47 |
| mir-148/152 | Basal transcription factors | 4 | 4.5 × 10−3 | 4.41 |
| mir-15/16/195/424/497 | Cell cycle | 17 | 1.0 × 10−4 | 2.19 |
| mir-150 | Wnt-signaling pathway | 7 | 6.9 × 10−3 | 2.80 |
| mir-155 | T cell receptor-signaling pathway | 10 | 1.0 × 10−3 | 3.17 |
| mir-185/882 | GnRH-signaling pathway | 4 | 8.6 × 10−3 | 3.28 |
| mir-190 | Cell adhesion molecules (CAMs) | 5 | 3.1 × 10−3 | 5.03 |
| mir-194 | TGF-beta-signaling pathway | 9 | 4.3 × 10−3 | 2.67 |
| mir-202/202-3p | ECM-receptor interaction | 11 | 1.3 × 10−3 | 2.69 |
| mir-203 | Insulin-signaling pathway | 14 | 3.0 × 10−3 | 1.98 |
| mir-205 | PPAR-signaling pathway | 4 | 3.3 × 10−3 | 4.84 |
| mir-208/208ab | Wnt-signaling pathway | 6 | 7.3 × 10−3 | 3.02 |
| mir-21/590-5p | Cytokine-cytokine receptor interaction | 11 | 1.0 × 10−4 | 4.33 |
| mir-217 | Gap junction | 4 | 8.6 × 10−3 | 2.98 |
| mir-218 | Heparan sulfate biosynthesis | 6 | 1.0 × 10−4 | 4.82 |
| mir-219/219-5p | Ether lipid metabolism | 3 | 4.3 × 10−3 | 7.21 |
| mir-23ab | Glycosphingolipid biosynthesis—lacto and neolacto series | 6 | 2.0 × 10−4 | 4.48 |
| mir-24 | Alanine and aspartate metabolism | 3 | 1.4 × 10−3 | 8.17 |
| mir-27ab | Neuroactive ligand-receptor interaction | 16 | 1.2 × 10−3 | 2.09 |
| mir-28/28-5p/708 | Jak-STAT-signaling pathway | 4 | 2.4 × 10−3 | 3.25 |
| mir-299/299-3p | Focal adhesion | 3 | 5.9 × 10−2 | 2.66 |
| mir-29abc | ECM-receptor interaction | 21 | 1.0 × 10−4 | 6.13 |
| mir-324-5p | TGF-beta-signaling pathway | 4 | 2.5 × 10−2 | 3.18 |
| mir-326/330/330-5p | Arachidonic acid metabolism | 3 | 2.5 × 10−4 | 17.91 |
| mir-33/33ab | Antigen processing and presentation | 3 | 7.7 × 10−3 | 5.98 |
| mir-339-5p | ErbB-signaling pathway | 3 | 3.2 × 10−2 | 3.03 |
| mir-346 | Wnt-signaling pathway | 5 | 1.0 × 10−2 | 3.32 |
| mir-34a/34b-5p/34c/34c-5p/449/449abc/699 | 4 | 1.3 × 10−3 | 4.62 | |
| mir-361/361-5p | Nucleotide excision repair | 3 | 1.0 × 10−4 | 23.84 |
| mir-365 | Apoptosis | 4 | 2.2 × 10−3 | 5.04 |
| mir-374/374ab | Retinol metabolism | 4 | 1.0 × 10−4 | 10.75 |
| mir-375 | Purine metabolism | 4 | 9.3 × 10−3 | 4.10 |
| mir-376/376ab/376b-3p | Neuroactive ligand-receptor interaction | 4 | 5.8 × 10−3 | 3.66 |
| mir-377 | Ubiquitin mediated proteolysis | 9 | 3.1 × 10−3 | 2.44 |
| mir-378/422a | Hedgehog-signaling pathway | 4 | 2.0 × 10−3 | 7.22 |
| mir-379 | Adherens junction | 3 | 2.9 × 10−2 | 3.92 |
| mir-384/384-3p | Lysine degradation | 4 | 5.5 × 10−4 | 8.32 |
| mir-410 | Heparan sulfate biosynthesis | 4 | 1.6 × 10−3 | 4.29 |
| mir-411 | Ubiquitin mediated proteolysis | 3 | 3.6 × 10−2 | 3.43 |
| mir-431 | Adherens junction | 3 | 8.5 × 10−2 | 2.37 |
| mir-433 | Cell cycle | 7 | 6.0 × 10−4 | 3.93 |
| mir-485/485-5p | Metabolic pathways | 17 | 3.5 × 10−4 | 2.49 |
| mir-486/486-5p | Focal adhesion | 6 | 1.8 × 10−3 | 3.28 |
| mir-490/490-3p | Adipocytokine-signaling pathway | 3 | 1.9 × 10−2 | 4.33 |
| mir-496 | CAMs | 3 | 7.3 × 10−3 | 5.05 |
| mir-503 | p53-signaling pathway | 6 | 1.0 × 10−4 | 5.85 |
| mir-543 | Circadian rhythm—mammal | 4 | 8.0 × 10−4 | 5.33 |
| mir-592/599 | mTOR-signaling pathway | 3 | 7.5 × 10−3 | 4.02 |
| mir-7/7ab | Purine metabolism | 5 | 1.8 × 10−3 | 3.55 |
| mir-758 | Toll-like receptor-signaling pathway | 3 | 5.7 × 10−3 | 5.31 |
| mir-874 | Calcium-signaling pathway | 5 | 1.0 × 10−2 | 2.99 |
| mir-875-5p | Cytokine–cytokine receptor interaction | 3 | 3.7 × 10−2 | 3.32 |
| mir-96/1271 | Glycosphingolipid biosynthesis—ganglio series | 4 | 5.0 × 10−4 | 5.86 |
| mir-99ab/100 | Melanogenesis | 3 | 1.1 × 10−3 | 8.65 |
Only the top prediction for each miRNA family and with FDR < 0.1 are shown. ‘Weight enrichment factor’ is the ratio between the total weight of the edges between the miRNA and the pathway genes in the bipartite graph G, and the average weight of such edges in 10 000 random graphs.
Refinement of known miRNA functions
| miRNA family | Known function (rank) | Proposed refined function (rank) |
|---|---|---|
| mir-146 | Immune response (−) | I-κB kinase/NF-κB cascade ( |
| miR-21/590-5p | Protein kinase cascade (178) | Negative regulation of MAP kinase activity ( |
| mir-192/215 | Regulation of cell cycle ( | Regulation of progression through cell cycle ( |
| mir-17-5p/20/93.mr/106/519.d | Regulation of cell cycle ( | Negative regulation of progression through cell cycle ( |
| mir-205 | Epithelial cell differentiation (−) | Establishment or maintenance of cell polarity ( |
| mir-141/200a | Epithelial cell differentiation (62) | Morphogenesis of embryonic epithelium ( |
| mir-1/206 | Glucose metabolism ( | Glucose catabolic process ( |
| mir-1/206 | Regulation of apoptosis ( | Anti-apoptosis ( |
| mir-9 | Neuron development ( | Peripheral nervous system development ( |
| mir-130/301 | Angiogenesis ( | Blood vessel morphogenesis ( |
| mir-29abc | Regulation of apoptosis (377) | Apoptotic program ( |
Relative ranks of known non-specific miRNA functions and proposed specific functions. (−) in the ‘Known function’ column indicates that the GO set corresponding to that function contained less than three miRNA targets, and thus this function was not ranked.
Figure 3.Performance of methods for enrichment detection on co-expression clusters. Out of the 1323 possible miRNA–cluster pairs, those with a correlation of r > 0.5 or r < −0.5 between the miRNA and the average mRNA expression were marked as ‘high’ (∼10% for each direction). The plots show the fraction of the 100 most significant miRNA–cluster pairs found by FAME and the HG test that fell into the ‘high’ category.
Figure 4.mRNA co-expression clusters and miRNA regulation in stem cell lines. (A) Enrichment and depletion of miRNA targets in co-expression clusters. Purple (green) squares indicate over- (under-) representation of miRNA targets in a cluster. Names of genomic clusters of miRNAs (Supplementary Table S1) are written in red. Only clusters with at least 30 genes that were enriched with targets of at least one miRNA with P < 3 × 10−3 and FDR < 0.1 are shown. Cluster-miRNA pairs with P > 0.05 are not shown (white squares). (B–F) Average expression levels of the mRNAs in co-expression clusters and of miRNAs in different families. The top rows in each subfigure show average mRNA expression of the co-expression clusters and the matrices below them show the expression of the miRNA families under the same conditions. The expression pattern of each miRNA and each mRNA were normalized to mean 0 and SD of 1. Fib., fibroblasts; CC, choriocarcinoma (placental cancer).
Figure 5.The 3′ UTR length bias. The 21 clusters in the SCD were divided into four bins, with five to six clusters in each bin, based on the average UTR length in each cluster. The total number of significant enrichments (FDR < 0.1) is shown for each bin.