| Literature DB >> 20920310 |
Ludwig Christian G Hinske1, Pedro A F Galante, Winston P Kuo, Lucila Ohno-Machado.
Abstract
BACKGROUND: miRNAs are small, non-coding RNA molecules that mainly act as negative regulators of target gene messages. Due to their regulatory functions, they have lately been implicated in several diseases, including malignancies. Roughly half of known miRNA genes are located within previously annotated protein-coding regions ("intragenic miRNAs"). Although a role of intragenic miRNAs as negative feedback regulators has been speculated, to the best of our knowledge there have been no conclusive large-scale studies investigating the relationship between intragenic miRNAs and host genes and their pathways.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20920310 PMCID: PMC3091682 DOI: 10.1186/1471-2164-11-533
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Classification of miRNAs in the Genome of Different Species
| Organism | Intragenic miRNAs | Intergenic miRNAs | Intragenic miRNAs | ||
|---|---|---|---|---|---|
| Intronic | Exonic | miRNAs on Host Gene Strand | miRNAs on Opposite Host Strand | ||
| 296 (42.6%) | 37 (5.3%) | 362 (52.1%) | 282 (84.7%) | 51 (15.3%) | |
| 171 (35.4%) | 30 (6.2%) | 282 (58.4%) | 163 (78.2%) | 38 (21.8%) | |
| 3 (1.5%) | 0 (0%) | 201 (98.5%) | 2 (66.7%) | 1 (33.3%) | |
| 50 (10.7%) | 1 (0.2%) | 418 (89.1%) | 46 (90.2%) | 5 (9.8%) | |
| 48 (15.0%) | 1 (0.3%) | 271 (84.7%) | 39 (79.6%) | 10 (20.4%) | |
| 65 (42.8%) | 2 (1.3%) | 85 (55.9%) | 53 (79.1%) | 14 (20.9%) | |
| 51 (33.1%) | 1 (0.6%) | 102 (66.2%) | 33 (63.6%) | 19 (36.5%) | |
Intragenic miRNAs are found in many different species. However, the distribution of intra- and intergenic miRNAs differs. These numbers are obtained by crossing miRNA genomic coordinates with known transcript coordinates (based on RefSeq sequences).
Properties of Host and Target Genes
| Property | Gene Set | Median[Range] Host/Target | Median[Range] Control | Ratio | p-Value |
|---|---|---|---|---|---|
| Total length (basepairs) | Host Genes | 84871.0 [2792-2220381] | 29324.5 [599-2304633] | 2.89 | < 2.2e-16 |
| Target genes | 83747.5 [2366-2220381] | 30232.5 [218-2220381] | 2.77 | < 2.2e-16 | |
| Introns | Host Genes | 13[1-88] | 8[1-105] | 1.62 | 4.3e-13 |
| Target genes | 10.5[0-78] | 8[0-311] | 1.31 | 9.77e-07 | |
| Length 5'UTR (basepairs) | Host Genes | 279.5[0-385608] | 298.5[0-1098107] | 0.94 | 0.25 |
| Target genes | 439.5[0-460277] | 282.5[0-1098107] | 1.56 | 2.32e-08 | |
| Length 3'UTR (basepairs) | Host Genes | 1218.5[0-535884] | 872[0-321862] | 1.4 | 4.71e-05 |
| Target genes | 1764[171-11799] | 872[0-72058] | 2.2 | < 2.2e-16 | |
| ARE (absolute) | Host Genes | 2.0[0-1794] | 1.0[0-592] | 2.0 | 3.89e-04 |
| Target genes | 5.0[0-47] | 2.0[0-187] | 2.5 | < 2.2e-16 | |
| ARE (per kb) | Host Genes | 1.9[0-2.74] | 1.49[0-0.045] | 1.26 | 0.012 |
| Target genes | 2.69[0-14.22] | 1.63[0-76.92] | 1.65 | < 2.2e-16 | |
| 5' UTR GC content | Host Genes | 0.6[0.31-0.95] | 0.59[0-1] | 1.06 | 0.015 |
| Target genes | 0.59[0.26-1] | 0.58[0-1] | 1.01 | 0.71 | |
Host and target genes display similar properties, compared to a set of control genes, including increased length, higher number of total introns, longer 3'UTRs and higher frequency of "AU-rich elements" (AREs).
Figure 1Intragenic miRNA Properties. (a) The sizes of host gene introns closest to the TSS are significantly larger than those of respective controls. (b) Intronic miRNAs appear to be unevenly distributed across the intronic regions of their host genes. More then half of intronic miRNAs are located within the first five 5' introns of their hosts. (c) The observed target coverage (diamond) is at the upper end of the random distribution of target coverages for different signaling pathways, indicating that intragenic miRNAs have more targets within the pathway than would be expected by chance. (d) Intragenic miRNAs may control their host in the setting of a negative feedback circuitry not only through direct regulation of the host's transcription, but also on the order of the interactome, by controlling other genes in the host's pathway.
Pathways with Overrepresentation of Genes Targeted by an Intronic miRNA
| Pathway | Host Genes in Pathway | Target Coverage | p-Value | q-Value |
|---|---|---|---|---|
| MAPK Signaling | ATF2; DDIT3; AKT2; FGF13; ARRB1; PPP3CA; PRKCA; CACNG8; RPS6KA2; MAP2K4; RPS6KA4 | 61.4% | < 0.001 | < 0.001 |
| Axon Guidance | PPP3CA; PTK2; SEMA4G; SEMA3F; SLIT3; ABLIM2; SLIT2 | 70.3% | < 0.001 | < 0.001 |
| Ubiquitin Mediated Proteolysis | HUWE1; WWP2; BIRC6; ITCH | 53.8% | < 0.001 | < 0.001 |
| Focal Adhesion | COL3A1; AKT2; PRKCA; PTK2; TLN2 | 49.5% | < 0.001 | < 0.001 |
| Glioma | AKT2; PRKCA | 52.3% | < 0.001 | < 0.001 |
| Melanoma | AKT2; FGF13 | 50.7% | < 0.001 | < 0.001 |
| Regulation of Actin Cytoskeleton | CHRM2; FGF13; SSH1; PTK2 | 41.0% | < 0.001 | < 0.001 |
| Chronic Myloid Leukemia | AKT2 | 38.2% | < 0.001 | < 0.001 |
| Colorectal Cancer | AKT2 | 35.7% | < 0.001 | < 0.001 |
| Prostate Cancer | AKT2 | 34.8% | 0.001 | 0.007 |
| Melanogenesis | PRKCA | 21.6% | 0.001 | 0.007 |
| Pancreatic Cancer | AKT2 | 35.6% | 0.002 | 0.01 |
| ErbB Signaling | ERBB4; AKT2; PRKCA; PTK2; MAP2K4 | 51.7% | 0.003 | 0.02 |
| Glycan Structures Biosynthesis | MGAT4B; FUT8; CSGLCA-T; GALNT10; HS3ST3A1 | 50.8% | 0.003 | 0.02 |
| Gap Junction | HTR2C; PRKCA; PRKG1 | 47.9% | 0.005 | 0.02 |
| Non-Small Cell Lung Cancer | AKT2; PRKCA | 42.6% | 0.007 | 0.03 |
| Small Cell Lung Cancer | AKT2; PTK2 | 35.6% | 0.013 | 0.05 |
| Long-Term Depression | PRKCA; PRKG1 | 33.3% | 0.014 | 0.05 |
| Insulin Signaling | AKT2; SREBF1 | 36.0% | 0.014 | 0.05 |
| Long-Term Potentiation | PPP3CA; PRKCA; RPS6KA2 | 27.1% | 0.005 | 0.06 |
| T-Cell Receptor Signaling | AKT2; PPP3CA | 32.3% | 0.016 | 0.09 |
| Wnt Signaling | PPP3CA; PRKCA | 21.6% | 0.020 | 0.09 |
22 out of 74 pathways containing host genes show a significant overrepresentation of targets within the pathway at a FDR of 10%. Host genes that were predicted targets of their own miRNA were removed from the count. Interestingly, the list of pathways contains many pathways crucial for development and signal transduction, or associated with neoplastic transformation.
Figure 2Correlation of Predicted miRNA Targets with Hosts. The PI3K - PKB/AKT pathway is believed to be a key component in cancer development. We compared correlation of miRNA predicted targets (prediction agreement ≥ 2) to the respective host in normal (N) and tumor (T) tissue. Several of the hypothesized targets display features predicted by our model, such as AR, PDGFB, PDGFRB, AKT3 and MAPK1.
Conservation of miRNA-Host Pairs
| Organism | miRNA-Host Pairs Predicted to Target own Host | miRNA-Host Pairs Not Predicted to Target own Host | p-Value | ||
|---|---|---|---|---|---|
| Conserved | Total | Conserved | Total | ||
| 18 (35.2%) | 51 | 41 (24.5%) | 167 | 0.18 | |
| 1 (2.12%) | 47 | 0 | 142 | 0.56 | |
| 5 (12.5%) | 40 | 1 (0.71%) | 139 | 0.001 | |
The subset of intragenic miRNA host pairs where the miRNA is predicted to target its own host shows a tendency to be more conserved. However, statistical significance can only be shown for conservation between human and chicken (2-sample test for equality of proportions with continuity correction).
Figure 3Prediction Agreement as a Measure of Confidence. When constructing an ROCs on protein measurements, there is an almost linear relationship of the resulting AUCs and prediction agreement. This is also true for mRNA measurements, though the slope is less steep.