| Literature DB >> 23325850 |
Leandro Castellano1, Justin Stebbing.
Abstract
MicroRNAs (miRNAs) are small RNA molecules that regulate gene expression. They are characterized by specific maturation processes defined by canonical and non-canonical biogenic pathways. Analysis of ∼0.5 billion sequences from mouse data sets derived from different tissues, developmental stages and cell types, partly characterized by either ablation or mutation of the main proteins belonging to miRNA processor complexes, reveals 66 high-confidence new genomic loci coding for miRNAs that could be processed in a canonical or non-canonical manner. A proportion of the newly discovered miRNAs comprises mirtrons, for which we define a new sub-class. Notably, some of these newly discovered miRNAs are generated from untranslated and open reading frames of coding genes, and we experimentally validate these. We also show that many annotated miRNAs do not present miRNA-like features, as they are neither processed by known processing complexes nor loaded on AGO2; this indicates that the current miRNA miRBase database list should be refined and re-defined. Accordingly, a group of them map on ribosomal RNA molecules, whereas others cannot undergo genuine miRNA biogenesis. Notably, a group of annotated miRNAs are Dgcr8 independent and DICER dependent endogenous small interfering RNAs that derive from a unique hairpin formed from a short interspersed nuclear element.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23325850 PMCID: PMC3597668 DOI: 10.1093/nar/gks1474
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Schematic representation for the known biogenic pathway of miRNA processing and maturation.
Figure 2.A fraction of the annotated miRNAs is Dgcr8 independent DICER dependent or Dgcr8 independent DICER independent. (A) Number of canonical miRNAs (Dgcr8 dependent, DICER dependent), Dgcr8 independent miRNAs and DICER independent miRNAs from the miRBase release 18, expressed in mESCs. (B) Classification of genomic repeats containing miRNAs using RepeatMasker on precursors downloaded from the miRBase release 18.
Classification according to previous literature of Dgcr8 independent DICER dependent miRNAs retrieved by this study in mESCs
| miRNA | miRNA biogenesis mechanism | Description by Berezikov | Description by Babiarz | Description by Chiang | Description by Ender | Description by Westholm |
|---|---|---|---|---|---|---|
| miR-1186 | DICER dependent | |||||
| Dgcr8 independent | ||||||
| miR-1195 | DICER dependent | |||||
| Dgcr8 independent | ||||||
| miR-1196 | DICER dependent | |||||
| Dgcr8 independent | ||||||
| miR-1839 | DICER dependent | ACA45 snoRNA | ||||
| Dgcr8 independent | ||||||
| miR-1843 | DICER dependent | |||||
| Dgcr8 independent | ||||||
| miR-1843b | DICER dependent | |||||
| Dgcr8 independent | ||||||
| miR-1935 | DICER dependent | |||||
| Dgcr8 independent | ||||||
| miR-1965 | DICER dependent | |||||
| Dgcr8 independent | ||||||
| miR-1981 | DICER dependent | Mirtron | ||||
| Dgcr8 independent | ||||||
| miR-1982 | DICER dependent | 5′-tailed mirtron | ||||
| Dgcr8 independent | ||||||
| miR-1983 | DICER dependent | tRNA-shRNA | ||||
| Dgcr8 independent | ||||||
| miR-3062 | DICER dependent | Canonical | ||||
| Dgcr8 independent | ||||||
| miR-3068 | DICER dependent | Canonical | ||||
| Dgcr8 independent | ||||||
| miR-3084 | DICER dependent | shRNA | ||||
| Dgcr8 independent | ||||||
| miR-3102 | DICER dependent | Sequentially diced mirtron | ||||
| Dgcr8 independent | ||||||
| miR-320 | DICER dependent | shRNA | ||||
| Dgcr8 independent | ||||||
| miR-344 | DICER dependent | shRNA | ||||
| Dgcr8 independent | ||||||
| miR-484 | DICER dependent | shRNA | ||||
| Dgcr8 independent | ||||||
| miR-5132 | DICER dependent | Mirtron | ||||
| Dgcr8 independent | ||||||
| miR-664 | DICER dependent | shRNA | ||||
| Dgcr8 independent | ||||||
| miR-668 | DICER dependent | shRNA | ||||
| Dgcr8 independent | ||||||
| miR-702 | DICER dependent | Mirtron | ||||
| Dgcr8 independent | ||||||
| miR-877 | DICER dependent | Mirtron | ||||
| Dgcr8 independent |
Figure 3.miR-3062 belongs to a new class of mirtrons. Top: the predicted RNA secondary structure of miR-3062; bottom: the distribution of reads across the precursor. Comparison of rpm between WT, DICER KO and Dgcr8 KO mESCs and cortex indicates that their biogenesis is Dgcr8 independent and DICER dependent. Green boxes indicate acceptor and donor splicing sites.
Figure 4.Somatic tissues express endogenous siRNAs derived from sequential DICER cleavage (A) Annotated predicted RNA secondary structure of mmu-miR-1965 retrieved from miRBase database. Highlighted in red is the miR-1965 mapped on the predicted annotated structure. (B) Chromosomal localization of the predicted precursor of miR-1965. It overlaps with two convergent SINE elements that after transcription fold in a long double-stranded RNA structure that gets processed by DICER. (C) Predicted RNA secondary structure of the SINE elements that are sequentially processed by DICER. Number of reads derived from WT mESCs and corresponding annotated miRNAs are mapped on the RNA structure. The number represents exclusively the most represented reads. (D) Reads per million of miR-1965 in different mouse tissues. (E) Predicted secondary structure of the transcript derived from SINE elements located on two different chromosomal locations, with the most abundant small RNAs derived from bone marrow, pancreas, liver and NIH3T3 embryonic fibroblasts from mouse.
Description of the novel miRNAs identified in this study
| Provisional ID | Total read count | Precursor coordinate | Genomic location | Gene | Type |
|---|---|---|---|---|---|
| chr1_2655 | 853 | chr1:172994721..172994783:− | Intron | shRNA (DGCR8−/−) | |
| chr15_37558 | 788 | chr15:25877982..25878042:+ | Intron | Canonical | |
| chrX_47068 | 292 | chrX:132281255..132281309:+ | Intron | shRNA (DGCR8−/−) | |
| chr5_14110 | 254 | chr5:123972645..123972704:− | Intron | shRNA (DGCR8−/−) | |
| chr15_38333 | 227 | chr15:90054791..90054844:+ | 5′-UTR | DGCR8−/− | |
| chr6_17117 | 156 | chr6:128924857..128924913:− | Exon | Canonical | |
| chr9_23196 | 146 | chr9:15117055..15117114:+ | Intron | shRNA | |
| chr11_28442 | 136 | chr11:6301633..6301697:+ | Intron | Canonical | |
| chr2_3181 | 117 | chr2:29701167..29701232:+ | Intergenic | Canonical | |
| chr5_11864 | 108 | chr5:34916800..34916863:+ | Intron | Canonical | |
| chr11_28874 | 101 | chr11:55314728..55314784:+ | Intron | Canonical | |
| chr11_29306 | 106 | chr11:77992329..77992390:+ | Span 5′-UTR-CDS | Canonical | |
| chr8_21975 | 96 | chr8:8690092..8690162:− | CDS | Canonical | |
| chr2_3037 | 88 | chr2:17979473..17979531:+ | Intron | shRNA (DGCR8−/−) | |
| chr9_23976 | 76 | chr9:94652248..94652310:+ | Intron | Canonical | |
| chr3_8489 | 93 | chr3:122244542..122244599:− | 3′-UTR | DGCR8−/− | |
| chr4_11627 | 78 | chr4:155604167..155604224:− | CDS | Canonical | |
| chr15_39359 | 73 | chr15:79829399..79829456:− | Intron | Canonical | |
| chr10_27859 | 70 | chr10:77802894..77802952:− | Intron | Canonical | |
| chr6_15465 | 54 | chr6:115725675..115725738:+ | Intron | Canonical | |
| chr6_15427 | 62 | chr6:108439737..108439798:+ | Intron | Two-tailed mirtron | |
| chr10_27686 | 49 | chr10:61656914..61656975:− | Intron | Canonical | |
| chr10_27394 | 41 | chr10:22449177..22449238:− | Intron | Canonical | |
| chr2_3125 | 46 | chr2:24915331..24915394:+ | Intron | Canonical | |
| chr2_4218 | 45 | chr2:119843253..119843316:+ | Intron | 5′-tailed mirtron | |
| chr8_22282 | 700 | chr8:41393228..41393283:− | Intron | shRNA (DGCR8−/−) | |
| chr17_33821 | 390 | chr17:29732483..29732546:+ | Intron | Canonical | |
| chr7_14441 | 301 | chr7:30691528..30691587:+ | Intron | shRNA (DGCR8−/−) | |
| chr3_6937 | 176 | chr3:89815074..89815132:− | Intron | Canonical | |
| chr6_12274 | 109 | chr6:38457325..38457394:+ | Intron | Canonical | |
| chr11_24328 | 76 | chr11:53577098..53577156:− | Intergenic | Canonical | |
| chr14_30051 | 71 | chr14:27489032..27489094:− | Span 5′-UTR-CDS | Canonical | |
| chrX_38038 | 46 | chrX:91519913..91519977:+ | Intron | Canonical | |
| chr7_14452 | 53 | chr7:31374399..31374457:+ | AS intron | shRNA (DGCR8−/−) | |
| chr2_3012 | 50 | chr2:31906466..31906529:+ | Intron | 5′-tailed mirtron | |
| chr12_25218 | 42 | chr12:11306693..11306751:+ | Intron | Canonical | |
| chr15_38048 | 519 | chr15:78179029..78179091:+ | CDS | Canonical | |
| chr15_39273 | 519 | chr15:78115217..78115279:− | CDS | Canonical | |
| chr2_4044 | 178 | chr2:94081535..94081600:+ | Intergenic | Canonical | |
| chr7_18793 | 171 | chr7:3219198..3219260:− | Intergenic | Canonical | |
| chr10_27945 | 179 | chr10:80878984..80879042:− | Intron | Canonical | |
| chr18_44110 | 44 | chr18:61113527..61113587:+ | Intron | Canonical | |
| chr17_41247 | 24 | chr17:27228190..27228257:+ | Intron | 5′-tailed mirtron | |
| chr2_6270 | 36 | chr2:164180048..164180119:− | Intron | 5′-tailed mirtron | |
| chr11_30735 | 286 | chr11:82798504..82798567:− | Intron | Canonical | |
| chr16_40566 | 301 | chr16:21256085..21256139:− | Intron | Canonical | |
| chr19_45771 | 77 | chr19:55267168..55267225:+ | Intron | Canonical | |
| chrX_47993 | 293 | chrX:146981927..146981988:− | Intron | Canonical | |
| chrX_47598 | 369 | chrX:64075830..64075888:− | Intergenic | Canonical | |
| chr1_19965 | 56 | chr1:53906268..53906341:+ | AS intron | Canonical | |
| chr1_20693 | 305 | chr1:153289523..153289581:+ | Intron | Canonical | |
| chr1_8658 | 1116 | chr1:173571968..173572033:− | Intron | Canonical | |
| chr11_4307 | 48 | chr11:28646198.28646265:− | Intergenic | Canonical | |
| chr15_6934_1 | 29 | chr15:82224348..82224409:− | Span 5′-UTR-CDS | Canonical | |
| chr15_6934_2 | 29 | chr15:82443316..82443377:− | Intergenic | Canonical | |
| chr15_6934_3 | 29 | chr15:82612504..82612572:+ | Span 5′-UTR-CDS | Canonical | |
| chr17_15196 | 347 | chr17:34161957..34162019:+ | Intron | Canonical | |
| chr2_13190 | 80 | chr2:26446881..26446942:− | AS intron | Canonical | |
| chr2_23582 | 185 | chr6:28972622..28972685:− | Intergenic | Canonical | |
| chr2_25696 | 52 | chr2:158069240..158069301:− | Intergenic | Canonical | |
| chr7_32398 | 1608 | chr7:147143465..147143521:− | Intergenic | Canonical | |
| chr7_6169 | 71 | chr7:3218640..3218696:− | Intergenic | Canonical | |
| chr8_40622 | 649 | chr8:123062596..123062654:+ | Intron | Canonical | |
| chr18_17432 | 642 | chr18:56697793..56697853:− | Intron | shRNA (DGCR8−/−) | |
| chr10_1158 | 925 | chr10:92895852..92895909:+ | Intergenic | LTR ERV1 (DGCR8−/−) | |
| chr5_30975 | 581 | chr5:36644219..36644278:+ | AS intron | SINE Mirb (DGCR8−/−) |
Figure 5.Classification of the novel miRNAs. (A) Genomic location of novel miRNAs. (B) Classification of the transcripts that produce novel miRNAs. (C) Level of conservation of the newly identified miRNAs.
Figure 6.Validation of the novel candidate miRNAs. (A) Heat map showing the expression of the novel miRNAs in NIH3T3, AGO2 from NIH3T3, WT mESCs, DICER−/− mESCs, Dgcr8−/− mESCs and CLIP of AGO2 from mESCs from two repeats. The sum of the normalized reads is provided in the Supplementary Table S5. (B) Table showing normalized miRNA expression and P-values from deep sequencing of small RNAs after ectopic expression in cells of a group of miRNAs selected from this study. This experiment has been conducted in triplicate. (C) Top: the schematic representation of the cloning of six complementary sites to the novel miRNA chr7_32398 as a UTR of the firefly luciferase in the pMIR-REPORT vector, forming a chr7_32398-sensor construct; bottom: firefly luciferase relative levels from cells transfected with empty vector and sensor versus cells transfected with sequences coding for pri-chr7_32398 and chr7_32398-sensor construct. This experiment has been conducted in triplicate on three independent occasions.
Figure 7.Novel candidate miRNA expression profiles. Expression of normalized reads of novel discovered miRNAs across different embryonic and adult mouse tissues. The sum of the normalized reads is provided in the Supplementary Table S6.