| Literature DB >> 19038026 |
Harpreet K Saini1, Anton J Enright, Sam Griffiths-Jones.
Abstract
BACKGROUND: MicroRNAs (miRNAs) are important regulators of gene expression and have been implicated in development, differentiation and pathogenesis. Hundreds of miRNAs have been discovered in mammalian genomes. Approximately 50% of mammalian miRNAs are expressed from introns of protein-coding genes; the primary transcript (pri-miRNA) is therefore assumed to be the host transcript. However, very little is known about the structure of pri-miRNAs expressed from intergenic regions. Here we annotate transcript boundaries of miRNAs in human, mouse and rat genomes using various transcription features. The 5' end of the pri-miRNA is predicted from transcription start sites, CpG islands and 5' CAGE tags mapped in the upstream flanking region surrounding the precursor miRNA (pre-miRNA). The 3' end of the pri-miRNA is predicted based on the mapping of polyA signals, and supported by cDNA/EST and ditags data. The predicted pri-miRNAs are also analyzed for promoter and insulator-associated regulatory regions.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19038026 PMCID: PMC2632650 DOI: 10.1186/1471-2164-9-564
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Distribution of conserved and non-conserved pre-miRNAs in the human, mouse and rat genomes, with respect to protein-coding gene annotation.
| 114 | 66 | 66 | |
| 20 | 28 | 7 | |
| 3 | 0 | 0 | |
| 18 | 4 | 4 | |
| 67 | 16 | ||
| 13 | 32 | ||
| 1 | 3 | ||
| 47 | 107 | ||
| 31 | 35 | ||
| 1 | 4 |
Figure 1Examples of inconsistent genomic contexts of conserved pre-miRNAs in human, mouse and rat. Connecting lines indicate orthology. (a) Human mir-22 is located in the exon of the RefSeq gene (accession NP_116284.2), which has no annotated ortholog in mouse and rat; (b) Human and mouse miR-148b are located in the first intron of the COPZ1 transcript. The terminal 5' exon of orthologous transcript NP_001101587.1 in rat is missing, locating miR-148b upstream of NP_001101587.1; (c) Rat miR-142 is located in the first exon of the alternatively spliced transcript Q9JIR0-2. The orthologous alternative transcripts in human and mouse lacks the 5' exon, placing human and mouse miR-142 upstream of the BZRAP1 transcript.
Paralogous families of human pre-miRNAs.
| Family | Members |
| 1 | mir-105-1 mir-105-2†m |
| 2 | mir-147 mir-147b†† |
| 3 | mir-153-1 mir-153-2†† |
| 4 | mir-199a-1 mir-199a-2†† mir-199b†m |
| 5 | mir-220a mir-220b |
| 6 | mir-329-2 mir-329-1†† |
| 7 | mir-374a mir-374b†† |
| 8 | mir-376a-2 mir-376a-1†† mir-376b†† mir-376c†† |
| 9 | mir-422a mir-378†† |
| 10 | mir-450a-2 mir-450a-1†† mir-450b†m |
| 11 | mir-487a mir-487b†† mir-539†† mir-154†† |
| 12 | mir-500 mir-501†† mir-502†† |
| 13 | mir-509-1 mir-509-2 mir-509-3 mir-514-1 mir-514-2 mir-514-3 mir-510 |
| 14 | mir-511-1 mir-511-2†m |
| 15 | mir-512-1 mir-512-2 |
| 16 | mir-513a-1 mir-513a-2 mir-513c mir-513b |
| 17 | mir-520e mir-520c mir-524 mir-515-2 mir-515-1 mir-525 mir-519b mir-519a-2 mir-520f mir-516b-2 mir-519d mir-516a-2 mir-526a-1 mir-517c mir-518e mir-521-2 mir-516a-1 mir-516b-1 mir-518c mir-527 mir-520g mir-526b mir-520b mir-517a mir-519e mir-526a-2 mir-518f mir-522 mir-517b mir-518a-2 mir-519c mir-523 mir-520d mir-521-1 mir-520h mir-518d mir-520a mir-518b mir-518a |
| 18 | mir-550-1 mir-550-2 |
| 19 | mir-570 mir-548c mir-548d-2 mir-548a-3 mir-548b mir-548d-1 mir-603 mir-548a-1 mir-548a-2 |
| 20 | mir-891a mir-891b |
| 21 | mir-941-1 mir-941-2 mir-941-3 mir-941-1 |
†† orthologs in mouse and rat. †m orthologs in mouse only.
Paralogous families of mouse pre-miRNAs.
| Family | Members |
| 1 | mir-135a-1 mir-135a-2†† mir-135b†† |
| 2 | mir-199a-1 mir-199b †h mir-199a-2†† |
| 3 | mir-291b mir-291a†r |
| 4 | mir-344-1 mir-344-2†h |
| 5 | mir-450a-2 mir-450a-1†† |
| 6 | mir-465a mir-465b-2 mir-465b-1 mir-465c-1 mir-465c-2†h |
| 7 | mir-466a mir-466e mir-466c mir-466b-1 mir-466b-3 mir-466b-2 mir-466d mir-466h mir-466g mir-669a-1 mir-669a-2 mir-669a-3 mir-466f-1 mir-669c mir-669b mir-297b mir-297c mir-466f-3 mir-466f-2 mir-297a-4 mir-297a-3 mir-297a-5 mir-297a-2 |
| 8 | mir-467a mir-467b mir-467c mir-467d mir-467e |
| 9 | mir-680-1 mir-680-2 mir-680-3 |
| 10 | mir-684-1 mir-684-2 |
| 11 | mir-692-1 mir-692-2 |
| 12 | mir-883b mir-883a†r |
†† orthologs in mouse and rat. †h orthologs in human only. †r orthologs in rat only.
Paralogous families of rat pre-miRNAs.
| Family | Members |
| 1 | mir-466b-1 mir-466b-2 |
| 2 | mir-664-1 mir-664-2 |
Polycistronic clusters of intergenic miRNAs with complete EST/cDNA coverage.
| mir-497~195 | Human: CR737132, DB266639, DA2895925, BI752321, AA631714 | Human: AK098506.1 |
| Rat: CV105515 | ||
| mir-144-451 | Human: R28106 | Mouse: AK158085.1 |
| Rat: AW919398, BF2869095, AI008234 | ||
| mir-99b~let-7e~mir-125a | Human: DB340912 | Human: AK125996 |
| mir-143~145 | Human: BM702257 | |
| mir-181a-1~181b-1 | Human: DA528985, BX355821 | |
| Mouse: BE332980, CA874578 | ||
| mir-29b-2~29c | Human: BF089238 | Mouse: AK081202, BC058715 |
| mir-298~296 | Human: W37080 | |
| mir-183~96~182 | Human: CV424506 | |
| mir-181c~181d | Human: AI801869, CB961518, CB991710, BU729805, CB996698, BM702754 | |
| Mouse: CJ191375 | ||
| mir-100~let-7a-2 | Human: DA545600, DA579531, DA474693, DA558986, DA600978 | Human: AK091713 |
| Mouse: BB657503, BM936455 | ||
| Rat: BF412891, BF412890, BF412889, BF412895 | Mouse: AK084170 | |
| mir-374b~421 | Human: DA706043, DA721080 | Human: AK125301 |
| Rat: BF559199, BI274699 | Mouse: BC027389, AK035525, BC076616, AK085125 | |
| mir-34b~34c | Human: BC021736 | |
| mir-15a-16-1 | Human: BG612167, BU932403, BG613187, BG500819 | Human: BC022349, BC022282, BC070292, BC026275, BC055417, AF264787 |
| Mouse: AI789372, BY718835 | Mouse: AK134888, AF380423, AF380425, AK080165 | |
| mir-193b~365-1 | Human: BX108536 | |
| hsa-mir-200c~141 | Human: AI969882, AI695443, AA863395, BM855863.1, AA863389 | |
| mir-374a~545 | Human: DA685273, AL698517, DA246751, DA755860, CF994086, DA932670, DA182706 | Human: AK057701 |
Figure 2Predicted pri-miRNAs, their lengths, and features that support the pri-miRNA prediction. The presence and absence of a feature is shown by colored and open boxes respectively.
Figure 3Annotation of pri-let-7i. (a) Transcription features mapped in the flanking regions surrounding let-7i in human; (b) Predicted promoter and insulator associated regulatory regions upstream of pri-let-7i and the corresponding tiling array and UCSC conservation scores; (c) RNAalifold-predicted secondary structure of pri-let-7i. The pairs conserved in human, mouse and rat are marked in red. The hairpin precursor and the mature sequence are boxed. The Drosha cleavage sites are marked by a dashed black arrow. The stem segments and basal single-stranded regions immediately flanking the pre-miRNA are highlighted in blue.
Figure 4Consensus display of transcription features mapped in the flanking regions surrounding the cluster mir-23a~27a~24-2 in human, mouse and rat. The distances are relative to the 5' end of the miR-23a precursor. The human, mouse and rat mir-23a~27a~24-2 features are shown in green, red and blue respectively.
Figure 5Transcription features mapped in the flanking regions surrounding mir-124-1 in human, mouse and rat. The lower panel shows the promoter region and UCSC conservation scores along the length of predicted pri-hsa-mir-124-1.
Figure 6Transcription features mapped in the flanking regions surrounding the cluster mir-15a~16-1 in human. The lower panel shows the promoter, CTCF binding regions upstream of predicted TSS and the corresponding UCSC conservation scores.
Figure 7Transcription features mapped in the flanking regions surrounding hsa-mir-550-2.
Figure 8Graphical display of transcription features mapped in the flanking regions surrounding miRNAs of Group IV; (a) hsa-mir-572 and (b) mmu-mir-715.
Figure 9Distribution of lengths of predicted pri-miRNAs of human, mouse and rat. The blue curve shows the fraction of human miRNAs clustered as a function of inter-miRNA distances. The red curve shows the distribution of inter-sequence distances of an equal number of randomly generated genomic positions.
Figure 10Conservation profiles of predicted pri-miRNAs. The predicted pri-miRNA is shown in red, the hairpin precursor sequence in green, and the 1 kb region upstream of the predicted 5' end of pri-miRNA in blue.