| Literature DB >> 17991681 |
Sam Griffiths-Jones1, Harpreet Kaur Saini, Stijn van Dongen, Anton J Enright.
Abstract
miRBase is the central online repository for microRNA (miRNA) nomenclature, sequence data, annotation and target prediction. The current release (10.0) contains 5071 miRNA loci from 58 species, expressing 5922 distinct mature miRNA sequences: a growth of over 2000 sequences in the past 2 years. miRBase provides a range of data to facilitate studies of miRNA genomics: all miRNAs are mapped to their genomic coordinates. Clusters of miRNA sequences in the genome are highlighted, and can be defined and retrieved with any inter-miRNA distance. The overlap of miRNA sequences with annotated transcripts, both protein- and non-coding, are described. Finally, graphical views of the locations of a wide range of genomic features in model organisms allow for the first time the prediction of the likely boundaries of many miRNA primary transcripts. miRBase is available at http://microrna.sanger.ac.uk/.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17991681 PMCID: PMC2238936 DOI: 10.1093/nar/gkm952
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The number of published hairpin precursor and mature miRNA sequences in selected model organisms
| Hairpin precursor loci | Mature miR sequences | ||||
|---|---|---|---|---|---|
| Total number | Clustered ≤10 kb from another miRNA | Overlap annotated transcripts | Distinct forms | Experimentally verified | |
| 533 | 190 (36%) | 267 (50%) | 555 | 546 (98%) | |
| 442 | 199 (45%) | 174 (39%) | 461 | 455 (99%) | |
| 337 | 151 (34%) | 41 (12%) | 193 | 183 (95%) | |
| 135 | 34 (25%) | 23 (17%) | 135 | 135 (100%) | |
| 93 | 34 (36%) | 36 (39%) | 88 | 85 (97%) | |
| 184 | 19 (10%) | 16 (9%) | 199 | 199 (100%) | |
| 215 | 42 (20%) | 9 (4%) | 215 | 55 (26%) | |
amiR* sequences are excluded from the mature miRNA count.
Figure 1.miRBase view of the distribution of genomic features around mmu-mir-135b on mouse chromosome 1, showing TSS, CpG island, EST, cDNA, DITAG (172B221 and 172B22) and polyA site support for a 15 kb primary transcript.