| Literature DB >> 21296742 |
Mathew W Wright1, Elspeth A Bruford.
Abstract
Previously, the majority of the human genome was thought to be 'junk' DNA with no functional purpose. Over the past decade, the field of RNA research has rapidly expanded, with a concomitant increase in the number of non-protein coding RNA (ncRNA) genes identified in this 'junk'. Many of the encoded ncRNAs have already been shown to be essential for a variety of vital functions, and this wealth of annotated human ncRNAs requires standardised naming in order to aid effective communication. The HUGO Gene Nomenclature Committee (HGNC) is the only organisation authorised to assign standardised nomenclature to human genes. Of the 30,000 approved gene symbols currently listed in the HGNC database (http://www.genenames.org/search), the majority represent protein-coding genes; however, they also include pseudogenes, phenotypic loci and some genomic features. In recent years the list has also increased to include almost 3,000 named human ncRNA genes. HGNC is actively engaging with the RNA research community in order to provide unique symbols and names for each sequence that encodes an ncRNA. Most of the classical small ncRNA genes have now been provided with a unique nomenclature, and work on naming the long (>200 nucleotides) non-coding RNAs (lncRNAs) is ongoing.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21296742 PMCID: PMC3051107 DOI: 10.1186/1479-7364-5-2-90
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
A summary of the current nomenclature for human non-protein coding RNA genes
| Type of RNA | HGNC stem symbol | Name format |
|---|---|---|
| miRNA | microRNA # | |
| tRNA -- genomic | transfer RNA 'single letter amino acid code' # (anticodon XXX) | |
| -- mitochondrial | mitochondrially encoded tRNA 'single letter amino acid code' # | |
| rRNA -- 5S | RNA, 5S ribosomal # | |
| -- 5.8S | RNA, 5.8S ribosomal # | |
| -- 18S | RNA, 18S ribosomal # | |
| -- 28S | RNA, 28S ribosomal # | |
| Spliceosomal -- U1 | RNA, U1 small nuclear # | |
| -- U2 | RNA, U2 small nuclear # | |
| -- U4 | RNA, U4 small nuclear # | |
| -- U5 | RNA, U5 small nuclear # | |
| -- U6 | RNA, U6 small nuclear # | |
| -- U4atac | RNA, U4atac small nuclear # | |
| -- U6atac | RNA, U6atac small nuclear # | |
| -- U11 | RNA, U11 small nuclear | |
| -- U12 | RNA, U12 small nuclear # | |
| snoRNA -- H/ACA box | small nucleolar RNA, H/ACA box # | |
| -- C/D box | small nucleolar RNA, C/D box # | |
| -- Cajal body specific | small cajal body-specific RNA # | |
| piRNA -- cluster | piwi-interacting RNA cluster # | |
| -- individual | piwi-interacting RNA # | |
| RNase -- MRP | RNA component of mitochondrial RNA processing endoribonuclease | |
| -- P | ribonuclease P RNA component H1 | |
| U7 | RNA, U7 small nuclear # | |
| Vault | vault RNA # | |
| 7SK | RNA, 7SK small nuclear | |
| Y | RNA, Ro-associated Y # | |
| SRP/7SL | RNA, 7SL, cytoplasmic # | |
| Telomerase | telomerase RNA component | |
| lncRNA -- known function | Function-based name(eg | eg X (inactive)-specific transcript (non-protein coding) |
| -- antisense | '-AS' suffix* (eg | eg BOK antisense RNA #1 (non-protein coding) |
| -- intronic | '- IT' suffix* (eg | eg MAGI2-IT1 intronic transcript #1 (non-protein coding) |
| -- host gene of small ncRNA | 'HG' suffix (eg | eg small nucleolar RNA host gene 1 (non-protein coding) |
| -- intergenic | long intergenic non-protein coding RNA # |
*to a protein-coding gene symbol