| Literature DB >> 20211845 |
Abstract
Initial gene discovery efforts through analysis of genome sequences and identification and characterization of expressed RNAs have revealed that only a relatively small portion of the genome is transcribed into protein coding mRNAs in vertebrates. However, in contrast with this paucity of protein coding 'genes', there is an enormous complexity in transcription and the protein coding mRNAs contribute to a very small fraction of transcripts in comparison with the different varieties of non-coding RNAs (ncRNAs). This transcriptome complexity may be hypothesized to have a regulatory role that is required for the development and function of organisms as complex as vertebrates. At the same time, it raises the fundamental question of the unequivocal definition of a gene. It is intriguing to postulate that many ncRNAs might finely modulate gene activity by acting as regulatory elements. The emerging hypotheses suggest that the gene regulatory machinery may be deeply interconnected with the world of short RNAs. These RNAs may generally act for fine-tuning of the protein-coding transcriptome.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20211845 PMCID: PMC2853385 DOI: 10.1093/dnares/dsq006
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Definition of RNAs classes discussed in this review
| Short name of RNA classes | Full name of RNA classes | Notes | References |
|---|---|---|---|
| PALRs | Promoter-associated long RNAs | Hundreds nt long RNAs spanning regions on proximal promoters to the first exon | |
| PASRs | Promoter-associated short RNAs | 20–70 nt long RNAs spanning regions around core promoters | |
| TASRs | Termini-associated short RNAs | 20–70 nt long RNAs spanning regions around transcription termination sites | |
| PROMPTs | Promoter upstream transcripts | Unstable transcripts mapping 0.5–2 kb upstream the transcription starting sites | |
| TSSa-RNAs | Transcription start sites antisense RNAs | RNAs, generally short and non-coding, generated from bidirectional activity of mammalian RNA Polymerase II | |
| NRO-RNAs | Nuclear run-on assay derived RNAs | Short RNA detected by nuclear run-on assays, mapping 20 to 50 downstream to transcriptions starting sites of mRNAs | |
| RE RNAs | Retrotransposon-derived RNAs | A heterogeneous class of RNAs which starting sites overlap retrotransposon elements | |
| tiRNAs | Tiny transcription initiation RNAs | RNAs about 18 nt long, positioned about 20 bp after the transcription starting sites of highly expressed mRNAs | |
| snoRNAs | Small nucleolar RNAs | Small ncRNAs that guide chemical modifications of other non-coding RNAs | |
| siRNAs | Small interfering RNAs | Double-stranded RNA molecules, 20–25 nucleotides in length, that act in various silencing pathways | |
| miRNAs | microRNAs | Single-stranded RNA molecules of 21–24 nucleotides in length, which regulate gene expression | |
| LincRNAs | Large intervening non-coding RNAs | Large non-coding RNAs that map in intergenic locations | |
| ncRNAs | Non-coding RNAs | Generic definition for non-protein coding RNAs | |
| sRNA | Short RNAs | Generic definition for short RNAs | |
| snRNA | Small nuclear RNAs | Nuclear small non-coding RNAs involved in various functions including splicing | |
| piRNA | Piwi interacting RNAs | 26–31 nt long RNAs involved in transcriptional gene silencing, including retrotransposons |
Figure 1Complexity of transcriptome around a hypothetical gene. Green boxes represent exons of typical protein coding gene. Not all the sRNAs and genomic elements are in scale. CAGE tags (red lines; dots beyond the arrowheads indicate than the 3′ ends are unknown) identify TSSs or other capped molecules; the dashed lines on CAGE tags indicate that the 3′ end is not determined. Various types of transcripts are indicated by numbers: (1) protein coding mRNA transcript gene (green boxes: coding exons, gray lines, 5′ and 3′ untranslated regions); (2, a–c), antisense RNAs in various relation with the transcripts (3′–3′ overlap, full overlap, 5′–5′ overlap); (3) CAGE tags identify transcript in the 3′-UTRs, likely polyadenylated; (4) termination-associated sRNAs (TASRs); (5) exonic long-capped transcripts; (6) CAGE tags identifying TSS (exact location can vary) and may overlap PALRs; (7) PASRs (green) and tiny 18 nt long RNAs (tiRNAs, arrowhead only); (8) antisense transcription events detected by CAGE, often (but not limited to) the first exons and introns; (9) bidirectionally transcribed RNAs from core promoters; (10) ncRNA splicing isoforms only partially overlapping to coding mRNA sequences; (11) PALRs; (12) PROMPTs, unstable transcripts on upstream regulatory regions; (13) miRNAs and endogenous siRNAs (deriving mostly from other loci, not perfectly matching in most cases in animal cells); (14) other sRNAs associated to exonic-capped long transcripts. The list of different types of RNAs is continuously growing and subject to revisions and further classifications.