| Literature DB >> 11897027 |
Flavio Mignone1, Carmela Gissi, Sabino Liuni, Graziano Pesole.
Abstract
Gene expression is finely regulated at the post-transcriptional level. Features of the untranslated regions of mRNAs that control their translation, degradation and localization include stem-loop structures, upstream initiation codons and open reading frames, internal ribosome entry sites and various cis-acting elements that are bound by RNA-binding proteins.Entities:
Mesh:
Substances:
Year: 2002 PMID: 11897027 PMCID: PMC139023 DOI: 10.1186/gb-2002-3-3-reviews0004
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1The generic structure of a eukaryotic mRNA, illustrating some post-transcriptional regulatory elements that affect gene expression. Abbreviations (from 5' to 3'): UTR, untranslated region; m7G, 7-methyl-guanosine cap; hairpin, hairpin-like secondary structures; uORF, upstream open reading frame; IRES, internal ribosome entry site; CPE, cytoplasmic polyadenylation element; AAUAAA, polyadenylation signal.
Features of complete UTR sequences derived from genomic entries annotated in UTRdb [47,48,50].
| 5' UTR | 3' UTR | |||||||
| Number of sequences | Average length | Maximum length | Minimum length | Number of sequences | Average length | Maximum length | Minimum length | |
| Humans | 1,203 | 210.2 | 2,803 | 18 | 1,247 | 1,027.7 | 8,555 | 21 |
| Other mammals | 142 | 141.3 | 936 | 20 | 148 | 441.1 | 3,324 | 37 |
| Rodents | 638 | 186.3 | 1,786 | 16 | 457 | 607.3 | 3,354 | 19 |
| Aves | 59 | 126.4 | 620 | 17 | 56 | 651.9 | 3,990 | 21 |
| Other vertebrates | 105 | 164.0 | 1,154 | 15 | 111 | 446.5 | 2,858 | 31 |
| Invertebrates | 5,464 | 221.9 | 4,498 | 14 | 3,736 | 444.5 | 9,142 | 15 |
| Liliopsidae | 144 | 129.8 | 715 | 17 | 127 | 273.3 | 1,605 | 22 |
| Other Viridiplantae | 1,471 | 103.0 | 1,355 | 12 | 1,699 | 207.7 | 1,911 | 13 |
| Fungi | 388 | 134.0 | 1,088 | 16 | 326 | 237.1 | 1,142 | 25 |
Figure 2The percentage of complete UTR sequences in the different taxonomic classes that contain (a) introns or (b) upstream AUGs, upstream ORFs or IRES elements. Hum, human; mam, other mammals; rod, rodents; av, Aves; vrt, other vertebrates; lil, Liliopsidae; vir, other plants (Viridiplantae); inv, invertebrates; fun, fungi. Data are taken from UTRdb [47].
Examples of genes with 5' UTRs longer than average and with upstream ORFs and/or repeat elements
| 5' UTR features | |||||
| UTR* | EMBL† | Gene description | Length | Repeats | uORFs |
| 5HSA002333 | M13994 | B-cell leukemia/lymphoma 2 (Bcl-2) proto-oncogene | 1,458 | 2 | 7 |
| 5HSA017553 | X63547 | Tre oncogene | 2,858 | 1 | 10 |
| 5SSC000518 | AJ000928 | c-Myc proto-oncogene ( | 1,330 | 3 | 13 |
| 5HSA016490 | AF074913 | Transcription factor Pax-5 | 1,125 | 0 | 2 |
| 5HSA024311 | AJ297406 | Transcription factor II B-related factor | 1,437 | 2 | 7 |
| 5HSA001903 | AF006822 | Myelin transcription factor 2 (MYT2) | 1,155 | 0 | 8 |
| 5HSA004086 | M62302 | Growth/differentiation factor 1 (GDF-1) | 1,346 | 2 | 1 |
| 5HSA004101 | M22373 | Insulin-like growth factor (IGF-II) | 1,169 | 3 | 0 |
Accession numbers in the *UTRdb [47] and †EMBL databases [52]. Abbreviations: uORFs, upstream ORFs.
Examples of 5' UTR sequences with highly stable stem-loop structures
| UTR | EMBL | Gene description | UTR length | Stem length | ΔG |
| 5hsa007030 | X12949 | Ret | 963 | 129 | -125.4 |
| 5hsa034512 | AF274954 | PNAS-29 | 323 | 95 | -71.7 |
| 5hsa019215 | AF139980 | LW-1 (LW-1) | 716 | 72 | -66.1 |
| 5hsa019416 | AF152961 | Chromatin-specific transcription elongation factor | 291 | 97 | -61.9 |
| 5hsa022262 | S95936 | Transferrin | 79 | 65 | -60.1 |
| 5hsa000763 | U19144 | GAGE-3 protein | 99 | 72 | -54.6 |
| 5hsa022576 | AF116649 | PRO0566 | 2,011 | 72 | -51.6 |
Highly stable structures are defined as those with ΔG ≤ -50 kcal/mol. Free energy was calculated with 'foldrna' program (GCG [53]) on stem-loop elements found with the PatSearch program on human UTRdb [47]. Stem length represents the total number of nucleotides involved in the structure.
5' UTR sequences with experimentally proved IRES elements
| UTRdb* | EMBL† | Gene description | UTRsite | Reference |
| 5HSA004100 | J04513 | Human FGF-2 | Y | [54] |
| 5HSA007092 | X87949 | Human BiP | Y | [55] |
| 5DVI000022 | M95825 | [56] | ||
| 5HSA007484 | M12783 | Human PDGF2/c-sis | Y | [57] |
| 5HSA011699 | AF025841 | Acute myeloid leukemia 1 protein (AML1) | Y | [58] |
| 5HSA000138 | AF013263 | Apoptotic protease activating factor 1 (Apaf-1) | [59] | |
| 5HSA001903 | AF006822 | Myelin transcription factor 2 (MYT2) | [60] | |
| 5CGR000096 | M17169 | Chinese hamster glucose-regulated protein GRP78 | Y | [30] |
| 5BTA000471 | M13440 | Bovine basic fibroblast growth factor (FGF) | Y | [30] |
| 5RNO001555 | M22427 | Rat basic fibroblast growth factor (FGF) | Y | [30] |
| 5HSA015336 | D14838 | Human FGF-9 | [ | |
| 5HSA005291 | M17446 | Human Kaposi's sarcoma oncogene (fibroblast growth factor) | [30] |
The elements recognized by the IRES detection algorithm in UTRsite [47,48,50] are marked 'Y'. Accession numbers in the *UTRdb [47] and †EMBL databases [52].
Genes with experimentally characterized upstream ORFs in their 5' UTR
| Gene | Organism | Number of upstream ORFs | Effects | Reference |
| Mammalian | 1 | Spermidine-dependent translation control | [61] | |
| Yeast | 4 | Starvation-dependent translation/stability regulation | [62] | |
| Human | 3 | Glucose-mediated translation control | [63] | |
| Yeast | 2 | NMD-independent mRNA destabilization | [26] | |
| Yeast | 1 | Weak translation inhibition | [26] | |
| V(1b) Vasopressin receptor | Rat | 5 | Translation inhibition (no destabilization) | [64] |
| Connexin-41 | 3 | Translation inhibition | [65] | |
| Human | 2 | Translation inhibition | [66] |
Functional elements in UTRsite collection annotated in UTRdb entries
| Functional elements | Localization (UTR) | Number of annotated entries* |
| 15-lipoxygenase differentiation control element (15-LOX-DICE) | 3' | 94 |
| Adh mRNA down-regulation element | 3' | 61 |
| Amyloid precursor protein 3' UTR destabilizing element | 3' | 15 |
| Class 2 AU-rich elements (ARE2) | 3' | 70 |
| Bruno-responsive element (BRE) | 3' | 199 |
| Barley yellow dwarf virus (bydv) | 5' and 3' | 6 |
| Cytoplasmic polyadenylation element (CPE) | 3' | 5,186 |
| GLUT1 mRNA-stability control element | 5' and 3' | 66 |
| Histone mRNA 3' UTR stem loop | 3' | 38 |
| Iron responsive element (IRE) | 5' and 3' | 121 |
| Internal ribosome entry site (IRES) | 5' | 7,356 |
| Msl-2 3' UTR control element | 3' | 19 |
| Msl-2 5' UTR control element | 5' | 5 |
| Nanos translation control element | 3' | 1 |
| Ribosomal S12 mRNA translational control element | 5' | 2 |
| Selenocysteine insertion sequence type 1 (SECIS-1) | 3' | 1,773 |
| Selenocysteine insertion sequence type 2 (SECIS-2) | 3' | 355 |
| Tra-2 and GLI element (TGE) | 3' | 81 |
| TNF-α mRNA stability control element | 3' | 8 |
| Terminal oligopyrimidine tract (TOP) | 5' | 272 |
| Upstream ORF | 5' | 71,438 |
| Vimentin mRNA 3' UTR control element | 3' | 6 |
*The number of genes in UTRdb [47] in which the structure is found as of June 2001.