| Literature DB >> 12429065 |
Andrea N Ladd1, Thomas A Cooper.
Abstract
Alternative splicing of pre-mRNAs is central to the generation of diversity from the relatively small number of genes in metazoan genomes. Auxiliary cis elements and trans-acting factors are required for the recognition of constitutive and alternatively spliced exons and their inclusion in pre-mRNA. Here, we discuss the regulatory elements that direct alternative splicing and how genome-wide analyses can aid in their identification.Entities:
Mesh:
Substances:
Year: 2002 PMID: 12429065 PMCID: PMC244920 DOI: 10.1186/gb-2002-3-11-reviews0008
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Functionally significant examples of different types of alternative splicing. (a) Alternative inclusion of a cassette exon is very common. Neuron-specific inclusion of the N1 exon in the c-src proto-oncogene generates an insertion in the SH3 protein-protein interaction domain that alters its binding to other proteins [34]. (b) Alternative exons may be mutually exclusive, such as exons IIIb and IIIc in the fibroblast growth factor receptor 2 (FGFR-2) gene. Use of IIIb produces a receptor with high affinity for keratinocyte growth factor (KGF), whereas use of IIIc produces a high-affinity FGF receptor. Loss of the IIIb isoform is thought to be important in prostate cancer [35]. (c) The choice of an alternative 5' splice site in the Wilms' Tumor suppressor gene Wt1 results in the insertion of the three amino acids lysine, threonine, and serine (KTS). The +KTS and -KTS forms play distinct roles in kidney and gonad formation, and shift of the balance toward the -KTS form is associated with Frasier syndrome [36]. (d) In the transformer (tra) gene in Drosophila, selection of a female-specific alternative 3' splice site produces a single long open reading frame that gives rise to a regulatory protein that controls female somatic sexual differentiation. In male flies, tra mRNAs lack a long open reading frame, and no protein is made [37]. (e) Alternative terminal exons in the gene encoding calcitonin and calcitonin-gene-related peptide (CGRP) give rise to a hormone involved in calcium homeostasis in the thyroid gland, or a neuropeptide involved in vasodilation in the nervous system [38]. (f) Alternative promoter usage in the myosin light chain (MLC) gene leads to different first exons, which pair with mutually exclusive downstream exons to give rise to distinct protein isoforms, namely MLC1 and MLC3 [39]. This type of alternative splicing pattern results primarily from transcriptional regulation, not from the regulation of splice-site choice per se. (g) Intron retention is one of the rarest forms of alternative splicing in humans. Retention of intron 2 in the human muscle-specific chloride channel 1 (ClC-1) mRNA in myotonic dystrophy (DM) patients introduces a premature stop codon and leads to downregulation of ClC-1 expression, contributing to problems in muscle relaxation (myotonia) [2].
Useful databases for identifying alternatively spliced RNAs and regulatory elementsof alternative splicing data
| Name | Description | URL | Published reference |
| Alternative Splicing database | Alternative exon database compiled from the literature | [ | |
| Alternative splicing databases | Several alternative splicing databases, including a database of splice variants of disease genes, a complete splice-site database, and a database of alternative splice forms for seven organisms | ||
| AltExtron | Transcript-confirmed human introns and exons; includes alternatively spliced data subsets | [ | |
| ASAP | Human alternative splicing database; part of the Alternative Splicing Annotation Project | [ | |
| ASDB | Alternatively spliced gene database; includes protein and nucleotide sequences for human, mouse, rat, | [ | |
| AsMamDB | Alternatively spliced mammalian genes; includes human, mouse, and rat | [ | |
| Gene Resource Locator | Gene map database; includes information on alternatively spliced transcripts | [ | |
| Intronerator | Introns in | [ | |
| ISIS | All introns identified in GenBank | [ | |
| PALS db | Putative alternative splicing predicted by EST alignments for mouse and human | [ | |
| SELEX-DB | [ | ||
| SpliceDB | Mammalian splice sites | [ | |
| SpliceNest | A database that maps GeneNest onto human genomic sequence and is integrated with GeneNest (EST clusters for human, mouse, zebrafish and | [ | |
| STACK | Putative human transcripts reconstructed from ESTs; includes the context of different tissues or pathological states | [ | |
| Yeast Intron Database | Introns in | [ |
Figure 2Typical features of alternative exons. Alternative exons are on average less than half the size of constitutive exons and have weak 5' and/or 3' splice sites. Auxiliary elements aid or prevent the recognition of these exons by binding trans-acting factors in different cellular contexts, and how often an exon is included in the mRNA depends on a balance between positive and negative regulation. Enhancer (+) and silencer (-) elements can be found within the alternative exon (yellow box in the center) or the flanking introns (lines). Splicing decisions are controlled by multiple elements, and for a given exon these can be different elements, multiple copies of the same element located at different sites, or a combination of the two (as indicated by the non-yellow colored boxes). Different alternative exons are regulated by different sets of auxiliary elements, but alternative exons that are regulated by the same trans-acting factors have some common elements. Intronic elements can be distal, but are more often located in the introns adjacent to the alternative exon (near the exon-intron boundary), and in some cases can overlap with, or be contained within, the consensus splice site sequences that are recognized by the basal spliceosomal machinery.
Known auxiliary intronic elements that regulate alternative splicing
| Exon or intron that is regulated | Type of splice site choice | Specificity of splicing* | Type of element (name) | Sequence† | Location of element (nucleotide number relative to alternative exon) | Associated | Reference |
| 4.1R exon 16 (human) | Cassette | Developmental stage-specific | Enhancer | AUUUAUUuuuAAAUAUuuuUUGGUUUGCCAAUUUUCAGUuuuUCCCG | Upstream (-106 to -59) | Unknown | [ |
| 4.1R exon 16 (human) | Cassette | Developmental stage-specific | Enhancer | UGUAUgaAcUUGAagcUUAUUUCAgUuGgUUGCCUGgAacCUUCUGCAUUcuUugCUGAUCCCCUUUUCUUCAUUCUgUgcugCAUUUGGUUUugCaUGCAAUUgCauGAAGGGACUUuAGgUuUAAAAUGCuUuUGcAUGUU | Downstream (+4 to 146) | Unknown | [ |
| α-actinin NM/SM exons (rat) | Mutually exclusive exons | Smooth muscle-specific | Silencer | Not defined | Upstream of SM exon | PTB‡ | [ |
| α-actinin NM/SM exons (rat) | Mutually exclusive exons | Smooth muscle-specific | Enhancer (BP-URE) | CCU | Upstream of NM exon | [ | |
| Agrin Y exon (exon 28, mouse) | Cassette | Neuron-specific | Enhancer | uuaccCAACUcaacucaCC | Downstream (+15 to 33) | Unknown | [ |
| Agrin Y exon (exon 28, mouse) | Cassette | Neuron-specific | Enhancer | aucuuuuguAGCCAUUCCuagagAGCccuuuccc | Downstream (+34 to 68) | Unknown | [ |
| AMP deaminase 1 exon 2 (human) | Cassette | Developmental Stage-specific | Enhancer (ExRE) | GCCCAGGCUGGAGUGCAGUA | Downstream (about +1.7 kb) | Unknown | [ |
| AMP deaminase 1 exon 2 (human) | Cassette | Developmental stage-specific | Enhancer (ExRE) | CCAUCGAAUGCAUUUACUU | Downstream (about +3 kb) | Unknown | [ |
| Amyloid precursor protein exon 8 (human) | Cassette | Neuron-specific | Enhancer | AUGUUU | Upstream (-47 to -42) | Unknown | [ |
| Amyloid precursor protein exon 8 (human) | Cassette | Neuron-specific | Silencer | UUU | Upstream (-38 to -36) | Unknown | [ |
| Calcitonin/CGRP exon 4 (human) | Cassette (contains alternative poly(A) site) | Cell type-specific | Enhancer (rep1) | CCAAGGGAAAgcaugGGUUAACCUA | Upstream (-254 to -230) | Unknown | [ |
| Calcitonin/CGRP exon 4 (human) | Cassette (contains alternative poly(A) site) | Cell type-specific | Enhancer (rep2) | CAGCCCUGGugcaugGCACUGCCUC | Upstream (-34 to -10) | Unknown | [ |
| Calcitonin/ CGRP exon 4 (human) | Cassette (contains alternative poly-A site) | Cell type-specific | Enhancer | Downstream (+204 to 238) | , [ | ||
| Cardiac troponin T exon 5 (chicken) | Cassette | Embryonic, muscle-specific | Silencer (MSE1) | Upstream (-36 to -1) | [ | ||
| Cardiac troponin T exon 5 (chicken) | Cassette | Embryonic, muscle-specific | Enhancer, Silencer (MSE2) | ACUGCAC | Downstream (+7 to +46) | [ | |
| Cardiac troponin T exon 5 (chicken) | Cassette | Embryonic, muscle-specific | Enhancer (MSE3) | UCGCGGGUCGG | Downstream (+53 to 87) | [ | |
| Cardiac troponin T exon 5 (chicken) | Cassette | Embryonic muscle-specific | Silencer (MSE4) | CGC | Downstream (+81 to 117) | [ | |
| Caspase-2 exon 9 (mouse) | Cassette | Brain and skeletal muscle-specific; associated with apoptosis | Silencer (In50 up) | GCUuauuaacUGCAAUGUAcuuuuuauuguuuuucauuccaGUUAAGG | Downstream (+141 to 188) | U2 snRNP | [ |
| Caspase-2 exon 9 (mouse) | Cassette | Brain and skeletal muscle-specific; associated with apoptosis | Silencer (In50dn) | AUUUCUGCUUGACUCUUCCAAAUCUUCC | Downstream (+189 to 244) | [ | |
| Chloride channel ClC-1 intron 2 (human) | Intron retention | Aberrant inclusion in myotonic dystrophy | Enhancer | UCUUGGGUAUAGCACCCAAAGUAAAGUAGUGACUCGUUAGCUGCUUUUCUCUCUCUCUCUCUCUCUCUCUCUCUCUCUCUGUCUCUACAUAUAUAUAUUUU | Upstream of exon 3 (-156 to -1) | [ | |
| Clathrin light chain B exon EN (rat) | Cassette | Neuron-specific | Silencer | CUCUUUCUCUUUAACCCUGUGCCUGCCUGUCUUGCUGUCUGUCUUCCCCCACCUAACUCCUUCUCCUAACGGUUUUCCUCAAG | Upstream (-85 to -1) | PTB‡, CUG-BP | [ |
| c- | Cassette | Neuron-specific | Silencer | GGGCCCUGUCUUCGCACCUCAGCCUCUCCUU | Upstream (-60 to -3) | [ | |
| c- | Cassette | Neuron-specific | Enhancer, silencer (DCS) | UGAGGCU | Downstream (+37 to 70) | [ | |
| Cystic fibrosis transmembrane regulator exon 9 (human) | Cassette | Aberrantly skipped in cystic fibrosis | Silencer (ISS) | Not published | Downstream (+117 to 264) | SR proteins | [ |
| Dopa decarboxylase exon B ( | Cassette | Tissue-specific | Silencer | Not published | Upstream (+84 to 528) | Unknown | [ |
| DMPK exon 16 (human) | Alternative terminal exon | Developmental stage-specific; skipped in DM | Enhancer | (CUG)n | Upstream (about 35 nucleotides) | Unknown | [ |
| Fibroblast growth factor receptor 1 α exon (human) | Cassette | Skipped in glial cell tumors | Silencer (ISS-1) | AAugcugcUACAG | Upstream (-242 to -281) | [ | |
| Fibroblast growth factor receptor 1 α exon (human) | Cassette | Skipped in glial cell tumors | Silencer (ISS-2) | AUCUGCCCCCACUCUGCUUCAGAAACugcugcCCACUAACAUUGCUCCCugccugcCGCGUG | Downstream (+98 to 159) | Unknown | [ |
| Fibroblast growth factor receptor 2 BEK/K-SAM (IIIb/IIIc) exons (human) | Mutually exclusive exons | Cell-type specific | Enhancer (IAS1) | CUUUCAUUUUUGUCUUUUUUU | Downstream of K-SAM (+10 to 29) | TIA-1‡ | [ |
| Fibroblast growth factor receptor 2 BEK/K-SAM (IIIb/IIIc) exons (human) | Mutually exclusive exons | Cell-type specific | Enhancer (IAS2) | CCAUGGAAAAAUGCC CACAAC | Downstream of K-SAM (+168 to 188) | Unknown | [ |
| Fibroblast growth factor receptor 2 BEK/K-SAM (IIIb/IIIc) exons (human) | Mutually exclusive exons | Cell-type specific | Enhancer (IAS3) | UGUGGGUUGAUUUUUUCCAUGCGUUUGAUUGCGUGCAUGUGUAGGAGGUGAAGCCGGUGU | Downstream of K-SAM (+963 to 1021) | Unknown | [ |
| Fibroblast growth factor receptor 2 IIIb/IIIc exons (rat) | Mutually exclusive exons | Cell-type specific | Enhancer (IIIb), silencer (IIIc) (ISAR) | CAAACAAAUUCAAAGAGAACGGACUCUGUgggcugauuuuuccauguGUUCAAUCGC | Downstream of IIIb (+915 to 978) | Unknown | [ |
| Fibroblast growth factor receptor 2 IIIb/IIIc exons (rat) | Mutually exclusive exons | Cell-type specific | Silencer (ISS1) | CUCAUUGUGA | Upstream of IIIb (-143 to -112) | [ | |
| Fibroblast growth factor receptor 2 IIIb/IIIc exons (rat) | Mutually exclusive exons | Cell-type specific | Silencer (ISS2) | UGGUGGGACCAUAGGCAGCAC | Upstream of IIIb | Unknown | [ |
| Fibronectin exon EIIIB (rat) | Cassette | Cell-type specific | Enhancer (ICR) | UCUUAAAGGUUCUCUGCCCugcaugGGAAGAAACAUUGCUGAGAACCACUGugcaugAACCCCCUCACUUGUGAUACGAGUUCACUGAAUGACAUUACGGCAAUGCAGUAGUGUGUAGAUAC | Downstream (+519 to 640) | Unknown | [ |
| GABAA receptor subunit γ2 24 nucleotide exon (rat) | Cassette | Neuron-specific | Silencer (rs1) | UGUUUCUCUUUCUCUCCUUU | Upstream (-89 to -70) | PTB‡ | [ |
| GABAA receptor subunit γ2 24 nucleotide exon (rat) | Cassette | Neuron-specific | Silencer (rs2) | CCUUUUCCUUCUUCUUAUU | Upstream (-69 to -51) | PTB‡ | [ |
| GABAA receptor subunit γ2 24 nucleotide exon (rat) | Cassette | Neuron-specific | Silencer (rs3) | GCAAUUCUCUUUUCUGUCU | Upstream (-31 to -13) | PTB‡ | [ |
| GABAA receptor subunit γ2 24 nucleotide exon (rat) | Cassette | Neuron-specific | Enhancer | ACAAAUCCA | Upstream (-12 to -4) | Unknown | [ |
| Spliced versus unspliced | Infection stage-specific | Silencer (NRS) | CCGGAGUG | Downstream of 5 splice site (about 300 nucleotides) | [ | ||
| Glycine receptor α2 exon 3A (human) | Cassette | Neuron-specific | Enhancer | Upstream (about 80 nucleotides) | [ | ||
| hnRNP A1 exon 7B (mouse) | Cassette | Auto-regulated | Enhancer (CE1) | UAAGGAGUCUGUAAGUAAUGGUUUCUGGAAAACCUG | Upstream (-194 to -86) | [ | |
| hnRNP A1 exon 7B (mouse) | Cassette | Auto-regulated | Enhancer (CE4) | AGCUAGAUUAGACUU CUAGAGUUU | Downstream (+35 to 58) | hnRNP A1‡ | [ |
| hnRNP A1 exon 7B (mouse) | Cassette | Auto-regulated | Silencer (CE6IO) | CCUGCUCUGCUGUGCUACCUCCUCCUGGCUUUAAGCUGGGGCCGCCUCCCCAAAAUAAGUAGGUGAAUGAGUGGUUA | Downstream (+128 to 204) | Unknown | [ |
| hnRNP A1 exon 7B (mouse) | Cassette | Auto-regulated | Silencer (CE9) | CUGGAUUAUUAACUGAAUGCCUCACUCAGAGAAUGAA | Downstream (+372 to 418) | Unknown | [ |
| Major late region 1 (L1) 52,55K/IIIa (adenovirus) | Alternative 3' splice sites | Infection stage-specific | Silencer | GCGUGGAGGAAUAUGACGAGGACGAUGAGUACGAGCCAGAGGACGGCGA | Upstream of IIIa splice site (-76 to -28) | SR proteins | [ |
| Major late region 1 (L1) 52,55K/IIIa (adenovirus) | Alternative 3' splice sites | Infection stage-specific | Enhancer | AGUACUAAGCggugauguuucugaucag | Upstream of IIIa splice site (-28 to -1) | Unknown | [ |
| MVM NS2-specific exon and small intron (parvovirus) | Cassette, alternative 5' and 3' splice sites | Infection stage-specific | Enhancer (IES) | GUUUAAGggAUgGUUgGUUgGUGgggU | Downstream of NS2 exon, small intron D2 5 splice site | Unknown | [ |
| Myosin heavy chain exon 11e ( | Mutually exclusive exons 11a-e | Muscle type-specific | Silencer (CIE1) | AUGUACCAUGUACC | Downstream of exon 11e | Unknown | [ |
| Myosin heavy chain exon 11e ( | Mutually exclusive exons 11a-e | Muscle type-specific | Enhancer (CIE2) | AGUGCUGUGU | Downstream of exon 11e | Unknown | [ |
| Myosin heavy chain exon 11e ( | Mutually exclusive exons 11a-e | Muscle type-specific | Enhancer (CIE3) | AUCCAUCCCUCUAUCUAUCCCAGUUCAACCUGUCCAAAAGUGUUUUUGAAGAACCGCUUAAGCAUAAGCAAAAAAA | Upstream of exon 12 (about -200 nucleotides) | Unknown | [ |
| NCAM MSDb exon (mouse) | Cassette | Muscle-specific, developmental stage-specific | Silencer (DSRE) | GACCUCACUGACCCAGCUGGGCCUUUGUCACAUUGGGGGAGCCUGGUUCUUUUCCUCUCAUUCUCUUAACUGUCUAUACUUUAGUUCUUUGAAUCUGGAGGAGUGGAACAGGUCCACUCUUUGGAAACUUGAACCUGGCUUUCUAGU | Downstream (+626 to 771) | Unknown | [ |
| NMDA exon 5 (rat) | Cassette | Neuron-specific | Silencer | GCUUUAGCGCCGUCAUUUUCAACCGUUUAUAAUCUUCUUCUGUGUCUGCAUAUUUUCUCUGUGCACAUUAUUCAUCAG | Upstream (-78 to -1) | PTB‡, CUG-BP | [ |
| Non-muscle myosin II heavy chain B exon N30 (human) | Cassette | Neuron-specific | Enhancer (IDDE) | GAUUugcaugucguacugcauguGUCCACUGUGCACACAAUC | Downstream (+1.5 kb) | Unknown | [ |
| Nova-1 exon H (mouse) | Cassette | Neuron-specific | Enhancer | GCCAAUCAGGUACAGUA | Downstream (+8 to 62) | [ | |
| Cassette | Male-specific | Silencer | UUUUUUUU | Upstream (-6 to -13) | Sxl‡ | [ | |
| Slo K+ channel STREX exon (mouse) | Cassette | Ca2+-dependent in neurons | Silencer (CaRRE) | CCUUGCCAUUAACCGCGcUcUUCcUcUCCUCCCAUCCACCACAUGGUUAUAG | Upstream (-53 to -1) | Unknown | [ |
| Intron retention | Variable | Silencer (ISS) | UAGUGAAUAGAGUUAGGCAGGGA | Upstream of SA7 splice site (-44 to -22) | hnRNP A1‡ | [ | |
| Cassette | Aberrantly spliced in disease | Silencer (ISS) | UCACACGU | Downstream (+11 to 18) | Unknown | [ | |
| Cassette | Aberrantly spliced in disease | Enhancer (ISM) | CCCAUGCG | Downstream (+19 to 26) | Unknown | [ | |
| Thyroid hormone receptor exon 9 (rat) | Alternative 5' splice site | Tissue and developmental stage-specific | Enhancer (SEα2) | GGCGGCCAGAgggUGUGCggagcugguggggaggagCUUggagagaagggaCAAAGCUgggggCUgagggagaaCCCCCA | Downstream (+130 to 210) | SF2/ASF‡, hnRNP H‡ | [ |
| Alternative 3' splice site | Sex-specific | Enhancer | Not published | Downstream of non-sex-specific 3 splice site (about 40 nucleotides) | Unknown | [ | |
| Alternative 3' splice site | Sex-specific | Silencer of non-sex-specific 3 splice site | UUUUUUUU | Upstream of non-sex-specific 3 splice site (-4 to -11) | Sxl‡, U2 snRNP‡ | [ | |
| α-tropomyosin exons 2/3 (rat) | Mutually exclusive exons | Smooth muscle-specific | Silencer (URE) | aucacgcugccugcugcaCCCCACCCCCUUCCCCCUUCCUUCCCCCCACCCCCGUACUCCACUGCCAACUCCC | Upstream of exon 3 (-75 to -1) | Unknown | [ |
| α-tropomyosin exons 2/3 (rat) | Mutually exclusive exons | Smooth muscle-specific | Silencer (DRE) | CUGGAUGCCGCCUCUGCUGCUGCGCACAUUUCAUUUAUAUUCUGUCCUUUCCCC | Downstream of exon 3 (+142 to 231) | [ | |
| β-tropomyosin exons 6A/6B (chicken) | Mutually exclusive exons | Skeletal muscle-specific | Enhancer (S4) | CCCCUCUCUCUAUCGCUGUCUCUUGAGCCACGC | Downstream of exon 6A (+37 to 69) | ASF/SF2, SC35, PTB, hnRNP K | [ |
| β-tropomyosin exons 6A/6B (chicken) | Mutually exclusive exons | Skeletal muscle-specific | Enhancer (IVS B7) | GCU | Downstream of exon 6B (+19 to 76) | [ | |
| β-tropomyosin exons 6/7 (rat) | Mutually exclusive exons | Striated muscle-specific | Silencer (IRE) | AACCCCACCCCCUCACCCCGUCGUCGCGCCACCCCACUGUCUCACCUCACUGUGCCCUCACGCUCCAUCCUGCCACACGCCCCUGCAG | Upstream of exon 7 (-88 to -1) | PTB‡, Sam68, FBP | [ |
*Elements listed may not have been shown to regulate the specificity of alternative splicing. † Sequences that have been shown to be important for element function but that have not yet been demonstrated to bind to specific trans-acting factors are shown in lower-case letters. ‡ Indicates that the factor has been shown to bind directly to the element (for example, by ultraviolet-induced crosslinking) and has had binding confirmed (for example, by mutational analysis). Specific binding sequences for trans-actingfactors are indicated in bold, italics or underlined when known.