| Literature DB >> 15461793 |
Gene Yeo1, Dirk Holste, Gabriel Kreiman, Christopher B Burge.
Abstract
BACKGROUND: Alternative pre-mRNA splicing (AS) is widely used by higher eukaryotes to generate different protein isoforms in specific cell or tissue types. To compare AS events across human tissues, we analyzed the splicing patterns of genomically aligned expressed sequence tags (ESTs) derived from libraries of cDNAs from different tissues.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15461793 PMCID: PMC545594 DOI: 10.1186/gb-2004-5-10-r74
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Levels of alternative splicing in 16 human tissues with moderate or high EST sequence coverage. Horizontal bars show the average fraction of alternatively spliced (AS) genes of each splicing type (and estimated standard deviation) for random samplings of 20 ESTs per gene from each gene with ≥ 20 aligned EST sequences derived from a given human tissue. The different splicing types are schematically illustrated in each subplot. (a) Fraction of AS genes containing skipped exons, alternative 3' splice site exons (A3Es) or 5' splice site exons (A5Es), (b) fraction of AS genes containing skipped exons, (c) fraction of AS genes containing A3Es, (d) fraction of AS genes containing A5Es.
Figure 2Examples of tissue-specific AS events in human genes with evidence of splice conservation in orthologous mouse genes. (a) Human fragile X mental retardation syndrome-related (FXR1) gene splicing detected in brain-derived EST sequences. FXR1 exhibited two alternative mRNA isoforms differing by skipping/inclusion of exons E15 and E16. Exclusion of E16 creates a shift in the reading-frame, which is predicted to result in an altered and shorter carboxy terminus. The exon-skipping event is conserved in the mouse ortholog of the human FXR1 gene, and both isoforms were detected in mouse brain-derived ESTs. (b) Human betaine-homocysteine S-methyltransferase (BHMT) gene splicing detected in liver-derived ESTs. BHMT exhibited two alternative isoforms differing by alternative 5' splice site usage in exon E4. Sequence comparisons indicate that the exon and splice site sequences involved in both alternative 5' splice site exon events are conserved in the mouse ortholog of the human BHMT gene. (c) Human cytochrome P450 2C8 (CYP2C8) gene splicing. CYP2C8 exhibited two alternative mRNA isoforms differing in the 3' splice site usage for exon E4 (detected in ESTs derived from several tissues), where the exclusion of a 71-base sequence creates a premature termination codon in exon E4b. Exons and splice sites involved in the AS event are conserved in the mouse ortholog of CYP2C8.
Human genes expressed in the liver with alternative 3' splice site exons (A3Es) or alternative 5' splice site exons (A5Es)
| Splicing type | Ensembl gene ID | Gene name | Exon numbers | Fold-change above median expression, HG-U95A | Fold-change above median expression, MG-U74A |
| A5E;A3E | 091513 | Serotransferrin precursor, | 8, 9; 4 | 100 | 100 |
| A5E;A3E | 115414 | Fibronectin precursor, | 36; 31 | 10 | - |
| A5E;A3E | 117601 | Antithrombin-III precursor, | 5; 4 | 100 | 100 |
| A5E;A3E | 136872 | Fructose-bisphosphate aldolase, | 3, 8; 4 | 100 | 10 |
| A5E;A3E | 140833 | Haptoglobin-related protein precursor, | 3 | 100 | 10 |
| A5E;A3E | 151790 | Tryptophan 2,3-dioxygenase, TDO2 | 3, 5; 4 | 10 | 100 |
| A5E;A3E | 171759 | Phenylalanine-4-hydroxylase, | 6; 4,10 | - | 100 |
| A5E | 047457 | Ceruloplasmin precursor, | 14, 16 | 3 | - |
| A5E | 055957 | Inter-alpha-trypsin inhibitor heavy chain H1 precursor, | 21 | 100 | 10 |
| A5E | 111275 | Aldehyde dehydrogenase, | 12 | 3 | 3 |
| A5E | 132386 | Pigment epithelium-derived factor precursor, | 4 | 10 | 10 |
| A5E | 138356 | Aldehyde oxidase, | 27, 29 | 3 | 3 |
| A5E | 138413 | Isocitrate dehydrogenase, | 3 | 1 | - |
| A5E | 145692 | Betaine-homocysteine | 4 | 10 | 100 |
| A5E | 160868 | Cytochrome P450, | 5 | 10 | 10 |
| A5E | 171766 | Glycine amidinotransferase, | 8 | 3 | 3 |
| A3E | 080618 | Carboxypeptidase, | 10 | - | - |
| A3E | 080824 | Heat shock protein HSP 90-alpha, | 8 | - | - |
| A3E | 096087 | Glutathione | 4, 6 | 10 | 10 |
| A3E | 106927 | Protein precursor, | 5, 9 | 100 | 100 |
| A3E | 110958 | Telomerase-binding protein P23, | 5 | <1 | 1 |
| A3E | 134240 | Hydroxymethylglutaryl-CoA synthase, | 8 | 10 | - |
| A3E | 138115 | Cytochrome P450, | 4 | 100 | 10 |
| A3E | 145192 | Alpha-2-HS-glycoprotein precursor, | 6 | 100 | 100 |
| A3E | 163631 | Serum albumin precursor, | 9 | 100 | 100 |
| A3E | 171557 | Fibrinogen gamma chain precursor, | 4 | 100 | 100 |
| A3E | 174156 | Glutathione S-transferase, | 4, 6 | 10 | 10 |
Examples of human AS genes found to exhibit A3E and/or A5E splicing with both isoforms detected in liver-derived ESTs. AS types are listed in the first column, followed by the last six digits of the Ensembl gene number, the gene name and alternative exon numbers. The last two columns list expression levels in human liver and mouse liver tissues, respectively, expressed in terms of the fold-change relative to the median expression level in other tissues (from the DNA microarray data of [43] and [45], respectively).
Figure 3Correlation of mRNA expression levels of 20 known splicing factors (see Materials and methods) across 26 human tissues (lower diagonal: data from Affymetrix HU-133A DNA microarray experiment [45]; upper diagonal: data from Affymetrix HU-95A DNA microarray experiment [43]). Small squares are colored to represent the extent of the correlation between the mRNA expression patterns of the 20 splicing factor genes in each pair of tissues (see scale at top of figure).
Sequence motifs enriched in skipped exons (SEs) and alternative 5' splice site exons (A5Es)
| AS type /tissue (motif name) | Oligonucleotides | Occurrences | Consensus (% of exons containing) |
| SE/brain (BR1) | 169 | ||
| 323 | |||
| 264 | |||
| 945 | |||
| 137 | |||
| 363 | |||
| 1021 | |||
| 136 | |||
| 375 | |||
| 122 | |||
| 118 | |||
| 108 | |||
| SE/brain (BR2) | 97 | ||
| 411 | |||
| 116 | |||
| SE/brain (BR3) | 324 | ||
| 948 | |||
| 426 | |||
| 171 | |||
| SE/brain (BR4) | 58 | ||
| 176 | |||
| 840 | |||
| SE/brain (BR5) | 925 | ||
| 206 | |||
| 122 | |||
| 102 | |||
| 79 | |||
| 126 | |||
| SE/brain (BR6) | 269 | ||
| 106 | |||
| 96 | |||
| 51 | |||
| 47 | |||
| 54 | |||
| SE/brain (BR7) | 186 | ||
| 63 | |||
| 24 | |||
| SE/testis (TE1) | 99 | ||
| 33 | |||
| Core A5E/liver (LI1) | 42 | ||
| 18 | |||
| Core A5E/liver (LI2) | 29 | ||
| 5 |
Sequence motifs of length four to six bases that are significantly over-represented (p < 0.002) in SEs relative to constitutively spliced exons from brain- or testis-derived ESTs are shown, followed by the number of occurrences in SEs in these tissues. Sequence motifs are grouped/aligned by similarity, and shared tetramers are shown in bold and listed in the last column, followed by the fraction of SEs that contain the given tetramer. Sequence motifs significantly over-represented (p < 0.01) in the core of A5Es from human liver-derived ESTs are shown at the bottom, followed by the number of A5E occurrences and the fraction of A5Es that contain the given tetramer. Statistical significance was evaluated as described in Materials and methods.
Figure 4Computation of splice junction difference ratio (SJD). The SJD value for a pair of transcripts is computed as the number of splice junctions in each transcript that are not represented in the other transcript, divided by the total number of splice junctions in the two transcripts, in both cases considering only those splice junctions that occur in portions of the two transcripts that overlap (see Materials and methods for details). SJD value calculations for different combinations of the transcripts shown in the upper part of the figure are also shown.
Figure 5Comparison of alternative mRNA isoforms across 25 human tissues. (a) Color-coded representation of SJD values between pairs of tissues (see Figure 4 and Materials and methods for definition of SJD). (b) Hierarchical clustering of SJD values using average-linkage clustering. Groups of tissues in clusters with short branch lengths (for example, thyroid/ovary, B-cell/bone) have highly similar patterns of AS. (c) Mean SJD values (versus other 24 tissues) for each tissue.