| Literature DB >> 18841251 |
Lan Lin1, Shihao Shen, Anne Tye, James J Cai, Peng Jiang, Beverly L Davidson, Yi Xing.
Abstract
Exonization of Alu elements is a major mechanism for birth of new exons in primate genomes. Prior analyses of expressed sequence tags show that almost all Alu-derived exons are alternatively spliced, and the vast majority of these exons have low transcript inclusion levels. In this work, we provide genomic and experimental evidence for diverse splicing patterns of exonized Alu elements in human tissues. Using Exon array data of 330 Alu-derived exons in 11 human tissues and detailed RT-PCR analyses of 38 exons, we show that some Alu-derived exons are constitutively spliced in a broad range of human tissues, and some display strong tissue-specific switch in their transcript inclusion levels. Most of such exons are derived from ancient Alu elements in the genome. In SEPN1, mutations of which are linked to a form of congenital muscular dystrophy, the muscle-specific inclusion of an Alu-derived exon may be important for regulating SEPN1 activity in muscle. Realtime qPCR analysis of this SEPN1 exon in macaque and chimpanzee tissues indicates human-specific increase in its transcript inclusion level and muscle specificity after the divergence of humans and chimpanzees. Our results imply that some Alu exonization events may have acquired adaptive benefits during the evolution of primate transcriptomes.Entities:
Mesh:
Year: 2008 PMID: 18841251 PMCID: PMC2562518 DOI: 10.1371/journal.pgen.1000225
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
The nucleotide substitution rates of three classes of exons during primate evolution.
| Human VS | Exons | Number of nucleotides | ||
| conserved | substituted | % substituted | ||
| Chimpanzee | Alu-derived exons | 59812 | 813 | 1.34 |
| cassette exons | 670739 | 4925 | 0.73 | |
| constitutive exons | 1486944 | 7822 | 0.52 | |
| Orangutan | Alu-derived exons | 51102 | 1957 | 3.69 |
| cassette exons | 620988 | 11452 | 1.81 | |
| constitutive exons | 1371194 | 18245 | 1.31 | |
| Macaque | Alu-derived exons | 37122 | 2801 | 7.02 |
| cassette exons | 619754 | 21563 | 3.36 | |
| constitutive exons | 1428543 | 36227 | 2.47 | |
| Marmoset | Alu-derived exons | 23863 | 3146 | 11.65 |
| cassette exons | 509368 | 31128 | 5.76 | |
| constitutive exons | 1132952 | 50968 | 4.31 | |
RT-PCR analysis of Alu-derived exons whose Exon array probe intensities correlate with overall gene expression levels.
| Gene | Cluster | Probeset | Target exon location | Target exon size (bp) | PCR Skipping (bp) | PCR inclusion (bp) | Alu type | Alu strand/mRNA | Splicing pattern | Impact on mRNA/protein | Gene name | GO processes/known features |
| FAM55C | 2634058 | 2634065 | chr3:102,984,195–102,984,314 | 120 | None | 250 | AluJb | Antisense | Constitutive inclusion | 5′UTR | Family with sequence similarity 55, member C | Unknown |
| NLRP1 | 3742783 | 3742834 | chr17:5,377,348–5,377,428 | 78 | None | 193 | ALuJb | Antisense | Constitutive inclusion | Coding with alternative 3′ splice site | NLR family, pyrin domain containing 1 | ATP binding, caspase activation, apoptosis |
| ZNF611 | 3869714 | 3869736 | chr19:57923795–57923895 | 101 | None | 204 | AluJb | Sense | Constitutive inclusion | 5′UTR | Zinc finger protein 611 | Regulation of transcription |
| ADAL | 3591365 | 3591369 | chr15:41412801–41412924 | 124 | None | 356/456/457 | AluJb | Sense | Constitutive inclusion | 5′UTR | Adenosine deaminase-like | Nucleotide metabolic process, deaminase activity |
| RPP38 | 3236538 | 3236542 | chr10:15184223–15184341 | 119 | None | 255/264 | AluJb | Antisense | Constitutive inclusion, only detected in kidney and testes | 5′UTR | Ribonuclease P/MRP 38 kDa subunit | tRNA processing, hydrolase activity |
| RSPH10B | 3037100 | 3037137 | chr7:5973475–5973594 | 120 | None | 334 | AluJb | Antisense | Constitutive inclusion, only detected in testes | 5′UTR | Radial spoke head 10 homolog B (Chlamydomonas) | Unknown |
| EFCAB5 | 3716259 | 3716293 | chr17:25424227–25424342 | 116 | 197 | 313 | AluJo | Antisense | Alternative major form, no tissue specificity detected | Coding with alternative 3′/stop codon/3′UTR | EF-hand calcium binding domain 5 | Calcium ion binding |
| GOLGA8A | 3617458 | 3617512 | chr15:32470616–32470919 | 142/228/229 | 178 | 320/406/407 | AluSx | Antisense | Alternative major form, no tissue specificity detected | 5′UTR | Golgi autoantigen, golgin subfamily a, 8A | Golgi apparatus protein |
| FLJ42842 | 3727033 | 3727035 | chr17:46771967–46772084 | 118 | 198 | 316 | AluJb | Antisense | Alternative major form, no tissue specificity detected | 3′UTR | Unknown | Unknown |
| ADARB1 | 3924041 | 3924084 | chr21:45428817–45428936 | 120 | 142 | 262 | AluJb | Antisense | Alternative major form, no tissue specificity detected | Coding region, decrease catalytic activity | Adenosine deaminase, RNA-specific, B1 | mRNA processing, adenosine deaminase activity |
| C16orf61/DC13 | 3701384 | 3701391 | chr16:79571876–79571985 | 110 | 152 | 262 | AluJo | Antisense | Alternative medium form, no tissue specificity detected | Exon in a non-coding transcript | Homo sapiens chromosome 16 open reading frame 61 | Unknown |
| SHMT1 | 3748323 | 3748350 | chr17:18204453–18204591 | 139 | 148 | 287 | ALuJb | Antisense | Alternative medium form, no tissue specificity detected | 5′UTR | Serine hydroxymethyltransferase 1 (soluble) | L-serine metabolic process, transferase activity |
| CLEC7A | 3444009 | 3444018 | chr12:10168953–10169039 | 87 | 103/222 | 190/309 | AluJb | Antisense | Alternative medium form, no tissue specificity detected | Coding region with in frame pre-mature termination codon OR 3′ UTR | C-type lectin domain family 7, member A | T-cell activation, inflammatory response, MHC protein binding |
| MIPOL1 | 3532935 | 3532943 | chr14:36758509–36758647 | 139 | 230 | 369/470/609 | AluJo | Antisense | Alternative minor form, no tissue specificity detected | 5′UTR | Mirror-image polydactyly 1 | Unknown |
| CAMKK2 | 3474885 | 3474928 | chr12:120215348–120215460 | 123 | 133 | 256 | AluSq | Antisense | Alternative minor form, no tissue specificity detected | 5′UTR | Calcium/calmodulin-dependent protein kinase kinase 2, beta | MAPKKK cascade, calmodulin-dependent protein kinase activity |
| CCDC53 | 3468225 | 3468250 | chr12:100968097–100968199 | 103 | 147 | 250 | AluJo | Antisense | Alternative minor form, no tissue specificity detected | Coding with premature termination codon | Coiled-coil domain containing 53 | Unknown |
| SLFN11 | 3753500 | 3753521 | chr17:30718078–30718195 | 118 | 156 | 253/274/324/372 | AluJb | Antisense | Pancreas specific minor form, alternative major form in most tissues | 5′UTR. | Schlafen family member 11 | ATP binding, nucleotide binding |
| NOX5 | 3599561 | 3599599 | chr15:67054122–67054224 | 103 | 128 | 231/276 | AluJb | Antisense | Liver, pancreas and testes specific skipping, constitutive inclusion in other tissues | 5′UTR | NADPH oxidase, EF-hand calcium binding domain 5 | NADPH oxidase that generates superoxide and functions as a H+ channel in a Ca(2+)-dependent manner |
| B3GALNT1 | 2703377 | 2703394 | chr3:162290426–162290544 | 119 | 253 | 372 | AluSx | Antisense | Cerebellum, heart and testes specific minor form | 5′UTR | beta-1,3-N-acetylgalactosaminyltransferase 1 | Protein amino acid glycosylation, galactosyltransferase activity |
Figure 1Examples of “Correlated” Exons analyzed by Exon Array analysis, semi-quantitative RT-PCR and sequencing.
A. Exon array analysis of NLRP1. B. RT-PCR analysis of Alu-derived exon in NLRP1. C. Exon array analysis of FAM55C. D. RT-PCR analysis of Alu-derived exon in FAM55C. E. RT-PCR analysis of Alu-derived exon in SLFN11. F. RT-PCR analysis of Alu-derived exon in NOX5. In Exon Array analysis, the bold line represents the overall gene expression levels across all 11 tissues, each with 3 replicates; each of the fine lines represents the background corrected intensities of a probe targeting the Alu-derived exon. The Pearson correlation coefficient of the individual probe's intensities with the estimated gene expression levels in 11 tissues is shown at the top right corner of each graph. In each gel figure, solid arrows show sequencing analysis confirmed Alu exon inclusion forms. Hollow arrows show sequencing analysis confirmed Alu exon skipping forms.
Detection of tissue-specific splicing in candidate Alu-derived exons selected from Exon array analysis and published literature.
| Gene | Cluster | Probeset | Target exon location | Target exon size (bp) | PCR Skipping (bp) | PCR inclusion (bp) | Alu type | Alu strand/mRNA | Observed Splicing pattern | Impact on mRNA/protein | Prior evidence of Exon Splicing | Gene name | GO processes/known features |
| ICA1 | 3038065 | 3038156 | chr7:8233793–8234006 | 214 | 156 | 370 | AluJo | Sense | Testes specific inclusion | Pre-mature stop or alternative start | Testes-specific inclusion based on Exon array data | Islet cell autoantigen 1 | Neurotransmitter transport, autoantigen in insulin-dependent diabetes mellitus and primary Sjogren's syndrome |
| ZNF254 | 3827427 | 3827448 | Chr19: 24023579–24023656 | 78 | 190 | 268 | AluJb | Antisense | Cerebellum specific major form, pancreas specific skipping, alternative medium form in other tissues | 5′UTR | Cerebellum-specifc inclusion based on Exon array data | Zinc finger protein 254 | DNA binding, negative regulation of transcription from RNA polymerase II promoter |
| SEPN1 | 2326126 | 2326133 | chr1:26001094–26001195 | 102 | 127 | 229 | AluJb | Antisense | Muscle specific major form, alternative minor form in most tissues | Coding sequence, but no protein detection in previous report | Alternative minor form predicted from ESTs and RT-PCR | Selenoprotein N, 1 | Calcium ion binding,mutations in this gene cause the classical phenotype of multiminicore disease and congenital muscular dystrophy with spinal rigidity and restrictive respiratory syndrome. |
| PKP2 | 3450234 | 3450257 | chr12: 32887387–32887503 | 132 | 196 | 328 | AluSg | Sense | Pancreas specific inclusion | Coding | Alternative minor form in HT29 cell line by RT-PCR | Plakophilin 2 | Cell-cell adhesion |
| RPE | 2525852 | 2525861 | chr2:210589173–210589226 | 54 | 233 | 287 | AluJ/FRAM | Antisense | Alternative minor form, no conclusive evidence for tissue-specificity | Coding | Alternative minor form in placenta by RT-PCR | Ribulose-5-phosphate-3-epimerase | Carbohydrate metabolic process, ribulose-phosphate 3-epimerase activity |
| SUGT1 | N/A | N/A | chr13:52133611–52133706 | 96 | 98 | 194 | AluSx | Antisense | Alternative medium form, no conclusive evidence for tissue-specificity | Coding | Uterus, pancreas, muscle-specific based on EST data | Suppressor of G2 allele of SKP1 | Mitosis |
| FAM79B (TPRG1) | 2657546 | 2657554 | chr3:190201257–190201394 | 138 | 153 | 291 | AluJo | Antisense | Alternative major form, no conclusive evidence for tissue-specificity | 5′UTR | Kidney-specific inclusion based on Exon array data | Tumor protein p63 regulated 1 | Unknown |
| BCL2L13 | 3936256 | 3936278 | chr22:16547375–16547472 | 98 | 152 | None | AluSp | Antisense | No detectable inclusion in any tested tissue | Coding with premature stop codon | Alternative minor form in heart, HeLa cell line, lymphocyte by RT-PCR | BCL2-like 13 (apoptosis facilitator) | Caspase activation,induction of apoptosis |
Figure 2Examples of tissue-specific Alu-derived exons analyzed by Exon Array analysis, semi-quantitative RT-PCR and sequencing.
A. Exon array analysis of ICA1 indicates a testes specific inclusion of Alu-derived exon. B. RT-PCR analysis of Alu-derived exon in ICA1. C. Exon array analysis of ZNF254 indicates a cerebellum specific inclusion of Alu-derived exon. D. RT-PCR analysis of Alu-derived exon in ZNF254. E. RT-PCR analysis of Alu-derived exon in PKP2. F. RT-PCR analysis of Alu-derived exon in SEPN1. In Exon Array analysis, the bold line represents the overall gene expression levels across all 11 tissues, each with 3 replicates; each of the fine lines represents the background corrected intensities of a probe targeting the Alu-derived exon. In each gel figure, solid arrows show sequencing analysis confirmed Alu exon inclusion forms. Hollow arrows show sequencing analysis confirmed Alu exon skipping forms. Dashed arrows show sequencing analysis confirmed non-specific PCR products.
Figure 3Evolution of SEPN1 Alu-exon splicing in primates.
A. The splicing pattern of SEPN1 Alu-derived exon. B. RT-PCR analysis of the SEPN1 Alu-derived exon in human, chimpanzee and macaque tissues. The RT-PCR primer was designed from the upstream and downstream constitutive exon on the human gene and matched perfectly to chimpanzee and macaque transcripts. C. Realtime qPCR primers that specifically amplify exon inclusion and skipping forms. The reverse PCR primer for the skipping form was designed from the junction of upstream and downstream constitutive exons. These PCR primers perfectly matched both human and chimpanzee transcripts. D. The ratio of exon inclusion/skipping in human tissues and tissues of two chimpanzees estimated by realtime qPCR. The SEPN1 exon showed strong exon inclusion in human muscle but not in chimpanzee muscle. C, cerebellum; K, kidney; L, liver; M, muscle.
Figure 4Most Alu exons with substantial transcript inclusion levels are derived from ancient Alu elements in the human genome.
Plotted here are distributions of AluJ Class and AluS Class in the human genome, in Alu-derived internal exons, and in Alu-derived exons with substantial transcript inclusion levels based on our RT-PCR results. AluJ class is indicated by white column; AluS class is indicated by hatched column.