| Literature DB >> 25254147 |
Hidehito Kuroyanagi1, Satomi Takei1, Yutaka Suzuki2.
Abstract
Mutually exclusive selection of one exon in a cluster of exons is a rare form of alternative pre-mRNA splicing, yet suggests strict regulation. However, the repertoires of regulation mechanisms for the mutually exclusive (ME) splicing in vivo are still unknown. Here, we experimentally explore putative ME exons in C. elegans to demonstrate that 29 ME exon clusters in 27 genes are actually selected in a mutually exclusive manner. Twenty-two of the clusters consist of homologous ME exons. Five clusters have too short intervening introns to be excised between the ME exons. Fidelity of ME splicing relies at least in part on nonsense-mediated mRNA decay for 14 clusters. These results thus characterize all the repertoires of ME splicing in this organism.Entities:
Keywords: NMD; alternative splicing; intron; mutually exclusive exons; pre-mRNA processing; proteome diversity
Year: 2014 PMID: 25254147 PMCID: PMC4165533 DOI: 10.4161/worm.28459
Source DB: PubMed Journal: Worm ISSN: 2162-4046
Table 1. Summary of experimental validation of the putative ME exon clusters
| Name and ID in WS239 | Position in WS235 | Supporting Reads*a | Sequence Identity | Intervening Intron(s) [nt]*b | NMD- | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AS Type | Chr | Gene | WBGene ID | Str | Exon | Left end | Right end | Exon Length | Nucleotide | Amino Acid | ||||
| Homologous ME | I | WBGene00009276 | + | 4a | 7837844 | 7837926 | 83 | 3n +2 | 9 | 41.0% | 17.9% | 683 | Yes | |
| 4b | 7838610 | 7838692 | 83 | 3n +2 | 12 | |||||||||
| WBGene00002978 | - | 7b | 14623010 | 14623148 | 139 | 3n +1 | 1615 | 71.2% | 78.3% | 422 | No | |||
| 7a*d | 14623571 | 14623709 | 139 | 3n +1 | 83 | |||||||||
| 5b | 14624030 | 14624147 | 118 | 3n +1 | 478 | 81.4% | 74.4% | 155, 123 | No | |||||
| 5c | 14624303 | 14624420 | 118 | 3n +1 | 94 | 47.5% | 35.9% | |||||||
| 5a | 14624544 | 14624661 | 118 | 3n +1 | 706 | 83.9% | 71.8% | |||||||
| II | WBGene00004921 | + | 6B*d | 6815048 | 6815187 | 140 | 3n +2 | 0 | 64.3% | 55.3% | 116 | No | ||
| 6A | 6815304 | 6815437 | 134 | 3n +2 | 63 | |||||||||
| WBGene00002251 | + | 3a | 8903388 | 8903494 | 107 | 3n +2 | 24 | 55.8% | 45.9% | 112 | Partially | |||
| 3b | 8903607 | 8903719 | 113 | 3n +2 | 72 | |||||||||
| III | WBGene00002915 | + | 19a | 2676547 | 2676664 | 118 | 3n +1 | 44 | 40.4% | 33.3% | 97 | No | ||
| 19b | 2676762 | 2676870 | 109 | 3n +1 | 34 | |||||||||
| WBGene00001631 | - | 8b | 4325419 | 4325523 | 105 | 3n | 26 | 44.1% | 28.9% | - | ||||
| 8a | 4325535 | 4325642 | 108 | 3n | 49 | |||||||||
| WBGene00015735 | - | 8b | 6636402 | 6636556 | 155 | 3n +2 | 77 | 53.4% | 30.8% | 336 | Partially | |||
| 8a | 6636893 | 6637017 | 125 | 3n +2 | 24 | |||||||||
| WBGene00000762 | + | 6a | 6937600 | 6937733 | 134 | 3n +2 | 13 | 47.0% | 40.0% | 678 | Yes | |||
| 6b | 6938412 | 6938545 | 134*e | 3n +2 | 12 | |||||||||
| WBGene00006768 | + | 7a | 8909233 | 8909355 | 123 | 3n | 46 | 47.2% | 35.4% | 725 | - | |||
| 7b | 8910081 | 8910221 | 141 | 3n | 56 | |||||||||
| WBGene00001630 | + | 9b | 13200771 | 13200881 | 111 | 3n | 16 | 77.7% | 54.1% | - | ||||
| 9c | 13200916 | 13201020 | 105 | 3n | 13 | 66.1% | 54.1% | |||||||
| 9a | 13202487 | 13202588 | 102 | 3n | 6 | 71.4% | 48.6% | |||||||
| WBGene00012306 | - | 2b | 13730072 | 13730243 | 172 | 3n +1 | 80 | 35.5% | 17.5% | 129 | Yes | |||
| 2a | 13730373 | 13730526 | 154 | 3n +1 | 28 | |||||||||
| IV | WBGene00008706 | + | 6a | 17478400 | 17478647 | 248 | 3n +2 | 78 | 75.4% | 80.5% | 207 | Yes | ||
| 6b | 17478855 | 17479102 | 248 | 3n +2 | 82 | |||||||||
| V | WBGene00006796 | + | 7a*d | 4504975 | 4505120 | 146 | 3n +2 | 4 | 49.1% | 30.2% | 238 | No | ||
| 7b | 4505359 | 4505516 | 158 | 3n +2 | 162 | |||||||||
| WBGene00001526 | + | 4a | 8952725 | 8952879 | 155 | 3n +2 | 19 | 52.2% | 48.1% | 138 | Partially | |||
| 4b | 8953018 | 8953175 | 158 | 3n +2 | 31 | |||||||||
| WBGene00000102 | + | 6a | 10250841 | 10251032 | 192 | 3n | 136 | 62.8% | 63.8% | 294 | - | |||
| 6b | 10251327 | 10251533 | 207 | 3n | 41 | |||||||||
| WBGene00000228 | + | 4a | 12480728 | 12480888 | 161 | 3n +2 | 147 | 61.4% | 57.1% | 225 | Partially | |||
| 4b | 12481114 | 12481196 | 83 | 3n +2 | 26 | |||||||||
| WBGene00004830 | + | 10a | 18499582 | 18499694 | 113 | 3n +2 | 21 | 65.5% | 64.1% | 856 | Partially | |||
| 10b | 18500551 | 18500663 | 113 | 3n +2 | 11 | |||||||||
| X | WBGene00003407 | - | 13D | 579498 | 579653 | 156 | 3n | 31*f | 68.6% | 53.8% | 334, 154, 435 | - | ||
| 13C | 579988 | 580143 | 156 | 3n | 35 | 76.9% | 59.6% | |||||||
| 13B | 580298 | 580453 | 156 | 3n | 37 | 74.4% | 53.8% | |||||||
| 13A | 580889 | 581044 | 156 | 3n | 2 | 69.2% | 50.0% | |||||||
| WBGene00000367 | - | 18b | 7854051 | 7854201 | 151 | 3n +1 | 8 | 53.0% | 44.0% | 97 | Partially | |||
| 18a | 7854299 | 7854428 | 130 | 3n +1 | 20 | |||||||||
| WBGene00010199 | + | 3a | 10571708 | 10571827 | 120 | 3n | 35 | 45.8% | 35.0% | - | ||||
| 3b | 10571865 | 10571984 | 120 | 3n | 32 | |||||||||
| WBGene00002280 | - | 10 | 16386613 | 16386723 | 111 | 3n | 164 | 65.8% | 56.8% | - | ||||
| 9 | 16386754 | 16386861 | 108 | 3n | 104 | |||||||||
| Non-Homologous ME | I | WBGene00004154 | - | 6 | 3274267 | 3274812 | 546 | 3n | 53 | 37.0% | 21.6% | 117 | - | |
| 5 | 3274930 | 3275010 | 81 | 3n | 3 | |||||||||
| WBGene00006594 | + | 14a | 5552880 | 5553369 | 490 | 3n +1 | 172 | 50.0% | 16.4% | 1371 | Partially | |||
| 14b | 5554741 | 5554864 | 124 | 3n +1 | 72 | |||||||||
| WBGene00008765 | - | 5b | 7301663 | 7301774 | 112 | 3n +1 | 16 | 41.8% | 17.1% | 132 | Partially | |||
| 5a | 7301907 | 7302027 | 121 | 3n +1 | 54 | |||||||||
| III | WBGene00006768 | + | 4a | 8906524 | 8906678 | 155 | 3n +2 | 34 | 60.0% | 17.3% | 288, 237 | Partially | ||
| 4b | 8906967 | 8907073 | 107 | 3n +2 | 49 | 51.6% | 18.0% | |||||||
| 4c | 8907311 | 8907432 | 122 | 3n +2 | 26 | 49.7% | 23.1% | |||||||
| IV | WBGene00001403 | + | 5D | 9542035 | 9542292 | 258 | 3n | 57 | 35.0% | 22.1% | 435 | - | ||
| 5C | 9542728 | 9542868 | 141 | 3n | 10 | |||||||||
| V | WBGene00011891 | + | 5 | 10574706 | 10574733 | 28 | 3n +1*g | 4 | 53%*h | 60%*h | 200 | Yes | ||
| 6 | 10574934 | 10575098 | 165 | 3n | 230 | |||||||||
| X | WBGene00001184 | + | 5B | 11017580 | 11017930 | 351 | 3n | 38 | 40.1% | 10.3% | - | |||
| 5A | 11017945 | 11018139 | 195 | 3n | 5 | |||||||||
| Tandem CE | I | WBGene00006594 | + | 17 | 5557960 | 5557971 | 12 | 3n | - | - | - | - | - | |
| Tandem CE | 18 | 5558453 | 5558479 | 27 | 3n | - | - | - | - | - | ||||
| Tandem CE | I | WBGene00002978 | - | 9b | 14622304 | 14622511 | 208 | 3n +1*g | - | - | - | - | - | |
| Tandem CE | 9a | 14622658 | 14622743 | 86 | 3n +2*g | - | - | - | - | - | ||||
| Single CE | II | WBGene00001340 | + | 4 | 166071 | 166150 | 80 | 3n +2 | - | - | - | - | - | |
| Tandem CE | II | WBGene00006997 | + | 8 | 4952490 | 4952531 | 42 | 3n*g | - | - | - | - | - | |
| Tandem CE | 9 | 4952647 | 4952694 | 48 | 3n | - | - | - | - | - | ||||
| Tandem CE | II | WBGene00016415 | + | 10 | 5205040 | 5205075 | 36 | 3n | - | - | - | - | - | |
| Tandem CE | 11 | 5205444 | 5205632 | 189 | 3n | - | - | - | - | - | ||||
| Tandem CE | III | WBGene00000542 | + | 3 | 7981673 | 7981810 | 138 | 3n | - | - | - | - | - | |
| Tandem CE | 4 | 7982464 | 7982538 | 75 | 3n | - | - | - | - | - | ||||
| Constitutive | IV | WBGene00006780 | + | 13 | 5984796 | 5984980 | 185 | 3n +2 | - | - | - | - | - | |
| Single CE | 14 | 5985093 | 5985170 | 78 | 3n | - | - | - | - | - | ||||
| Undetected | 15 | 5985577 | 5986355 | 779 | 3n +2 | - | - | - | - | - | ||||
| Alternative Acceptors | IV | WBGene00006779 | - | 14S | 10329097 | 10329195 | 99*i | 3n | - | - | - | - | - | |
| Tandem CE | V | WBGene00010742 | - | 5L | 11194203 | 11194342 | 140*j | 3n +2*g | - | - | - | - | - | |
| Tandem CE | 4 | 11194574 | 11194627 | 54 | 3n | - | - | - | - | - | ||||
| Tandem CE | V | WBGene00013481 | - | 7 | 18661622 | 18661711 | 90 | 3n | - | - | - | - | - | |
| Tandem CE | 6 | 18662119 | 18662313 | 195 | 3n | - | - | - | - | - | ||||
| Constitutive | X | WBGene00009888 | + | 5 | 9554299 | 9554547 | 249 | 3n | - | - | - | - | - | |
| Tandem CE | 6 | 9554692 | 9554823 | 132 | 3n | - | - | - | - | - | ||||
| Constitutive | X | WBGene00020732 | + | 16 | 17679922 | 17680035 | 114 | 3n | - | - | - | - | - | |
ME, mutually exclusive exons; CE, cassette exon. *aTotal number of sequence reads mapped to each ME exon and its junctions with the flanking exons out of 10.3 million mapped reads in RNA-seq analysis of polyA+ RNA from synchronized smg-2 L1 larvae. *bIntervening introns shorter than 40 nt are underlined. *cYes, NMD isoform(s) were significantly increased in the smg-2 mutant (P < 0.05 in a modified χ2 test of the RT-PCR products). Partially, NMD isoform(s) were slightly increased in the smg-2 mutant (difference in the amount of the splice variant > 0.4 [% of sum] in molar concentration). No, no apparent difference in the RT-PCR patterns between N2 and the smg-2 mutant. *dThese exons were expressed at very low levels at the L1 stage. At the young adult stage, lev-11 exon 7a and snt-1 exon 6B were still rare, while unc-62 exon 7a was readily detected. *eOut-of-frame tandem acceptor sites were frequently used for coq-2 exon 6b. *fmrp-1 exon 13D is aberrantly annotated in the RefSeq model used for mapping. *gThese exons carry in-frame termination codons. *hNucleotide and amino acid sequence identities of del-6 exons 5/6 appear high because of their short lengths. *iIn-frame tandem acceptor sites were frequently used for unc-43 exon 14. *jTandem donor sites were frequently used for K10D6.2 exon 5.

Figure 1. RT-PCR analyses of the putative ME exons in the wild-type (N2) and the smg-2 (yb979) mutant. (A) clp-1 exons 3/4. (B) del-6 exons 5/6. del-6 exon 5 is unique in that it carries a natural termination codon. (C) bet-2 exons 3a/3b. Note that the intervening intron is retained instead of double ME exon inclusion for this cluster. (D) unc-32 exons 7a/7b. (E) akt-1 exons 6a/6b. A non-productive exon 6a isoform utilizing an aberrant acceptor site is detected in the smg-2 mutant. (F) fbl-1 exons 5D/5C. (G) F30F8.9 exons 4a/4b. (H) coq-2 exons 6a/6b. (I) gck-1 exons 4a/4b. (J and K) lev-11 exons 5a/5c/5b. (L) let-805 exons 19a/19b. Splicing patterns are schematically indicated. Coding regions are in orange. Arrows indicate predicted positions of undetected isoforms indicated on the right. Asterisks indicate non-specific bands.

Figure 2. Statistics of the ME exons and their flanking introns. (A and B) Size distributions of the 63 verified ME exons (A) and their 92 flanking introns, including the intervening introns (B). The mean and median sizes are also indicated. (C) Sequence logos of the splice acceptor and donors sites of the 63 ME exons.
Table 2. Gene ontology analysis of 25 genes with the ME exons and GO terms
| Ontology type | GO term ID | Fold enrichment | Count in 25 Genes with ME Exons and GO terms | Count in all genes with GO Terms (12,834) | Term | |
|---|---|---|---|---|---|---|
| Biological_process | GO:0046928 | 257 | 2 | 4 | 2.18E-05 | regulation of neurotransmitter secretion |
| GO:0030163 | 16 | 4 | 131 | 1.11E-04 | protein catabolic process | |
| GO:0007166 | 79 | 2 | 13 | 2.80E-04 | cell surface receptor linked signal transduction | |
| GO:0040011 | 3.5 | 9 | 1327 | 5.77E-04 | locomotion | |
| GO:0034765 | 49 | 2 | 21 | 7.48E-04 | regulation of ion transmembrane transport | |
| GO:0043050 | 45 | 2 | 23 | 8.99E-04 | pharyngeal pumping | |
| Cellular_component | GO:0005865 | 114 | 2 | 9 | 1.30E-04 | striated muscle thin filament |
| GO:0016021 | 2.9 | 12 | 2143 | 2.77E-04 | integral to membrane | |
| GO:0016020 | 3.1 | 11 | 1847 | 3.37E-04 | membrane | |
| GO:0005604 | 57 | 2 | 18 | 5.47E-04 | basement membrane | |
| GO:0005578 | 47 | 2 | 22 | 8.22E-04 | proteinaceous extracellular matrix | |
| Molecular_function | GO:0005201 | 257 | 2 | 4 | 2.18E-05 | extracellular matrix structural constituent |
| GO:0005244 | 49 | 2 | 21 | 7.48E-04 | voltage-gated ion channel activity |