| Literature DB >> 32265287 |
Tatsuo Kanno1, Peter Venhuizen2, Ming-Tsung Wu3, Phebe Chiou1, Chia-Liang Chang1, Maria Kalyna4, Antonius J M Matzke1, Marjori Matzke1.
Abstract
To investigate factors influencing pre-mRNA splicing in plants, we conducted a forward genetic screen using an alternatively-spliced GFP reporter gene in Arabidopsis thaliana This effort generated a collection of sixteen mutants impaired in various splicing-related proteins, many of which had not been recovered in any prior genetic screen or implicated in splicing in plants. The factors are predicted to act at different steps of the spliceosomal cycle, snRNP biogenesis pathway, transcription, and mRNA transport. We have described eleven of the mutants in recent publications. Here we present the final five mutants, which are defective, respectively, in RNA-BINDING PROTEIN 45D (rbp45d), DIGEORGE SYNDROME CRITICAL REGION 14 (dgcr14), CYCLIN-DEPENDENT KINASE G2 (cdkg2), INTERACTS WITH SPT6 (iws1) and CAP BINDING PROTEIN 80 (cbp80). We provide RNA-sequencing data and analyses of differential gene expression and alternative splicing patterns for the cbp80 mutant and for several previously published mutants, including smfa and new alleles of cwc16a, for which such information was not yet available. Sequencing of small RNAs from the cbp80 mutant highlighted the necessity of wild-type CBP80 for processing of microRNA (miRNA) precursors into mature miRNAs. Redundancy tests of paralogs encoding several of the splicing factors revealed their functional non-equivalence in the GFP reporter gene system. We discuss the cumulative findings and their implications for the regulation of pre-mRNA splicing efficiency and alternative splicing in plants. The mutant collection provides a unique resource for further studies on a coherent set of splicing factors and their roles in gene expression, alternative splicing and plant development.Entities:
Keywords: Arabidopsis thaliana; CBP80; miRNAs; mutant screen; pre-mRNA splicing
Mesh:
Substances:
Year: 2020 PMID: 32265287 PMCID: PMC7263676 DOI: 10.1534/g3.119.400998
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.542
Figure 1Alternatively-spliced GFP reporter gene system. (A) The T-DNA construct introduced into Arabidopsis comprises a GFP reporter gene under the transcriptional control of a minimal promoter (TATA) and upstream viral (EPRV) enhancer. In the WT T line, however, the expected transcription initiation site (TSS) (light gray arrow) is not used. Rather transcription of GFP pre-mRNA initiates at an unanticipated upstream TSS (black bar and arrow). Alternative splicing yields three GFP splice variants: an unspliced transcript, a transcript resulting from splicing of a canonical GU-AG intron, and a transcript arising from splicing a U2-type intron with non-canonical AU-AC splice sites, which are considered weak compared to GU-AG splice sites (Crotti ). The unspliced and GU-AG transcripts contain numerous premature termination codons (black asterisks). Hence only the AU-AC transcript can be translated into GFP protein. The coding sequence of GFP protein (green bars) uniquely contains a 27 amino acid extension (short green bars with black outline) compared to standard GFP (Fu ; Kanno ). Arrowheads denote a tandem repeat cluster upstream of the cryptic promoter. The black AUG designates the major translation initiation codon. The 3′ AC splice site is only three nucleotides downstream of the 3′ AG splice site (Kanno et al. 2008; 2016, 2017a, b). (B) Schematic of forward screen and GFP fluorescence phenotypes of newly germinated seedlings. The wild-type Target (T) line has an intermediate level of GFP fluorescence visible primarily in the stem (hypocotyl) and shoot and root apices. Mutants generated by EMS treatment of the T line exhibit either reduced (Weak) or enhanced (Hyper) GFP fluorescence relative to the T line. Cotyledons (first leaves appearing from a germinating seedling) appear red owing to auto-fluorescence of chlorophyll at the excitation wavelength for GFP. Figure adapted from Figure 1 in Kanno et al. (2018a) with permission from the Genetics Society of America.
Mutants identified in a forward genetic screen based on an alternatively-spliced GFP reporter gene
| Name | AGI number | Predicted function in splicing | No. of alleles | Effect of mutation on development | Reference | |
|---|---|---|---|---|---|---|
| coilin | At1g13030 | marker protein for Cajal bodies, which facilitate snRNP maturation | 12 | negligible | ||
| CWC16a | At1g25682 | step I factor | 3 | negligible | ||
| SMU1 | At1g73720 | recruited prior to B* complex formation; | 1 | negligible | ||
| SMFA | At4g30220 | small nuclear ribonucleoprotein | 1 | negligible | ||
| PRP39A | At1g04080 | U1 snRNP component | 5 | negligible | ||
| RBP45D | At5g19350 | U1 snRNP component | 2 | negligible | this study | |
| DG CR14-related | At3g07790 | spliceosomal C complex | 2 | negligible | this study | |
| CDKG2 | At1g67580 | splicing-related protein kinase | 1 | early flowering | this study | |
| IWS1 | At1g32130 | transcription elongation | 2 | negligible | this study | |
| AtRTF2 | At5g58020 | contributes to ubiquitin-based regulation of the spliceosome? | 2 | embryo lethal | ||
| PRP8A | At1g80070 | U5 snRNP component; acts at catalytic core of spliceosome | 3 | embryo lethal | ||
| RBM25 | At1g60200 | U1 snRNP component | 2 | low seed set | ||
| PRP18A | At1g03140 | step II factor | 1 | short roots, small siliques | Kanno | |
| PRP4KA | A3g25840 | recruited prior to B* complex formation; needed for catalytic activation of spliceosome | 5 | broad rosettes, late flowering, tall stature, low seed set | ||
| SAC3A | At2g39340 | mRNA export factor | 5 | negligible | ||
| CBP80 | At2g13540 | multiple | 1 | Serrated leaves, early flowering | this study | |
| T line (WT) | n.a. | n.a. | Wild-type line expressing | n.a. | normal |
The mutants retrieved in a forward genetic screen based on an alternatively-spliced GFP reporter gene in Arabidopsis (Figure 1) include a predicted core spliceosomal protein (SMFa); putative components of the U1 snRNP (PRP39a, RBM25, RBP45d) and U5 snRNP (PRP8); putative step I and step II factors transiently associated with the spliceosome (CWC16a and PRP18a, respectively); a predicted complex C protein (DGCR14); putative splicing regulatory proteins (RTF2, SMU1, PRP4ka, CDKG2); one structural protein presumed to be important for snRNP maturation (coilin), putative mRNA export factors (SAC3a, CBP80) and a predicted transcription elongation factor (IWS1). Developmental phenotypes are primarily observed in six (of seven) identified gfw mutations, two of which are embryo-lethal.
Further screening of the M2 population after publication of the first alleles of coilin, PRP39a and CWC16a has identified three new alleles of coilin (R9H; first intron, 3′ splice site; second intron, 5′ splice site), one new prp39a allele (R226*) and two new cwc16a alleles (W18*; fifth intron, 3′ splice site). These unpublished alleles are counted in the number of alleles shown here.
Abbreviation: SRA, Sequence Read Archive (NCBI); ABRC, Arabidopsis Biological Resource Center; T (or ST) refers to the WT T line harboring the alternatively-spliced GFP reporter gene. If the sequencing data from T line has a separate SRA number, it is noted in the table; n.d., not done; n.a., not available.
Figure 2Spliceosomal cycle and factors identified in the forward genetic screen. Splicing is catalyzed by the spliceosome, a large and dynamic ribonucleoprotein (RNP) machine located in the nucleus. Spliceosomes comprise five small nuclear (sn) RNPs, each containing a heptameric ring of Sm or Like-Sm proteins and a different snRNA (U1, U2, U4, U5 or U6), as well as numerous other non-snRNP proteins. During the splicesomal reaction cycle, the five snRNPs act sequentially on the pre-mRNA with a changing assemblage of non-snRNP proteins to form a series of complexes that catalyze two consecutive trans-esterification reactions. In complex E, U1 and U2 snRNPs first recognize the 5′ and 3′ splice sites branch points of introns and interact to form pre-spliceosomal complex A. The subsequent addition of preformed U4/U5/U6 tri-snRNP creates pre-catalytic complex B. Ensuing reorganization steps induce release of U1 and U4 snRNPs and conversion of complex B to complex B*, which catalyzes the first reaction yielding the free 5′ exon and lariat 3′-exon intermediates. The newly formed C complex catalyzes the second reaction to achieve intron lariat excision and exon ligation. Lastly, dismantling of the spliceosome frees individual components to assemble anew at the next intron. The positions of factors identified in our screen, as predicted by their orthologs in yeast or metazoans, are indicated by colored rectangles (green, Hyper-GFP; dull green, GFP-weak). Adapted by permission from Springer Nature, Nat. Rev. Mol. Cell Biol. 15: 108–121, A day in the life of the spliceosome, A. G. Matera and Z. Wang, 2014.
Figure 3snRNP maturation pathway and factors identified in the forward genetic screen snRNAs are transcribed by RNA polymerase II in the nucleus and transported to the Cajal Body (CB), where they undergo quality control steps before export to the cytoplasm. In the cytoplasm, the monomethyl guanosine 5′ cap is converted to trimethyl guanosine by the enzyme TGS1. The snRNAs are encircled by a heteromeric ring complex comprising seven Sm core proteins, which protect the snRNA and together with the trimethyl guanosine cap, act as nuclear import signals. Back in the nucleus, the snRNP transits again through the CB where it undergoes further maturation steps before being released into the nucleoplasm to assemble into spliceosomes. Predicted positions of factors identified in our screen are shown by colored rectangles (bright green, Hyper-GFP; dull green, GFP-weak). Adapted by permission from Springer Nature, Nat. Rev. Mol. Cell Biol. 15: 108–121, A day in the life of the spliceosome, A. G. Matera and Z. Wang, 2014.
Numbers of DEGs and DAS events in different splicing-related factors recovered in the forward genetic screen
| Differentially Expressed Genes | ||||||
|---|---|---|---|---|---|---|
| Direction of change | ||||||
| Up | 2105 | 334 | 676 | 3121 | 1093 | |
| Down | 567 | 435 | 265 | 1937 | 1299 | |
| Total Number of DEGs | 2672 | 769 | 941 | 5058 | 2392 | |
| Supplemental Table | Table S4 | Table S6 | Table S8 | Table S9 | Table S9 | |
| AA | 168 (8.1) | 129 (44.9) | 45 (14.2) | 292 (13.4) | 69 (19.5) | |
| AD | 199 (9.6) | 17 (5.9) | 27 (8.5) | 244 (11.2) | 37 (10.5) | |
| CE | 158 (7.7) | 20 (7.0) | 20 (6.3) | 307 (14.1) | 29 (8.2) | |
| EI | 83 (4.0) | 13 (4.5) | 26 (8.2) | 158 (7.2) | 37 (10.5) | |
| RI | 1456 (70.5) | 108 (37.6) | 200 (62.9) | 1183 (54.2) | 181 (51.3) | |
| Total Number of DAS events | 2064 (100) | 287 (100) | 318 (100) | 2184 (100) | 353 (100) | |
| Total Number of DAS genes | 1507 | 255 | 240 | 1465 | 287 | |
| Supplemental Table | Table S3 | Table S5 | Table S7 | Table S10 | Table S10 | |
Abbreviations: AA, alternative acceptor/alternative 3′ splice site; AD, alternative donor/alternative 5′ splice site; CE, cassette exon; EI, exitron; RI, retained intron.
Only the DEGs/DAS events changed in all alleles, with the same direction of change, are listed here. For the full overview of alternative splicing events per allele, see the respective supplementary tables.