Literature DB >> 30297453

PRP4KA, a Putative Spliceosomal Protein Kinase, Is Important for Alternative Splicing and Development in Arabidopsis thaliana.

Tatsuo Kanno1, Peter Venhuizen2, Tuan-Nan Wen1, Wen-Dar Lin1, Phebe Chiou1, Maria Kalyna3, Antonius J M Matzke4, Marjori Matzke4.   

Abstract

Splicing of precursor messenger RNAs (pre-mRNAs) is an essential step in the expression of most eukaryotic genes. Both constitutive splicing and alternative splicing, which produces multiple messenger RNA (mRNA) isoforms from a single primary transcript, are modulated by reversible protein phosphorylation. Although the plant splicing machinery is known to be a target for phosphorylation, the protein kinases involved remain to be fully defined. We report here the identification of pre-mRNA processing 4 (PRP4) KINASE A (PRP4KA) in a forward genetic screen based on an alternatively spliced GFP reporter gene in Arabidopsis thaliana (Arabidopsis). Prp4 kinase is the first spliceosome-associated kinase shown to regulate splicing in fungi and mammals but it has not yet been studied in plants. In the same screen we identified mutants defective in SAC3A, a putative mRNA export factor that is highly coexpressed with PRP4KA in Arabidopsis Whereas the sac3a mutants appear normal, the prp4ka mutants display a pleiotropic phenotype featuring atypical rosettes, late flowering, tall final stature, reduced branching, and lowered seed set. Analysis of RNA-sequencing data from prp4ka and sac3a mutants identified widespread and partially overlapping perturbations in alternative splicing in the two mutants. Quantitative phosphoproteomic profiling of a prp4ka mutant detected phosphorylation changes in several serine/arginine-rich proteins, which regulate constitutive and alternative splicing, and other splicing-related factors. Tests of PRP4KB, the paralog of PRP4KA, indicated that the two genes are not functionally redundant. The results demonstrate the importance of PRP4KA for alternative splicing and plant phenotype, and suggest that PRP4KA may influence alternative splicing patterns by phosphorylating a subset of splicing regulators.
Copyright © 2018 Kanno et al.

Entities:  

Keywords:  Arabidopsis thaliana; PRP4 kinase; SAC3A; alternative splicing; protein phosphorylation

Mesh:

Substances:

Year:  2018        PMID: 30297453      PMCID: PMC6283158          DOI: 10.1534/genetics.118.301515

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


PRECURSOR messenger RNA (pre-mRNA) splicing, which entails removal of introns and joining of exons, is an essential step in the expression of most eukaryotic genes. Splicing is catalyzed in two consecutive transesterification steps by the spliceosome, a large, dynamic ribonucleoprotein machine located in the nucleus (Will and Lührmann 2011; Matera and Wang 2014; Meyer 2016). In constitutive splicing, the same splice sites are always used, generating a single processed messenger RNA (mRNA) from a given gene. By contrast, alternative splicing involves varying splice-site usage, thus yielding multiple mRNA isoforms from a single primary transcript. Alternative splicing greatly expands transcriptome and proteome diversity. Although rare in Saccharomyces cerevisiae (budding yeast) (Gould ), alternative splicing occurs at low frequency in Schizosaccharomyces pombe (fission yeast) (Fair and Pleiss 2017) and is common in plants and metazoans (Nilsen and Graveley 2010; Marquez ; Naftelberg ). Major modes of alternative splicing include intron retention (IR), exon skipping (ES), alternative 5′ (donor) splice site, and alternative 3′ (acceptor) splice site. Splicing of exonic introns (exitrons), which are alternatively spliced internal regions of reference protein-coding exons, represents a noncanonical splicing event and occurs in ∼7% of Arabidopsis and 4% of human protein-coding genes (Marquez ; Staiger and Simpson 2015; Sibley ; Zhang ). ES is the most frequent mode of alternative splicing in animal cells, whereas it is rarely observed in plants (Marquez ). IR predominates in plants and is also widespread in animals (Marquez ; Braunschweig ). In plants, alternative splicing has important roles in development and in responses to the environment (Staiger and Brown 2013; Filichkin ; Szakonyi and Duque 2018). The recognition of alternative splice sites and modulation of splicing events is guided by a splicing code, which involves a complex interplay among trans-acting factors, cis-acting RNA regulatory elements, and other RNA and chromatin features (Barash ; Baralle and Baralle 2018). Trans-acting splicing factors include serine/arginine-rich (SR) proteins and heterogeneous nuclear ribonucleoproteins (hnRNPs), which respectively bind exonic and intronic cis-regulatory elements, which are termed splicing enhancers and silencers (Barta ; Matera and Wang 2014). Because splicing is coupled to transcription, chromatin structure can influence alternative splicing patterns by influencing the rate of transcription, exon definition, and recruitment of splicing factors through chromatin binding proteins (Naftelberg ). Post-translational modifications of splicing proteins (such as phosphorylation, acetylation, ubiquitination, and sumoylation) contribute to the regulation of both constitutive and alternative splicing (Will and Lührmann 2011; Pozzi ). In particular, reversible phosphorylation of SR proteins and other splicing-related factors has an essential role in splicing (Fluhr 2008; Stamm 2008; Will and Lührmann 2011). SR proteins, which are present in organisms with more complex splicing patterns (fission yeast, plants, and metazoans), feature one or two RNA recognition motifs at their N terminus and an arginine/serine-rich (RS) domain at their C terminus. Phosphorylation/dephosphorylation in the RS domain can alter the ability of SR proteins to interact with other proteins and RNA, which in turn modifies pre-mRNA splicing outcomes (Barta ; Will and Lührmann 2011). The plant spliceosomal machinery is a major target of phosphorylation, as illustrated by a previous phosphoproteomic investigation in Arabidopsis thaliana (Arabidopsis), which identified 22 phosphoproteins with a putative role in RNA metabolism. The set of phosphoproteins included 11 out of 18 SR proteins encoded in the Arabidopsis genome (de la Fuente van Bentem ). In diverse organisms, SR proteins can be phosphorylated by several distinct families of conserved protein kinases (Fluhr 2008; Zhou and Fu 2013). Kinases found previously to be important for phosphorylating SR proteins in Arabidopsis include SR protein kinases (de la Fuente van Bentem ; Rosembert 2017), Cdc2-like or LAMMER-type kinases (Golovkin and Reddy 1999; Savaldi-Goldstein ), and mitogen-activated protein kinases (Feilner ; de la Fuente van Bentem , 2008). Pre-mRNA processing 4 (PRP4) kinases, which are dual-specificity kinases (Lehti-Shiu and Shiu 2012), represent another general class of protein kinase involved in phosphorylating SR proteins and other splicing factors (Will and Lührmann 2011; Lützelberger and Käufer 2012). Prp4 kinases are present in all eukaryotes examined except the fungal group Hemiascomycetes, which includes budding yeast. Prp4 kinase was discovered in fission yeast as a temperature-sensitive mutant defective in pre-mRNA splicing at the restrictive temperature (Alahari ). Although Prp4 kinase is the first kinase shown to regulate pre-mRNA splicing in fungi and mammals (Lützelberger and Käufer 2012), it has not yet been studied for its role in splicing in plants (Lehti-Shiu and Shiu 2012). We report here the recovery of mutants defective in PRP4 kinase A (PRP4KA) (At3g25840) in a forward genetic screen designed to identify factors that influence splicing of an alternatively spliced GFP reporter gene in Arabidopsis. In the same screen, we also retrieved mutants impaired in SAC3A (suppressor of actin; Novick ) (At2g39340), a putative mRNA export factor that is highly coexpressed with PRP4KA in Arabidopsis. We describe the phenotypes of prp4ka and sac3a mutants as well as findings from RNA-sequencing (RNA-seq) analyses to determine differential gene expression and alternative splicing profiles in the two mutants. We present results from a quantitative phosphoproteomic investigation of a prp4ka mutant to identify potential substrates of this kinase. Finally, we describe tests of a mutant defective in PRP4KB (At1g13350), the paralog of PRP4KA, to address possible functional redundancy of the paralogous PRP4K genes (Al-Ayoubi ).

Materials and Methods

Plant material

The Arabidopsis transgenic T line containing an alternatively spliced GFP reporter gene (referred to here as “wild type”) and the prp4ka and sac3a mutants generated by ethyl methanesulfonate (EMS) mutagenesis of the T line are in the Col-0 ecotype (Kanno , 2017a,b). Seeds of a prp4kb transfer-DNA (T-DNA) insertion mutant (SALK_035104C) were provided by the Nottingham Arabidopsis Stock Center. The T-DNA is inserted into the middle of the ninth exon. To our knowledge, this is the first report of the prp4kb T-DNA insertion mutant, which we will refer to as prp4kb-1. The prp4kb-1 allele appears to be a complete knockout (Supplemental Material, Figure S1). All plants were cultivated under long-day conditions (22–23°, 16 hr light, 8 hr dark). The terminology used for different plant generations is as follows: The M2 generation refers to progeny resulting from self-fertilization (selfing) of the original M1 mutant plant grown from seeds treated with EMS. M1 progeny are heterozygous for a given mutation. Thus, M2 is the first generation when a recessive mutation can be homozygous. Further selfing of the M2 plants leads to generations M3, M4, and so on. Backcrossing an M2 plant with the parental wild-type T line produces the BC1 generation, which is again heterozygous for the respective mutation. Selfing of BC1 plants produces the BC1F2 generation, 25% of which are again homozygous for the respective mutation. BC1F2 plants contain fewer EMS-induced mutations than the original M2 plant. Further selfing of BC1F2 plants produces generations BC1F3, BC1F4, and so forth. Crossing two strains that are homozygous for different mutations produces the F1 generation, which is heterozygous for the two mutations. Selfing an F1 plant produces the F2 generation, which is segregating the two mutations in a Mendelian manner.

Forward genetic screen, phenotype analysis, and complementation

The forward genetic screen based on an alternatively spliced GFP reporter gene in the wild-type T line has been described previously (Kanno , 2017a,b). The mutagen EMS generates almost exclusively G/A to C/T transition mutations (Kim ). Screening of putative mutants was performed in the M2 generation. The gfw5 and gfw6 mutants described here were identified by the GFP-weak phenotype of M2 seedlings cultivated under sterile conditions on solid Murashige and Skoog medium viewed using a Leica M165FC fluorescence stereomicroscope. The first alleles in the PRP4KA gene (At3g25840) and in the SAC3A gene (At2g39340) in gfw5 and gfw6 mutants, respectively, were identified by next generation mapping (NGM) (James ). NGM involves sequencing of pooled DNA isolated from at least 50 BC1F2 seedlings that display a GFP-weak phenotype (Kanno ,b). Additional prp4ka and sac3a alleles were identified by Sanger sequencing of the PRP4KA and SAC3A genes considered as possible candidates for mutations in unnamed mutants. Phenotypic analysis of prp4ka and sac3a mutants (two alleles of each) was performed on the BC1F3 generation. A total of 12 plants from each genotype (wild-type T line, prp4ka-2, prpk4a-4, sac3a-3, and sac3a-6 mutants) were grown side by side on soil under long-day conditions (22–23°, 16 hr light, 8 hr dark) and observed during the entire vegetative growth, reproductive phases, and into senescence. The phenotypic characters that were scored included flowering time (time to bolting), rosette diameter, final height of adult plant, number of main and auxiliary stems/branches, seed weight from individual plants, and (in some cases) numbers of siliques per plant and seeds per silique. For complementation tests, the prp4ka-4 and sac3a-6 mutants were transformed with a construct containing either the PRP4KA or SAC3A wild-type coding sequence under the transcriptional control of the 35S promoter and terminator sequences (Pietrzak ). The constructs were introduced into the respective mutant plants (BC1F3 generation) using the floral dip method (Clough and Bent 1998) and Agrobacterium binary vector BV-MpPATot SalI (Matzke ), which encodes resistance to phosphinothricin (PPT). T1 transformants were selected on solid Murashige and Skoog medium containing 20 µg/ml PPT and 200 µg/ml cefotaxime to destroy agrobacteria. Successful complementation was indicated by a return to an intermediate GFP phenotype (similar to that observed in the wild-type T line) in seedlings growing on solid Murashige and Skoog medium and, in the prp4ka mutants, restoration of a wild-type phenotype in soil-grown plants. The presence of the respective prp4ka-4 and sac3a-6 mutations in complemented lines was confirmed by Sanger sequencing.

Western blotting using a GFP antibody

Western blotting to determine levels of GFP protein in the prp4ka-4 and sac3a-6 mutants compared to wild-type T line was carried out as described previously (Fu ; Kanno , 2017a,b). Total protein was isolated from 2-week-old seedlings growing on solid Murashige and Skoog medium under a 16 hr light/8 hr dark cycle at 24°. Monoclonal antibodies to GFP were purchased from Roche (catalog no. 11814 460001). For a loading control, a duplicate gel containing the same samples was run and stained with Coomassie brilliant blue.

Semiquantitative RT-PCR to assess levels of GFP RNA splicing variants

Semiquantitative RT-PCR was used to gauge the levels of the three GFP RNA splice variants in prp4ka-4 and sac3a-6 mutants relative to the wild-type T line following a published protocol (Sasaki ; Kanno ,b). Total RNA was isolated from 2-week-old seedlings of the wild-type T line, the prp4ka-4 mutant, and the sac3a-6 mutant (BC1F3 generation for both mutants) growing on solid Murashige and Skoog medium as described above using a Plant Total RNA Miniprep Kit (GeneMark, Taichung, Taiwan). Primers for GFP and actin are listed in Table S1.

RNA-seq

Total RNA was isolated from 2-week-old seedlings of the wild-type T line, the prp4ka-4 mutant, and the sac3a-6 mutant (BC1F3 generation for both mutants) cultivated on Murashige and Skoog medium as described above. Preparation of libraries and RNA-seq were performed (biological triplicates for each sample) as described previously (Sasaki ; Kanno ). Whole-genome resequencing of the prp4ka-4 and sac3a-6 mutants was conducted to identify any remaining EMS-induced, second-site mutations that change splice sites. These mutations were then removed from the analysis of alternative splicing.

RNA-seq analyses for differentially expressed genes and alternative splicing events

Differential expression analysis:

To determine differential expression of the prp4ka and sac3a mutants compared to the wild type, we considered the transcript per million (TPM) estimated with Salmon (version 0.8.0; Patro ) for the Reference Transcript Dataset for Arabidopsis 2 (AtRTD2)-Quantification of Alternatively Spliced Isoforms (QUASI) (AtRTD2-QUASI) annotation (Zhang ), and used tximport (Soneson ) to group transcript read counts per gene. Differential genes were determined using edgeR (version 3.18.1; Robinson ). Genes were considered differentially expressed for a false discovery rate <0.05.

Alternative splicing analysis:

Alternative splicing events were generated using SUPPA (Alamancos ) from the AtRTD2-QUASI reference transcriptome annotation file (Zhang ). The ES, IR, and exitron events were extracted using variable boundaries, whereas the alternative 3′ and 5′ splicing events (A3 and A5, respectively) were defined with strict boundaries. The percent spliced-in (PSI) inclusion values were calculated based on the transcript TPM quantification. Differential splicing, the ΔPSI, was calculated using the event PSI and Salmon TPM values as input. Events were considered significantly changed for an absolute ΔPSI ≥ 0.1 and a P-value of <0.01. Introns with a U12 signature were derived from the analysis performed by Zhang .

SNP/indel calling:

SNPs and indels were identified using the Genome Analysis Toolkit (GATK) pipeline (Van der Auwera ). Picard (version 2.10.9, http://broadinstitute.github.io/picard) was used to generate the sequence dictionary for the TAIR10 genome release. Reads were aligned to the TAIR10 genome using BWA-MEM (0.7.16a-r1181; Li 2013), with the added -M flag. The resulting SAM file was converted to BAM format, sorted, and duplicates were marked using Picard tools. The GATK (version 3.8-0-ge9d806836) haplotypeCaller was used to obtain the raw variants and the SelectVariants function was used to extract the SNPs and indels. SNPs were filtered using the following filter expression: “QD < 2.0 || FS > 60.0 || MQ < 40.0 || MQRankSum < −12.5 || ReadPosRankSum < −8.0.” The filter expression for indels was as follows: “QD < 2.0 || FS > 200.0 || ReadPosRankSum < −20.0.” SNPs and indels were intersected with the AtRTD2 annotated transcripts and the SUPPA events using in-house scripts. Any events with a SNP and/or indel overlapping with either the 5′ splice site or the 3′ splice site were removed from the final output. For the 5′ splice site, the last 3 exonic and the first 10 intronic bases were taken; for the 3′ splice the last 14 intronic bases and the first 3 exonic bases were used.

Analysis of alternative introns differentially regulated in prp4ka and sac3a mutants:

Introns were analyzed per alternative splicing event type, comparing the features of the differentially spliced introns against the introns of the same event type not changed in the mutants. Due to the limited size of the shared and same subgroups, the different types of alternative splicing events were grouped and compared to all events in the prp4ka and sac3a mutants. Splice-site strengths were evaluated by using position weight matrices (Sheth ).

PRP4KA-dependent first intron splicing:

The differentially regulated IR events in the prp4ka mutant were divided into two categories: the first introns of a transcript and all other remaining introns. Splice-site strengths for the first and the remaining introns were evaluated by using position weight matrices (Sheth ). The degrees of retention (expressed as PSI values) of the first and remaining introns were compared for the wild-type and prp4ka genotypes.

Isobaric tags for relative and absolute quantification analysis

Protein preparations and liquid chromatography–mass spectrometry:

Total protein was isolated from 1 g of 2-week-old seedlings growing on solid Murashige and Skoog medium as described above following a previously described protocol (Vélez-Bermúdez ). Protein treatment, protease digestion, and labeling prior to liquid chromatography (LC)–mass spectrometry (MS) (LC-MS) analysis were performed as described previously (Lan ) with minor modifications. Protein concentrations were measured using a Pierce 660 nm protein Assay kit (Thermo Scientific). Proteins in 8 M urea, 50 mM Tris-HCl, pH 8.5, were reduced in 10 mM DTT for 1 hr at 37° and Cys were alkylated in 50 mM iodoacetamide at room temperature for 30 min in the dark. The protein solution was then diluted to contain 4 M urea with 50 mM Tris-Cl, pH 8.5; digested with 250 units/ml Benzonase (Sigma-Aldrich, St. Louis, MO) at room temperature for 2 hr; followed by Lys-C (Wako, Osaka, Japan) digestion [1:200 weight by weight (w/w)] at room temperature for 4 hr. The protein solution was further diluted to contain <2 M urea with 50 mM Tris-Cl, pH 8.0, and incubated with modified trypsin (1:50 w/w; Promega, Madison, WI) at 37° overnight. The digested solution was acidified with 10% trifluoroacetic acid, desalted using an Oasis HLB cartridge (Waters Associates, Milford, MA), and dried using a SpeedVac. For phosphoproteome analysis, prior to isobaric tags for relative and absolute quantification (iTRAQ) labeling, phosphopeptides were enriched from digested proteins (3.5 mg) using TiO2 affinity chromatography (Titansphere Phos-TiO; GL Sciences) according to the method described by the vendor.

Peptide labeling with isobaric tags and fractionation:

Dissolution of dried peptides in dissolution buffer and labeling with iTRAQ reagents (Multiplex kit; AB Sciex) were performed according to the manufacturer’s instructions. Tryptic peptides from two different samples, prp4ka-4 and the wild-type T line, were labeled with iTRAQ 116 and 117 reagents, respectively. For the phosphoproteome analysis, the prp4ka-4 and T-line samples were labeled with 114 and 115 reagents, respectively, after the phosphopeptides were enriched using TiO2 affinity chromatography. The labeling reactions with iTRAQ reagents were incubated for 1 hr at room temperature. Following the reaction, solutions from different iTRAQ labels were combined and further fractionated on a strong cation-exchange (SCX) (PolySulfoethyl A, 4.6 × 200 mm, 5 µm, 200 Å; PolyLC) HPLC. The SCX chromatography was performed with an initial equilibrium buffer A containing 10 mM KH2PO4, 25% acetonitrile, pH 2.65, followed by a 0–15% buffer B (1 M KCl in buffer A, pH 2.65) gradient for 20 min, a 15–30% buffer B gradient for 10 min, a 30–50% buffer B gradient for 5 min, a 50–100% buffer B gradient for 1 min, and 100% buffer B for 5 min. The flow rate was 1 ml/min. The chromatography was recorded with absorbance 214 nm UV light. Fractions (0.5 min/fraction) were collected and pooled into 16 final fractions. Fractions were desalted using an Oasis HLB Cartridge (Waters) prior to LC-MS/MS analysis. Enriched phosphopeptide sample was fractionated using hydrophilic interaction liquid chromatography (HILIC) (TSKgel Amide-80 HR, 4.6 × 250 mm, 5 μm; Tosoh). The HILIC was performed in solvent containing acetonitrile and 0.1% trifluoroacetic acid with decreasing acetonitrile gradients: 90–85% in 5 min, 85–60% in 50 min, and 60–0% in 5 min, at flow rate of 0.5 ml/min. Ten fractions were collected for LC-MS/MS analysis.

LC-MS/MS analysis:

Peptides in each fraction were redissolved in 0.1% formic acid and the LC-MS/MS was performed using the Q Exactive Mass Spectrometer equipped with the Dionex UltiMate 3000 RSLCnano LC system or the LTQ-Orbitrap Fusion Lumos Mass Spectrometer equipped with the EASY-nLC system. A C18 capillary column (Acclaim PepMap RSLC, 75 μm × 250 mm; Thermo Scientific) was used to separate peptides with a 120-min linear gradient (from 3 to 35%) of solvent B (0.1% formic acid in acetonitrile) at a flow rate of 300 nl/min on the LC system. The MS was operated in the data-dependent mode with the top 10 (Q Exactive) or top 20 (Fusion Lumos) ions (charge states ≥2) for MS/MS analysis following an MS survey scan for each acquisition cycle. The selected ions were isolated in the quadrupole and subsequently fragmented using higher-energy C-trap dissociation (HCD) and then analyzed in the Orbitrap cell. The MS was set as follows on Q Exactive: mass-to-charge ratio range of 380–1800, resolving power of 70,000, automatic gain control (AGC) target of 3e6, and maximum ion trap (IT) of 30 ms. For the Fusion Lumos Mass Spectrometer, the MS was set as follows: resolving power of 120,000, AGC target of 4e5, maximum IT of 50 ms. The MS/MS was set as follows on Q Exactive: resolving power of 17,500, AGC target of 1e5, maximum IT of 200 ms, and HCD collision energy (NCE) of 30%. For the Fusion Lumos Mass Spectrometer, the MS was set as follows: resolution power of 15,000, AGC target of 5e4, maximum IT of 100 ms, and HCD NCE of 35%. For phosphopeptides, the HCD was set at 30 with 10% stepped NCE on Q Exactive, or at 35% NCE with 5% stepped NCE on the Fusion Lumos Mass Spectrometer.

Data analysis for protein identification and quantification:

Peptide identification was performed using the Proteome Discoverer software (version 2.1; Thermo Scientific) with the Sequest HT and Mascot (version 2.5; Matrix Sciences) search engines. MS data were searched against the AtRTD2 translation (Zhang ) database. Search conditions were set as follows: full trypsin digestion, two maximum missed cleavage allowed, precursor mass tolerance of 10 ppm, fragment mass tolerance of 20 mmu, dynamic modifications of oxidation (M), protein N-terminal acetylation, iTRAQ4plex (Y), static modifications of carbamidomethyl (C), and iTRAQ4plex (N terminus and K). The peptide spectrum matches (PSMs) were validated using the Percolator validator algorithm, which automatically conducted a decoy database search and rescored PSMs using q-values and posterior error probabilities. All PSMs were filtered with a q-value threshold of 0.05 (5% false discovery rate, FDR) or 0.01 (1% FDR) for proteome or phosphoproteome analysis, respectively. A q-value threshold of at least 0.01 was finally used to filter protein FDR for the proteome analysis. For comparative peptide:protein quantification, the ratios (114:115 and 116:117) of iTRAQ reporter ion intensities in MS/MS spectra of PSMs were used to calculate the fold changes between samples.

Statistical analysis of iTRAQ data

For each phosphoproteome analysis table made by the Proteome Discoverer software (biological replicates 1–3), phosphorylation sites were notated in the form of amino acid:protein:position according to reported peptide sequences, modifications, and master protein accessions. For example: (1) peptide ILSSLSR and modification Phospho [S3(100)] were interpreted as Phospho:S:AT1G01050.P2:24, (2) DEPAEESDGDLGFGLFD and Phospho [S7(100)] were interpreted as Phospho:S:AT1G01100.P2:102:AT4G00810.c1:103:AT5G47700.2:103 because of multiple master proteins, and (3) QSDTSPPPSPASK and Phospho [T/S] were interpreted as S:AT1G01320.1:146/149/153/156|T:AT1G01320.1:148 because of no explicit position of S. In so doing, all records related with the same phosphorylation site in the phosphoproteome analysis tables of biological replicates would result in the same notation, and its normalized abundances in the control and the treatment samples were added up for each replicate. In each phosphoproteome analysis table, log ratios of phosphorylation sites were computed based on the abundance sums and then transferred into Z-scores. In so doing, a phosphorylation site was associated with as many Z-scores as many times of detection in biological replicates. Assuming Z-scores close to zero as for unchanged abundance between the control and the treatment samples, every phosphorylation site with two or more Z-scores was tested for its deviation from “unchanged abundance” by testing the deviation of zero from its Z-scores using a model of the standard normal distribution. Note that the underlying null hypothesis assumes that the abundance of a phosphorylation site was not changed in all three replicates, and a significant P-value indicates altered abundance in the mutant samples. A similar method was applied to the proteome analysis tables for detecting proteins with altered abundance in the three replicates. Finally, the two statistical results were joined according to reported phosphorylation–protein relationships.

Gene ontology classification

Gene ontology (GO) classification of the genes affected in prp4ka and sac3a mutants and GO term overrepresentation tests were done using PANTHER software tools (Thomas ) (version 13.1 released February 3, 2018) available at http://pantherdb.org.

Effects of a prp4kb mutation on GFP expression and development

To test whether a homozygous mutation in PRP4KB (At1g13350), the paralog of PRP4KA, would affect GFP expression, we crossed the wild-type T line (T/T;B/B) with a homozygous prp4kb T-DNA insertion mutant (−/−;b/b) (SALK_035104C). In the prp4kb mutant, the T-DNA is inserted in the ninth exon, thus disrupting the PRP4 kinase domain. Self-fertilization of the F1 plants generated from this cross (genotype T/-;B/b; the dash indicates hemizygosity for the transgenic T locus) yielded a segregating F2 population. F2 seeds were germinated on solid Murashige and Skoog medium and screened ∼2 weeks later under a fluorescence stereomicroscope for GFP expression, which is observed with a genotype of either T/T or T/- [collectively written hereafter as T/(T)]. A subset of GFP-positive F2 seedlings was transferred to soil for genotyping to identify T/(T);b/b plants. To assess the effects of a prp4kb mutation on development, the prp4kb homozygous mutant was grown on soil next to age-matched prp4ka mutants and the wild-type T line. All plants were observed during the entire growth and reproductive phases and into senescence. Characters noted included flowering time, rosette diameter, final height of adult plant, stem/branch number, and silique number. To investigate the viability of double homozygous mutant plants (a/a; b/b), we crossed the homozygous prp4ka-4 mutant (T/T;a/a;B/B) to a prp4kb-1 homozygous plant (A/A;b/b). Both of these alleles are presumably nulls (Figure S1). Self-fertilization of the F1 plants resulting from this cross (genotype T/-;A/a;B/b) produced a segregating F2 population. The F2 seeds were germinated on solid Murashige and Skoog medium and prescreened under a fluorescence stereomicroscope for a GFP-weak phenotype [indicating a genotype of T/(T);a/a with the b allele segregating in the F2 population]. Selected GFP-weak F2 progeny were transferred to soil for genotyping to identify T/(T);a/a;b/b plants. Primers for detecting prp4kb-1 are listed in Table S1.

Data availability

Seeds of the homozygous T line can be acquired from the Arabidopsis Biological Resource Center (ABRC), Ohio State University, under the stock number CS69640. Seeds of the prp4ka and sac3a mutants will be submitted to ABRC upon acceptance of the article and are presently available on request from the Matzke laboratory. RNA and DNA sequencing data are available at the National Center for Biotechnology Information Sequence Read Archive under accession number SRP117313. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE (Vizcaíno ) partner repository with the data set identifier PXD008580. Figure S1 shows phenotypes of sac3a, prp4ka, and prp4kb mutants; Figure S2 shows amino acid sequence alignments of PRP4K proteins in selected plant species; Figure S3 shows amino acid alignments of PRP4K proteins in model organisms; Figure S4 shows amino acid alignments of SAC3A proteins in selected plant species; Figure S5 shows amino acid alignments of SAC3A proteins in model organisms; Figure S6 shows a statistical analysis of features of introns affected by differential alternative splicing (DAS) in the prp4ka mutant; Figure S7 shows a statistical analysis of features of introns affected by DAS in the sac3a mutant; Figure S8 contains an analysis of alternative introns differentially regulated in both prp4ka and sac3a mutants; Figure S9 contains an analysis of first intron splicing in wild type and the prp4ka mutant; Figure S10 is a figure of the spliceosomal cycle and predicted positions of mutated factors identified in the screen; Table S1 shows primers used in this study; Table S2 shows mutants identified so far in the forward genetic screen; Table S3 shows spliceosomal and NineTeen Complex (NTC)-associated genes/proteins changing in expression, alternative splicing, and/or phosphorylation in prp4ka; Table S4 shows spliceosomal and NTC-associated genes changing in expression and/or alternative splicing in the sac3a mutant; Table S5 shows differentially expressed genes (DEGs) in the prp4ka and sac3a mutant; Table S6 shows IR events affected in the prp4ka and sac3a mutants; Table S7 shows ES events affected in the prp4ka and sac3a mutants; Table S8 shows alternative 5′ and 3′ splice-site events affected in the prp4ka or sac3a mutants; Table S9 shows exitron splicing events affected in prp4ka and sac3a mutants; Table S10 lists phosphorylation changes in the prp4ka mutant; Table S11 shows a GO analysis for genes affected in the prp4ka mutant; Table S12 shows a GO analysis for genes affected the in sac3a mutant; Table S13 shows a GO analysis for the shared set of genes affected in the prp4ka and sac3a mutants; and Table S14 lists flowering genes affected in the prp4ka mutant. Supplemental material available at Figshare: https://doi.org/10.25386/genetics.7171694.

Results

Alternatively spliced GFP reporter gene system

The alternatively spliced GFP reporter gene used in the forward genetic screen to identify splicing factors has been described previously (Sasaki ; Kanno , 2017a,b). Of three major transcripts issuing from the GFP reporter gene, only one, which results from splicing a U2-type intron with noncanonical AT–AC splice sites, corresponds to a translatable GFP mRNA (Figure 1). The AT–AC intron does not contain the highly conserved 5′ splice-site sequence and branch-point sequence typical of U12 introns (recognized by the minor U12 spliceosome) (Sasaki ) and hence is most likely spliced by the major U2 spliceosome, which is known to splice AT–AC introns in addition to canonical GT–AG introns (Burge ; Turunen ). Mutations in genes encoding splicing proteins can change the ratio of the three transcripts, giving rise to either a GFP-weak (gfw) or Hyper-GFP (hgf) phenotype relative to the intermediate level of GFP observed in the wild-type T line (Sasaki ; Kanno , 2017a,b). So far, we have reported five hgf and four gfw mutants, all of which are deficient in splicing-related factors predicted to act at various stages of the spliceosomal cycle and small nuclear ribonucleoprotein (snRNP) maturation pathway (Table S2). Here we describe two new mutants in the GFP-weak category: gfw5 and gfw6.
Figure 1

Alternatively-spliced GFP reporter gene used in genetic screen. Top: The T-DNA construct introduced into Arabidopsis comprises a GFP reporter gene under the transcriptional control of a minimal promoter (TATA) and upstream viral (EPRV) enhancer. In the wild-type T line, however, the expected transcription initiation site (gray arrow) is not used. Rather, transcription of GFP pre-mRNA initiates at a cryptic upstream promoter (black bar and arrow). Alternative splicing yields three GFP splice variants: an unspliced transcript, a transcript resulting from splicing of a canonical GT–AG intron, and a transcript arising from splicing a U2-type intron with noncanonical AT–AC splice sites, which are weakly recognized by the U2 spliceosome compared to canonical GT–AG splice sites (Crotti ). The unspliced and GT–AG transcripts contain numerous premature termination codons (*). Hence only the AT–AC transcript can be translated into GFP protein. The coding sequence of GFP protein (green bars) uniquely contains a 27 amino acid extension (short stippled green bars) compared to standard GFP (Fu ; Kanno ). Arrowheads denote a tandem repeat cluster upstream of the cryptic promoter. AUG designates the major translation initiation codon. The 3′ AT splice site is only 3 nt downstream of the 3′ AG splice site (Kanno , 2016, 2017a,b). Figure adapted from figure 1 in Kanno .

Alternatively-spliced GFP reporter gene used in genetic screen. Top: The T-DNA construct introduced into Arabidopsis comprises a GFP reporter gene under the transcriptional control of a minimal promoter (TATA) and upstream viral (EPRV) enhancer. In the wild-type T line, however, the expected transcription initiation site (gray arrow) is not used. Rather, transcription of GFP pre-mRNA initiates at a cryptic upstream promoter (black bar and arrow). Alternative splicing yields three GFP splice variants: an unspliced transcript, a transcript resulting from splicing of a canonical GT–AG intron, and a transcript arising from splicing a U2-type intron with noncanonical AT–AC splice sites, which are weakly recognized by the U2 spliceosome compared to canonical GT–AG splice sites (Crotti ). The unspliced and GT–AG transcripts contain numerous premature termination codons (*). Hence only the AT–AC transcript can be translated into GFP protein. The coding sequence of GFP protein (green bars) uniquely contains a 27 amino acid extension (short stippled green bars) compared to standard GFP (Fu ; Kanno ). Arrowheads denote a tandem repeat cluster upstream of the cryptic promoter. AUG designates the major translation initiation codon. The 3′ AT splice site is only 3 nt downstream of the 3′ AG splice site (Kanno , 2016, 2017a,b). Figure adapted from figure 1 in Kanno .

Recovery of prp4ka (gfw5) and sac3a (gfw6) mutants

The gfw5 and gfw6 mutants were identified by their GFP-weak phenotypes in a population of M2 seedlings (Figure 2A). NGM (James ) using pooled DNA isolated from at least 50 GFP-weak BC1F2 seedlings of the gfw5 and gfw6 mutants revealed homozygous recessive mutations in genes encoding PRP4KA and SAC3A, respectively. Subsequent Sanger sequencing of PRP4KA and SAC3A genes in additional unnamed gfw mutants identified a total of five prp4ka alleles and five sac3a alleles. The prp4ka alleles are the first to be isolated and hence are named prp4ka-1 to prp4ka-5 (Figure 3A). In view of two T-DNA insertion alleles previously published for sac3a (Lu ), the new sac3a alleles are designated sac3a-3 to sac3a-7 (Figure 3B). Complementation of the prp4ka-4 and sac3a-6 mutants with the respective wild-type coding sequences resulted in restoration of intermediate, wild-type levels of GFP fluorescence (Figure 2A), thus confirming that the prp4ka and sac3a mutations were responsible for the GFP-weak phenotypes of the respective mutants.
Figure 2

Molecular basis of GFP-weak phenotypes of prp4ka and sac3a mutants. (A) GFP-weak fluorescence in seedlings of prp4a and sac3a mutants (prp4ka-2/gfw5-2 and sac3a-3/gfw6-1). (B) Left: Semiquantitative RT-PCR to detect the three GFP splice variants (unspliced, GT–AG transcripts, and AT–AC transcripts) in prp4ka and sac3a mutants. Wild-type T line and nontransgenic Col-0 represent positive and negative controls, respectively; actin is the constitutively expressed control. Right: Percentages of the three major GFP RNA splice variants derived from an analysis of RNA-seq data (Table S5). The average of three biological replicates is shown for each sample. A two-sample t-test using the percentages of GFP RNA isoforms found a statistically significant difference between the amount of AT–AC and unspliced transcripts between the wild-type T line and the two mutants (P < 0.05). The total amount of GFP transcripts did not change significantly in prp4ka and sac3a mutants. (C) Western blotting to detect GFP protein in prp4ka and sac3a mutants. Total protein isolated from the indicated plant lines was separated by SDS-PAGE, blotted onto a membrane, and probed with a monoclonal antibody to GFP protein (top). The Coomassie brilliant blue-stained gel is shown as a loading control. The prominent ∼56-kDa band is presumed to be the large subunit of ribulose bisphosphate carboxylase. CBB, Coomassie brilliant blue; gDNA (T), genomic DNA of T line; RT−, without reverse transcriptase; RT+, with reverse transcriptase; T, wild-type T line (GFP-intermediate control); WT, wild type.

Figure 3

PRP4KA and SAC3A gene structures, positions of mutations, and protein domains. (A) The pre-mRNA of PRP4KA (At3g25840) is alternatively spliced. Two splice variants are annotated in TAIR (http://www.arabidopsis.org/index.jsp) and 14 splice variants are annotated in AtRTD2 (Zhang ). The reference transcript isoform At3g25840.1 encodes a 935 amino acid protein that contains RS protein (RSRP) superfamily domain and a catalytic domain of the serine/threonine kinase, pre-mRNA processing factor 4 (STKc_PRP4) domain (https://www.ncbi.nlm.nih.gov/). We identified the following prp4ka alleles in our screen: prp4ka-1 (M1I), prp4ka-2 (R237*), prp4ka-3 (W360*), prp4ka-4 (Q546*), and prp4ka-5 (splice-site acceptor, sixth intron) (Figure S2). All of these alleles, which encode defective mRNAs or, in one case, abolishes initiation of translation at the normal ATG start codon (the next methionine codon is 300 bp downstream), are likely to be nulls. (B) SAC3A (At2g39340) encodes a 1066 amino acid protein containing a conserved SAC3/GANP/THP3 domain (http://www.arabidopsis.org/). In budding yeast, this domain in the SAC3 protein integrates interactions between other proteins in the TREX complex that couples transcription and mRNA export (https://www.ebi.ac.uk/interpro/). We identified the following sac3a alleles: sac3a-3 (Q375*), sac3a-4 (W509*), sac3a-5 (splice-site donor, sixth intron), sac3a-6 (splice-site acceptor, eighth intron), and sac3a-7 (splice-site donor, 10th intron). The sac3a alleles all encode defective mRNAs and are likely to be nulls. Chr3, chromosome 3; 3′_ss, alternative 3′ splice-site acceptor; 5′_ss, alternative 5′ splice-site donor.

Molecular basis of GFP-weak phenotypes of prp4ka and sac3a mutants. (A) GFP-weak fluorescence in seedlings of prp4a and sac3a mutants (prp4ka-2/gfw5-2 and sac3a-3/gfw6-1). (B) Left: Semiquantitative RT-PCR to detect the three GFP splice variants (unspliced, GT–AG transcripts, and AT–AC transcripts) in prp4ka and sac3a mutants. Wild-type T line and nontransgenic Col-0 represent positive and negative controls, respectively; actin is the constitutively expressed control. Right: Percentages of the three major GFP RNA splice variants derived from an analysis of RNA-seq data (Table S5). The average of three biological replicates is shown for each sample. A two-sample t-test using the percentages of GFP RNA isoforms found a statistically significant difference between the amount of AT–AC and unspliced transcripts between the wild-type T line and the two mutants (P < 0.05). The total amount of GFP transcripts did not change significantly in prp4ka and sac3a mutants. (C) Western blotting to detect GFP protein in prp4ka and sac3a mutants. Total protein isolated from the indicated plant lines was separated by SDS-PAGE, blotted onto a membrane, and probed with a monoclonal antibody to GFP protein (top). The Coomassie brilliant blue-stained gel is shown as a loading control. The prominent ∼56-kDa band is presumed to be the large subunit of ribulose bisphosphate carboxylase. CBB, Coomassie brilliant blue; gDNA (T), genomic DNA of T line; RT−, without reverse transcriptase; RT+, with reverse transcriptase; T, wild-type T line (GFP-intermediate control); WT, wild type. PRP4KA and SAC3A gene structures, positions of mutations, and protein domains. (A) The pre-mRNA of PRP4KA (At3g25840) is alternatively spliced. Two splice variants are annotated in TAIR (http://www.arabidopsis.org/index.jsp) and 14 splice variants are annotated in AtRTD2 (Zhang ). The reference transcript isoform At3g25840.1 encodes a 935 amino acid protein that contains RS protein (RSRP) superfamily domain and a catalytic domain of the serine/threonine kinase, pre-mRNA processing factor 4 (STKc_PRP4) domain (https://www.ncbi.nlm.nih.gov/). We identified the following prp4ka alleles in our screen: prp4ka-1 (M1I), prp4ka-2 (R237*), prp4ka-3 (W360*), prp4ka-4 (Q546*), and prp4ka-5 (splice-site acceptor, sixth intron) (Figure S2). All of these alleles, which encode defective mRNAs or, in one case, abolishes initiation of translation at the normal ATG start codon (the next methionine codon is 300 bp downstream), are likely to be nulls. (B) SAC3A (At2g39340) encodes a 1066 amino acid protein containing a conserved SAC3/GANP/THP3 domain (http://www.arabidopsis.org/). In budding yeast, this domain in the SAC3 protein integrates interactions between other proteins in the TREX complex that couples transcription and mRNA export (https://www.ebi.ac.uk/interpro/). We identified the following sac3a alleles: sac3a-3 (Q375*), sac3a-4 (W509*), sac3a-5 (splice-site donor, sixth intron), sac3a-6 (splice-site acceptor, eighth intron), and sac3a-7 (splice-site donor, 10th intron). The sac3a alleles all encode defective mRNAs and are likely to be nulls. Chr3, chromosome 3; 3′_ss, alternative 3′ splice-site acceptor; 5′_ss, alternative 5′ splice-site donor. The positions of the prp4ka and sac3a mutations are shown in Figure 3. PRP4KA, which encodes a protein 935 amino acids in length, has two paralogs in Arabidopsis: PRP4KB (At1g13350; 834 amino acids) and At3g53640, which is an intronless, unexpressed pseudogene. Both PRP4KA and PRP4KB are ubiquitously expressed, with PRP4KA having a higher expression level than PRP4KB (http://bar.utoronto.ca/efp/cgi-bin/efpWeb.cgi). SAC3A has two expressed paralogs in Arabidopsis, SAC3B (At3g06290) and SAC3C (At3g54380), which are more closely related to each other than to SAC3A (Lu ). Coexpression analysis using both the ATTED database version 9.2 (http://atted.jp/; CoExSearch function) and the Expression Angler tool (http://bar.utoronto.ca/; AtGenExpress Plus – Extended Tissue Compendium) indicated that PRP4KA and SAC3A are highly coexpressed in Arabidopsis. Amino acid sequence alignments of PRP4KA and SAC3A orthologs in selected plant species and model organisms are shown in Figures S2–S5, respectively.

Characterization of prp4ka and sac3a mutants

Semiquantitative RT-PCR was used to investigate the splicing pattern of GFP pre-mRNA in prp4ka-4 and sac3a-6 mutants. Relative to the wild-type T line, the mutants accumulated reduced amounts of the translatable AT–AC GFP transcript and increased levels of the unspliced, untranslatable transcript (Figure 2B). Western blot analysis demonstrated decreased levels of GFP protein in prp4ka and sac3a mutants (Figure 2C). These results are consistent with the GFP-weak phenotype of M2 seedlings. The morphological phenotypes of the prp4ka and sac3a mutants were examined during vegetative and reproductive phases. Whereas the sac3a mutants were largely indistinguishable from wild-type plants, the prp4ka mutants displayed a pleiotropic phenotype typified by somewhat flat, darker green rosettes, late flowering, tall final stature, lowered seed set, and reduced branching (Figure 4, A and B, and Figure S1, A and B). The lowered seed set in prp4ka mutants reflected both fewer seeds per silique and fewer siliques per plant (Figure S1B). The aberrant traits of prp4ka mutants returned to more wild-type levels in complemented plants (Figure 4C and Figure S1, A and B), indicating that the prp4ka mutations are indeed largely responsible for the abnormal phenotype of the corresponding mutants.
Figure 4

Phenotypic analysis of prp4ka and sac3a mutants. (A and B) The sac3a mutants appear largely normal by the measured criteria: transition to flowering (bolting), final height of adult plant, seed weight, and branch (stem) number. The prp4ka mutants feature delayed flowering, lowered seed set, reduced branching, a tall final stature, and somewhat flat, darker green rosettes [numerical values in Figure S1A (prp4ka and sac3a experiment) and Figure S1B (prp4kb experiment)]. (B) Tall stature is not visible in the photograph, which shows age-matched, wild-type (WT) and mutant plants, but is apparent in fully grown prp4ka plants (Figure S1A). (C) Complementation of the prp4ka mutants with a 35Spro-PRP4KA transgene restores a normal phenotype. Particularly visible in the age-matched samples shown here is the late transition to flowering and somewhat flat, darker green rosettes in the prp4ka mutant (left) compared to the complemented lines (right), which also have normal branching patterns (Figure S1B).

Phenotypic analysis of prp4ka and sac3a mutants. (A and B) The sac3a mutants appear largely normal by the measured criteria: transition to flowering (bolting), final height of adult plant, seed weight, and branch (stem) number. The prp4ka mutants feature delayed flowering, lowered seed set, reduced branching, a tall final stature, and somewhat flat, darker green rosettes [numerical values in Figure S1A (prp4ka and sac3a experiment) and Figure S1B (prp4kb experiment)]. (B) Tall stature is not visible in the photograph, which shows age-matched, wild-type (WT) and mutant plants, but is apparent in fully grown prp4ka plants (Figure S1A). (C) Complementation of the prp4ka mutants with a 35Spro-PRP4KA transgene restores a normal phenotype. Particularly visible in the age-matched samples shown here is the late transition to flowering and somewhat flat, darker green rosettes in the prp4ka mutant (left) compared to the complemented lines (right), which also have normal branching patterns (Figure S1B).

RNA-seq analysis

To analyze the genome-wide effects of homozygous prp4ka and sac3a mutations on differential gene expression and alternative splicing, we carried out RNA-seq using total RNA isolated from 2-week-old seedlings of the homozygous prp4ka-4 and sac3a-6 mutants (BC1F3 generation) and the wild-type T line. All samples were run in biological triplicate. The RNA-seq data confirmed the findings obtained from semiquantitative RT-PCR, showing reduced splicing efficiency of GFP pre-mRNA. The amount of AT–AC transcript decreased significantly in prp4ka-4 and sac3a-6 mutants compared to the wild type. By contrast, the level of unspliced, untranslatable transcript increased significantly in prp4ka-4 and sac3a-6 mutants relative to the wild type (Figure 2B). The findings from a genome-wide analysis of DEGs and DAS are summarized in Table 1. A number of splicing-related factors were identified in this analysis and are compiled separately for prp4ka (Table S3) and sac3a (Table S4). DEGs numbered 1571 in the prp4ka and 3046 in the sac3a mutant (Table 1). Of these, around a quarter (407) was shared between the prp4ka and sac3a mutants, but the direction of change (up or down) was not always the same in both mutants (Table 1 and Table S5). Upregulated genes in the prp4ka mutant included SAC3A and the putative U1 snRNP component PRP39A (Table S5), which was identified previously in the same genetic screen that retrieved the prp4ka and sac3a mutants (Table S2).
Table 1

Summary of DEGs and DAS events in the prp4ka and sac3a mutants

Ref, reference; 5′_ss, alternative 5′ splice-site donor; 3′_ss, alternative 3′ splice-site acceptor.

Number of DEGs in the sac3a and prp4ka mutants using an FDR <0.05.

The major alternative splicing events are illustrated to the right. Regions included or excluded due to alternative splicing are shown in gray. The numbers of DAS events observed in each mutant are indicated in the middle columns. Overlap columns show the numbers of DEGs and DAS events shared between the prp4ka and sac3a mutants.

Ref, reference; 5′_ss, alternative 5′ splice-site donor; 3′_ss, alternative 3′ splice-site acceptor. Number of DEGs in the sac3a and prp4ka mutants using an FDR <0.05. The major alternative splicing events are illustrated to the right. Regions included or excluded due to alternative splicing are shown in gray. The numbers of DAS events observed in each mutant are indicated in the middle columns. Overlap columns show the numbers of DEGs and DAS events shared between the prp4ka and sac3a mutants. DAS was detected for 1225 and 533 genes in the prp4ka and sac3a mutants, respectively (Tables S6–S9). The numbers of overlapping genes affected by both DEG and DAS events in the mutants are shown in Figure 5. Whereas about a quarter (390) of prp4ka DEGs were also differentially spliced, only ∼3.7% (113) of sac3a DEGs displayed changes in alternative splicing (Figure 5). A total of 206 genes showed DAS in both mutants (Tables S6–S9).
Figure 5

Venn diagrams showing distribution of genes affected in the prp4ka and sac3a mutants. (A) Venn diagram for genes affected in the prp4ka mutant (DEGs, DAS genes, genes encoding proteins with changes in phosphorylation). (B) Venn diagram for genes affected in the sac3a mutant (DEGs and DAS genes). (C) Venn diagram for all genes affected in the prp4ka (DEGs, DAS genes, changes in phosphorylation) and sac3a (DEGs and DAS genes) mutants.

Venn diagrams showing distribution of genes affected in the prp4ka and sac3a mutants. (A) Venn diagram for genes affected in the prp4ka mutant (DEGs, DAS genes, genes encoding proteins with changes in phosphorylation). (B) Venn diagram for genes affected in the sac3a mutant (DEGs and DAS genes). (C) Venn diagram for all genes affected in the prp4ka (DEGs, DAS genes, changes in phosphorylation) and sac3a (DEGs and DAS genes) mutants. In total, 1905 and 788 instances of DAS were detected in the prp4ka and sac3a mutants, respectively (Tables S6–S9). IR represented the most common DAS event, comprising 1402 cases of differential IR in the prp4ka mutant and 484 in the sac3a mutant (Table 1 and Table S6). Of these, 123 IR events were shared and the direction of the change for ∼71.5% of them (88 IRs) was the same in both mutants. The vast majority (95.8%) of IRs affected in the prp4ka mutant and ∼64% of IRs in the sac3a mutant showed higher retention in comparison to controls (Table S6). ES was represented by relatively few events: 38 in the prp4ka and 26 in sac3a, 4 of which were shared by both mutants although the direction of change was not always the same (Table 1 and Table S7). Several hundred events involving alternative 5′ and 3′ splice-site selection were detected in both mutants (Table 1 and Table S8). A total of 32 alternative 5′ splice-site selection and 11 alternative 3′ splice-site selection events were shared between prp4ka and sac3a mutants, and the direction of the change was the same for 24 (75%) and 7 (∼63%) of them, respectively (Table S8). Exitrons were represented by 44 and 42 cases in the prp4ka and sac3a mutants, respectively (Table 1 and Table S9). Eight exitrons overlapped in the two mutants, and all but one, At1g77080, was regulated in the same direction (Table S9). Of all DAS events, only 13 in the prp4ka and 3 in the sac3a mutants involved a U12 intron; the remainder (1892 in prp4ka and 785 in sac3a) entailed U2 introns.

Analysis of alternative introns differentially regulated in prp4ka and sac3a mutants

We analyzed the guanine-cytosine (GC) content, length, and splice-site strength of introns affected in the prp4ka (Figure S6) and sac3a (Figure S7) mutants and in two subgroups of alternative splicing events regulated in both mutants: the “shared”-subgroup of 178 events regulated in both mutants, and the “same”-subgroup of 128 events regulated in the same direction (Figure S8). This analysis (Figures S6 and S7) revealed that, in both the prp4ka and sac3a mutants, the more retained-introns have significantly lower 5′ splice-site scores (i.e., weaker 5′ splice sites; two-sample t-test, P < 0.001), whereas the 3′ splice-site strength of more-retained introns in prp4ka is slightly, but significantly, higher (two-sample t-test, P < 0.001). Additionally, the more-retained introns of both mutants exhibit a slight, but significant, increase in GC content and the more-retained introns in prp4ka are significantly longer (two-sample t-test, P < 0.001). The alternatively regulated 5′ and 3′ splice-site events in both mutants exhibited no significant difference in splice-site strength. However, the selection of the lesser-used alternative 5′ or 3′ splice site results in significantly longer introns (two-sample t-test, P < 0.001). The introns regulated in both the prp4ka and sac3a mutants (shared- and same-subgroups) are significantly (Wilcoxon test, P < 0.05) longer than the introns regulated in either one of the separate mutants (Figure S8A). The GC content of the affected introns in prp4ka is lower than those regulated in the sac3a mutant (Wilcoxon test, P < 0.05) and in both mutants (Figure S8B; Wilcoxon test, P < 0.05). The regulated introns in sac3a have on average a slightly higher 5′ splice-site score (i.e., stronger 5′ splice sites) (Figure S8C; Wilcoxon test, P < 0.001), whereas their 3′ splice-site score is slightly lower than the regulated introns in prp4ka (Wilcoxon test, P < 0.001) and the regulated introns in both mutants (Figure S8D; Wilcoxon test, P = 0.04).

Analysis of first intron splicing in the prp4ka mutant

A previous study in fission yeast found that Prp4 kinase is required for recognition and splicing of a subset of first introns that have weak 5′ splice sites and branch-point sequences (Eckert ). We evaluated whether the same trend might be observed with the significantly differentially retained introns identified in the prp4ka mutant (Figure S9). The regulated introns were split up into those affecting the first introns of a gene and all other (the remaining) introns. There was no significant difference in the 5′ splice-site strength in the first and the remaining introns, whereas the 3′ splice-site score of the first introns was slightly, but significantly, higher (Wilcoxon test, P = 0.014). The first introns in the prp4ka mutant exhibit significantly higher retention rates compared to the remaining introns (Wilcoxon test, P < 0.001), whereas this behavior was not observed for the first and remaining retained introns in wild type. Out of all differentially retained introns in the prp4ka mutant, 41.7% (585 out of 1402) comprised first introns. Evaluation of all IR events annotated in the AtRTD2 transcriptome revealed that 27.2% describe the retention of a first intron, which is substantially lower (Chi-square goodness-of-fit test, P < 2.2e−16) than the 41.7% observed in the prp4ka mutant.

Peptide phosphorylation changes detected in the prp4ka mutant

To identify potential substrates of PRP4KA, we used the iTRAQ method (Lan ) to perform a quantitative phosphoproteomic analysis on total protein isolated from 2-week-old seedlings of the prp4ka-4 mutant and from the wild-type T line. The experiments were performed using three independent biological replicates. Search of the mass spectrometry data against the AtRTD2 translation (Zhang ) database identified 1059 peptides in proteins encoded by 396 genes. Peptides showing statistically significant changes in phosphorylation in at least two of the three experiments are listed in Table S10. The numbers of overlapping genes/proteins affected by DEG, DAS, or phosphorylation changes in the prp4ka mutant are shown in Figure 5A. Twenty splicing-related factors, including five SR proteins (At-SR30, At-RS41, At-RS40, At-SCL33, and At-SCL30A), were identified in the iTRAQ analysis as were a number of other RNA-binding proteins (Table 2 and Table S10). Two splicing factors, AtGRP7 and FLK, showed changes in both phosphorylation levels and alternative splicing in the prp4ka mutant (Table S3).
Table 2

Phosphorylation changes in selected splicing factors and RNA-binding proteins in the prp4ka-4 mutant

Peptide sequenceaIdentifierbNameFunctionReference
Lose phosphorylation
 [-].MESTESYAAGSPEELAK.[R]AT5G04430.ID11BTR1LNOVA-like RNA-binding proteinde la Fuente van Bentem et al. (2006)
 [-].MESTESYAAGSPEELAKR.[S]AT5G04430.JS1
 [-].MESTESYAAGSPEELAKRSPEPHDSSEADSAEKPTHIR.[F]AT5G04430.JS4
AT5G04430.P1
AT5G04430.P2
 [K].EGGGYSFFPSPSANGAQGALTYQ.[-]AT3G21215.P1RNA-binding (RRM/RBD/RNP motifs) family proteinA putative RNA splicing protein similar to mec-8AceView (https://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/)
 [K].RKEGGGYSFFPSPSANGAQGALTYQ.[-]
 [R].AASPQIRSTPEIDSSQYLTELLAEHQK.[L]AT2G38610.P2RNA-binding KH domain-containing proteinSimilar to human QKI proteins; regulation of pre-mRNA splicing, export of target RNAs from the nucleus, translation of proteins, and RNA stabilityAceView (https://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/)
Quaking-like 3
 [R].SVPSSPGPNWLNSPGSSSGLIAK.[R]AT5G56140.P1RNA-binding KH domain-containing protein
 [R].SVPSSPGPNWLNSPGSSSGLIAKR.[T]Quaking-like 2
 [K].IFVGGISYSTDEFGLR.[E]AT1G74230.ID1GR-RBP5Glycine-rich RNA-binding protein; hnRNP familyTable S1 in Koncz et al. (2012)
AT1G74230.ID4
 [R].SGGGGGYSGGGGSYGGGGGR.[R]AT2G21660.P1CCR2/ATGRP7Glycine-rich RNA-binding proteinTable S1 in Koncz et al. (2012)
 [R].SGGGGGYSGGGGSYGGGGGRR.[E]
 [K].VVVAYGGTPIHQQLR.[E]AT3G58510.3DEA(D/H)-box RNA helicase family proteinB complex associatedTable S1 in Koncz et al. (2012)
 [R].FSPSVDR.[Y]AT1G09140.CR4At-SR30SR protein, SR subfamilyBarta et al. (2010); Table S1 in Koncz et al. (2012)
 [R].FSPSVDRYSSSYSASR.[A]AT1G09140.ID154
AT1G09140.ID155
AT1G09140.ID157
AT1G09140.ID85
AT1G09140.P1
AT1G09140.P2
AT1G09140.P3
 [K].DDDSRGNGYSPER.[R]AT5G52040.ID8At-RS41SR proteins, plant-specific RS subfamilyde la Fuente van Bentem et al. (2006); Barta et al. (2010); Table S1 in Koncz et al. (2012)
 [R].GNGYSPER.[R]AT5G52040.ID14
 [R].GNGYSPERR.[R]
 [RK].ERTSPDYGR.[GR]
 [RK].ERTSPDYGR.[GR]AT4G25500.4At-RS40
 [M].SGGLDMSLDDIIK.[S]AT5G02530.2ALY2TREX complexTable S1 in Koncz et al. (2012)
AT5G02530.P1
 [R].NTFDENVDSNNNLSPSASQGIGAPSPYSYAAVLGSSLSR.[N]AT2G29200.P1Pumilio 1 (PUM1)PUF proteins regulate both mRNA stability and translation through sequence-specific binding to 3′ UTRs of target mRNA transcripts
 [R].SGSAPPTVDGSVSAAGGLFSGGGGAPFLEFGGVNK.[G]
 [R].SGSAPPTVDGSVSAAGGLFSGGGGAPFLEFGGVNKGNGFGGDDEEFR.[K]
 [K].NNLSPSASQGIGAPSPYSYAAVLGSSLSR.[N]AT2G29190.2Pumilio 2 (PUM2)
 [R].SGSAPPTVDGSVSAAGGLFSGGGGAPFLEFGGGNK.[G]
 [R].SGSAPPTVDGSVSAAGGLFSGGGGAPFLEFGGGNKGNGFGGDDEEFR.[K]
 [K].SIADMIQRPHSAGNRPIAQDIHAISSDTSSEHAR.[R]AT3G20250.c2Pumilio 5 (PUM5)
AT3G20250.c3
AT3G20250.c4
AT3G20250.ID3
AT3G20250.ID4
AT3G20250.JC6
 [R].DSPTSQPVPIVALATR.[L]AT1G49760.2Poly(A) binding protein 8 (PAB8)mRNA binding proteinTable S1 in Koncz et al. (2012)
 [R].DVNTMPGPTQNMLSVPYDVSSGGGVHHRDSPTSQPVPIVALATR.[L]
 [R].ELSPTGLDSSPR.[D]AT3G51950.ID3Zinc finger (CCCH-type) family protein/RNA recognition motif (RRM)-containing proteinRibonuclease activityAddepalli and Hunt (2008)
 [R].ELSPTGLDSSPRDVLGGR.[G]
 [R].SGSCVLDGLGYGGDSDLGFGGVPCSYFAR.[G]
 [K].FRIPSPGDVYNR.[T]AT5G60170.1RNA-binding (RRM/RBD/RNP motifs) family proteinSimilar to human CCR4-NOT transcription complex, subunit 4AceView (https://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/)
 [R].NLSPSLNDPYGFSSR.[L]AT5G60170.ID35
Gain phosphorylation
 [K].GNPLLNTPTSFSVK.[R]AT3G13200.P1Splicing factor Cwf15/Cwc15NTC-associatedTable S1 in Koncz et al. (2012)
 [K].KSLNRSPPSYGSHPR.[G]AT3G13224.P2RNA-binding glycine-rich protein D3 (RBGD3)Glycine-rich RNA-binding protein; hnRNP family
 [K].SLNRSPPSYGSHPR.[G]AT3G13224.P3
 [K].NGSVSGTELVEDDHER.[A]AT5G16260.c1Early flowering 9 (ELF9)17S U2 snRNPTable S1 in Koncz et al. (2012)
 [R].LKNGSVSGTELVEDDHER.[A]AT5G16260.P1
 [K].VEDEEGIPEHLESLQK.[S]AT3G04610.ID6Flowering locus KH domain (FLK)hnRNP E1/E2Table S1 in Koncz et al. (2012)
AT3G04610.JC12
AT3G04610.JC3
AT3G04610.P2
AT3G04610.P4
AT3G04610.s3
 [R].EEGSPMSGSISPYNSLGMK.[R]AT4G26480.P1RNA-binding KH domain-containing proteinSimilar to human QKI proteins; regulation of pre-RNA splicing, export of target RNAs from the nucleus, translation of proteins, and RNA stabilityAceView (https://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/)
Quaking-like 1
 [R].STPEIDSSQYLTELLAEHQK.[L]AT2G38610.P2RNA-binding KH domain-containing protein
Quaking-like 3
 [R].EEGSPMSGSVSPYNSLGMK.[R]AT5G56140.P1RNA-binding KH domain-containing protein
Quaking-like X3
 [R].SYTPSPPR.[G]AT1G55310.2At-SCL33SR proteins, plant-specific SCL subfamilyde la Fuente van Bentem et al. (2006); Barta et al. (2010); Table S1 in Koncz et al. (2012)
AT1G55310.3
AT1G55310.c2
AT1G55310.c3
AT1G55310.CR7
AT1G55310.ID1
AT1G55310.ID3
AT1G55310.P1
AT1G55310.P3
 [R].SYTPSPPRGYGR.[R]AT3G13570.c1At-SCL30A
AT3G13570.CR2
AT3G13570.P1
AT3G13570.SR1
 [R].MLQSGMPLDDRPEGQRSPSPEPVYDNMGIR.[I]AT5G51300.1atSF1/BBP splicing factor 1Splice site selectionde la Fuente van Bentem et al. (2006); Table S1 in Koncz et al. (2012)
 [R].SPSPEPVYDNMGIR.[I]
 [R].SGSAPPTVDGSVSAAGGLFSGGGGAPFLEFGGVNK.[G]AT2G29200.P1pumilio 1 (PUM1)PUF proteins regulate both mRNA stability and translation through sequence-specific binding to 3′ UTRs of target mRNA transcripts
 [R].SGSAPPTVDGSVSAAGGLFSGGGGAPFLEFGGGNK.[G]AT2G29190.2Pumilio 2 (PUM2)
 [R].DAALGSQLSRPASCNTFR.[D]AT3G10360.JC2Pumilio 4 (PUM4)
 [R].GNFSPGSSPSGMDSR.[D]AT3G21100.2RNA-binding (RRM/RBD/RNP motifs) family protein
AT3G21100.ID3
AT3G21100.ID8
 [K].DSNVTPDDDVSGMRSPSAFFK.[H]AT3G13300.P1Varicose (VCS)Involved in mRNA decapping
 [K].VFCSQVSNLSTEMAR.[D]AT3G13300.P2
 [R].DCYPSTEGTFIPGESK.[A]AT3G13300.P3
 [K].SSSAADSYVGSLISLTSK.[S]AT1G26110.2Decapping 5 (DCP5)mRNA decapping
AT1G26110.ID1
AT1G26110.ID2
AT1G26110.ID5
AT1G26110.P1
 [K].SPVATTQQLPK.[V]AT1G79280.1Nuclear pore anchor (NUA)mRNA exportTable S1 in Koncz et al. (2012)
 [R].VPSSTPLIKSPVATTQQLPK.[V]AT1G79280.2
 [R].VPSSTPLIK.[S]AT1G79280.3
 [K].VVMTPDTPSK.[G]AT3G62800.2Double-stranded-RNA-binding protein 4 (DRB4)A nuclear dsRNA-binding protein DRB4 that interacts specifically with DCL4
AT3G62800.P3
 [R].DGPGPLHSPAVSK.[S]AT5G57870.2Eukaryotic translation initiation factor isoform 4G1 (eIFiso4G1)RNA metabolic process
 [R].RDGPGPLHSPAVSK.[S]AT5G57870.P1

NOVA-1, a mammalian, neuron-specific regulator of alternative splicing containing three K homology domains; mec-8, a Caenorhabditis elegans protein that regulates alternative splicing of unc-52; KH, K homology.

Serines (S), threonines (T), and tyrosines (Y) in bold font and which are underlined indicate the phosphorylated residues detected by iTRAQ. The peptides listed showed statistically significant changes in phosphorylation in at least two out of three separate iTRAQ experiments (Table S10). The amino acids before and after the tryptic peptide in the protein sequence are annotated by brackets and separated by dots.

Gene models (identifiers) are according to the AtRTD2 transcriptome annotation (Zhang ). Reference gene models are shown in bold font. For a fuller list of RNA metabolism-related proteins identified in the iTRAQ analysis see Tables S2 and S10 (see “Keyword RNA”).

NOVA-1, a mammalian, neuron-specific regulator of alternative splicing containing three K homology domains; mec-8, a Caenorhabditis elegans protein that regulates alternative splicing of unc-52; KH, K homology. Serines (S), threonines (T), and tyrosines (Y) in bold font and which are underlined indicate the phosphorylated residues detected by iTRAQ. The peptides listed showed statistically significant changes in phosphorylation in at least two out of three separate iTRAQ experiments (Table S10). The amino acids before and after the tryptic peptide in the protein sequence are annotated by brackets and separated by dots. Gene models (identifiers) are according to the AtRTD2 transcriptome annotation (Zhang ). Reference gene models are shown in bold font. For a fuller list of RNA metabolism-related proteins identified in the iTRAQ analysis see Tables S2 and S10 (see “Keyword RNA”).

GO analyses of genes affected in prp4ka and sac3a mutants

The overrepresentation test of GO terms for all 2768 genes whose expression is affected at different levels (DE, DAS, and changes in phosphorylation) in the prp4ka mutant (Figure 5A) shows enrichment of RNA-processing and splicing-related terms (Table S11). Although these terms were not overrepresented in the prp4ka DEGs (1571 genes), they were among the most highly enriched GO terms for DAS genes (1225). Similarly, for the set of proteins with phosphorylation changes (396 genes), RNA-processing and splicing-related terms were also overrepresented (Table S11). For the 3466 genes affected (DE and DAS) in the sac3a mutant (Figure 5B), the overrepresented terms included “RNA binding protein” and “spliceosomal complex” (Table S12). For the 3046 DEGs in sac3a, similar terms were overrepresented with the exception of spliceosomal complex. By contrast, significant enrichment of splicing-related terms was observed for the 533 DAS genes (Table S12). For genes/proteins affected by DEG, DAS, or phosphorylation in prp4ka (2768) and DEG or DAS in sac3a (3466), 731 were shared (Figure 5C). GO analysis showed enrichment of the terms “nuclear speckle” and spliceosomal complex (Table S13).

Tests of a prp4kb mutation on GFP expression and plant phenotype

To investigate whether a homozygous mutation in PRP4KBwould confer a GFP-weak phenotype similar to prp4ka mutations, we performed the breeding scheme described in the Materials and Methods. Of 23 GFP-intermediate F2 plants descending from a cross between a homozygous prp4kb-1 mutant (−/−; b/b) and the wild-type T line (T/T; B/B), four (17.4%, expected percentage 25%) were found to be homozygous for the prp4kb-1 mutation. The finding of homozygous b/b F2 plants with intermediate, wild-type levels of GFP fluorescence demonstrates that a prp4ka mutation does not weaken GFP expression. Homozygous prp4kb-1 plants appear normal, in contrast to the aberrant phenotype of prp4ka mutants (Figure S1, B and C). To assess the viability of plants homozygous for both the prp4ka-4 and prp4kb-1 mutations, we performed the breeding strategy described in the Materials and Methods section. F3 progeny of a T/(T);a/a;B/b plant were prescreened for a GFP-weak phenotype [indicating homozygosity of the prp4ka-4 allele or T/(T);a/a]. We genotyped 54 GFP-weak F3 progeny for the prp4kb-1 allele and found 5 that were heterozygous for the prp4kb-1 mutation [T(T);a/a;B/b]. However, no doubly homozygous F3 progeny [T/(T);a/a;b/b] were identified. If the double homozygous mutant is viable, the expected number of T/(T);a/a;b/b F3 progeny in a population of 54 plants would be 13–14 (25%). These results suggest that the double homozygous mutant is not capable of survival. However, the number of B/b heterozygotes obtained in the F3 population (5 out of 54 or 9.25%) was also lower than expected (27 out of 54 or 50%), which may indicate that the b allele is not transmitted well in the a/a mutant background.

Discussion

In a forward genetic screen for mutants showing modified splicing of an alternatively spliced GFP reporter gene in Arabidopsis, we recovered loss-of-function mutations in the genes encoding the dual-specificity protein kinase PRP4KA and the putative mRNA nuclear export factor SAC3A. Both the prp4ka and sac3a mutants were identified by their GFP-weak phenotypes, which are due—at least in part—to diminished splicing efficiency of GFP pre-mRNA. PRP4KA and SAC3A have not been identified in any prior forward genetic screen in Arabidopsis or studied previously for their roles in pre-mRNA splicing in plants. It is unclear why this particular screen repeatedly retrieved mutants defective in these two genes, but the findings clearly demonstrate the contributions of PRP4KA and SAC3A to GFP pre-mRNA splicing and to GFP expression.

PRP4KA

PRP4K-related proteins are present in most eukaryotes with the prominent exception of the fungal group Hemiascomycetes, which contains budding yeast. Prp4 kinase is an essential gene in fission yeast (Alahari ; Lützelberger and Käufer 2012) and in metazoans (Dellaire ). By contrast, our study indicates that PRP4KA is not essential in Arabidopsis. Prp4 kinase is also not necessary for growth in the wheat scab fungus Fusarium graminaerum but it is needed for efficient splicing (Gao ). Although the prp4ka alleles we identified are most likely genetic nulls, the respective mutants are viable and fertile. Nevertheless, they show an obvious pleiotropic phenotype, the molecular basis of which remains to be established. The DEG, DAG, and protein phosphorylation lists may suggest candidate genes for follow-up studies; for example, these lists contain a number of flowering-related genes, which may contribute to the late-flowering phenotype (Table S14). The failure of a prp4kb mutation to visibly affect either GFP expression or plant morphology and development rules out extensive functional redundancy of PRP4KA and PRP4KB. This conclusion is supported by the fact that we retrieved five independent mutant alleles of prp4ka in our screen but not a single mutant allele of prp4kb. The inability to recover prp4ka prp4kb double mutants indicates that at least one wild-type copy of a PRP4K gene is essential for plant viability. However, this possibility needs to be examined more thoroughly in the future by reassessing the apparent weak inheritance of the prp4kb-1 allele in the homozygous prp4ka-4 mutant, which itself displays reduced fertility as evidenced by a lowered seed set. Genetic studies in fission yeast (Bottner ) and F. graminaerum (Gao ) as well as biochemical analyses in human cells (Schneider ; Boesler ) established that Prp4 kinase transiently associates with the spliceosome as a component of the precatalytic B complex and facilitates the transition to the catalytically active B* (or Bact) complex (Schneider ). Determining whether PRP4KA has a similar role in splicing in Arabidopsis will require the development of methods for isolating the cognate spliceosomal complexes from plant cells. Although detailed biochemical analyses of plant spliceosomes await further technical advances, a PRP4KA–GFP fusion protein in Arabidopsis was localized to nuclear speckles, which are enriched in splicing factors, thus further substantiating a role for PRP4KA in splicing (Koroleva ). Both PRP4KA and another splicing factor identified previously in this screen, SMU1 (Kanno ), are placed in the category “recruited prior to Bact” (the spliceosomal complex preceding catalytic B*) in a compilation of known and predicted splicing factors in Arabidopsis (table S1 in Koncz ). In human cells, Smu1, like Prp4k, is most abundant in the precatalytic B complex (Wahl and Lührmann 2015) and has been proposed to act by recognizing splicesomal targets for ubiquitination (Higa ). Based on these findings from human cells, one can speculate that PRP4KA and SMU1 in Arabidopsis are likewise components of the precatalytic B complex and are involved in triggering different post-translational modifications (phosphorylation and ubiquitination, respectively) important for assembly of Bact and the catalytically active B* complex (Figure S10).

SAC3A

Sac3 proteins are evolutionarily conserved members of the transcription-export (TREX) complex, which was first defined in budding yeast as a complex coupling transcription to mRNA export from the nucleus (Strässer ). In budding yeast, Sac3 is not an essential gene (Bauer and Kölling 1996) and, likewise, SAC3A is dispensable in Arabidopsis. The sac3a mutants we identified are viable, fertile, and generally appear indistinguishable from wild-type plants. A previous study also reported that a T-DNA insertion mutant of sac3a does not have a morphological mutant phenotype (Lu ). In Arabidopsis, SAC3A and another member of the SAC3 family, SAC3B, have been detected as constituents of the TREX-2 complex (Lu ). Unexpectedly, however, triple (presumably) null mutations in sac3a, sac3b, and sac3c did not seem to impair mRNA transport (Lu ). Confirming this finding requires more extensive examination of mRNA transport in the triple mutant, including tests of additional alleles in transport studies.

Roles of PRP4KA and SAC3A in splicing

Although the splicing pattern of GFP pre-mRNA is not dramatically changed in prp4ka-4 and sac3a-6, both mutants clearly exhibit reduced splicing efficiency of the noncanonical AT–AC intron in GFP pre-mRNA. This reduction likely contributes to the GFP-weak phenotypes of the mutants by decreasing the level of translatable GFP mRNA and, hence, GFP protein. The finding of diminished levels of the translatable AT–AC transcript, which results from splicing at splice sites that are less efficiently used by the U2-dependent spliceosome than canonical GT–AG splice sites (Crotti ), is consistent with recent results from fission yeast showing that Prp4 kinase facilitates recognition of introns with weak splice sites (Eckert ). On a genome-wide scale, the prp4ka and sac3a mutants exhibit widespread perturbations in alternative splicing. These results demonstrate the functional relevance of PRP4KA and SAC3A for pre-mRNA splicing, a conclusion that is further supported by the overrepresentation of RNA-processing and splicing-related terms in the GO analyses of DEG and DAS genes in the two mutants as well as proteins undergoing phosphorylation changes in the prp4ka mutant. As expected, IR was the most frequently observed alternative splicing event, but an appreciable number of changes in other categories, particularly alternative 5′ and 3′ splice selection, was also detected. The overlap between the DEGs and DAS events in the two mutants was only partial despite their similar patterns of GFP pre-mRNA splicing and high levels of coexpression, which suggested that they may function in the same process (Usadel ). These findings reinforce the complex nature of alternative splicing and are in accord with earlier findings in budding yeast that mutations in given splicing factors have quite different effects on individual genes (Pleiss ). The genome-wide analysis of introns more retained in prp4ka and sac3a mutants revealed a tendency toward somewhat weaker 5′ splice sites and an increased GC content. Alternatively regulated introns common to both the prp4ka and sac3a mutants are significantly longer than the introns regulated in either of the single mutants. Strikingly, ∼42% of the more-retained introns in the prp4ka mutant were found to be first introns. The exact role of PRP4KA in the splicing of first introns remains to be determined.

Potential substrates of PRP4KA

Prp4 kinase in fission yeast phosphorylates the SR protein Srp2 (Lützelberger and Käufer 2012) and in human cells the splicing factors Prp6 (STA1, At4g43030 in Arabidopsis) and Prp31 (PRP31A, At1g60170 in Arabidopsis) during formation of the catalytically active B* complex (Schneider ). We did not detect the Arabidopsis orthologs of these proteins in the iTRAQ analysis of the prp4ka mutant. However, our findings are generally in agreement with the previous studies in that we identified 5 SR proteins and 15 other splicing-related factors that change in phosphorylation level in the prp4ka mutant. Ten of these splicing-related proteins significantly lose phosphorylation and hence are potentially direct substrates of PRP4KA activity. A number of additional RNA-binding proteins not yet implicated in splicing similarly lose phosphorylation in the prp4ka mutant, suggesting they may also be directly targeted for phosphorylation by PRP4KA. Splicing factors and other proteins that gain phosphorylation in the prp4ka mutant are presumably responding indirectly to a reduction in PRP4KA activity, perhaps through another protein kinase or phosphatase that is itself modified by PRP4KA. Potential phosphorylation substrates of nonsplicing factors that were identified in the iTRAQ analysis could reflect additional roles for PRP4KA. For example, Prp4k in human cells has been implicated in coupling pre-mRNA splicing with chromatin remodeling events that regulate transcription (Dellaire ) and in mitosis (Montembault ). Some of the phosphorylated residues we identified in SR proteins and other splicing-related factors were also detected in a previous phosphoproteomic analysis of proteins involved in RNA metabolism in Arabidopsis (de la Fuente van Bentem ). In the prior study, it was noted that phosphorylation in splicing factors often occurs at a serine or threonine followed by a proline (pSP or pTP). We observed a similar trend, suggesting that PRP4KA may frequently target SP and TP sites.

General comments and speculation

As discussed above, PRP4KA and SAC3A are highly coexpressed with each other, and the respective mutants have similar GFP-weak phenotypes and patterns of GFP pre-mRNA splicing. These observations drew our attention to the possibility of a novel functional relationship between the two proteins. Whereas splicing regulation is a recognized function of Prp4 kinases, Sac3 proteins have not been directly associated with splicing but rather with the aforementioned role in mRNA export. However, given the known coupling between transcription and splicing (Naftelberg ), and the subsequent requirement to export mature mRNAs out of the nucleus, it is conceivable that PRP4KA and SAC3A cooperate during the transition between these consecutive processes. In fission yeast, Prp4k has been proposed to act as a checkpoint kinase that only permits properly spliced transcripts to exit the nucleus (Lützelberger and Käufer 2012). Extrapolating from this suggestion, it is conceivable that PRP4KA and SAC3A cooperate functionally to link splicing quality control and nuclear export. Under this hypothesis, the prp4ka and sac3a mutations would affect not only splicing of GFP pre-mRNA but also retard the efflux of mature GFP mRNA from the nucleus to the cytoplasm. Both of these deficiencies would contribute additively to the GFP-weak phenotype of the mutants. In this context, it is interesting to note that the iTRAQ analysis identified several nuclear pore and nuclear transport proteins as potential substrates of PRP4KA. Clearly, further work is required to understand the functional relationship between PRP4KA and SAC3A, and to define potentially expanded roles for these proteins. The prp4ka and sac3a mutants we identified and the easily monitored alternatively spliced GFP reporter gene system should be useful tools for these investigations.
  75 in total

Review 1.  Regulation of splicing by protein phosphorylation.

Authors:  R Fluhr
Journal:  Curr Top Microbiol Immunol       Date:  2008       Impact factor: 4.291

Review 2.  Alternative splicing at the intersection of biological timing, development, and stress responses.

Authors:  Dorothee Staiger; John W S Brown
Journal:  Plant Cell       Date:  2013-10-31       Impact factor: 11.277

3.  Characterization of the SAC3 gene of Saccharomyces cerevisiae.

Authors:  A Bauer; R Kölling
Journal:  Yeast       Date:  1996-08       Impact factor: 3.239

4.  Suppressors of yeast actin mutations.

Authors:  P Novick; B C Osmond; D Botstein
Journal:  Genetics       Date:  1989-04       Impact factor: 4.562

5.  The spliceosome-activating complex: molecular mechanisms underlying the function of a pleiotropic regulator.

Authors:  Csaba Koncz; Femke Dejong; Nicolas Villacorta; Dóra Szakonyi; Zsuzsa Koncz
Journal:  Front Plant Sci       Date:  2012-01-26       Impact factor: 5.753

6.  Transcript specificity in yeast pre-mRNA splicing revealed by mutations in core spliceosomal components.

Authors:  Jeffrey A Pleiss; Gregg B Whitworth; Megan Bergkessel; Christine Guthrie
Journal:  PLoS Biol       Date:  2007-04       Impact factor: 8.029

7.  SUMO conjugation to spliceosomal proteins is required for efficient pre-mRNA splicing.

Authors:  Berta Pozzi; Laureano Bragado; Cindy L Will; Pablo Mammi; Guillermo Risso; Henning Urlaub; Reinhard Lührmann; Anabella Srebrow
Journal:  Nucleic Acids Res       Date:  2017-06-20       Impact factor: 16.971

8.  PRP4 is a spindle assembly checkpoint protein required for MPS1, MAD1, and MAD2 localization to the kinetochores.

Authors:  Emilie Montembault; Stéphanie Dutertre; Claude Prigent; Régis Giet
Journal:  J Cell Biol       Date:  2007-11-12       Impact factor: 10.539

9.  2016 update of the PRIDE database and its related tools.

Authors:  Juan Antonio Vizcaíno; Attila Csordas; Noemi del-Toro; José A Dianes; Johannes Griss; Ilias Lavidas; Gerhard Mayer; Yasset Perez-Riverol; Florian Reisinger; Tobias Ternent; Qing-Wei Xu; Rui Wang; Henning Hermjakob
Journal:  Nucleic Acids Res       Date:  2015-11-02       Impact factor: 16.971

10.  A high quality Arabidopsis transcriptome for accurate transcript-level analysis of alternative splicing.

Authors:  Runxuan Zhang; Cristiane P G Calixto; Yamile Marquez; Peter Venhuizen; Nikoleta A Tzioutziou; Wenbin Guo; Mark Spensley; Juan Carlos Entizne; Dominika Lewandowska; Sara Ten Have; Nicolas Frei Dit Frey; Heribert Hirt; Allan B James; Hugh G Nimmo; Andrea Barta; Maria Kalyna; John W S Brown
Journal:  Nucleic Acids Res       Date:  2017-05-19       Impact factor: 19.160

View more
  10 in total

1.  Dynamics of Protein Phosphorylation during Arabidopsis Seed Germination.

Authors:  Emmanuel Baudouin; Juliette Puyaubert; Patrice Meimoun; Mélisande Blein-Nicolas; Marlène Davanture; Michel Zivy; Christophe Bailly
Journal:  Int J Mol Sci       Date:  2022-06-24       Impact factor: 6.208

2.  MDF is a conserved splicing factor and modulates cell division and stress response in Arabidopsis.

Authors:  Cloe de Luxán-Hernández; Julia Lohmann; Eduardo Tranque; Jana Chumova; Pavla Binarova; Julio Salinas; Magdalena Weingartner
Journal:  Life Sci Alliance       Date:  2022-10-20

3.  Genome-scale analysis of Arabidopsis splicing-related protein kinase families reveals roles in abiotic stress adaptation.

Authors:  M C Rodriguez Gallo; Q Li; M Devang; R G Uhrig
Journal:  BMC Plant Biol       Date:  2022-10-22       Impact factor: 5.260

4.  Integrative Proteome and Phosphoproteome Profiling of Early Cold Response in Maize Seedlings.

Authors:  Jiayun Xing; Jinjuan Tan; Hanqian Feng; Zhongjing Zhou; Min Deng; Hongbing Luo; Zhiping Deng
Journal:  Int J Mol Sci       Date:  2022-06-10       Impact factor: 6.208

5.  Daily temperature cycles promote alternative splicing of RNAs encoding SR45a, a splicing regulator in maize.

Authors:  Zhaoxia Li; Jie Tang; Diane C Bassham; Stephen H Howell
Journal:  Plant Physiol       Date:  2021-06-11       Impact factor: 8.340

6.  A novel strategy to uncover specific GO terms/phosphorylation pathways in phosphoproteomic data in Arabidopsis thaliana.

Authors:  Denise S Arico; Paula Beati; Diego L Wengier; Maria Agustina Mazzella
Journal:  BMC Plant Biol       Date:  2021-12-14       Impact factor: 4.215

7.  Low molecular weight protein phosphatase APH mediates tyrosine dephosphorylation and ABA response in Arabidopsis.

Authors:  Yanyan Du; Shaojun Xie; Yubei Wang; Yu Ma; Bei Jia; Xue Liu; Jingkai Rong; Rongxia Li; Xiaohong Zhu; Chun-Peng Song; W Andy Tao; Pengcheng Wang
Journal:  Stress Biol       Date:  2022-05-18

8.  PRP4KA phosphorylates SERRATE for degradation via 20S proteasome to fine-tune miRNA production in Arabidopsis.

Authors:  Lin Wang; Xingxing Yan; Yanjun Li; Zhiye Wang; Shweta Chhajed; Baoshuan Shang; Zhen Wang; Suk Won Choi; Hongwei Zhao; Sixue Chen; Xiuren Zhang
Journal:  Sci Adv       Date:  2022-03-25       Impact factor: 14.136

9.  A Collection of Pre-mRNA Splicing Mutants in Arabidopsis thaliana.

Authors:  Tatsuo Kanno; Peter Venhuizen; Ming-Tsung Wu; Phebe Chiou; Chia-Liang Chang; Maria Kalyna; Antonius J M Matzke; Marjori Matzke
Journal:  G3 (Bethesda)       Date:  2020-06-01       Impact factor: 3.542

Review 10.  Alternative Splicing and DNA Damage Response in Plants.

Authors:  Barbara Anna Nimeth; Stefan Riegler; Maria Kalyna
Journal:  Front Plant Sci       Date:  2020-02-19       Impact factor: 6.627

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.