Literature DB >> 17537823

Proteomic analysis of in vivo-assembled pre-mRNA splicing complexes expands the catalog of participating factors.

Yen-I G Chen1, Roger E Moore, Helen Y Ge, Mary K Young, Terry D Lee, Scott W Stevens.   

Abstract

Previous compositional studies of pre-mRNA processing complexes have been performed in vitro on synthetic pre-mRNAs containing a single intron. To provide a more comprehensive list of polypeptides associated with the pre-mRNA splicing apparatus, we have determined the composition of the bulk pre-mRNA processing machinery in living cells. We purified endogenous nuclear pre-mRNA processing complexes from human and chicken cells comprising the massive (>200S) supraspliceosomes (a.k.a. polyspliceosomes). As expected, RNA components include a heterogeneous mixture of pre-mRNAs and the five spliceosomal snRNAs. In addition to known pre-mRNA splicing factors, 5' end binding factors, 3' end processing factors, mRNA export factors, hnRNPs and other RNA binding proteins, the protein components identified by mass spectrometry include RNA adenosine deaminases and several novel factors. Intriguingly, our purified supraspliceosomes also contain a number of structural proteins, nucleoporins, chromatin remodeling factors and several novel proteins that were absent from splicing complexes assembled in vitro. These in vivo analyses bring the total number of factors associated with pre-mRNA to well over 300, and represent the most comprehensive analysis of the pre-mRNA processing machinery to date.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17537823      PMCID: PMC1919476          DOI: 10.1093/nar/gkm347

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Eukaryotic RNA polymerase II (RNA Pol II) transcripts are matured through a highly coordinated program of processing steps prior to export from the nucleus to the cytoplasm where they are translated into protein by the ribosome (1–5). These pre-mRNA processing events include 5′ end modification by 7-methyl-guanosine cap addition and binding of the nuclear cap binding complex, intron removal by the spliceosome, 3′ end cleavage and poly-adenosine tail addition, transcript-specific modifications such as adenosine deamination and binding of specific protein factors to regulate and promote mature mRNA export from the nucleus. Coordination of these events involves interaction between the machineries involved in each process. For example, the RNA Pol II transcription complex communicates and interacts extensively with the 5′ end capping, pre-mRNA splicing and 3′ end processing machineries (1). While native pre-mRNAs contain multiple, often extremely large introns, in vitro pre-mRNA splicing reactions are carried out using synthetic pre-mRNA fragments containing a single, efficiently spliced intron of a size compatible with acrylamide gel electrophoresis analysis. Although the core pre-mRNA processing machinery will likely be very similar between different transcripts as well as for the multiple introns contained within a single transcript, the bulk pre-mRNA processing machinery purified from its native context is likely to contain a more comprehensive sample of the polypeptides required for or participating in the splicing of pre-mRNA in vertebrate cells. Several groups have purified and characterized spliceosomes formed on model vertebrate pre-mRNAs in vitro (6–8) and shown that they contain a remarkably large number of associated polypeptides. Nevertheless, as these synthetic precursors have generally been modified from their natural state by internal deletions within the intron and truncations of the exons, the pattern of associated proteins is inevitably less complex than on the full-length, generally multi-intronic precursors that exist in vivo. In addition, the spliceosomes purified from in vitro reactions were assembled on pre-mRNAs derived from either the adenovirus major late or β-globin loci. Thus, it is likely that there exist a number of factors that are required for or participate in pre-mRNA processing in vivo, yet are not present in previously purified splicing complexes because they are specific to one or more of the thousands of other pre-mRNAs present in metazoan cells. Finally, the pathway by which pre-mRNA processing complexes are assembled in vitro using salt-extracted nuclear fractions most likely bypasses many interactions relevant to this process in vivo. Thus spliceosomes purified following in vivo assembly are expected to contain additional components that reflect the native pathway, but are not required to effect model intron removal in vitro. Additionally, factors that assist in inter-spliceosome interactions in multi-intron substrates will be absent from mono-spliceosome purifications, and should be present in in vivo-purified complexes. Consistent with this view, other investigators have shown that endogenous pre-mRNA is processed in extremely large ribonucleoprotein particles, called supraspliceosomes (9–11) or polyspliceosomes (12). Biochemical and structural analyses of these complexes have demonstrated the presence of RNA Pol II transcripts (13,14) and the pre-mRNA splicing machinery components (15,16) as well as functional interactions that mirror those in active splicing complexes assembled in vitro (12). The higher order particles formed in vivo partly reflect the presence of multiple introns, an average of eight per pre-mRNA (17) with some transcripts possessing as many as 147 introns [Nebulin (18)], that need to be faithfully removed prior to nuclear export. In Figure 1, we present a schematic model of the pre-mRNA processing pathway in vivo that encompasses the concept of the supra/polyspliceosome. Whether the individual ‘spliceosome’ moieties are formed via stepwise snRNP assembly on individual introns (2) or via pre-formed penta-snRNPs (19) in vertebrates is still a matter of considerable debate, although recent chromatin immunoprecipitation experiments in human cells provide support for the penta-snRNP model (20).
Figure 1.

Model describing the role of vertebrate supraspliceosomes in gene expression. (A) Co-transcriptional assembly of spliceosomes, 5′ end modification machinery and other pre-mRNA binding factors on RNA polymerase II transcripts. (B) The released transcript is partially spliced and bound by numerous spliceosome moieties as well as the 5′ cap-binding complex and 3′ end processing factors. (C) The mature mRNA is associated in the nucleus with RNA binding proteins, 5′- and 3′-end stabilizing factors (the CBP heterodimer and poly(A)-binding protein), and proteins that promote export to the cytoplasm.

Model describing the role of vertebrate supraspliceosomes in gene expression. (A) Co-transcriptional assembly of spliceosomes, 5′ end modification machinery and other pre-mRNA binding factors on RNA polymerase II transcripts. (B) The released transcript is partially spliced and bound by numerous spliceosome moieties as well as the 5′ cap-binding complex and 3′ end processing factors. (C) The mature mRNA is associated in the nucleus with RNA binding proteins, 5′- and 3′-end stabilizing factors (the CBP heterodimer and poly(A)-binding protein), and proteins that promote export to the cytoplasm. With the goal of expanding our understanding of pre-mRNA splicing as it occurs in intact cells, we have purified the endogenous pre-mRNA processing machines from HeLa cells and from chicken DT40 pre-B cells (21) on a preparative scale and have defined their RNA and polypeptide compositions. We have chosen the chicken DT40 system to compare with the HeLa system for a number of reasons. First, we have shown that working with this rapidly growing cell type, which possesses high rates of homologous recombination, allows for downstream experimental flexibility in epitope-tagging of other genes (22). The evolutionary distance between human and chicken will also allow us to assess the evolutionary conservation of the machinery as well as validating novel co-purifying factors. We show that these pre-mRNA processing complexes contain spliced and unspliced mRNAs, all five spliceosomal snRNAs and polypeptides involved in all aspects of pre-mRNA processing from transcription to nuclear export. Although our strategy may not be sensitive enough to identify very low abundance pre-mRNA-specific factors, it has allowed us to probe more deeply into the general pre-mRNA processing machinery present in vertebrate cells. Indeed, when combined with the data from in vitro-assembled spliceosome characterization and known splicing factors not detected in any complexes previously purified, we show there are at least 305 polypeptides involved in or present during the processing of nuclear pre-mRNA.

MATERIALS AND METHODS

Purification of HeLa supraspliceosomes

Ten liters of HeLa cells (purchased from the National Cell Culture Center) were processed essentially as described (11). Briefly, cells were washed in PBS (137 mM NaCl, 2.7 mM KCl, 10 mM Na2HPO4, 2 mM KH2PO4) and disrupted by mechanical breakage in a glass dounce (20 strokes, pestle ‘B’) in a hypotonic solution (30 mM Tris–Cl pH 7.5, 10 mM KCl, 5 mM MgCl2, 10 mM 2-mercaptoethanol) at 4°C. Nuclei were pelleted at 1000 × g at 4°C for 5 min through the hypotonic buffer containing 25% glycerol. The nuclei were washed three times in the hypotonic buffer containing 0.5% Triton X-100 and once with detergent-free hypotonic buffer. Nuclei were re-suspended in a low-salt buffer (LS+; 10 mM Tris–Cl pH 7.5, 100 mM KCl, 2 mM MgCl2, 10 mM 2-mercaptoethanol, 0.15 mM spermine, 0.05 mM spermidine) and sonicated twice for 20 s at the maximum microtip setting. The resulting nuclear debris was pelleted at 14 000 × g for 10 s, and the supernatant was layered onto a 15–45% glycerol gradient (11ml Beckman SW41) made isotonic to LS- buffer (LS buffer without polyamines) and sedimented at 40 000 × g for 90 min. Fractions (420 μl) were collected from the top. Protein and nucleic acid were separated by phenol/chloroform extraction and precipitation with acetone (23) (protein) or ethanol (nucleic acid). Fractions corresponding to the supraspliceosomes were pooled from six velocity gradients run in parallel fashion, diluted to ∼8% glycerol with LS- buffer and incubated with 20 mg Y12 antibody which had been covalently attached to 1 g CnBr-sepharose (GE Biosciences) according to the manufacturer's instructions. After incubation with rotation for 2 h at 4°C, the sepharose matrix was washed with 200 ml LS- buffer by gravity flow in a column and supraspliceosome material was eluted with 0.2 M glycine. Protein and nucleic acids were separated by phenol/chloroform extraction as described above.

Purification of chicken supraspliceosomes

Six liters of SmD3-TAP DT40 cells (22) were grown in Dulbecco's modified Eagle media supplemented with 5% chicken serum and 2.5% Fetalplex (Gemini Bio-Products) to a density of 7.5 × 105 cells/ml for TAP purification. Cells were harvested by centrifugation (1000 x g for 5 min), washed twice with ice-cold PBS, allowed to swell in 10 ml of TM buffer (10 mM Tris–Cl pH 7.5, 3 mM MgCl2) with 0.2 mM PMSF, 1 µg/ml leupeptin, and 1 µg/ml pepstatin for 10 min ice, and lysed with 25 strokes of a Dounce homogenizer at 4°C. The nuclei were pelleted and washed twice with 10 ml of TM buffer containing 0.1% NP40, re-suspended in 5 ml of low salt buffer (30 mM Tris–Cl, 125 mM KCl, 5 mM MgCl2, 0.5% Triton-X100), and sonicated at the maximum output, twice for 20 s on ice with 1 min in ice between sonications. The sonicated mixture was centrifuged at 14 000 x g for 1 min and the supernatant was used for TAP purification. TAP-tagged protein material for SmD3-TAP DT40 cells was affinity purified by the TAP procedure (24). The TEV eluate was layered onto glycerol gradients and fractionated as described above for the human supraspliceosomes.

Immunoprecipitations

Polyclonal antisera directed against the carboxyl-terminal 15 amino acids of KIAA0332 and NP_035897 (NCBI accession numbers) were produced by Genemed Synthesis and the IgG fraction was partially purified by ammonium sulfate precipitation at 50% saturation. Antiserum or non-immune serum was incubated for 1 h at 4°C with the sample(s) of interest prior to addition of 50 µl Protein-A agarose beads. This mixture was incubated one further hour with rotation at 4°C prior to washing with 4 × 15 ml IPP150. Proteins and nucleic acids were released from the matrix by incubation in IPP150 at 100°C for 5 min. The supernatant was collected and phenol extracted as described above to harvest, separate and precipitate the nucleic acids and proteins.

Northern blot analysis

Nucleic acids were transferred to Brightstar membranes (Ambion) and hybridized with snRNA probes consisting of antisense chicken snRNAs transcribed with α32P-GTP using T7 RNA polymerase (U5) or SP6 RNA polymerase (U1, U2, U4, U6 snRNAs) from plasmids containing cDNA versions of the chicken snRNAs.

Western blot analysis

Polypeptides were resolved in 10% polyacrylamide gels (25), transferred to nitrocellulose membranes (Biorad) and blotted with antiserum as described in the text. The secondary antibody used was horseradish peroxidase-conjugated goat anti-rabbit IgG (Rockland) and the signal was detected by enhanced chemiluminescence (Perkin Elmer).

Mass spectrometry peptide identification

Pooled supraspliceosome protein fractions were separated by polyacrylamide gel electrophoresis and stained with Coomassie Blue G-250 (26). Discrete gel slices were dissected from the top of the gel lane to the bottom and all regions were subjected to trypsin digestion. Mass spectrometry and database searching was performed as previously described (27–29).

RESULTS AND DISCUSSION

Purification of endogenous human pre-mRNA processing complexes

We discovered that in gently sonicated nuclei treated with low salt (11), the majority of the snRNA, as judged by visual inspection of ethidium bromide stained gels (Figure 2A), was engaged in very large (>80S) ribonucleoprotein complexes that closely resemble supraspliceosomes in sedimentation values and other properties (9,10,30). These particles may also be related to the polyspliceosomes described in salt-extracted nuclei, which sedimented as complexes slightly smaller than our supra/polyspliceosome, likely reflecting salt-induced factor loss during nuclear extraction (12).
Figure 2.

Human supraspliceosome-associated polypeptides and snRNAs. RNA (A) and protein (B) were extracted from preparative glycerol gradient fractions and electrophoretically resolved through urea-PAGE (A) or SDS-PAGE (B) gels stained with silver (RNA) or coomassie blue (protein). Bar below B represents the fractions of the material pooled for immunopurification with Y12 antibody. (C) Affinity-purified supraspliceosomal proteins run under two SDS-PAGE conditions to resolve either large or small polypeptides. Gels were aligned to show all polypeptides in the affinity-purified fractions and are delineated by the marking between them. The entire gel lanes shown from the two gels in (C) were dissected and each gel slice was subjected to mass spectrometry protein identification. The proteins identified are reported under the Hs PS column in Tables 1–3 and in Supplemental Table S1.

Human supraspliceosome-associated polypeptides and snRNAs. RNA (A) and protein (B) were extracted from preparative glycerol gradient fractions and electrophoretically resolved through urea-PAGE (A) or SDS-PAGE (B) gels stained with silver (RNA) or coomassie blue (protein). Bar below B represents the fractions of the material pooled for immunopurification with Y12 antibody. (C) Affinity-purified supraspliceosomal proteins run under two SDS-PAGE conditions to resolve either large or small polypeptides. Gels were aligned to show all polypeptides in the affinity-purified fractions and are delineated by the marking between them. The entire gel lanes shown from the two gels in (C) were dissected and each gel slice was subjected to mass spectrometry protein identification. The proteins identified are reported under the Hs PS column in Tables 1–3 and in Supplemental Table S1. Comparisons of snRNP, snRNP biogenesis and known spliceosome associated proteins (SAPs) profiles from supraspliceosomes purified from human or chicken cells and mono-spliceosomes purified from three in vitro-assembled preparations aData available at: http://www.ensembl.org. bHUGO Gene Nomenclature Committee designation, a dash represents a polypeptide which has not yet been assigned a systematic name. cCommon name. dGallus gallus supraspliceosome. eHomo sapiens (HeLa) supraspliceosome. fNeubauer et al. spliceosome data (8). gRappsilber et al. spliceosome data (6). hZhou et al. spliceosome data (7). f,g,hWere compiled from Jurica and Moore (104). Polypeptides demonstrated or predicted by sequence homology to interact with the pre-mRNA, mRNA or the spliceosome and comparisons of those identified in the supraspliceosomes with those of spliceosomes formed in vitro aData available at: http://www.ensembl.org. bHUGO Gene Nomenclature Committee designation, a dash represents a polypeptide which has not yet been assigned a systematic name. cCommon name. dGallus gallus supraspliceosome; eHomo sapiens (HeLa) supraspliceosome. fNeubauer et al. spliceosome data (8). gRappsilber et al. spliceosome data (6). hZhou et al. spliceosome data (7). f,g,hWere compiled from Jurica and Moore (104). Structural, nucleoporin and novel polypeptides present in the supraspliceosomes and in vitro-assembled splicing complexes aData available at: http://www.ensembl.org. bHUGO Gene Nomenclature Committee designation, a dash represents a polypeptide which has not yet been assigned a systematic name. cCommon name. dGallus gallus supraspliceosome. eHomo sapiens (HeLa) supraspliceosome. fNeubauer et al. spliceosome data (8). gRappsilber et al. spliceosome data (6). hZhou et al. spliceosome data (7). f,g,hWere compiled from Jurica and Moore (104). For purification of the human supraspliceosomes, we gently sonicated nuclei in a buffered low-salt solution and purified the material in the ∼200S region (Figures 2A and B) as previously described (11). This material was immunopurified using the antibody Y12 on a solid matrix. Remarkably, this treatment nearly quantitatively retained the detectable material from this region of the glycerol gradient as judged by coomassie gel staining, even after extensive washing, indicating that the majority of the nuclear contents of this size are Sm-antigen-containing complexes. The lack of polyribosomes in the rapidly sedimenting material (as judged by the absence of 5S and 5.8S rRNAs and ribosomal proteins by mass spectrometry) indicates that the nuclei we prepare are not contaminated with cytoplasm. The material purified on a preparative scale was separated by SDS-PAGE gels of two compositions to provide optimal resolution of the large number of polypeptides present (Figure 2C). To demonstrate the specificity of these purifications, gradient-separated supraspliceosomes were subjected to affinity chromatography using identical beads and identical washing and elution conditions, but lacking the Y12 antibody. In Figure S1, we show that from the mock purification, there is no detectable coomassie-stained material in the resulting protein gel (panel A) and no snRNAs present (panel B).

Purification of endogenous chicken pre-mRNA processing complexes

Using our recently developed CLEP tagging procedure (22), we tagged the SmD3 polypeptide in chicken DT40 cells by introducing a TAP tag (24) at the native genomic locus. For each experiment, 6 l of SmD3-TAP-DT40 cells were harvested and processed as described for purification of supraspliceosomes from HeLa cells. Affinity chromatography was performed according to the TAP procedure (24) and the TEV eluate was sedimented through a glycerol gradient. The material corresponding to the supraspliceosomes was isolated; proteins and nucleic acids are shown in Figures 3A and B, respectively. In Figure S1, we show the proteins (panel C) and RNA (panel D) resulting from an identical affinity purification procedure performed using extracts from untagged DT40 cells. The absence of proteins, beyond the contaminating TEV protease, and the absence of snRNAs indicates that the purification is specific and the proteins identified by mass spectrometry are likely to be bona fide supraspliceosome components. Additional confidence is provided in that there is size-selection as well as one or two steps of affinity chromatography.
Figure 3.

Chicken supraspliceosome-associated polypeptides and snRNAs. Fractions corresponding to the CLEP-tag purified, glycerol gradient-sedimented chicken supraspliceosomes were separated into protein (A) and RNA (B) fractions and electrophoresed through SDS-PAGE (A) or urea-PAGE (B) gels and stained with coomassie blue (protein) or ethidium bromide (RNA). The identities of the snRNAs are indicated on the right of panel B. The entire gel lane from (A) was dissected and each gel slice was subjected to mass spectrometry for protein identification. The proteins identified are reported under the Gg PS column in Tables 1–3 and in Supplemental Table S1.

Chicken supraspliceosome-associated polypeptides and snRNAs. Fractions corresponding to the CLEP-tag purified, glycerol gradient-sedimented chicken supraspliceosomes were separated into protein (A) and RNA (B) fractions and electrophoresed through SDS-PAGE (A) or urea-PAGE (B) gels and stained with coomassie blue (protein) or ethidium bromide (RNA). The identities of the snRNAs are indicated on the right of panel B. The entire gel lane from (A) was dissected and each gel slice was subjected to mass spectrometry for protein identification. The proteins identified are reported under the Gg PS column in Tables 1–3 and in Supplemental Table S1.

RNA content of the supraspliceosomes

RNAs corresponding to the supraspliceosome fractions from human and chicken cells are shown in Figures 2B and 3B, respectively. Identities of the RNAs were confirmed by northern blotting (data not shown). The presence of all five spliceosomal snRNAs in this material indicated that it contained a mixture of pre-mRNA splicing complexes in varying stages of assembly and activity, as both U1 and U4 snRNAs have been shown to be released from the spliceosome before the first catalytic step of the splicing reaction in vitro (31–33). Alternatively, the presence of both U1 and U4 may reflect functional differences between our preparations and the spliceosomes assembled in vitro; for example, it is possible that the U1 and U4 snRNAs do not completely dissociate in conjunction with catalytic activation in vivo, but are only destabilized and maintained locally. The presence of all five snRNAs in roughly equivalent amounts also lends experimental evidence to the participation of the penta-snRNP in these functional complexes (19).

Mass spectrometry analysis of supraspliceosome-associated polypeptides

Polyacrylamide gel lanes from the entire human and chicken supraspliceosome fraction were dissected and the material analyzed by tandem mass spectrometry (27–29). Though a wide variety of polypeptides were identified, it is notable that we detected very little background contamination of factors known to be unrelated to gene expression. In Tables 1–4, we categorize the identified polypeptides according to function. Remarkably, we detected 222 distinct polypeptides in the chicken supraspliceosomes, and 177 distinct polypeptides in the HeLa supraspliceosomes. These numbers are significantly higher than the number of polypeptides detected in any one of the three previously published spliceosome purifications (6–8). Known pre-mRNA processing proteins not present in any purified splicing complex aData available at: http://www.ensembl.org. bHUGO Gene Nomenclature Committee designation, a dash represents a polypeptide which has not yet been assigned a systematic name. cCommon name. The polypeptides identified by mass spectrometry were validated by analyzing the percent-coverage for each protein (Supplemental Table 1). Although there is a distribution of coverage for the identified proteins, we note that while many snRNP-associated factors had a large percentage of their sequence identified, some snRNP-associated proteins had <10% coverage. Differences in coverage may reflect differences in abundance, a paucity of appropriately-sized trypsin fragments or to peculiarities in the mass spectrometry detection of a particular peptide. The low-coverage of some of the novel polypeptides that may reasonably be implied to function in pre-mRNA processing (i.e. contain RNA binding motifs) may reflect their association with a smaller subset of pre-mRNAs than a general RNA binding protein such as an hnRNP. In the case of one novel factor, ZFR, the percent-coverage was low (3.4% for chicken, 8% for human) but its association with the splicing machinery was verified independently (see below).

Known snRNP-associated polypeptides

By mass spectrometry, we identified nearly all of the known pre-mRNA splicing snRNP-associated polypeptides (Table 1). We were initially surprised by the apparent absence in our preparations of a subset of snRNP-associated proteins found in most or all of the previously purified spliceosomes. However, upon closer inspection, we observed that for each polypeptide not represented in our mass spectrometry results, the inability to be detected correlated with the presence of an abundant hnRNP protein of similar molecular weight. The coverage of the major spliceosomal snRNP proteins was more complete for the chicken supraspliceosomes. Indeed, the CLEP tagging and purification procedure was sensitive enough to detect the presence of two minor AT–AC spliceosome components, the U11/U12–65K and U11/U12–48K polypeptides (34) in the chicken fractions, whereas no AT–AC specific splicing components were detected in the human complexes. The ability to detect all of the Sm proteins, but not all of the LSM proteins may reflect the difficulty in detecting all of these proteins in splicing complexes as shown previously (35,36), the 5-fold abundance differences of the two classes of proteins or perhaps due to the LSM proteins leaving the spliceosome during the process of pre-mRNA splicing (37).
Table 1.

Comparisons of snRNP, snRNP biogenesis and known spliceosome associated proteins (SAPs) profiles from supraspliceosomes purified from human or chicken cells and mono-spliceosomes purified from three in vitro-assembled preparations

ENSEMBL accession #aHGNCbPolypeptidecGg PSdHs PSeNfRgZh
U1 snRNP
ENSG00000104852SNRP70U1–70K
ENSG00000077312SNRPAU1A
ENSG00000124562SNRPCU1-C
U2 snRNP
ENSGALG00000008038SF3B1SF3b155
ENSG00000087365SF3B2SF3b145
ENSGALG00000002531SF3B3SF3b130
ENSGALG00000000581DDX42SF3b125
ENSGALG00000007937SF3A1SF3a120
ENSGALG00000021679SF3A2SF3a66
ENSGALG00000001540SF3A3SF3a60
ENSGALG00000013352SF3B4SF3b49
ENSGALG00000008729SNRPB2U2B′′
ENSGALG00000007170SNRPA1U2A′
ENSGALG00000016501SF3b14
ENSGALG00000020000SF3B5SF3b10
U2-snRNP associated
ENSGALG00000014395DHX15PRP43/DDX15
ENSGALG00000002612SR140
ENSGALG00000006332RBM17SPF45
ENSGALG00000008561SMNDC1SPF30
ENSGALG00000003824CHERPCHERP
U5, U4/U6 & U4/U6•U5 snRNP
ENSGALG00000002943PRPF8U5–220K
ENSGALG00000003477ASCC3L1U5–200K
ENSGALG00000000988EFTUD2U5–116K
ENSG00000175467SART1U4/U6•U5–110K
ENSGALG00000006001C20ORF14U5–102K
ENSG00000174243DDX23U5–100K
ENSGALG00000000465PRPF3U4/U6–90K
ENSG00000168883USP39U4/U6•U5–65K
ENSG00000105618PRPF31U4/U6•U5–61K
ENSGALG00000008857PRPF4U4/U6–60K
ENSGALG00000004874PPIHUSA–CYP
ENSGALG00000000615WDR57U5–40K
ENSGALG00000011931NHP2L1U4/U6•U5–15.5k
ENSGALG00000017396TXNL4AU5–15K
AT/AC
ENSGALG00000005162RNPC3U11/U12–65K
ENSGALG00000013005C6ORF151U11/U12–48K
Sm/LSM
ENSGALG00000007250SNRPBSmB/B'
ENSGALG00000011842SNRPD1SmD1
ENSG00000125743SNRPD2SmD2
ENSGALG00000006596SNRPD3SmD3
ENSGALG00000000137SNRPESmE
ENSGALG00000011409SNRPFSmF
ENSG00000143977SNRPGSmG
ENSG00000111987LSM2LSM2
ENSG00000170860LSM3LSM3
ENSGALG00000003385LSM4LSM4
ENSG00000106355LSM5LSM5
ENSGALG00000009985LSM6LSM6
ENSG00000130332LSM7LSM7
ENSGALP00000014820LSM8LSM8
snRNP biogenesis
ENSGALG00000010154SIP1SIP1
ENSG00000119953SMNDC1SMNrp30
ENSGALG00000003158COILCoilin
SAPs
ENSGALG00000002514SFPQPSF
ENSGALG00000002060FUSTLS/FUS
ENSGALG00000012468PRPF39PRP39
ENSGALG00000010501SNW1SKIP/PRP45
ENSGALG00000016704CDC5LCDC5
ENSGALG00000005012RAB43ISY1
ENSGALG00000008429CRNKL1CRN1
ENSGALG00000009257PRLG1Prp46/PRL1
ENSGALG00000015061CDC40CDC40/PRP17
ENSGALG00000002002BCAS2SPF27
ENSGALG00000013919PRPF19PRP19
ENSGALG00000010627PRPF38APRP38
ENSGALG00000005507NONOp54nrb
ENSGALG00000004555RBM22ECM2/RBM22
ENSGALG00000001247SYF2SYF2
ENSGALG00000000726ELAVL1ELAV/Hu
ENSGALG00000009001CWC22
ENSGALG00000008149EWSR1EWSR1 (RBP)
ENSGALG00000004705BUD31BUD31
ENSG00000076924XAB2SYF1
ENSGALG00000001962PTBP1PTB
ENSGALG00000011857LUC7L2LUC7/CROP
ENSG00000196504PRPF40APRP40
ENSGALG00000010167PNNPinin
ENSGALG00000012813PRPF4BPrp4K
ENSGALG00000000833IKRED
ENSGALG00000001500SLU7
ENSG00000063244U2AF2U2AF65
ENSGALG00000016198U2AF1U2AF35
ENSG00000168066SF1SF1
ENSGALG00000001034C20ORF4AAR2
ENSGALG00000005525SFRS1SF2/ASF
ENSG00000102241HTATSF1TAT-SF1
ENSGALG00000002087NCBP1CBC80
ENSGALG00000006843NCBP2CBC20
ENSG00000087087ASR2B
ENSG00000100296THOC5KIAA0983
ENSG00000159086C21ORF66C21ORF66
ENSG00000126803HSPA2HSP70-2
ENSGALG00000006512HSPA8HSP71
ENSGALG00000009838AQRAquarius
ENSGALG00000002014SMU1SMU1
ENSGALG00000005623TFIP11SPP382
ENSG00000137656CWC26
ENSG00000126698DNAJC8SPF31
ENSG00000105705SF4SF4
ENSG00000113649TCERG1CA150
ENSGALG00000004626RBM5E1B-AP5
ENSG00000100056DGCR14DGCR14
ENSG00000105298C19ORF29C19ORF29
ENSG00000109536FRG1FRG1
ENSG00000171824EXOSC10RRP6
ENSG00000160799CCDC12CCDC12/CWF18
ENSGALG00000011678DNAJC13DnaJ
ENSG00000100813ACIN1Acinus
ENSG00000131051RNPC2HCC
ENSG00000084463WBP11WBP11
ENSG00000196419XRCC6Ku70

aData available at: http://www.ensembl.org.

bHUGO Gene Nomenclature Committee designation, a dash represents a polypeptide which has not yet been assigned a systematic name.

cCommon name.

dGallus gallus supraspliceosome.

eHomo sapiens (HeLa) supraspliceosome.

fNeubauer et al. spliceosome data (8).

gRappsilber et al. spliceosome data (6).

hZhou et al. spliceosome data (7).

f,g,hWere compiled from Jurica and Moore (104).

By western blotting of chicken cell nuclear extracts with the Y12 monoclonal antibody, we determined that the Sm antigens in chicken cells were not reactive with the Y12 antibody (data not shown), eliminating the possibility of a direct comparison between Y12-immunopurified material from chicken and human cells.

snRNP biogenesis factors

Previous spliceosome purifications did not yield polypeptides known to be involved in snRNP biogenesis. We note in Table 1 that there are three of these present in the chicken supraspliceosomes (SIP1, SMNrp30 and Coilin). These factors, which are involved in the de novo assembly of snRNPs, are contained in Cajal bodies (CBs), nuclear organelles enriched for pre-mRNA splicing factors (38). We hypothesize that the CLEP tagging procedure may allow purification of a subset of snRNPs in the process of being re-targeted to the CBs. We did not detect this class of polypeptides in the HeLa supraspliceosomes. Other factors present in the chicken supraspliceosome but not those from HeLa cells were cyclophillins and chromatin remodeling proteins, which may be related to procedural differences in the purification methods.

Known spliceosome associated proteins (SAPs)

In the second half of Table 1, we compile a list of the 59 SAPs identified in one or more of the spliceosome purifications. The in vivo-assembled complexes contained pre-mRNA-interacting factors such as U2AF (39,40), PTB (41,42), and the cap binding complex proteins (43), which were present in some but not all of the in vitro-assembled spliceosomes. We detected the majority of PRP19-complex (NTC) related components as well, including homologues of Prp19p (44,45), Syf1p (45), Syf2p (45), Syf3/Clf1p (45,46), Isy1p (47), SKIP/Prp45p (48,49) and CDC5/Cef1p (50,51), which were also detected in vitro, and BCAS2/SPF27, which was only detected in supraspliceosomes assembled in vivo. Interestingly, our preparations included a number of polypeptides that are snRNP-associated in yeast but have not been identified in purified metazoan snRNPs or spliceosomes. Among these factors are putative orthologues of yeast Prp38p (52), Prp39p (53), Prp40p (54), Aar2p (55) and Luc7p (56). Other polypeptides exclusively contained in the purified supraspliceosomes include, NONO/p54nrb, PNN/Pinin, CWC22, which have been previously implicated in splicing (57–60). Conversely, 13 polypeptides were exclusively associated with the in vitro purified spliceosomes, possibly reflecting specificity for the pre-mRNAs upon which these were assembled. Alternatively, the differences in composition may result from differences in the procedures. Most striking, however, is the absence of SF1/BBP from our supraspliceosome preparations. This may be due to the fact that SF1/BBP interacts very early with the pre-mRNA, remains associated for a short time and is replaced by U2 snRNP at the branchpoint sequence (61).

RNA helicase-like proteins

We identified a large number of DExH/D proteins in the preparations of supraspliceosome assembled in vivo. In addition to the RNA helicase-like polypeptides known to function in pre-mRNA splicing, such as DDX15/Prp43p (62–64), DHX8/Prp22p (65–67), DDX46/Prp5p (68–70), DDX5/p68 and DDX17/p72 (71), UAP56/Sub2p (72–74), DDX16/Prp2p (75,76) and the snRNP-associated helicases U5-200K/Brr2p (77–79) and U5-100K/Prp28p (80–82), we noted a number not previously implicated in pre-mRNA splicing and absent from the in vitro purified spliceosomes [Table 2 (‘Gg’ and ‘Hs’ to ‘N’, ‘R’ and ‘Z’)]. These include 13 additional polypeptides with sequence motifs indicative of DEAD, DEAH or Ski2p-like helicase family members, most of which were absent from spliceosomes assembled in vitro. Although we do not as yet have evidence that these proteins function in pre-mRNA splicing, the complete absence of DNA helicases in our preparations indicates a specificity (i.e. a specificity for RNA processing complexes and against chromatin components), which minimally suggests that they function in some aspect of RNA Pol II transcript processing.
Table 2.

Polypeptides demonstrated or predicted by sequence homology to interact with the pre-mRNA, mRNA or the spliceosome and comparisons of those identified in the supraspliceosomes with those of spliceosomes formed in vitro

ENSEMBL accession #aHGNCbPolypeptidecGg PSdHs PSeNfRgZh
RNA Helicase-like
ENSGALG00000016461DDX1DDX1
ENSGALG00000016231DDX3XDDX3
ENSGALG00000003532DDX5DDX5/p68
ENSGALG00000012247DDX17DDX17/p72
ENSGALG00000012147DDX18DDX18
ENSG00000102786DDX26DDX26/HDB
ENSGALG00000006974DDX27DDX27
ENSGALG00000003030DDX41DDX41/ABSTRAKT
ENSG00000123136DDX39DDX39
ENSG00000145833DDX46DDX46/PRP5
ENSGALG00000008530DDX48DDX48
ENSGALG00000004144DDX50DDX50/Gu-ß
ENSGALG00000001186DHX8DHX8/PRP22
ENSG00000135829DHX9DHX9/HELICASEA
ENSG00000137333DHX16DHX16/PRP2
ENSGALG00000005027DHX30DHX30
ENSGALG00000003658DHX35DHX35
ENSG00000174953DHX36DHX36
ENSGALG00000014709SKIV2L2SKIV2L2
ENSG00000198563BAT1UAP56
hnRNP
ENSGALG00000006160HNRNPA0hnRNPA0
ENSGALG00000011036HNRNPA2B1hnRNPA2/B1
ENSG00000135486HNRPA1hnRNPA1
ENSGALG00000009250HNRNPA3hnRNPA3
ENSGALG00000014381HNRNPABhnRNPAB
ENSG00000092199HNRPChnRNPC1/C2
ENSG00000138668HNRNPDhnRNPD0/AUF1
ENSGALG00000011184HNRNPDLhnRNPD0
ENSG00000169813HNRPFhnRNPF
ENSGALG00000006457RBMXhnRNPG
ENSGALG00000005955HNRNPH1hnRNPH1
ENSGALG00000003947HNRNPH3hnRNPH3
ENSGALG00000012591HNRNPKhnRNPK
ENSG00000104824HNRNPLhnRNPL
ENSGALG00000000377HNRNPMhnRNPM
ENSGALG00000015830SYNCRIPhnRNPQ
ENSGALG00000000814HNRNPRhnRNPR
ENSGALG00000010671HNRNPUhnRNPU
ENSGALG00000018665hnRNP novel
ENSG00000126457HRMT1L2HRMT1L2
SR family
ENSG00000133226SRRM1SRm160
ENSG00000167978SRRM2SRm300
ENSGALG00000005525SFRS1SF2p33
ENSG00000161547SFRS2SC35
ENSGALG00000000533SFRS3SFRS3SRp20
ENSG00000116350SFRS4SRp75
ENSGALG00000009484SFRS5SRp40
ENSGALG00000000990SFRS6SRp55
ENSGALG00000013825SFRS79G8
ENSGALG00000002487SFRS8SFRS8
ENSG00000111786SFRS9SRp30
ENSGALG00000006531SFRS10SFRS10
ENSG00000116754SFRS11SRp54
ENSG00000153914SFRS12SFRS12
ENSGALG00000004133FUSIP1SRrp35
Cyclophillins
ENSGALG00000013383PPIECYP-E
ENSGALG00000004874PPIHUSA-CYP
ENSGALG00000014747SDCCAG10CYP16
ENSG00000137168PPIL1PPIL1/CWF27
ENSG00000100023PPIL2PPIL2/CYP60
ENSG00000115934PPIL3PPIL3B
ENSG00000113593PPWD1PPWD1
RBP
ENSG00000102317RBM3RBM3
ENSGALG00000018992RBM4BRBM4B/Lark
ENSGALG00000004626RBM5RBM5
ENSGALG00000007017RBM7RBM7
ENSG00000131795RBM8ARBM8B
ENSGALG00000013411RBM14RBM14
ENSG00000162775RBM15RBM15
ENSGALG00000002372RBM15BRBM15B
ENSG00000197676RBM16RBM16
ENSGALG00000004555RBM22RBM22
ENSG00000119707RBM25RBM25
ENSG00000091009RBM27RBM27
ENSGALG00000011038CBX3RNPS1
ENSG00000033030ZCCHC8ZFP8
ENSGALG00000006113ZNF326ZFP326
ENSG00000179950PUF60
ENSG00000197381ADARB1ADAR1
ENSG00000160710ADARADAR2
ENSGALG00000010952Requiem
ENSGALG00000011570ILF2NFAT45
ENSG00000129351ILF3NFAT90
ENSG00000169564PCBP1PolyrCBP
ENSGALG00000012225CIRBPCIRBP
ENSG00000056097ZFRZFR
ENSGALG00000009427TIAL1TIA1
ENSGALG00000001187STRBPSTRBP
ENSG00000136231IGF2BP3IMP3
ENSG00000060138CSDACSDA
ENSG00000121774KHDRBS1SAM68
ENSG00000126254-Novel RRM
ENSG00000132773TOE1TOE1
ENSG00000142864SERBP1SERBP1
Export/transcription/NMD
ENSGALG00000014915THOC1THO1/HPR1
ENSGALG00000008507THOC2THO2
ENSG00000051596THOC3THO3/TEX1
ENSGALG00000007237THOC4THO4/ALY
ENSGALG00000004571PPP1CAGLC7/PPP1CA
ENSGALG00000002569RANRan
ENSG00000119392GLE1LGLE1L
ENSGALG00000007653RAE1GLE2/RAE1
ENSGALG00000003220RENT1UPF1/RENT
ENSG00000131795RBM8AY14/RBM8A
ENSGALG00000002144THRAP3TRAP150
ENSG00000172660TAF15TAF15/RBP56
ENSG00000162231NXF1TAP
ENSG00000065978YBX1YBX1
ENSGALG00000010689MAGOHMago nashi
3′ end proc.
ENSG00000071894CPSF1CPSF1
ENSGALG00000010783CPSF2CPSF2
ENSGALG00000016424CPSF3CPSF3
ENSGALG00000004714CPSF4CPSF4
ENSGALG00000003084CPSF5CPSF5
ENSG00000111605CPSF6CPSF6
ENSGALG00000011685CSTF3CSTF-77
ENSGALG00000013943FIP1L1FIP1
ENSG00000172239PAIP1PAIP1
ENSG00000100836PABPN1PABPN1
ENSGALG00000003800PABPC4PABPC4

aData available at: http://www.ensembl.org.

bHUGO Gene Nomenclature Committee designation, a dash represents a polypeptide which has not yet been assigned a systematic name.

cCommon name.

dGallus gallus supraspliceosome;

eHomo sapiens (HeLa) supraspliceosome.

fNeubauer et al. spliceosome data (8).

gRappsilber et al. spliceosome data (6).

hZhou et al. spliceosome data (7).

f,g,hWere compiled from Jurica and Moore (104).

Supraspliceosome-associated hnRNPs

Polypeptides termed hnRNPs are highly abundant nuclear proteins known to interact with hnRNA. We detected virtually all of the known hnRNP proteins in both human and chicken cells, as well as other hnRNP-like proteins annotated in genome databases (Table 2). We note that in the spliceosomes purified from in vitro extracts, some hnRNPs were identified; however, perhaps owing to specific binding of some hnRNPs to the bulk hnRNA and not the single transcript used in the in vitro spliceosome assembly reactions, a greater number of these polypeptides were detected in the supraspliceosomes.

SR proteins

Many splicing factors are rich in arginine and serine residues including long stretches of alternating dipeptides termed SR domains. These factors function at multiple steps in the pre-mRNA splicing pathway (83), and were constituents of the in vitro-purified splicing complexes. We detected 10 different SR family members in our purified supraspliceosomes from both chicken and human cells (Table 2), though not a complete set. One possible conclusion is that, due to the means by which these complexes were purified and analyzed, these SR proteins represent the major SR proteins functioning in these cells, and that those not detected in our preparations function in the splicing of a smaller subset of pre-mRNAs.

Cyclophillins

In addition to the known snRNP-associated USA–CYP (84,85), we detected five additional potential proline cis-trans isomerases co-purifying with spliceosomes from chicken, but interestingly, not from HeLa cells. Several of those from the chicken purification were also present in spliceosomes assembled in vitro (Table 3). As it is likely that these proteins also function in HeLa cells, their absence may represent operational differences in the ways in which the chicken and human cells were handled and the ways in which the complexes were purified and analyzed.
Table 3.

Structural, nucleoporin and novel polypeptides present in the supraspliceosomes and in vitro-assembled splicing complexes

ENSEMBL accession #aHGNCbPolypeptidecGg PSdHs PSeNfRgZh
Structural
ENSGALG00000012533MYH9Myosin
ENSGALG00000009126TTNTitin
ENSGALG00000001381ACTG1Actin
ENSGALG00000002478MATR3MATRIN3
ENSGALG00000008677VIMVimentin
ENSG00000117245KIF17Kinesin KIF17
ENSGALG00000002197NPM1NUMATRIN
ENSGALG00000014692LMNB1Lamin B
ENSGALG00000013505SYNE1NuSpectrin
ENSG00000140259MFAP1MFAP1
Chromatin modification
ENSGALG00000000360ARID1AARID1A-SWI/SNF
ENSGALG00000013683ARID1BARID1B-SWI/SNF
ENSGALG00000010164SMARCA2SMARCA2
ENSG00000127616SMARCA4Brahma/SMARCA4
ENSGALG00000009913SMARCA5SMARCA5
ENSGALG00000005983SMARCB1SMARCB1
ENSGALG00000005048SMARCC2BRG1-SWI/SNF
ENSGALG00000005048SMARCC1SMARCC1
ENSGALP00000010010SMARCD1SMARCD1
ENSGALG00000000363SMARCD2SMARCD2
ENSGALG00000002100SMARCE1SMARCE1
Nucleoporins
ENSGALG00000005714PKD1PKD1 (NUP assoc)
ENSGALG00000003830NUP214NUP214
ENSGALG00000012720NUP153NUP153
ENSGALG00000005078NUP210NUP210
ENSG00000102900NUP93NUP93
ENSG00000108559NUP88NUP88
ENSG00000111581NUP107NUP107
ENSG00000110713NUP98NUP98
ENSG00000138750NUP54NUP54
ENSG00000163002NUP35NUP35
ENSG00000069248NUP133NUP133
ENSG00000155561NUP205NUP205
Novel or unknown to splicing
ENSGALG00000011351HSP90AA1HSP90α
ENSGALG00000010175HSP90AA2HSP90ß
ENSGALG00000012726HSP90B1HSP108
ENSGALG00000009967LRPPRCLRP130
ENSGALG00000003693MACF1Macrophin
ENSGALG00000007705NCLNucleolin
ENSGALG00000015933C21ORF66GCRBF
ENSGALG00000008454NOP58
ENSGALG00000009061ACTL6ABAF53A
ENSGALG00000007520SSRP1FACT80
ENSGALG00000015821CCT8TCP1-theta
ENSGALG00000014500NOL5ANol5A/NOP56
ENSGALG00000005624MED12TRAP230
ENSGALG00000010973TRA2ATRA2α
ENSGALG00000008372XRN2RAT1
ENSG00000197157SND1
ENSGALG00000001948SAFB2SAFB/HSP27
ENSGALG00000003177BRD8BRD8
ENSGALG00000010699FOG
ENSGALG00000005177C9ORF10C9ORF10
ENSGALG00000002653ELG
ENSGALG00000016949WBP4WBP4
ENSGALG00000004133FUSIP1Fus IP
ENSGALG00000017384ERHERH
ENSG00000079246XRCC5Ku80
ENSG00000182562ATAD3AATAD3A (AAA ATPase)
ENSGALG00000001515ATAD3BATAD3B (AAA ATPase)
ENSG00000108588CCDC47CCDC47

aData available at: http://www.ensembl.org.

bHUGO Gene Nomenclature Committee designation, a dash represents a polypeptide which has not yet been assigned a systematic name.

cCommon name.

dGallus gallus supraspliceosome.

eHomo sapiens (HeLa) supraspliceosome.

fNeubauer et al. spliceosome data (8).

gRappsilber et al. spliceosome data (6).

hZhou et al. spliceosome data (7).

f,g,hWere compiled from Jurica and Moore (104).

Other RNA binding proteins

There were 32 polypeptides identified among all of the splicing complex purifications possessing sequence homology to polypeptides believed to interact with RNA by virtue of containing RNA Recognition Motifs (RRM), double-stranded RNA binding domains (dsRBD) or other motifs implicated in RNA binding. Some were identified previously, such as the ELAV/Hu protein that binds AU-rich elements in both cytoplasmic (86) and nuclear RNAs (87), the U2AF-related PUF60 protein (88), the dsRBD-motif-containing NFAT45 and NFAT90 and RNA adenosine deaminases; these were previously shown to exist in large nuclear complexes (16) and believed to function in RNA Pol II transcript metabolism. Only a single predicted RNA binding protein was found in the in vitro-assembled splicing complexes but not in either the chicken or HeLa supraspliceosomes, while 22 were exclusively found in supraspliceosomes, but not in the in vitro-formed complexes. We believe this is most likely due to the use of a single pre-mRNA in vitro, while a broader spectrum of RNA binding proteins will be associated with bulk pre-mRNA. In Figure 4 we present a graphical representation of the 16 presumptive RNA binding proteins novel to our study and highlight the sequence motifs contained in each.
Figure 4.

Novel spliceosome-associated polypeptides with predicted RNA binding motifs. Polypeptides from Table 2 with no known function in the pre-mRNA processing pathway are shown with graphical representations of the various RNA interaction or other noted motifs listed at the bottom of the Figure.

Novel spliceosome-associated polypeptides with predicted RNA binding motifs. Polypeptides from Table 2 with no known function in the pre-mRNA processing pathway are shown with graphical representations of the various RNA interaction or other noted motifs listed at the bottom of the Figure.

Non-spliceosomal pre-mRNA processing factors

Cap-binding proteins (CBC80 and CBC20) are present in both spliceosomes assembled in vitro and supraspliceosomes assembled in vivo (Table 1). In Table 2, we report the presence in our preparations of many 3′ end processing factors (CSPF, CSTF and poly-A binding proteins), a comprehensive set of proteins shown to be involved in mRNA export including the TREX complex (THO1/HPR1, THO2, THO3/TEX1, UAP56 and ALY) (89), export factors such as TAP (90,91), GLE1 (92), GLE2 (93) and GLC7 (94), and exon junction complex constituents including Y14 and Magoh. We also note that a single component of the nonsense-mediated decay (NMD) pathway (UPF1) (95) was identified in the chicken material. As NMD is likely to be active only in a very small subset of pre-mRNA processing complexes (96), we were surprised to observe even a single polypeptide implicated in this process.

Nuclear matrix and filament proteins

Recent data from several laboratories suggest a functional interaction between the structural proteins of the nuclear matrix and the gene expression machinery (97). Consistent with this model, we detected a number of nuclear matrix proteins in our endogenously formed pre-mRNA splicing complex preparations including actin, spectrin, matrin3, numatrin, lamin B and a matrix associated protein MAP1. Although we cannot confirm the functional relevance of these associations, we note that a few structural proteins also co-purified with in vitro-assembled spliceosomes. We also note that a number of hnRNPs and other known splicing factors such as Prp19p (98) were initially termed nuclear matrix-associated proteins, indicating an intimate relationship between the pre-mRNA processing machinery and the nuclear matrix. Indeed, it is an attractive hypothesis that pre-mRNA and mRNA are trafficked to the nuclear pore via the nuclear matrix.

Nuclear pore complex proteins

A substantial number of nucleoporins (NUPs) are present in the purified supraspliceosome complexes from human cells but not in spliceosomes assembled in vitro, which may perhaps be due to our use of sonication to release complexes from the purified nuclei versus salt extraction for preparation of splicing extracts. In the chicken supraspliceosomes, we detected a smaller set of NUPs, which may be due to their release by the detergent NP40 present during purification of these complexes. NP40 was absent during the purification of the human complexes, which likely maintained the integrity of hydrophobic interactions believed to stabilize the interaction of export complexes with NUPs.

Polypeptides novel to pre-mRNA splicing–SWI/SNF proteins and associated factors

A recent report from Muchardt and colleagues (99) has demonstrated that the SWI/SNF component Brahma/SMARCA4 (Brm) associates with the splicing apparatus and its presence favors the inclusion of alternatively spliced exons. The Prp4 kinase, which is present in both the human and chicken supraspliceosomes, has been reported to phosphorylate both Brm and the splicing factor U5–102K/hPrp6 (100) providing further evidence that it functions in Pol II transcript maturation. In the purified chicken supraspliceosomes, Brahma/SMARCA4, and a number of other SWI/SNF-related polypeptides were identified (Table 3), all with high degrees of confidence given the depth of the peptide identification. As our mass spectrometry data neither included structural proteins of chromatin- such as histones, nor the DNA replication machinery or other DNA binding proteins, Brahma and other polypeptides with chromatin-related functions must specifically associated with the pre-mRNA processing complexes.

Novel polypeptides present in native supraspliceosomes

Our mass spectrometry peptide data revealed several novel and intriguing polypeptides in the supraspliceosome complexes (Table 3). We found chicken homologs of the yeast splicing factors Prp38p, Prp39p and Aar2p, previously unannotated in purified pre-mRNA splicing complexes. The other polypeptides of interest in the endogenous splicing complexes include the 5′ to 3′ exonuclease XRN2/Rat1p (101–103), which has been implicated in linking transcription termination with polyadenylation. XRN2/Rat1p was found in both the human and chicken preparations, as were two uncharacterized AAA ATPases, ATAD3A and ATAD3B. The identities of 25 additional novel polypeptides are reported in Table 3.

Co-immunoprecipitation of spliceosomal components by antiserum directed against a novel polypeptide

To demonstrate the authenticity and functional relevance of a novel polypeptide that co-purified with endogenous spliceosomes, we generated antiserum against ZFR (Table 2) and used it to specifically immunopurify ZFR-associated components. In Figure 5A, we show that the ZFR polypeptide is present in very high molecular weight complexes that co-migrate with supraspliceosomal material. In Figure 5B, we show that the anti-ZFR antiserum, but not the pre-immune serum or the Protein-A beads, immunoprecipitates the U1, U2, U4, U5 and U6 snRNAs. As a positive control, we showed that antiserum directed against the known spliceosomal protein SR140, prepared and analyzed under identical conditions, also immunoprecipitated all of the snRNAs. We also tested for the presence of another pre-mRNA splicing factor, hPrp43 (DHX15), in the material immunopurified with anti-ZFR; Figure 5C shows that the specific antiserum, but not the pre-immune serum or the Protein-A beads, immunoprecipitates hPrp43p/DDX15. This demonstrates that the novel spliceosome-associated factor ZFR is indeed associated with spliceosomal snRNAs and other spliceosomal proteins.
Figure 5.

The novel Zn finger protein ZFR is a bona fide spliceosomal component. (A) ZFR sediments with the 200S particle in glycerol gradients. HeLa nuclear extract was subjected to glycerol velocity gradient sedimentation analysis as in Figure 2. Proteins from the indicated fractions were electrophoresed through SDS-PAGE gels and subjected to western blot analysis using anti-ZFR antiserum. The bar below the gel denotes the 200S region. (B) ZFR is specifically associated with spliceosomal snRNAs. Equal amounts of HeLa nuclear extract were incubated with protein-A beads (beads), pre-immune serum and protein-A beads (pre-immune), anti-ZFR antiserum and protein-A beads (anti-ZFR) or anti-SR140 antiserum and protein-A beads (anti-SR140) according to the Materials and Methods. Recovered nucleic acids were subjected to northern blot analysis and probed with antisense probes to human snRNAs (identities noted to the right of the Figure). (C) ZFR is specifically associated with complexes containing spliceosomal proteins. Immunoprecipitation conditions and lanes are as described in (B). Proteins were subjected to western blot analysis using hPrp43 antiserum.

The novel Zn finger protein ZFR is a bona fide spliceosomal component. (A) ZFR sediments with the 200S particle in glycerol gradients. HeLa nuclear extract was subjected to glycerol velocity gradient sedimentation analysis as in Figure 2. Proteins from the indicated fractions were electrophoresed through SDS-PAGE gels and subjected to western blot analysis using anti-ZFR antiserum. The bar below the gel denotes the 200S region. (B) ZFR is specifically associated with spliceosomal snRNAs. Equal amounts of HeLa nuclear extract were incubated with protein-A beads (beads), pre-immune serum and protein-A beads (pre-immune), anti-ZFR antiserum and protein-A beads (anti-ZFR) or anti-SR140 antiserum and protein-A beads (anti-SR140) according to the Materials and Methods. Recovered nucleic acids were subjected to northern blot analysis and probed with antisense probes to human snRNAs (identities noted to the right of the Figure). (C) ZFR is specifically associated with complexes containing spliceosomal proteins. Immunoprecipitation conditions and lanes are as described in (B). Proteins were subjected to western blot analysis using hPrp43 antiserum. We note that there are snRNA stoichiometry differences between the two immunoprecipitations. Although we cannot be certain of exactly why this is, it may be that the ZFR polypeptide is associated with a number of different complexes, as is almost certain from its distribution in the glycerol gradient. As the immunoprecipitations were performed using nuclear extracts, and not size-selected complexes, the snRNA representation reflects the ZFR-associated material from the entire nucleus and not solely from the polyspliceosomes. In this work, we report the composition of the endogenous pre-mRNA processing machinery from human and chicken cells and provide a comparison between native supraspliceosome complexes and spliceosomes assembled on a model single-intron substrate in vitro from salt-extracted nuclear material. These supraspliceosomes have recently been shown to be functional in add-back experiments using micrococcal nuclease-treated extracts for in vitro splicing by Sperling and colleagues (21), further enhancing the functional relevance of our findings. In addition to confirming the set of factors known to interact with Pol II transcripts during splicing, we discovered an extensive array of novel factors by purifying supraspliceosomes from two types of vertebrate cells. Many of these have been implicated in pre-mRNA maturation including a subset of the SWI/SNF chromatin remodeling complex proteins, recently shown to influence alternative splicing patterns and to co-purify with pre-mRNA splicing factors. The novel polypeptides discovered in the endogenous complexes will provide a rich source of new proteins to investigate, ultimately enhancing our understanding of this incredibly complex macromolecular machine.

Comparison with in vitro-assembled spliceosomes

In Tables 1–3 we present a comparison of the polypeptides present in the supraspliceosomes purified in this work with those purified in previous in vitro spliceosome preparations. The core machinery (snRNPs, SAPs, SR proteins, etc.) is well represented in the material derived from all of the purification schemes. However, a great many other factors are present in all of the preparations as well, highlighting the amazing number of proteins required to remove even the single intron used in the in vitro experiments. The methods used for the purification of all of the complexes represented in Tables 1–3 were operationally distinct and the in vivo-assembled spliceosomes contained a larger number of proteins than did the spliceosome preparations formed in vitro. What is perhaps most remarkable about our results is the fact that, despite the operationally distinct purification strategies, the basal pre-mRNA processing machinery required to effect the removal of a single intron in vitro is not vastly different than that purified from complex mixtures of all of the pre-mRNAs in a vertebrate or human nucleus. The major differences in composition between the previous purifications and the one described herein involve (i) polypeptides predicted by sequence homology to interact with the pre-mRNA (ii) the depth of coverage for polypeptides involved in export and 3′ end processing and (iii) polypeptides that may require that the pre-mRNA in these complexes follow the path of RNA Pol II transcription and nuclear trafficking, such as the SWI/SNF complexes, structural proteins and NUPs.

Pre-mRNA processing factors not present in any spliceosome purification

To complete the catalog of polypeptides that participate in the nuclear pre-mRNA processing pathway, we have compiled a list of factors know to function in processing of RNA Pol II transcripts, but not present in any of the five spliceosome preparations listed in Tables 1–3. In Table 4, we outline this relatively short list of 14 factors. Table 4 does not include factors implicated in yeast splicing, but for which no identifiable human or vertebrate homologue in the genomic databases. By adding together all of the polypeptides listed in Tables 1–4, we arrive at an estimate of at least 305 for the number of proteins that co-purify with endogenous pre-mRNA splicing complexes. This is by far the largest cataloguing of factors potentially required for or participating in this process to date.
Table 4.

Known pre-mRNA processing proteins not present in any purified splicing complex

ENSEMBL accession #aHGNCbPolypeptidec
ENSG00000095485CWF19L1CWF19
ENSG00000152404CWF19L2CWF19
ENSG00000165630PRPF18PRP18
ENSG00000140829DHX38PRP16
ENSG00000149532CFI-59K
ENSG00000165494PCF11PCF11
ENSG00000172409CLP1CLP1
ENSG00000111880HCAP1HCE/CEG1
ENSG00000146007ZMAT2SNU23
ENSG00000108296CCDC49CWC25
ENSG00000101138CSTF1CSTF-50
ENSG00000101811CSTF2CSTF-64
ENSG00000161981C16ORF33U11/U12–25K
ENSG00000184209U11/U12–35K

aData available at: http://www.ensembl.org.

bHUGO Gene Nomenclature Committee designation, a dash represents a polypeptide which has not yet been assigned a systematic name.

cCommon name.

Two possible classes of polypeptides may exist that are not detected in our preparations. First are those that are underrepresented because they may interact with only a small number of pre-mRNAs, such as intron- or exon-specific binding proteins. Other classes of proteins which may participate in pre-mRNA splicing but as absent from our analyses might include tissue-type developmental stage-specific factors which would not be present in our bulk supraspliceosome preparations due to the use of only two cell types. To date, such factors have generally been elucidated via genetic or molecular strategies focused on individual pre-mRNAs. However, with the introduction of the CLEP tagging technology to more cell types (22), it may be possible to rapidly enumerate factors that function in regulated or alternative splicing. Although we cannot completely eliminate the possibility that there may be contaminants present in our preparations, the basic strategy we adopted is validated by the presence proteins such as Brahma that were not detected in the previously characterized in vitro-assembled spliceosomes yet have clearly been implicated in splicing through other means. Our analyses in aggregate greatly expand our knowledge of the protein factors that function both in basal and regulated splicing in vertebrate cells.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.
  103 in total

1.  PUF60: a novel U2AF65-related splicing activity.

Authors:  P S Page-McCaw; K Amonlirdviman; P A Sharp
Journal:  RNA       Date:  1999-12       Impact factor: 4.942

2.  Composition and functional characterization of the yeast spliceosomal penta-snRNP.

Authors:  Scott W Stevens; Daniel E Ryan; Helen Y Ge; Roger E Moore; Mary K Young; Terry D Lee; John Abelson
Journal:  Mol Cell       Date:  2002-01       Impact factor: 17.970

3.  The human 18S U11/U12 snRNP contains a set of novel proteins not found in the U2-dependent spliceosome.

Authors:  Cindy L Will; Claudia Schneider; Markus Hossbach; Henning Urlaub; Reinhard Rauhut; Sayda Elbashir; Thomas Tuschl; Reinhard Lührmann
Journal:  RNA       Date:  2004-06       Impact factor: 4.942

Review 4.  Actin and myosin I in the nucleus: what next?

Authors:  Primal de Lanerolle; Terazina Johnson; Wilma A Hofmann
Journal:  Nat Struct Mol Biol       Date:  2005-09       Impact factor: 15.369

5.  Rat1p and Xrn1p are functionally interchangeable exoribonucleases that are restricted to and required in the nucleus and cytoplasm, respectively.

Authors:  A W Johnson
Journal:  Mol Cell Biol       Date:  1997-10       Impact factor: 4.272

6.  Requirement of the RNA helicase-like protein PRP22 for release of messenger RNA from spliceosomes.

Authors:  M Company; J Arenas; J Abelson
Journal:  Nature       Date:  1991-02-07       Impact factor: 49.962

7.  Proteomics analysis reveals stable multiprotein complexes in both fission and budding yeasts containing Myb-related Cdc5p/Cef1p, novel pre-mRNA splicing factors, and snRNAs.

Authors:  Melanie D Ohi; Andrew J Link; Liping Ren; Jennifer L Jennings; W Hayes McDonald; Kathleen L Gould
Journal:  Mol Cell Biol       Date:  2002-04       Impact factor: 4.272

8.  Prp43 is an essential RNA-dependent ATPase required for release of lariat-intron from the spliceosome.

Authors:  Arnold Martin; Susanne Schneider; Beate Schwer
Journal:  J Biol Chem       Date:  2002-03-08       Impact factor: 5.157

9.  A cold-sensitive mRNA splicing mutant is a member of the RNA helicase gene family.

Authors:  E J Strauss; C Guthrie
Journal:  Genes Dev       Date:  1991-04       Impact factor: 11.361

10.  Mammalian PRP4 kinase copurifies and interacts with components of both the U5 snRNP and the N-CoR deacetylase complexes.

Authors:  Graham Dellaire; Evgeny M Makarov; Jeff J M Cowger; Dasa Longman; Heidi G E Sutherland; Reinhard Lührmann; Joseph Torchia; Wendy A Bickmore
Journal:  Mol Cell Biol       Date:  2002-07       Impact factor: 4.272

View more
  56 in total

1.  Drosophila acinus encodes a novel regulator of endocytic and autophagic trafficking.

Authors:  Adam S Haberman; Mohammed Ali Akbar; Sanchali Ray; Helmut Krämer
Journal:  Development       Date:  2010-05-26       Impact factor: 6.868

Review 2.  Adenovirus E1B 55-kilodalton protein: multiple roles in viral infection and cell transformation.

Authors:  Andrew N Blackford; Roger J A Grand
Journal:  J Virol       Date:  2009-02-11       Impact factor: 5.103

3.  The conserved 3' end domain of U6atac snRNA can direct U6 snRNA to the minor spliceosome.

Authors:  Rosemary C Dietrich; Richard A Padgett; Girish C Shukla
Journal:  RNA       Date:  2009-04-16       Impact factor: 4.942

Review 4.  Spliceosomal immunophilins.

Authors:  Annia Mesa; Jason A Somarelli; Rene J Herrera
Journal:  FEBS Lett       Date:  2008-06-09       Impact factor: 4.124

5.  Adaptor Aly and co-adaptor Thoc5 function in the Tap-p15-mediated nuclear export of HSP70 mRNA.

Authors:  Jun Katahira; Hitomi Inoue; Ed Hurt; Yoshihiro Yoneda
Journal:  EMBO J       Date:  2009-01-22       Impact factor: 11.598

6.  Release of SF3 from the intron branchpoint activates the first step of pre-mRNA splicing.

Authors:  Rea M Lardelli; James X Thompson; John R Yates; Scott W Stevens
Journal:  RNA       Date:  2010-01-20       Impact factor: 4.942

Review 7.  The structure of human cleavage factor I(m) hints at functions beyond UGUA-specific RNA binding: a role in alternative polyadenylation and a potential link to 5' capping and splicing.

Authors:  Qin Yang; Gregory M Gilmartin; Sylvie Doublié
Journal:  RNA Biol       Date:  2011-09-01       Impact factor: 4.652

8.  U1 small nuclear RNA variants differentially form ribonucleoprotein particles in vitro.

Authors:  Jason A Somarelli; Annia Mesa; Carol E Rodriguez; Shalini Sharma; Rene J Herrera
Journal:  Gene       Date:  2014-02-26       Impact factor: 3.688

9.  Novel splicing factor RBM25 modulates Bcl-x pre-mRNA 5' splice site selection.

Authors:  Anyu Zhou; Alexander C Ou; Aeri Cho; Edward J Benz; Shu-Ching Huang
Journal:  Mol Cell Biol       Date:  2008-07-28       Impact factor: 4.272

10.  The N-terminus of Prp1 (Prp6/U5-102 K) is essential for spliceosome activation in vivo.

Authors:  Martin Lützelberger; Claudia A Bottner; Wiebke Schwelnus; Susanne Zock-Emmenthal; Aleh Razanau; Norbert F Käufer
Journal:  Nucleic Acids Res       Date:  2009-12-09       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.