| Literature DB >> 23241926 |
Yang Wang1, Xinshu Xiao, Jianming Zhang, Rajarshi Choudhury, Alex Robertson, Kai Li, Meng Ma, Christopher B Burge, Zefeng Wang.
Abstract
To better understand splicing regulation, we used a cell-based screen to identify ten diverse motifs that inhibit splicing from introns. Motifs were validated in another human cell type and gene context, and their presence correlated with in vivo splicing changes. All motifs exhibited exonic splicing enhancer or silencer activity, and grouping these motifs according to their distributions yielded clusters with distinct patterns of context-dependent activity. Candidate regulatory factors associated with each motif were identified, to recover 24 known and new splicing regulators. Specific domains in selected factors were sufficient to confer intronic-splicing-silencer activity. Many factors bound multiple distinct motifs with similar affinity, and all motifs were recognized by multiple factors, which revealed a complex overlapping network of protein-RNA interactions. This arrangement enables individual cis elements to function differently in distinct cellular contexts, depending on the spectrum of regulatory factors present.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23241926 PMCID: PMC3537874 DOI: 10.1038/nsmb.2459
Source DB: PubMed Journal: Nat Struct Mol Biol ISSN: 1545-9985 Impact factor: 15.369
Figure 1A fluorescence-activated screen for intronic splicing silencers (FAS-ISS) identifies 102 unique ISS 10mers
(a) Test of the reporter system. Two known ISS sequences: Test 1 (PTB binding site TCCTCCTCCA) and Test 2 (hnRNP A1 binding site, TAGGTAGGTA), and an arbitrary control sequence (ACCTCAGGCG) were inserted into the GFP-based reporter (pZW11) and transiently transfected into 293T cells. The fraction of GFP-positive cells was measured with flow cytometry 24 h after transfection (mean and standard deviation of 3 or more replicates shown). (b) Measurement of splicing with RT-PCR using the RNA templates purified from the transfected cells and primers targeting exons 1 and 3 of reporter. (c) Chromatogram from sequencing of the random 10mer region of minigene. DNAs amplified by PCR from stably transfected 293FlpIn cells shows the randomness of the initial library prior to screening. Sequences around the insertion region (positions 480–489) are shown. (d) Diagram of screening strategy and resulting ISS motifs. The 102 unique ISS 10mers identified through FAS-ISS were clustered based on the sequence similarity using ClustalW2. The 16 ISS 10mers that were reinserted into the original construct and retested for ISS activity by transient transfection are indicated (red asterisks).
Figure 2Identification of core ISS motifs and exemplars and validation of activity
(a) Over-represented oligonucleotide k-mers were extracted from FAS-ISS 10mers and clustered based on sequence similarity. The consensus motifs, the exemplars (representative 6- or 7-mers) and tested mutations for each group are listed at right. (b) The ISS exemplar and mutant from each group were inserted into the pZW11 plasmid, transiently transfected into 293T cells, and splicing was assessed by flow cytometry as the fraction of GFP positive cells, or (c) by semi-quantitative RT-PCR (sqRT-PCR). The percent-spliced-in (PSI) values of three independent experiments are shown beneath a representative gel. Error bars, s.d.; n=3. (d)Validation of ISS activity in a second cell type. HeLa cells were transfected with pZW11 containing ISS exemplars and their mutants, and the inclusion of the test exon was determined by sqRT-PCR. Error bars, s.d.; n=3. (d) Validation of ISS activity in a heterologous intron context. Two copies of ISS exemplars (Supplementary Table 3) from each group were inserted downstream of a cassette exon in a different splicing reporter (pZW2C, see Methods) and transfected into 293T cells, followed by sqRT-PCR as above. Four neutral sequences that did not match known SREs (Supplementary Table 4) were used as negative controls. Error bars, s.d.; n=3.
Figure 3ISS motifs regulate splicing from an exonic context
(a) FAS-ISS motifs can function as ESEs or ESSs. Two copies of ISS exemplars were inserted into the test exon of a modular splicing reporter [55], and transfected into HeLa and 293T cells. The resulting PSI values divided by the mean of 3 negative controls are shown as “relative exon inclusion” levels (Supplementary Table 4). The mean and s.d. of three independent transfections are shown. (b) FAS-ISS motifs can regulate alternative 5′SS. A single copy of the exemplar motif from each group was inserted between two alternative 5′SS of a minigene reporter [23] and transfected into HeLa and 293T cells. The observed PSI value – defined as the fractional use of the proximal (downstream) 5′SS – was measured by sqRT-PCR and divided by that observed in controls to give relative proximal 5'SS usage (Supplementary Table 4). Error bars, s.d.; n=3.
Figure 4Classification of ISSs based on genomic distribution yields clusters with similar context-dependent activity
(a) Principal component analysis (PCA) of relative enrichment in 13 pairs of pre-mRNA regions. Relative enrichment between each pair of regions (e.g., SE relative to CE) were converted to a vector of 13 z-scores for each ISS motif, which were analyzed by standard PCA. ISS groups were clustered based on their positions in the space of the first two principal components, which explain 86% of the variance. Both axes were labeled with z-scores (unitless). (b) Summary of the distributional biases and context dependent activity of each ISS group. Diagrams representing constitutive exons, skipped exons and introns are shown adjacent to each motif in order from highest (above) to lowest motif abundance, followed by diagrams representing the distribution of the motif from 5' to 3' ends of exons. The activities of each ISS motif in different pre-mRNA locations based on data from Figure 3 and Supplementary Table 5 are listed at right (the ½ indicates that motif B had ESE activity in 1 of 2 cell lines tested). The ISS motif labels are color-coded according to the PCA clusters shown in panel A.
Figure 5Identification and validation of ISS-associated splicing repressors
(a) Diagram of protein identification procedure. (b) Affinity purification using motif U (CA-rich) and its mutant as bait incubated with extracts from 293T and HeLa cells. Four specific bands were excised for mass spectrometry, yielding peptides from the proteins listed at right. (c) RNAi analysis of the putative splicing regulatory factors. Derepression of the target exon is observed following knockdown of hnRNP L and/or YB-1. Results for reporters containing another ISS (motif D) as a specificity control are shown at right. The mean and s.d. of PSI values measured by semi-quantitative RT-PCR from three independent experiments are shown below a representative gel in all panels c, d and f (error bars, s.d.; n=3). (d) RT-PCR experiments showing the over-expression of trans-factors specifically inhibits exon inclusion in reporters with the cognate ISS. Reporters contain ISSs of other group were used as a specificity control. (e) Co-immunoprecipitation in 293T cells using anti-Flag antibody for precipitation and antibodies against endogenous proteins for detection. The cells were lysed in the absence or presence of RNase A. (f) RT-PCR experiments showing that over-expression of YB-1 reversed the splicing derepression phenoptype caused by RNAi of hnRNP L. The splicing reporters containing cognate ISS (group U) or control ISS (group D) were used as a specificity control.
Putative trans-factors of FAS-ISSs identified through affinity purification.
| Associated Proteins | |||
|---|---|---|---|
| ISS | Representative | Putative splicing factors | Other RNA binding |
| A | CTCCTC | hnRNP I/PTB | |
| B | TACAGCT | Nono (2 RRM) | La protein |
| YB-1 (CSD) | |||
| hnRNP I/PTB (4 RRM + Ala-rich) | |||
| C | CTTCAG | SFPQ (2 RRM + Gln/Glu/Pro-rich) | Nucleolin |
| hnRNP UL1 (Gly-rich + Pro-rich) | |||
| hnRNP H1 (3 RRM + Gly-rich) | |||
| hnRNP A1 (2 RRM + Gly-rich) | |||
| hnRNP A2/B1 (2 RRM + Gly-rich) | |||
| D | GAACAG | GRSF-1 (3 RRM + Ala-rich) | CSDE1 |
| hnRNP F (3 RRM) | |||
| SFRS1 (2 RRM + Gly-rich) | |||
| RBM45/CELF-3 homolog (3 RRM) | |||
| E | CAAAGGA | SFRS1 (2 RRM + Gly-rich) | several U1 snRNP core |
| hnRNP Q (3 RRM) | |||
| hnRNP A1 (2 RRM + Gly-rich) | |||
| hnRNP A2/B1 (2 RRM + Gly-rich) | |||
| hnRNP H1 (3 RRM + Gly-rich) | |||
| hnRNP F (3 RRM) | |||
| FUS | |||
| F | AGATATT | KHSRP (4 KH + Gly-rich) | La protein |
| DAZAP1 (2 RRM + Pro-rich) | |||
| hnRNP D0 (2 RRM + Gly-rich) | |||
| hnRNP A0 (2 RRM + Gly-rich) | |||
| hnRNP A1 (2 RRM + Gly-rich) | |||
| hnRNP A2/B1 (2 RRM + Gly-rich) | |||
| hnRNP A3 (2 RRM + Gly-rich) | |||
| CIRBP (1 RRM + Gly-rich) | |||
| G3BP-1 (1 RRM + Gly-rich) | |||
| G3BP-2 (1 RRM + Gly-rich) | |||
| G | ACATGA | G3BP1 (1 RRM + Gly-rich) | GTF2I |
| hnRNP L (3 RRM + Gly-rich + Pro-rich) | |||
| H | AATTTA | DAZAP1 (2 RRM + Pro-rich) | La protein |
| hnRNP A0 (2 RRM + Gly-rich) | |||
| hnRNP D0 (2 RRM + Gly-rich) | |||
| hnRNP DL (2 RRM + Gly-rich) | |||
| hnRNP A1 (2 RRM + Gly-rich) | |||
| hnRNP A2/B1 (2 RRM + Gly-rich) | |||
| hnRNP A3 (2 RRM + Gly-rich) | |||
| I | AGTAGG | DAZAP1 (2 RRM + Pro-rich) | DHX36 |
| hnRNP A1 (2 RRM + Gly-rich) | |||
| hnRNP A2/B1 (2 RRM + Gly-rich) | |||
| hnRNP A3 (2 RRM + Gly-rich) | |||
| hnRNP H1 (3 RRM + Gly-rich) | |||
| hnRNP D0 (2 RRM + Gly-rich) | |||
| hnRNP DL (2 RRM + Gly-rich) | |||
| FUS | |||
| U | CACACCA | hnRNP L (3 RRM + Gly-rich + Pro-rich) | IGF mRNA binding |
| YB-1 (CSD) | |||
Characteristic RNA binding or splicing regulatory protein motifs are indicated (RRM: RNA Recognition Motif; CSD: Cold shock domain; Gly-rich: Glycine rich motif; etc.).
Based on the literature rather than affinity purification/mass spec.
Figure 6ISSs are recognized by a complex, overlapping network of factors
(a) Interactions between putative splicing factors and all ISS groups were represented by solid lines (dashed lines represent interactions inferred from the literature), yielding a many-to-many connectivity map encompassing all 10 ISSs and their cognate factors. (b) Surface plasmon resonance presenting direct RNA-protein binding between ISS exemplars and proteins of the hnRNP A and D families. Three different protein concentrations (50, 100 and 200 nM, from bottom to top in each panel) were injected at a flow rate of 50 µl/min for 120 s. For hnRNP A1, 20, 40, and 80 nM were used to compensate for the strong binding signal. (c) RT-PCR results showing that tethering RS and Gly-rich to a downstream intronic location can repress splicing. Fusion proteins consisting of a PUF domain fused to an RS or Gly-rich domain were assayed for their effects on splicing. Two target 8mers, UGUAUGUA and UUGAUAUA recognized respectively by PUF(3-2) and PUF(6-2), were inserted into the splicing reporter pZW2C and co-transfected with the indicated fusion proteins into 293T cells. All combinations of fusion proteins and targets were tested, with matching PUF-target pairs indicated by same colors (blue or black). The PUF domains alone and empty vector served as negative controls. Right panel, fusions between PUF(6-2) and a variety of RS and Gly-rich domains were tested. Error bars, s.d.; n=3.