Literature DB >> 23638305

Counting on co-transcriptional splicing.

Mattia Brugiolo1, Lydia Herzel1, Karla M Neugebauer1.   

Abstract

Splicing is the removal of intron sequences from pre-mRNA by the spliceosome. Researchers working in multiple model organisms - notably yeast, insects and mammalian cells - have shown that pre-mRNA can be spliced during the process of transcription (i.e. co-transcriptionally), as well as after transcription termination (i.e. post-transcriptionally). Co-transcriptional splicing does not assume that transcription and splicing machineries are mechanistically coupled, yet it raises this possibility. Early studies were based on a limited number of genes, which were often chosen because of their experimental accessibility. Since 2010, eight studies have used global datasets as counting tools, in order to quantify co-transcriptional intron removal. The consensus view, based on four organisms, is that the majority of splicing events take place co-transcriptionally in most cells and tissues. Here, we discuss the nature of the various global datasets and how bioinformatic analyses were conducted. Considering the broad differences in experimental approach and analysis, the level of agreement on the prevalence of co-transcriptional splicing is remarkable.

Entities:  

Year:  2013        PMID: 23638305      PMCID: PMC3619158          DOI: 10.12703/P5-9

Source DB:  PubMed          Journal:  F1000Prime Rep        ISSN: 2051-7599


Introduction

Transcription and pre-mRNA splicing are carried out by two distinct macromolecular machines, RNA polymerase II (Pol II) and the spliceosome, which are capable of functioning independently of one another. Polyadenylation of transcripts occurs upon co-transcriptional cleavage of the nascent RNA chain and, thereby, indicates that transcription is complete. Although most mature poly A+ transcripts are spliced, early studies yielded mixed results as to whether splicing is completed before polyadenylation [1-3]. Subsequently, a number of key studies in several model systems supported the notion that splicing at least begins co-transcriptionally, i.e. on nascent RNA tethered to chromatin by elongating Pol II [4-12]. Co-transcriptional splicing is not surprising: nucleoplasm is filled with spliceosomal components that should be able to associate with nascent RNA, much as ribosomes are capable of translating RNA co-transcriptionally in bacteria. Moreover, in vivo rates for the splicing reaction are fast: in the order of 30 seconds to 3 minutes from the time the intron is complete, depending on the species and the method used [5,12-15]; splicing takes significantly less time than gene transcription [16]. Nevertheless, some dramatic examples of post-transcriptional splicing – for example, in anucleate platelets upon activation and in developing fern gametes – remind us of the potential importance of not splicing co-transcriptionally in certain biological contexts [17,18]. The regulatory potential of co-transcriptional splicing is what the field has found so exciting. We started to consider the possibilities that Pol II and the spliceosome could interact physically, that transcription might influence splicing and vice versa, and that co-transcriptional RNA processing could yield a more “fit” mRNP [16,19]. We asked ourselves: how frequent is co-transcriptional splicing among genes and introns? Early studies were based on specific genes, which were handpicked because of special properties such as transcriptional inducibility, gene length, and accessibility to light and/or EM microscopy [20]. Since 2010, eight global studies, investigating multiple tissues and cell types in four organisms, have been published. The consensus view is that co-transcriptional splicing is widespread (see Table 1). These important studies are the subject of this review.
Table 1.

Global studies on co-transcriptional splicing in four different organisms, listed in chronological order of publication

Carrillo Oesterreich et al. 2010 [21], Ameur et al. 2011 [22], Khodor et al. 2011 [23], Bhatt et al. 2012 [28], Girard et al. 2012 [26], Tilgner et al. 2012 [24], Windhager et al. 2012 [25] and Khodor et al. 2012 [27]. Co-transcriptional splicing frequencies are given as a value on the scale 0 to 1 (0% and 100% co-transcriptional splicing, respectively) in form of the median (Carrillo Oesterreich et al. 2010 [21], Khodor et al. 2011 [23] & 2012 [27], Tilgner et al. 2012 [24]), as average value determined by spliceosomal protein quantification in the chromatin fraction compared to nucleoplasm (Girard et al. 2012 [26]) or as fraction of highly expressed exons surrounded by long introns showing high co-transcriptional splicing (Ameur et al. 2011 [22]). Windhager et al. 2012 [25] calculated the co-transcriptional splicing frequency as the ratio of observed reads in the set of analyzed genes to the number of reads calculated when expecting a uniform read distribution without splicing and weighted for the transcript expression.

Global studies of co-transcriptional splicing

The first global study on co-transcriptional splicing was undertaken in budding yeast, where it became clear that most introns are spliced co-transcriptionally; 50% of introns are >74% spliced before transcription termination [21]. Table 1 shows that co-transcriptional splicing frequencies are similarly high in fly and human cell lines and tissues [22-26]. Despite general agreement, there are important experimental and analytical differences among the studies. Carrillo Oesterreich et al. [21], Khodor et al. [23,27] and Tilgner et al. [24] all based their analyses on RNA isolated from biochemically purified chromatin, whereas Ameur et al. [22] and Windhager et al. [25] used total RNA sequencing and 4-thio-uridine (4sU)-labeling, respectively, to quantify co-transcriptional splicing. Even active spliceosomes and their nuclear location can be monitored, using protein biochemistry and immunofluorescence, showing that the majority of active spliceosomes is associated with chromatin [26]. That said, not all intron removal is co-transcriptional. In particular, terminal introns are least well removed co-transcriptionally, and 20% of activated spliceosomes in the cell are not chromatin-associated [24,26]. Table 1 and Figure 1 illustrate the key points of each global study and how co-transcriptional splicing frequencies were quantified. Due to different sample preparations and RNA pools analyzed, one can rarely apply the same method of analysis to a given dataset. For example, Ameur et al. performed total RNA sequencing; because exon reads could come from nascent or mature RNA, analysis was restricted to intron reads [22]. Gene architecture differences (e.g. short and few introns in yeast versus long and many introns in humans) also play a role. Three modes of analysis have been employed, using either junction reads – reads representing spliced (exon-exon) and unspliced (exon-intron and intron-exon) – or intron and/or exon coverage for calculating a splicing score:
Figure 1.

Schematic representations of how co-transcriptional splicing scores are calculated in global studies

(a) Calculation of co-transcriptional splicing frequency (∆) for each intron, by determining the relative difference between observed and extrapolated intronic probe intensities, using high density tiling arrays. (b) Determination of co-transcriptional splicing frequency around a given exon, by subtracting the read coverage over 2kb of the upstream intron (∑us) from read coverage over 2kb of the 5’ end of the downstream intron (∑ds). (c) Determination of co-transcriptional splicing frequency for each intron, by calculating the ratio of reads around the 3’ splice site: read coverage over the last 25 bp of a given intron, a, is divided by read coverage over the first 25 bp of the downstream exon, b. (d) Determination of co-transcriptional splicing frequency for each gene, by dividing the read coverage over exons by the read coverage over the whole locus. (e) Determination of co-transcriptional splicing frequency (completed splicing index, coSI) around a given exon, using exon-intron and exon-exon junction reads; c, d, and e represent exon-exon junctions; f and g represent exon-intron junctions.

Exon-centered splicing score [22,24]. Pro: Optimal for alternative cassette exon usage. Con: First and terminal exons have to be analyzed differently. Intron-centered splicing score [21,23,27]. Pro: First and terminal exons can be included in the analysis. Con: Short exons are a disadvantage. Gene-based splicing score [24,28]. Pro: Inclusion of many reads reduces noise. Con: No independent splicing values for individual introns; no information about alternative splicing events per gene; stable RNAs encoded within introns artificially lower splicing, if not filtered out; because RNAseq data are often biased towards 3’ ends, results could depend on transcript length.

Global studies on co-transcriptional splicing in four different organisms, listed in chronological order of publication

Carrillo Oesterreich et al. 2010 [21], Ameur et al. 2011 [22], Khodor et al. 2011 [23], Bhatt et al. 2012 [28], Girard et al. 2012 [26], Tilgner et al. 2012 [24], Windhager et al. 2012 [25] and Khodor et al. 2012 [27]. Co-transcriptional splicing frequencies are given as a value on the scale 0 to 1 (0% and 100% co-transcriptional splicing, respectively) in form of the median (Carrillo Oesterreich et al. 2010 [21], Khodor et al. 2011 [23] & 2012 [27], Tilgner et al. 2012 [24]), as average value determined by spliceosomal protein quantification in the chromatin fraction compared to nucleoplasm (Girard et al. 2012 [26]) or as fraction of highly expressed exons surrounded by long introns showing high co-transcriptional splicing (Ameur et al. 2011 [22]). Windhager et al. 2012 [25] calculated the co-transcriptional splicing frequency as the ratio of observed reads in the set of analyzed genes to the number of reads calculated when expecting a uniform read distribution without splicing and weighted for the transcript expression. Abbreviations: 4sU4-thio-uridine, I – intensity, ds – downstream, us – upstream, NA – not applicable; for score calculation schemes refer to Figure 1.

Schematic representations of how co-transcriptional splicing scores are calculated in global studies

(a) Calculation of co-transcriptional splicing frequency (∆) for each intron, by determining the relative difference between observed and extrapolated intronic probe intensities, using high density tiling arrays. (b) Determination of co-transcriptional splicing frequency around a given exon, by subtracting the read coverage over 2kb of the upstream intron (∑us) from read coverage over 2kb of the 5’ end of the downstream intron (∑ds). (c) Determination of co-transcriptional splicing frequency for each intron, by calculating the ratio of reads around the 3’ splice site: read coverage over the last 25 bp of a given intron, a, is divided by read coverage over the first 25 bp of the downstream exon, b. (d) Determination of co-transcriptional splicing frequency for each gene, by dividing the read coverage over exons by the read coverage over the whole locus. (e) Determination of co-transcriptional splicing frequency (completed splicing index, coSI) around a given exon, using exon-intron and exon-exon junction reads; c, d, and e represent exon-exon junctions; f and g represent exon-intron junctions. Differences among the methods could well influence the numerical results obtained and/or the interpretations, sometimes making it hard to compare the studies. For example, intron length negatively correlates with co-transcriptional splicing frequency in Drosophila, mouse and human cells [23,24,27]. However, Ameur et al., focusing their analysis in human brain on highly expressed genes with long introns, conclude the opposite [22]. Experimental validation of RNA sequencing and array data by RT-qPCR strengthens and extends results from these approaches [21,22]. Though Ameur et al. could not calculate co-transcriptional splicing frequencies for short introns genome-wide, their RT-PCR results suggest that high co-transcriptional splicing observed for long introns can also be inferred for shorter ones [22]. Remarkably, numerous studies agree that constitutive splicing is more co-transcriptional than alternative splicing [22-24,27]. Last year, one study conducted in induced mouse macrophages reported that full-length, polyA cleaved RNAs accumulate on chromatin in a partially spliced state [28]. The inference that splicing is completed post-transcriptionally in this cell type has been rather hastily interpreted as evidence against co-transcriptional splicing in general [29,30]. However, no overall numerical values for co-transcriptional splicing are provided [28]. Direct comparison to the other global studies is more difficult, because this analysis calculates splicing values on a per gene basis, using coverage over the whole gene (see Table 1 and Figure 1); since splicing values vary from intron to intron (see above), calculations for individual splicing events are more informative. It is possible that the gene-based frequency of splicing yields an underestimate of intron removal; for example, summation of the coverage data from Tilgner et al. (see Table S2 in [24]) also yields a lower co-transcriptional splicing frequency than usage of splice junction reads (Table 1) [24]. Introns contain a number of stable RNAs, such as snoRNAs, which contribute reads that do not represent unspliced transcripts [31]. Gene-based calculations may be influenced by coverage biases that can reflect differences in nucleotide content (e.g. fraction GC), directionality of sequencing (e.g. from the 3’ end) or RT-priming, which all influence the preparation of RNAseq libraries [32]. The average co-transcriptional splicing frequencies obtained for each gene will likely be influenced by gene length and the total number of introns within the gene; terminal exons are long and generally full of reads, and terminal introns tend to be least well spliced co-transcriptionally [6,11,24]. Nevertheless, it is clear from this and previous studies (referenced within) that processing may be delayed in these cells, such that a higher proportion of splicing is post-transcriptional. It would be fascinating to know which introns are being retained and, indeed, whether all introns within the same transcript are retained. The relatively low co-transcriptional splicing frequencies from both mouse studies contrast sharply with the high co-transcriptional splicing frequencies from yeast, fly, and human (Table 1). Perhaps the easiest means of addressing this would be to analyze directly comparable human and mouse cell types. It is difficult to resolve differences among studies when validation of co-transcriptional splicing frequencies by an independent method, such as RT-PCR, is omitted. Unfortunately, most current studies do not include validation. Validation acknowledges that something can be unexpected in either the experiment or the analysis, such as differences in biochemical purification, library biases or genome annotation [32]. For example, chromatin preparations can be contaminated with mRNA, which is highly abundant and could lead to an over-estimate in the degree of co-transcriptional splicing. A co-transcriptional process is one that occurs before polyA cleavage, so one would ideally like to incorporate this property into the validation. Due to fluctuations in read densities, the degree of polyA cleavage in the RNA sample can be difficult to ascertain from RNAseq. A prominently used assay employs reverse transcription to specifically copy only uncleaved transcripts, by utilizing a reverse primer placed downstream of the polyA cleavage site; subsequent PCR can query the spliced or unspliced status of the nascent RNA [11,12,21,33]. This method can be difficult in mammals, where polyA cleavage sites are hard to predict. Nevertheless, an independent, small-scale study focusing on 22 human genes was able to validate the high frequencies of co-transcriptional splicing seen in the global data, even among terminal and alternative introns [12].

Summary and future directions

Taken together, this array of high quality global studies enables us to reach a consensus on co-transcriptional splicing: it is widespread, albeit not 100%. Future challenges encompass the relative importance of co- and post-transcriptional splicing in terms of the fate of the RNA, on the one hand, and/or transcriptional activity of the gene, on the other hand. For example, histone modifications, which would have a bearing on co-transcriptional but not post-transcriptional splicing, can directly or indirectly recruit splicing factors and modify alternative splicing [34,35]. Moreover, transcription elongation rates are influenced by nucleosome positioning and histone modifications, which influence alternative splice site choice [16,35,36]. Co-transcriptional splicing may also have long-lasting effects on the RNA's lifetime, by ensuring proper assembly of export-competent mRNPs [19]. These examples show that co-transcriptional splicing is important for mRNA biogenesis. Co-transcriptional splicing has also emerged as an important regulator of transcriptional activity. It has long been known that the presence of promoter-proximal introns can stimulate gene expression [37-39]. Recent work shows that splicing feeds back to transcription through a distance-dependent enhancer-like activity of the first 5’ splice site [33]. Thus, genes and gene expression machinery have evolved coordinately to take advantage of crosstalk between transcription and splicing. If specific biological situations – such as the activation of transcriptional programs in macrophages or the repression of splicing in platelets – circumvent co-transcriptional processes, then perhaps there are additional regulatory reasons. In this sense, it is important to recognize that no study claims 100% of introns are 100% co-transcriptionally removed. Advances in high-throughput sequencing that enable sequencing of longer DNA molecules (in the kilobase range) will provide clarity and facilitate analysis, as well as providing insight into the order of intron removal and co-transcriptional dynamics of alternative splicing. Those introns, such as alternative introns, that are spliced post-transcriptionally may be subject to different regulatory mechanisms [19,40]. A challenge for the future will be to more fully explore the significance of post-transcriptional splicing for gene expression.
  40 in total

Review 1.  Pre-mRNA splicing: where and when in the nucleus.

Authors:  Joonhee Han; Ji Xiong; Dong Wang; Xiang-Dong Fu
Journal:  Trends Cell Biol       Date:  2011-04-21       Impact factor: 20.808

2.  New insights from existing sequence data: generating breakthroughs without a pipette.

Authors:  Alex M Plocik; Brenton R Graveley
Journal:  Mol Cell       Date:  2013-02-21       Impact factor: 17.970

3.  Removal of retained introns regulates translation in the rapidly developing gametophyte of Marsilea vestita.

Authors:  Thomas C Boothby; Richard S Zipper; Corine M van der Weele; Stephen M Wolniak
Journal:  Dev Cell       Date:  2013-02-21       Impact factor: 12.270

4.  Transcription, splicing, and release: are we there yet?

Authors:  Ranjan Sen; Sebastian D Fugmann
Journal:  Cell       Date:  2012-07-20       Impact factor: 41.582

5.  RNP particles at splice junction sequences on Drosophila chorion transcripts.

Authors:  Y N Osheim; O L Miller; A L Beyer
Journal:  Cell       Date:  1985-11       Impact factor: 41.582

6.  Steps in the processing of Ad2 mRNA: poly(A)+ nuclear sequences are conserved and poly(A) addition precedes splicing.

Authors:  J R Nevins; J E Darnell
Journal:  Cell       Date:  1978-12       Impact factor: 41.582

Review 7.  Pre-mRNA splicing during transcription in the mammalian system.

Authors:  Amy Pandya-Jones
Journal:  Wiley Interdiscip Rev RNA       Date:  2011-05-02       Impact factor: 9.957

8.  The ovalbumin gene family: hormonal control of X and Y gene transcription and mRNA accumulation.

Authors:  M LeMeur; N Glanville; J L Mandel; P Gerlinger; R Palmiter; P Chambon
Journal:  Cell       Date:  1981-02       Impact factor: 41.582

9.  Introns increase transcriptional efficiency in transgenic mice.

Authors:  R L Brinster; J M Allen; R R Behringer; R E Gelinas; R D Palmiter
Journal:  Proc Natl Acad Sci U S A       Date:  1988-02       Impact factor: 11.205

10.  A wave of nascent transcription on activated human genes.

Authors:  Youichiro Wada; Yoshihiro Ohta; Meng Xu; Shuichi Tsutsumi; Takashi Minami; Kenji Inoue; Daisuke Komura; Jun'ichi Kitakami; Nobuhiko Oshida; Argyris Papantonis; Akashi Izumi; Mika Kobayashi; Hiroko Meguro; Yasuharu Kanki; Imari Mimura; Kazuki Yamamoto; Chikage Mataki; Takao Hamakubo; Katsuhiko Shirahige; Hiroyuki Aburatani; Hiroshi Kimura; Tatsuhiko Kodama; Peter R Cook; Sigeo Ihara
Journal:  Proc Natl Acad Sci U S A       Date:  2009-10-13       Impact factor: 11.205

View more
  56 in total

1.  Repetitive elements regulate circular RNA biogenesis.

Authors:  Jeremy E Wilusz
Journal:  Mob Genet Elements       Date:  2015-05-21

2.  Design (and) principles of nuclear dynamics in Stockholm.

Authors:  Yaron Shav-Tal; Jan Lammerding
Journal:  Nucleus       Date:  2015       Impact factor: 4.197

Review 3.  Environmental influences on RNA processing: Biochemical, molecular and genetic regulators of cellular response.

Authors:  Athma A Pai; Francesca Luca
Journal:  Wiley Interdiscip Rev RNA       Date:  2018-09-14       Impact factor: 9.957

Review 4.  A 360° view of circular RNAs: From biogenesis to functions.

Authors:  Jeremy E Wilusz
Journal:  Wiley Interdiscip Rev RNA       Date:  2018-04-14       Impact factor: 9.957

Review 5.  A day in the life of the spliceosome.

Authors:  A Gregory Matera; Zefeng Wang
Journal:  Nat Rev Mol Cell Biol       Date:  2014-02       Impact factor: 94.444

Review 6.  Coupling of RNA Polymerase II Transcription Elongation with Pre-mRNA Splicing.

Authors:  Tassa Saldi; Michael A Cortazar; Ryan M Sheridan; David L Bentley
Journal:  J Mol Biol       Date:  2016-04-20       Impact factor: 5.469

Review 7.  Control of mammalian gene expression by selective mRNA export.

Authors:  Vihandha O Wickramasinghe; Ronald A Laskey
Journal:  Nat Rev Mol Cell Biol       Date:  2015-06-17       Impact factor: 94.444

8.  FUS functions in coupling transcription to splicing by mediating an interaction between RNAP II and U1 snRNP.

Authors:  Yong Yu; Robin Reed
Journal:  Proc Natl Acad Sci U S A       Date:  2015-06-29       Impact factor: 11.205

Review 9.  Nascent RNA and the Coordination of Splicing with Transcription.

Authors:  Karla M Neugebauer
Journal:  Cold Spring Harb Perspect Biol       Date:  2019-08-01       Impact factor: 10.005

10.  Influenza Virus NS1 Protein-RNA Interactome Reveals Intron Targeting.

Authors:  Liang Zhang; Juan Wang; Raquel Muñoz-Moreno; Min Kim; Ramanavelan Sakthivel; Wei Mo; Dandan Shao; Aparna Anantharaman; Adolfo García-Sastre; Nicholas K Conrad; Beatriz M A Fontoura
Journal:  J Virol       Date:  2018-11-27       Impact factor: 5.103

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.