Literature DB >> 32103605

Organismal benefits of transcription speed control at gene boundaries.

Xueyuan Leng1, Maxim Ivanov1, Peter Kindgren1, Indranil Malik2, Axel Thieffry3,4, Peter Brodersen4, Albin Sandelin3,4, Craig D Kaplan2,5, Sebastian Marquardt1.   

Abstract

RNA polymerase II (RNAPII) transcription is crucial for gene expression. RNAPII density peaks at gene boundaries, associating these key regions for gene expression control with limited RNAPII movement. The connections between RNAPII transcription speed and gene regulation in multicellular organisms are poorly understood. Here, we directly modulate RNAPII transcription speed by point mutations in the second largest subunit of RNAPII in Arabidopsis thaliana. A RNAPII mutation predicted to decelerate transcription is inviable, while accelerating RNAPII transcription confers phenotypes resembling auto-immunity. Nascent transcription profiling revealed that RNAPII complexes with accelerated transcription clear stalling sites at both gene ends, resulting in read-through transcription. The accelerated transcription mutant NRPB2-Y732F exhibits increased association with 5' splice site (5'SS) intermediates and enhanced splicing efficiency. Our findings highlight potential advantages of RNAPII stalling through local reduction in transcription speed to optimize gene expression for the development of multicellular organisms.
© 2020 The Authors. Published under the terms of the CC BY NC ND 4.0 license.

Entities:  

Keywords:  NET-seq; speed; splicing; stalling; transcription

Mesh:

Substances:

Year:  2020        PMID: 32103605      PMCID: PMC7132196          DOI: 10.15252/embr.201949315

Source DB:  PubMed          Journal:  EMBO Rep        ISSN: 1469-221X            Impact factor:   8.807


Introduction

A decisive step during gene expression is the conversion of the DNA sequences of a gene into pre‐mRNA by RNA polymerase II (RNAPII) transcription. Profiles of RNAPII transcription across genes in eukaryotes revealed two main RNAPII localization peaks at gene boundaries, near gene transcription start sites (TSSs) and poly‐adenylation sites (PASs) 1. At the 3′ end of genes, RNAPII peaks promote nascent RNA 3′ end processing and transcriptional termination in mammals 2, 3. The function of RNAPII peaks at promoter‐proximal regions near TSSs is actively debated. On the one hand, “pause‐release” of RNAPII can facilitate rapid induction of gene expression 4; on the other hand, imaging of Drosophila and human RNAPII at promoter‐proximal positions revealed rapid turnover, arguing against stable “pausing” of the same population of RNAPII complexes over time 5, 6. In metazoans, the negative elongation factor (NELF) complex promotes promoter‐proximal pausing of RNAPII by limiting RNAPII mobility 7. However, NELF is conspicuously absent in yeast and plants, which implies that many organisms use alternative mechanisms to stall RNAPII at promoter‐proximal region (i.e., RNAPII stalling) 8. In gene bodies, RNAPII accumulates at exon–intron boundaries and exhibits distinct accumulation profiles for exons with alternative splicing (AS) outcomes 9, 10. The efficiency of splicing may hence be coupled to the local speed of RNAPII elongation at exon–intron boundaries 11. In summary, peaks of accumulated RNAPII represent sites with reduced RNAPII forward movement, which may facilitate the integration of cellular signals to control gene expression post‐initiation by co‐transcriptional RNA processing 12. RNAPII forward movement depends on the dynamics of the trigger loop (TL), a central structure in the RNAPII active center 13, 14, 15. In addition, RNAPII backtracking induced by weak RNA–DNA hybrids (i.e., nucleotide misincorporation) limits RNAPII forward movement 16, 17, 18. A “gating tyrosine” in the RNAPII second largest subunit RPB2 (i.e., Y769 in budding yeast Rpb2) stacks with the first backtracked nucleotide and is proposed to prevent further backtracking 19 and is also positioned to interact with the TL when in its closed, catalysis‐promoting state. Point mutations in budding yeast Rpb1 TL residues and Rpb2 TL‐interacting residues alter the RNAPII elongation speed in vivo 20, 21, 22, 23, 24. Such “kinetic RNAPII mutants” have informed greatly on the effects of altered transcription speed on gene expression and transcription‐related phenotypes. For example, the budding yeast rpb2‐P1018S slow transcription mutant (i.e., rpb2‐10) promotes RNAPII arrest and reduces transcription processivity 25, 26. Moreover, kinetic RNAPII mutants displaying accelerated transcription favor the use of upstream TSSs, while mutants displaying slow transcription tend to use downstream TSSs 27. Variations of transcription speed alter profiles of co‐transcriptional chromatin signatures and of RNAPII C‐terminal domain (CTD) phosphorylation that impact pre‐mRNA processing 28, 29, 30. These observations indicate a profound effect of RNAPII transcription elongation speed on gene expression. The important question of whether growth and differentiation programs in a multicellular organism can be executed when RNAPII carries kinetic point mutations remains largely unclear. Here, we altered RNAPII transcription activity in Arabidopsis through point mutations in NRPB2, the second largest subunit of Arabidopsis RNAPII. A mutant accelerating RNAPII transcription triggered phenotypes consistent with auto‐immunity, but was able to execute key steps of pattern formation and organogenesis. A mutation predicted to decrease RNAPII transcription speed was inviable. Nascent RNAPII transcription profiling revealed that the mutant accelerating transcription resulted in reduced RNAPII stalling at both gene boundaries. Our findings highlight mechanistic connections between the intrinsic speed of RNAPII and RNAPII stalling at both gene boundaries that coordinate gene expression in the context of a multicellular organism.

Results

Altering transcription activity of RNAPII by targeted mutagenesis of NRPB2

To alter the in vivo RNAPII transcription activity in whole plants, we generated point mutations in Arabidopsis RNAPII. The target residues were identified in Rpb2, the second largest budding yeast RNAPII subunit. The Rpb2 proline 1018 to serine substitution (rpb2‐P1018S) represents the classic slow transcription mutant rpb2‐10, and the tyrosine 769 to phenylalanine substitution (rpb2‐Y769F) represents a mutation which might influence backtracking and TL function (Fig 1A) 19, 31, 32. Sequence alignments identified P979S and Y732F in the highly conserved regions of NRPB2, the second largest subunit of Arabidopsis RNAPII as the equivalent positions to budding yeast P1018S (rpb2‐10) and Y769F, respectively (Fig 1B). We generated these point mutations in constructs carrying the genomic NRPB2 sequence fused to a C‐terminal FLAG‐tag driven by the endogenous NRPB2 promoter and integrated them into the nrpb2‐2 null mutant background 33 (Fig EV1A). To investigate whether these point mutations affected NRPB2 protein accumulation, we performed Western blotting on FLAG‐tagged NRPB2P979S‐FLAG, NRPB2Y732F‐FLAG, and wild‐type NRPB2‐FLAG (NRPB2WT‐FLAG; Fig 1C). We identified several individual transformant lines with comparable steady‐state protein levels; thus, any differences we detected in the characterization of these lines would have to be attributed to the effects of the point mutations on RNAPII activity.
Figure 1

Altering transcription activity of RNAPII by targeted mutagenesis in NRPB2

Schematic drawing of Saccharomyces cerevisiae RNAPII transcription active center. Trigger loop is shown in blue. TL‐interacting Rpb2 domain is shown in beige. Proline 1018 (P1018, green) and gating tyrosine 769 (Y769, red) are highlighted. The schematic drawing is based on PDB: 2e2h 15.

Protein sequence alignment of RNAPII Rpb2 Y769 and P1018 regions in S. cerevisiae and Arabidopsis thaliana. P979S and Y732F are the yeast equivalent point mutations in Arabidopsis. The color scheme indicates conservation from variable (blue) to conserved (red).

Detection of NRPB2WT‐FLAG, NRPB2P979S‐FLAG, and NRPB2Y732F‐FLAG protein by Western blotting in NRPB2 ‐FLAG Col‐0, NRPB2 ‐FLAG Col‐0, and NRPB2Y ‐FLAG Col‐0 plants. Untagged NRPB2 (Col‐0) was used as a negative control. Histone H3 was used as an internal control, and total protein level detected by stain‐free blot was used as a loading control. Quantification was done by normalizing to the loading control and anti‐H3 blot based on three independent replicates.

Transmission rate of nrpb2‐2 allele in nrpb2‐2 line (n = 197) and nrpb2‐ lines combined with homozygous NRPB2 ‐FLAG (n = 280), NRPB2 ‐FLAG (n = 240), and NRPB2 ‐FLAG (n = 210), respectively. Fisher's exact test was used as a statistical test, three asterisks denote P < 0.001 between samples, and n.s. stands for not significant.

Image of homozygous mutant nrpb2‐2 fully complemented by NRPB2 ‐FLAG (top, NRPB2 nrpb2‐2 ) and partially complemented by NRPB2 ‐FLAG (bottom, NRPB2 nrpb2‐2 ). Plants were grown for 4 weeks in soil. Scale bars represent 1 cm.

Source data are available online for this figure.

Figure EV1

Generation and characterization of Arabidopsis NRPB2 point mutations (related to Fig 1)

Schematic overview of a work flow to generate NRPB2 nrpb2‐2 , NRPB2 nrpb2‐2 , and NRPB2 nrpb2‐2 Arabidopsis. First, constructs harboring NRPB2 (blue), NRPB2 (red), and NRPB2 (green) transgene expression cassette were transformed into wild‐type (Col‐0) Arabidopsis via Agrobacterium‐mediated transformation; T3 transformant plants with homozygous transgenes are crossed with nrpb2‐2 (gray) heterozygous Arabidopsis; then, plants positive for both transgenes and nrpb2‐2 allele were selected for propagation into F3 generation to screen for homozygous double mutants of transgene and nrpb2‐2.

Phenotype of Arabidopsis siliques of wild‐type (Col‐0), NRPB2 Col‐0, and NRPB2 nrpb2‐2 plants. Scale bars represent 10 mm.

Silique length of wild‐type (Col‐0), NRPB2 Col‐0, and NRPB2 nrpb2‐2 plants (n > 20 for each genotype). Two‐sided Student's t‐test was used for statistical test, *** denotes P < 0.001. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile.

Opened siliques from wild‐type (Col‐0), NRPB2 Col‐0, and NRPB2 nrpb2‐2 plants. Red arrows indicate aborted ovules.

Phenotype of alternative transformation events to lines presented in Fig 1E. Homozygous mutant nrpb2‐2 was fully complemented by NRPB2‐FLAG (top) and partially complemented by NRPB2 ‐FLAG (bottom). Plants were grown for 4 weeks in soil. Scale bars represent 1 cm.

Relative expression level of PR1, PR2, and PR5 in NRPB2 nrpb2‐2 and NRPB2 nrpb2‐2 by RT–qPCR. Error bars represent SEM from three independent replicates. **denotes P < 0.01 by two‐sided Student's t‐test.

Source data are available online for this figure.

Altering transcription activity of RNAPII by targeted mutagenesis in NRPB2

Schematic drawing of Saccharomyces cerevisiae RNAPII transcription active center. Trigger loop is shown in blue. TL‐interacting Rpb2 domain is shown in beige. Proline 1018 (P1018, green) and gating tyrosine 769 (Y769, red) are highlighted. The schematic drawing is based on PDB: 2e2h 15. Protein sequence alignment of RNAPII Rpb2 Y769 and P1018 regions in S. cerevisiae and Arabidopsis thaliana. P979S and Y732F are the yeast equivalent point mutations in Arabidopsis. The color scheme indicates conservation from variable (blue) to conserved (red). Detection of NRPB2WT‐FLAG, NRPB2P979S‐FLAG, and NRPB2Y732F‐FLAG protein by Western blotting in NRPB2 ‐FLAG Col‐0, NRPB2 ‐FLAG Col‐0, and NRPB2Y ‐FLAG Col‐0 plants. Untagged NRPB2 (Col‐0) was used as a negative control. Histone H3 was used as an internal control, and total protein level detected by stain‐free blot was used as a loading control. Quantification was done by normalizing to the loading control and anti‐H3 blot based on three independent replicates. Transmission rate of nrpb2‐2 allele in nrpb2‐2 line (n = 197) and nrpb2‐ lines combined with homozygous NRPB2 ‐FLAG (n = 280), NRPB2 ‐FLAG (n = 240), and NRPB2 ‐FLAG (n = 210), respectively. Fisher's exact test was used as a statistical test, three asterisks denote P < 0.001 between samples, and n.s. stands for not significant. Image of homozygous mutant nrpb2‐2 fully complemented by NRPB2 ‐FLAG (top, NRPB2 nrpb2‐2 ) and partially complemented by NRPB2 ‐FLAG (bottom, NRPB2 nrpb2‐2 ). Plants were grown for 4 weeks in soil. Scale bars represent 1 cm. Source data are available online for this figure.

Generation and characterization of Arabidopsis NRPB2 point mutations (related to Fig 1)

Schematic overview of a work flow to generate NRPB2 nrpb2‐2 , NRPB2 nrpb2‐2 , and NRPB2 nrpb2‐2 Arabidopsis. First, constructs harboring NRPB2 (blue), NRPB2 (red), and NRPB2 (green) transgene expression cassette were transformed into wild‐type (Col‐0) Arabidopsis via Agrobacterium‐mediated transformation; T3 transformant plants with homozygous transgenes are crossed with nrpb2‐2 (gray) heterozygous Arabidopsis; then, plants positive for both transgenes and nrpb2‐2 allele were selected for propagation into F3 generation to screen for homozygous double mutants of transgene and nrpb2‐2. Phenotype of Arabidopsis siliques of wild‐type (Col‐0), NRPB2 Col‐0, and NRPB2 nrpb2‐2 plants. Scale bars represent 10 mm. Silique length of wild‐type (Col‐0), NRPB2 Col‐0, and NRPB2 nrpb2‐2 plants (n > 20 for each genotype). Two‐sided Student's t‐test was used for statistical test, *** denotes P < 0.001. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile. Opened siliques from wild‐type (Col‐0), NRPB2 Col‐0, and NRPB2 nrpb2‐2 plants. Red arrows indicate aborted ovules. Phenotype of alternative transformation events to lines presented in Fig 1E. Homozygous mutant nrpb2‐2 was fully complemented by NRPB2‐FLAG (top) and partially complemented by NRPB2 ‐FLAG (bottom). Plants were grown for 4 weeks in soil. Scale bars represent 1 cm. Relative expression level of PR1, PR2, and PR5 in NRPB2 nrpb2‐2 and NRPB2 nrpb2‐2 by RT–qPCR. Error bars represent SEM from three independent replicates. **denotes P < 0.01 by two‐sided Student's t‐test. Source data are available online for this figure. The Arabidopsis nrpb2‐2 null allele is female gametophytic lethal, but can be transmitted through the male germline with reduced transmission rate 33. We could hence assay complementation of the gametophytic phenotypes to gain insights into the effects of RNAPII mutants. We assayed the transmission rate of the nrpb2‐2 null allele in the plants carrying homozygous NRPB2 , NRPB2 , or NRPB2 transgenes in nrpb2‐2 background (Fig EV1A). We would predict increased transmission rate of the nrpb2‐2 allele if the gametophytic defects could be complemented. As predicted, NRPB2 can fully (i.e., to the expected level of 50%) complement the transmission of nrpb2‐2 compared to non‐transformed controls (Fig 1D). Interestingly, NRPB2 could almost fully complement nrpb2‐2 transmission, while NRPB2 did not significantly increase transmission rate compared to non‐transformed controls (Fig 1D). These data suggest that NRPB2 fails to provide the RNAPII activity necessary for germline development. Indeed, silique dissection revealed that the germline defects in NRPB2 nrpb2‐2 were associated with reduced fertility and ovule abortion (Fig EV1B–D). Consistently, we identified plants homozygous for both NRPB2 transgene and nrpb2‐2 mutant (NRPB2 nrpb2‐2 ) while NRPB2 nrpb2‐2 genotype could not be recovered. Remarkably, when all RNAPII complexes carried the NRPB2 mutation (i.e., NRPB2 nrpb2‐2 ) we observed viable plant growth and development. These plants exhibited a dwarfed stature (Figs 1E and EV1E), but resembled Arabidopsis seedlings concerning basic patterning and organ formation. The dwarfed stature was reminiscent of mutants displaying auto‐immunity, which is often associated with increased expression of pathogen‐related (PR) genes 34. Indeed, we detected elevated expression of PR1, PR2, and PR5 in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 (Fig EV1F). These data highlight important roles of the ability to control the speed of RNAPII transcription during plant growth and development. In summary, Arabidopsis RNAPII harboring the NRPB2 point mutation failed to provide viable RNAPII activity during gametogenesis. However, the NRPB2 mutation can partly rescue the germline defects in nrpb2‐2 null mutants and allow plant growth and basic aspects of development.

NRPB2 accelerates RNAPII transcription in vivo

To investigate the effect of NRPB2 on RNAPII transcription speed, we first tested whether the equivalent rpb2‐Y769F mutant in budding yeast classifies as a fast or slow RNAPII transcription mutant by assaying its sensitivity towards mycophenolic acid (MPA) and Mn2+ 35, 36. Budding yeast RNAPII mutants conferring enhanced catalytic activity (RNAPII fast mutants) are more sensitive towards Mn2+ than the RNAPII slow mutants 20. In budding yeast, RNAPII fast mutants are sensitive to MPA due to deficient expression of IMD2 gene, which counteracts the inhibition of GTP synthesis by MPA. RNAPII slow mutants tend to be resistant to MPA due to the constitutive IMD2 expression 21. rpb2‐Y769F exhibited strong growth defects towards MPA and Mn2+ while we observed no effect for rpb2‐P1018S (Fig EV2A). rpb2‐Y769F thus shows a growth phenotype consistent with fast RNAPII transcription mutants 20. Interestingly, the rpb2‐Y769F/P1018S double mutant exhibited mild sensitivity towards MPA compared to either single mutant (Fig EV2A), consistent with a complementary effect on transcription speed as seen across many RNAPII active site mutations in budding yeast 27. Primer extension analyses of alternative TSSs usage of the ADH1 gene represent an additional assay for RNAPII catalytic rate and therefore putative elongation speed 27, where catalytically hyperactive RNAPII mutants exhibit an upstream shift of TSS. In agreement with previously characterized fast RNAPII transcription mutants, rpb2‐Y769F shifts the ADH1 TSS upstream compared to wild type or other Y769 substitutions (Fig EV2B). We next tested the combinations of rpb2‐Y769F with TL residue mutants previously demonstrated to alter RNAPII transcription speed. rpb2‐Y769F was synthetically lethal with previously characterized fast RNAPII transcription mutants such as rpb1‐L1101S, rpb1‐E1103G, and rpb1‐G1097D (Fig EV2C), suggesting that these combinations synergistically accelerated RNAPII transcription and supporting the interaction between Y769 and TL residues. Conversely, rpb2‐Y769F suppressed the growth defect of previously characterized slow RNAPII transcription mutants such as rpb1‐F1086S, rpb1‐H1085Q, and rpb1‐H1085Y 27, 35 (Fig EV2C), suggesting compensatory effects on transcription speed when combining these “slow” mutations with rpb2‐Y769F. In conclusion, our results characterized budding yeast rpb2‐Y769F as a mutation conferring phenotypes consistent with hyperactive RNAPII mutants which increase RNAPII transcription speed.
Figure EV2

Molecular and phenotypic characterization of the rpb2‐Y769F mutation in budding yeast and Arabidopsis equivalent NRPB2Y732F (related to Fig 2)

Differential sensitivity of various budding yeast rpb2 mutants towards Mn2+ and MPA in SC‐Leu media.

Primer extension analyses for ADH1 transcription start site usage in rpb2 mutants in budding yeast.

Genetic interaction between rpb2‐Y769F and Rpb1 TL mutations. Growth was assayed at day 1 and day 5. Ability to grow on SC‐Leu + 5FOA indicates that rpb2‐Y769F counteracts Rpb1 TL mutations. Red box indicates the phenotype of rpb2‐Y769F crossed with Rpb1 TL mutations.

A work flow of immunoprecipitation (IP) of FLAG‐tagged NRPB2 protein by anti‐FLAG followed by nascent RNA isolation, RT–qPCR analyses, and plaNET‐seq (left). Western blotting (right) of NRPB2WT‐FLAG and NRPB2Y732F‐FLAG as IP input (input), after IP (unbound), and after elution by FLAG peptides (eluted). Upper panel shows representative anti‐FLAG blots. Lower panel shows total proteins as loading control for indicated fractions.

Nascent RNA profile of AT5G41740. Nascent RNA RT–qPCR assay measuring RNAPII signal at three positions (dark red bars: probes 1, 2, and 3) on gene upon flagellin 22 treatment in a 0‐, 2‐, 3‐, and 4‐min time course. Nascent RNA signal values were normalized to reference gene ACT2. Error bars represent SEM from 3 independent replicates. The statistical significance of differences between NRPB2 and NRPB2 at the same time point was assessed by two‐sided Student's t‐test. n.s. denotes not significant; * denotes P < 0.05; and ** denotes P < 0.01. Scale bar (black) represent 0.5 kb.

Source data are available online for this figure.

Molecular and phenotypic characterization of the rpb2‐Y769F mutation in budding yeast and Arabidopsis equivalent NRPB2Y732F (related to Fig 2)

Differential sensitivity of various budding yeast rpb2 mutants towards Mn2+ and MPA in SC‐Leu media. Primer extension analyses for ADH1 transcription start site usage in rpb2 mutants in budding yeast. Genetic interaction between rpb2‐Y769F and Rpb1 TL mutations. Growth was assayed at day 1 and day 5. Ability to grow on SC‐Leu + 5FOA indicates that rpb2‐Y769F counteracts Rpb1 TL mutations. Red box indicates the phenotype of rpb2‐Y769F crossed with Rpb1 TL mutations. A work flow of immunoprecipitation (IP) of FLAG‐tagged NRPB2 protein by anti‐FLAG followed by nascent RNA isolation, RT–qPCR analyses, and plaNET‐seq (left). Western blotting (right) of NRPB2WT‐FLAG and NRPB2Y732F‐FLAG as IP input (input), after IP (unbound), and after elution by FLAG peptides (eluted). Upper panel shows representative anti‐FLAG blots. Lower panel shows total proteins as loading control for indicated fractions. Nascent RNA profile of AT5G41740. Nascent RNA RT–qPCR assay measuring RNAPII signal at three positions (dark red bars: probes 1, 2, and 3) on gene upon flagellin 22 treatment in a 0‐, 2‐, 3‐, and 4‐min time course. Nascent RNA signal values were normalized to reference gene ACT2. Error bars represent SEM from 3 independent replicates. The statistical significance of differences between NRPB2 and NRPB2 at the same time point was assessed by two‐sided Student's t‐test. n.s. denotes not significant; * denotes P < 0.05; and ** denotes P < 0.01. Scale bar (black) represent 0.5 kb. Source data are available online for this figure. To investigate the in vivo RNAPII transcription speed of Arabidopsis RNAPII carrying the NRPB2 mutation, we developed an assay to monitor nascent RNAPII elongation after rapid transcription induction. To avoid time‐consuming sample handling and processing issues associated with RNAPII chromatin immunoprecipitation from plants (RNAPII‐ChIP) 25, 37, we analyzed nascent RNA attached to RNAPII to monitor RNAPII elongation 38. We identified three pathogen resistance‐related Toll/interleukin receptor (TIR)‐type NB‐LRR genes AT4G19520, AT5G41740, and AT5G41750 39, 40 that are rapidly induced by flagellin 22 treatment. To monitor the “waves” of RNAPII elongation on these three genes after transcriptional induction, we performed a time course experiment during flagellin 22 treatment and determined the RNAPII signal by analyzing nascent RNA attached to RNAPII 38. We chose NRPB2 ‐FLAG Col‐0 and NRPB2 ‐FLAG Col‐0 as material for this assay since we detected no differences in growth and immune response in this background. In brief, FLAG‐tagged NRPB2WT and NRPB2Y732F proteins were immunoprecipitated by anti‐FLAG antibody; RNAPII‐associated RNA was purified and used in RT–qPCR analyses of three locations spanning these genes (Fig 2A). When gene induction is well synchronized, fast transcription is expected to show higher nascent RNA level in the gene body and towards the 3′ end of candidate genes during flagellin 22 treatment. We found that the candidate genes were rapidly induced by flagellin 22 treatment, as we detected an increase in nascent RNA level at probe 1 of these genes from 0 to 4 min after treatment (Figs 2B and C, and EV2E). Furthermore, data for the probe capturing RNAPII transcription shortly after induction (i.e., probe 1) suggest that these genes were induced with similar kinetics and to similar levels in NRPB2 and NRPB2 . Interestingly, we found that NRPB2 showed higher nascent RNA level than NRPB2 at probe 2 and probe 3 located further into the gene, from 3 min of flagellin 22 treatment onwards (Figs 2B and C, and EV2E). These data suggest that although wild‐type RNAPII and mutant RNAPII were equally induced near the 5′ ends of genes, the NRPB2 RNAPII reaches the 3′ ends of genes earlier than NRPB2 , supporting faster RNAPII transcription in the NRPB2 mutants. In summary, we detect evidence that the Arabidopsis NRPB2 mutant exhibits accelerated RNAPII transcription in vivo.
Figure 2

accelerates RNAPII transcription in vivo

Schematic drawing of the experimental design to investigate RNAPII transcription speed in vivo. In brief, Arabidopsis seedlings of NRPB2 ‐FLAG Col‐0 and NRPB2 ‐FLAG Col‐0 were grown on MS media for 12 days and then were transferred to MS liquid media for 2 days. Flagellin peptides (flagellin 22) were added into media, and treated samples were collected in a 0‐min (no treatment), 2‐, 3‐ and 4‐min time course. The nascent RNA was isolated and used for reverse transcription and qPCR analyses to reveal RNAPII accumulation at different region in genes. See Materials and Methods for technical details.

Nascent RNA profile of AT5G41750. Nascent RNA RT–qPCR assay measuring RNAPII signal at three positions (dark red bars: probes 1, 2, and 3) on the gene upon flagellin 22 treatment in a 0‐, 2‐, 3‐, and 4‐min time course. Nascent RNA signal values were normalized to reference gene ACT2. Error bars represent SEM from three independent replicates. The statistical significance of differences between NRPB2 and NRPB2 at the same time point was assessed by a two‐sided Student's t‐test. n.s. denotes not significant; * denotes P < 0.05; and ** denotes P < 0.01. Scale bar (black) represent 0.5 kb.

Nascent RNA profile of AT4G19520. Nascent RNA RT–qPCR assay measuring RNAPII signal at three positions (dark red bars: probes 1, 2, and 3) on the gene upon flagellin 22 treatment in a 0‐, 2‐, 3‐, and 4‐min time course. Nascent RNA signal values were normalized to reference gene ACT2. Error bars represent SEM from three independent replicates. The statistical significance of differences between NRPB2 and NRPB2 at the same time point was assessed by a two‐sided Student's t‐test. n.s. denotes not significant; * denotes P < 0.05; and ** denotes P < 0.01. Scale bar (black) represents 0.5 kb.

Source data are available online for this figure.

accelerates RNAPII transcription in vivo

Schematic drawing of the experimental design to investigate RNAPII transcription speed in vivo. In brief, Arabidopsis seedlings of NRPB2 ‐FLAG Col‐0 and NRPB2 ‐FLAG Col‐0 were grown on MS media for 12 days and then were transferred to MS liquid media for 2 days. Flagellin peptides (flagellin 22) were added into media, and treated samples were collected in a 0‐min (no treatment), 2‐, 3‐ and 4‐min time course. The nascent RNA was isolated and used for reverse transcription and qPCR analyses to reveal RNAPII accumulation at different region in genes. See Materials and Methods for technical details. Nascent RNA profile of AT5G41750. Nascent RNA RT–qPCR assay measuring RNAPII signal at three positions (dark red bars: probes 1, 2, and 3) on the gene upon flagellin 22 treatment in a 0‐, 2‐, 3‐, and 4‐min time course. Nascent RNA signal values were normalized to reference gene ACT2. Error bars represent SEM from three independent replicates. The statistical significance of differences between NRPB2 and NRPB2 at the same time point was assessed by a two‐sided Student's t‐test. n.s. denotes not significant; * denotes P < 0.05; and ** denotes P < 0.01. Scale bar (black) represent 0.5 kb. Nascent RNA profile of AT4G19520. Nascent RNA RT–qPCR assay measuring RNAPII signal at three positions (dark red bars: probes 1, 2, and 3) on the gene upon flagellin 22 treatment in a 0‐, 2‐, 3‐, and 4‐min time course. Nascent RNA signal values were normalized to reference gene ACT2. Error bars represent SEM from three independent replicates. The statistical significance of differences between NRPB2 and NRPB2 at the same time point was assessed by a two‐sided Student's t‐test. n.s. denotes not significant; * denotes P < 0.05; and ** denotes P < 0.01. Scale bar (black) represents 0.5 kb. Source data are available online for this figure.

Accelerated RNAPII transcription reduces promoter‐proximal RNAPII stalling

To study the genome‐wide effects of accelerated RNAPII transcription speed in NRPB2 , we performed plant Native Elongating Transcript sequencing (plaNET‐seq) to monitor nascent RNAPII transcription 8. Two independent replicates of plaNET‐seq were performed for NRPB2 nrpb2‐2 mutant and NRPB2 nrpb2‐2 control (Fig EV3A and B). Nascent RNA profiling in Arabidopsis revealed RNAPII stalling peaks near the beginning of transcription units in promoter‐proximal regions. The positioning of the first nucleosome correlates well with the position of promoter‐proximal RNAPII stalling in Arabidopsis 8. To address the role of transcription speed in regulating promoter‐proximal stalling, we investigated the RNAPII signal in promoter‐proximal regions from plaNET‐seq in NRPB2 nrpb2‐2 and NRPB2 nrpb2‐2 . Visual inspection suggested that NRPB2 reduced peaks of RNAPII near the 5′ ends of genes when compared to NRPB2 (Fig 3A). A metagene plot showing plaNET‐seq RNAPII signal in a 1 kb region centered at the +1 nucleosomes 41 revealed that NRPB2 reduced promoter‐proximal RNAPII stalling centered at the +1 nucleosome position genome‐wide compared to NRPB2 (Fig 3B). The metagene‐level reduction in RNAPII stalling in NRPB2 was confirmed when the plaNET‐seq signal was anchored at TSSs (Fig EV3C). To further quantify this effect, we calculated the RNAPII stalling index for well‐expressed genes (plaNET‐seq signal FPKM > 10, n = 6,596), which represents relative enrichment of RNAPII signal at promoter‐proximal regions compared to the whole gene body. This analysis quantified a 35% reduction in the median value of RNAPII promoter‐proximal stalling index in NRPB2 compared to NRPB2 (Fig 3C). These data illustrate that a restriction of RNAPII transcription speed contributes strongly to the formation of characteristic promoter‐proximal RNAPII peaks.
Figure EV3

Genome‐wide effects of on nascent RNAPII transcription by plaNET‐seq compared to (related to Fig 3)

Scatterplot showing the biological reproducibility of plaNET‐seq experiment in NRPB2 nrpb2‐2 . TPM‐normalized plaNET‐seq signal was summarized within 10‐bp bins genome‐wide. Pearson R = 0.987.

Scatterplot showing the biological reproducibility of plaNET‐seq experiment in NRPB2 nrpb2‐2 . TPM‐normalized plaNET‐seq signal was summarized within 10‐bp bins genome‐wide. Pearson R = 0.987.

Metagene profile of plaNET‐seq mean signal of RNAPII in a 1 kb window centered at the TSS of Arabidopsis genes (n = 24,862) in NRPB2 (blue) and NRPB2 (red).

plaNET‐seq signal of RNAPII across the whole AT2G19830 gene in NRPB2 (blue) and NRPB2 (red). Arrows indicate the elevated nascent RNAPII signal in the gene body.

Metagene profile of plaNET‐seq mean signal of RNAPII in exons (50–100 bp, scaled to 100 bins, n = 31,202) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Metagene profile of plaNET‐seq mean signal of RNAPII in exons (100–200 bp, scaled to 100 bins, n = 33,600) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Metagene profile of plaNET‐seq mean signal of RNAPII in exons (200–300 bp, scaled to 100 bins, n = 9,795) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Metagene profile of plaNET‐seq mean signal of RNAPII in introns (50–100 bp, scaled to 100 bins, n = 58,050) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Metagene profile of plaNET‐seq mean signal of RNAPII in introns (100–200 bp, scaled to 100 bins, n = 34,213) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Metagene profile of plaNET‐seq mean signal of RNAPII in introns (200–300 bp, scaled to 100 bins, n = 128) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Metagene profile of plaNET‐seq mean signal of RNAPII in constitutive exons (n = 75,136) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Metagene profile of plaNET‐seq mean signal of RNAPII in alternative exons (n = 724) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Metagene profile of plaNET‐seq mean signal of RNAPII in constitutive exons (n = 97,358) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Metagene profile of plaNET‐seq mean signal of RNAPII in alternative exons (n = 5,306) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Figure 3

Accelerated RNAPII transcription reduces promoter‐proximal RNAPII stalling and enhances RNAPII activity in gene body

plaNET‐seq signal of RNAPII in the promoter‐proximal region of AT1G70600 in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Arrows indicate the RNAPII signal at the region of promoter‐proximal stalling.

Metagene profile of plaNET‐seq mean signal of RNAPII in a 1 kb window centered at the +1 nucleosome in Arabidopsis genes (n = 25,474) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). The significance of differences of plaNET‐seq signal in the region from −25 bins to +25 bins around +1 nucleosome between NRPB2 and NRPB2 was assessed by a two‐sided Mann–Whitney U‐test, P = 5.20e‐10.

RNAPII stalling index calculated for all the genes with plaNET‐seq FPKM ≥ 10 in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red; n = 6,596). Medians of the stalling index are 1.891 and 1.222 for NRPB2 and NRPB2 , respectively. *** denotes P‐value < 0.001 by Wilcoxon signed‐rank test. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile.

Metagene profile of plaNET‐seq mean signal over whole genes (length from 0.5 kb to 5 kb, scaled to 500 bins, n = 27,042) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Metagene profile of plaNET‐seq mean signal of RNAPII in exons (length from 50 to 300 bp, scaled to 100 bins, n = 73,925) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Pink dashed line rectangle illustrates the amplitude of differences between the minimum and the maximum of RNAPII signal across the exons. A two‐sided Mann–Whitney U‐test was used to assess the plaNET‐seq signal of NRPB2 (blue) and NRPB2 (red) in exons, P < 1e‐16.

Metagene profile of plaNET‐seq mean signal of RNAPII in introns (length from 50–300 bp, scaled to 100 bins, n = 102,260) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). A two‐sided Mann–Whitney U‐test was used to assess the plaNET‐seq signal of NRPB2 (blue) and NRPB2 (red) in introns, P < 1e‐16.

Genome‐wide effects of on nascent RNAPII transcription by plaNET‐seq compared to (related to Fig 3)

Scatterplot showing the biological reproducibility of plaNET‐seq experiment in NRPB2 nrpb2‐2 . TPM‐normalized plaNET‐seq signal was summarized within 10‐bp bins genome‐wide. Pearson R = 0.987. Scatterplot showing the biological reproducibility of plaNET‐seq experiment in NRPB2 nrpb2‐2 . TPM‐normalized plaNET‐seq signal was summarized within 10‐bp bins genome‐wide. Pearson R = 0.987. Metagene profile of plaNET‐seq mean signal of RNAPII in a 1 kb window centered at the TSS of Arabidopsis genes (n = 24,862) in NRPB2 (blue) and NRPB2 (red). plaNET‐seq signal of RNAPII across the whole AT2G19830 gene in NRPB2 (blue) and NRPB2 (red). Arrows indicate the elevated nascent RNAPII signal in the gene body. Metagene profile of plaNET‐seq mean signal of RNAPII in exons (50–100 bp, scaled to 100 bins, n = 31,202) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Metagene profile of plaNET‐seq mean signal of RNAPII in exons (100–200 bp, scaled to 100 bins, n = 33,600) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Metagene profile of plaNET‐seq mean signal of RNAPII in exons (200–300 bp, scaled to 100 bins, n = 9,795) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Metagene profile of plaNET‐seq mean signal of RNAPII in introns (50–100 bp, scaled to 100 bins, n = 58,050) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Metagene profile of plaNET‐seq mean signal of RNAPII in introns (100–200 bp, scaled to 100 bins, n = 34,213) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Metagene profile of plaNET‐seq mean signal of RNAPII in introns (200–300 bp, scaled to 100 bins, n = 128) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Metagene profile of plaNET‐seq mean signal of RNAPII in constitutive exons (n = 75,136) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Metagene profile of plaNET‐seq mean signal of RNAPII in alternative exons (n = 724) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Metagene profile of plaNET‐seq mean signal of RNAPII in constitutive exons (n = 97,358) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Metagene profile of plaNET‐seq mean signal of RNAPII in alternative exons (n = 5,306) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Accelerated RNAPII transcription reduces promoter‐proximal RNAPII stalling and enhances RNAPII activity in gene body

plaNET‐seq signal of RNAPII in the promoter‐proximal region of AT1G70600 in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Arrows indicate the RNAPII signal at the region of promoter‐proximal stalling. Metagene profile of plaNET‐seq mean signal of RNAPII in a 1 kb window centered at the +1 nucleosome in Arabidopsis genes (n = 25,474) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). The significance of differences of plaNET‐seq signal in the region from −25 bins to +25 bins around +1 nucleosome between NRPB2 and NRPB2 was assessed by a two‐sided Mann–Whitney U‐test, P = 5.20e‐10. RNAPII stalling index calculated for all the genes with plaNET‐seq FPKM ≥ 10 in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red; n = 6,596). Medians of the stalling index are 1.891 and 1.222 for NRPB2 and NRPB2 , respectively. *** denotes P‐value < 0.001 by Wilcoxon signed‐rank test. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile. Metagene profile of plaNET‐seq mean signal over whole genes (length from 0.5 kb to 5 kb, scaled to 500 bins, n = 27,042) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Metagene profile of plaNET‐seq mean signal of RNAPII in exons (length from 50 to 300 bp, scaled to 100 bins, n = 73,925) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Pink dashed line rectangle illustrates the amplitude of differences between the minimum and the maximum of RNAPII signal across the exons. A two‐sided Mann–Whitney U‐test was used to assess the plaNET‐seq signal of NRPB2 (blue) and NRPB2 (red) in exons, P < 1e‐16. Metagene profile of plaNET‐seq mean signal of RNAPII in introns (length from 50–300 bp, scaled to 100 bins, n = 102,260) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). A two‐sided Mann–Whitney U‐test was used to assess the plaNET‐seq signal of NRPB2 (blue) and NRPB2 (red) in introns, P < 1e‐16.

Accelerated transcription increases nascent RNAPII signal in gene bodies

We observed increased RNAPII signals in NRPB2 compared to NRPB2 at intragenic positions downstream of promoter‐proximal stalling sites (Fig EV3D). A metagene analysis of RNAPII activity across gene bodies confirmed this observation on a genome‐wide scale (Fig 3D). Increased RNAPII signal in gene bodies could be reconciled by less RNAPII at promoter‐proximal stalling regions in NRPB2 compared to NRPB2 . Consistently, increased nascent transcription in gene bodies in NRPB2 correlated with increased plaNET‐seq metagene profiles of exons and introns (Fig 3E and F). Interestingly, we detected an accumulation of exonic plaNET‐seq signal towards the 3′ end of exons in NRPB2 (Fig 3E). This effect was insensitive to the exon length (Fig EV3E–G). Exon–intron boundaries may thus trigger a pile‐up of nascent RNAPII transcription when transcription is accelerated. In introns, accelerated RNAPII transcription amplifies nascent RNAPII signal compared to NRPB2 and resulted in a uniform accumulation profile, which can be observed in metagene plots for introns of variable length genome‐wide (Figs 3F, and EV3H–J). We next tested possible connections between increased intragenic nascent RNAPII signal and splicing regulation. However, the fast mutant showed increased signal over both constitutive and alternative exons and introns (Fig EV3K–N). In conclusion, accelerated RNAPII transcription in NRPB2 resulted in increased nascent RNAPII transcription in gene bodies.

Accelerated transcription enhances intron splicing and exon skipping

plaNET‐seq co‐purifies splicing intermediates due to co‐transcriptional spliceosome association with RNAPII (Fig 4A). The splicing intermediates appear as single‐nucleotide sharp peaks at 5′ splicing site (5′SS) and 3′ splicing site (3′SS) and thus can be distinguished from the nascent reads 10, 42. We detected an increased fraction of splicing intermediate reads corresponding to 5′SS in plaNET‐seq of NRPB2 compared to NRPB2 , while no obvious difference could be detected for 3′ splicing intermediates (Fig 4B). These data suggested an increased association of accelerated RNAPII transcription with splicing intermediates overlapping a 5′SS. Since 5′ splicing intermediates are associated with the spliceosome, we predicted that higher RNAPII coverage in gene bodies could increase spliceosome association and perhaps enhance splicing in NRPB2 . To test this idea, two independent replicates of RNA‐seq were performed for NRPB2 nrpb2‐2 and NRPB2 nrpb2‐2 seedlings (Fig EV4A). RNA‐seq detects predominantly spliced transcripts with a characteristic signal intensity profile matching annotated exons. However, we noticed RNA‐seq signal corresponding to some introns, presumably representing regulatory or poorly spliced introns (i.e., retained introns). Interestingly, initial visual inspection of several retained introns indicated that accelerated RNAPII transcription in NRPB2 appeared to decrease intronic RNA‐seq signal (Fig 4C). Strikingly, this finding is supported by a genome‐wide decrease in the fraction of intronic RNA‐seq signal across all genes (Fig 4D), suggesting a genome‐wide trend of increased splicing efficiency in plants when RNAPII transcription is accelerated. A systematic genome‐wide analysis identified 1,517 differentially expressed (DE) introns from the RNA‐seq data of NRPB2 compared to NRPB2 . The majority (1,334 out of 1,517) of DE introns exhibit decreased fraction of intronic reads (Table EV2). We identified similar numbers of DE exons with increased or decreased expression in NRPB2 compared to NRPB2 , while we detected many more introns with decreased expression (Fig 4E). Quantification of DE exons revealed a small yet significant reduction in expression (Fig 4F) that we visualized for internal exons of the AT1G58060 and AT3G05680 genes (Fig 4G). In contrast, we detected a stronger decrease for DE introns in NRPB2 mutant compared to NRPB2 (Fig 4H). We next tested alternative 5′SS and 3′SS usage (Fig EV4B) in the NRPB2 mutant compared to NRPB2 and found a trend to shift 5′SS upstream and 3′SS downstream (Fig EV4C–E). We note that a downstream shift of 3′SS is consistent with effects observed in the splicing factor mutant ntr1 linked to increased transcription speed in Arabidopsis 16. In summary, our RNA‐seq data revealed multiple effects of accelerated RNAPII transcription on splicing in Arabidopsis. Our analyses highlighted reduced intron retention as the most notable effect of altered RNAPII activity in Arabidopsis on splicing. The data support the idea that inefficient splicing of these introns in wild type may be interpreted through a model where RNAPII transcription speed could be limiting their splicing.
Figure 4

Analysis of alternative splicing in and mutant

A schematic illustration of the co‐transcriptional RNAPII–spliceosome complex. plaNET‐seq mainly detects splicing intermediates corresponding to 5′ splicing site (5′SS) co‐purified with engaged RNAPII complex.

Bar charts showing the fractions of 5′ and 3′ splicing intermediate reads from plaNET‐seq in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Genome browser snapshots illustrating enhanced intron splicing in NRPB2 nrpb2‐2 (NRPB2 , red) compared to NRPB2 nrpb2‐2 (NRPB2 , blue). Scale bars denote 0.5 kb.

The fraction of RNA‐seq intronic reads calculated for all genes (n = 24,912) in NRPB2 nrpb2‐2 and NRPB2 nrpb2‐2 . Two‐sided Mann–Whitney U‐test: **** denotes P‐value < 2.2e‐16. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile.

Differentially expressed (DE) exons and introns in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 based on RNA‐seq results. Numbers of DE exons and introns were shown in plot.

Quantification (log2 fold change of FPKM from RNA‐seq) of differentially expressed (DE) exons and non‐DE exons in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 . ** denotes P‐value < 0.01 by Wilcoxon signed‐rank test. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile.

Genome browser snapshots illustrating enhanced exon skipping in NRPB2 nrpb2‐2 (NRPB2 , red) compared to NRPB2 nrpb2‐2 (NRPB2 , blue). Scale bars denote 0.5 kb.

Quantification (log2 fold change of FPKM from RNA‐seq) of differentially expressed (DE) introns and non‐DE exons in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 . **** denotes P‐value < 0.0001 by Wilcoxon signed‐rank test. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile.

Source data are available online for this figure.

Figure EV4

Genome‐wide effects of on gene expression by RNA‐seq compared to (related to Fig 4)

Reproducibility of RNA‐seq data demonstrated by clustered heatmap of Euclidean distances between two independent replicates of RNA‐seq in both NRPB2 nrpb2‐2 and NRPB2 nrpb2‐2 . Darker blue stands for higher reproducibility, and lighter blue represents low reproducibility.

Illustration of constitutive splicing site (SS), alternative 5′ splicing site (SS), and alternative 3′ splicing site (SS).

Differentially regulated alternative 5′SS and 3′SS in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 based on RNA‐seq results. Numbers of up‐ and down‐regulated SS were shown on the plot.

Quantification (the changes of present splicing inclusion, dPSI) of differentially regulated alternative 3′SS exons and non‐DE exons in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 . dPSI > 0 and dPSI < 0 suggest upstream and downstream shift of alternative 5′SS, respectively. **** denotes P‐value < 0.0001 by Wilcoxon signed‐rank test. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile.

Quantification (the changes of present splicing inclusion, dPSI) of differentially regulated alternative 5′SS exons and non‐DE exons in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 . dPSI > 0 and dPSI < 0 suggest downstream and upstream shift of alternative 5′SS, respectively. ** denotes P‐value < 0.01 by Wilcoxon signed‐rank test. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile.

Analysis of alternative splicing in and mutant

A schematic illustration of the co‐transcriptional RNAPII–spliceosome complex. plaNET‐seq mainly detects splicing intermediates corresponding to 5′ splicing site (5′SS) co‐purified with engaged RNAPII complex. Bar charts showing the fractions of 5′ and 3′ splicing intermediate reads from plaNET‐seq in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Genome browser snapshots illustrating enhanced intron splicing in NRPB2 nrpb2‐2 (NRPB2 , red) compared to NRPB2 nrpb2‐2 (NRPB2 , blue). Scale bars denote 0.5 kb. The fraction of RNA‐seq intronic reads calculated for all genes (n = 24,912) in NRPB2 nrpb2‐2 and NRPB2 nrpb2‐2 . Two‐sided Mann–Whitney U‐test: **** denotes P‐value < 2.2e‐16. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile. Differentially expressed (DE) exons and introns in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 based on RNA‐seq results. Numbers of DE exons and introns were shown in plot. Quantification (log2 fold change of FPKM from RNA‐seq) of differentially expressed (DE) exons and non‐DE exons in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 . ** denotes P‐value < 0.01 by Wilcoxon signed‐rank test. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile. Genome browser snapshots illustrating enhanced exon skipping in NRPB2 nrpb2‐2 (NRPB2 , red) compared to NRPB2 nrpb2‐2 (NRPB2 , blue). Scale bars denote 0.5 kb. Quantification (log2 fold change of FPKM from RNA‐seq) of differentially expressed (DE) introns and non‐DE exons in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 . **** denotes P‐value < 0.0001 by Wilcoxon signed‐rank test. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile. Source data are available online for this figure.

Genome‐wide effects of on gene expression by RNA‐seq compared to (related to Fig 4)

Reproducibility of RNA‐seq data demonstrated by clustered heatmap of Euclidean distances between two independent replicates of RNA‐seq in both NRPB2 nrpb2‐2 and NRPB2 nrpb2‐2 . Darker blue stands for higher reproducibility, and lighter blue represents low reproducibility. Illustration of constitutive splicing site (SS), alternative 5′ splicing site (SS), and alternative 3′ splicing site (SS). Differentially regulated alternative 5′SS and 3′SS in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 based on RNA‐seq results. Numbers of up‐ and down‐regulated SS were shown on the plot. Quantification (the changes of present splicing inclusion, dPSI) of differentially regulated alternative 3′SS exons and non‐DE exons in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 . dPSI > 0 and dPSI < 0 suggest upstream and downstream shift of alternative 5′SS, respectively. **** denotes P‐value < 0.0001 by Wilcoxon signed‐rank test. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile. Quantification (the changes of present splicing inclusion, dPSI) of differentially regulated alternative 5′SS exons and non‐DE exons in NRPB2 nrpb2‐2 compared to NRPB2 nrpb2‐2 . dPSI > 0 and dPSI < 0 suggest downstream and upstream shift of alternative 5′SS, respectively. ** denotes P‐value < 0.01 by Wilcoxon signed‐rank test. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile.

Accelerated RNAPII transcription reduces RNAPII stalling at gene ends

plaNET‐seq resolves peaks of RNAPII activity at 3′ ends of plant genes. This localized reduction in transcription speed at gene ends may assist RNAPII transcriptional termination. To test this hypothesis, we investigated the RNAPII stalling peaks at 3′ ends of genes by plaNET‐seq in the fast transcription mutant NRPB2 compared to NRPB2 . We detected RNAPII stalling downstream of poly (A) sites (PASs) of Arabidopsis genes in NRPB2 (Fig 5A). In contrast, PAS‐stalling peaks of RNAPII in this region were often undetectable in NRPB2 , as shown for the AT2G21410 gene (Fig 5A). A metagene analysis confirmed RNAPII peaks downstream of PAS at 3′ gene ends in NRPB2 and a strong reduction in NRPB2 genome‐wide (Fig 5B). These data connect increased RNAPII transcription speed and reduced RNAPII stalling at gene ends downstream of PAS. If RNAPII stalling were promoting transcription termination, we would expect termination defects in NRPB2 . Indeed, genome browser screenshots indicated higher RNAPII signal downstream of the PAS‐stalling region in NRPB2 in comparison with NRPB2 (Fig 5A), suggesting transcriptional read‐through as a consequence of increased transcription speed. To quantify this effect genome‐wide, we determined the transcriptional read‐through lengths in NRPB2 and NRPB2 . We used a statistical model which was based on empirical distributions of plaNET‐seq tag counts in both genic and intergenic regions (see Materials and Methods). Strikingly, we observed that NRPB2 extended transcriptional read‐through genome‐wide (FPKM > 5, n = 9,316; Fig 5C). We detected a 115 nt increase of median transcriptional read‐through length in NRPB2 compared to NRPB2 (NRPB2 , 649 nt; NRPB2 , 534 nt; Fig 5D). NRPB2 accelerating transcription speed thus reduces RNAPII termination efficiency and extends transcriptional read‐through. The process of RNAPII termination is sensitive to the RNAPII active site and putative catalysis, consistent with a model where increased RNAPII speed alters kinetic competition between transcriptional stalling, termination, and further elongation.
Figure 5

Accelerated transcription reduces RNAPII stalling at 3′ gene ends and enhances transcriptional read‐through downstream of PAS

plaNET‐seq signal of RNAPII at 3′ end of AT2G21410 in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Arrows indicate the RNAPII signal peaks at PAS‐stalling region.

Metagene profile of plaNET‐seq mean signal of RNAPII in a 1 kb window centered at PAS (n = 24,448) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). The significance of differences of plaNET‐seq signal in the region from PAS to +100 bins between NRPB2 and NRPB2 was assessed by two‐sided Mann–Whitney U‐test, P = 1.53e‐06.

Histogram of transcriptional read‐through length (nt) from PAS of protein‐coding gene (plaNET‐seq FPKM > 5, n = 9,316) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red).

Box plot shows the RNAPII transcriptional read‐through length from PAS of protein‐coding genes (plaNET‐seq, FPKM > 5, n = 9,316) called based on statistical model (see Materials and Methods) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Medians of read‐through length in NRPB2 and NRPB2 mutant are 534 and 649 nt. Two‐sided Mann–Whitney U‐test: *** denotes P = 9.9e‐62. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile.

Metagene plot of RNAPII signal by plaNET‐seq anchored at both PAS of upstream genes and TSS of downstream genes for tandemly oriented genes (n = 5,753) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Red arrow denotes the direction of transcriptional read‐through. Pink dashed line rectangle indicates the region corresponding to the second half of PAS‐TSS gaps along 5′–3′ direction.

Metagene plot of RNAPII signal by plaNET‐seq anchored at PASs of both upstream genes and downstream genes for gene pairs located in “tail‐to‐tail” orientation (n = 1,384) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Red arrows denote the directions of transcriptional read‐through from both PASs. Pink dashed line rectangles indicate the region corresponding to the second half of PAS‐TSS gaps along 5′–3′ direction.

Accelerated transcription reduces RNAPII stalling at 3′ gene ends and enhances transcriptional read‐through downstream of PAS

plaNET‐seq signal of RNAPII at 3′ end of AT2G21410 in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Arrows indicate the RNAPII signal peaks at PAS‐stalling region. Metagene profile of plaNET‐seq mean signal of RNAPII in a 1 kb window centered at PAS (n = 24,448) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). The significance of differences of plaNET‐seq signal in the region from PAS to +100 bins between NRPB2 and NRPB2 was assessed by two‐sided Mann–Whitney U‐test, P = 1.53e‐06. Histogram of transcriptional read‐through length (nt) from PAS of protein‐coding gene (plaNET‐seq FPKM > 5, n = 9,316) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Box plot shows the RNAPII transcriptional read‐through length from PAS of protein‐coding genes (plaNET‐seq, FPKM > 5, n = 9,316) called based on statistical model (see Materials and Methods) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Medians of read‐through length in NRPB2 and NRPB2 mutant are 534 and 649 nt. Two‐sided Mann–Whitney U‐test: *** denotes P = 9.9e‐62. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile. Metagene plot of RNAPII signal by plaNET‐seq anchored at both PAS of upstream genes and TSS of downstream genes for tandemly oriented genes (n = 5,753) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Red arrow denotes the direction of transcriptional read‐through. Pink dashed line rectangle indicates the region corresponding to the second half of PAS‐TSS gaps along 5′–3′ direction. Metagene plot of RNAPII signal by plaNET‐seq anchored at PASs of both upstream genes and downstream genes for gene pairs located in “tail‐to‐tail” orientation (n = 1,384) in NRPB2 nrpb2‐2 (NRPB2 , blue) and NRPB2 nrpb2‐2 (NRPB2 , red). Red arrows denote the directions of transcriptional read‐through from both PASs. Pink dashed line rectangles indicate the region corresponding to the second half of PAS‐TSS gaps along 5′–3′ direction. Transcriptional read‐through blurs the boundaries of transcription units, which could result in overlapping transcripts and potential gene expression conflicts. To investigate this, we focused on read‐through transcription of tandemly oriented genes, where transcription read‐through from upstream‐located genes may invade downstream genes. RNAPII with accelerated transcription speed is expected to extend transcriptional read‐through into the intergenic space (i.e., gaps) between the PAS of an upstream gene and the TSS of a downstream gene (PAS‐TSS gaps). Indeed, NRPB2 shows higher RNAPII signal than NRPB2 in the second half of PAS‐TSS gaps (n = 5,753) while RNAPII in NRPB2 stalls downstream of PAS in the first half of PAS‐TSS gaps (Figs 5E and EV5A). We further investigated plaNET‐seq RNAPII signal in PAS‐PAS gaps of paired genes facing each other in “tail‐to‐tail” orientation (n = 1,384). Also for this subset of genes, NRPB2 lacked the characteristic RNAPII PAS‐stalling in the first half of PAS‐PAS gaps and showed significantly higher RNAPII signal in the second half of PAS‐PAS gaps (Figs 5F and EV5B). These data suggest that accelerated transcription speed triggers transcriptional read‐through genome‐wide, resulting in overlapping transcripts and potential gene expression conflicts. In conclusion, our data highlight connections between reduced speed of RNAPII transcription at gene ends (i.e., PAS‐stalling) and the termination of RNAPII transcription, linking the speed of transcription to spatial separation of plant transcription units.
Figure EV5

Quantification of read‐through transcription in compared to (related to Fig 5)

Box plot showing the comparison of plaNET‐seq signal of NRPB2 (blue) and NRPB2 (red) in the region corresponding to the second half of PAS‐TSS gaps (n = 5,753) between tandemly oriented genes. Mann–Whitney test, **** denotes P = 1.70e‐43. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile.

Box plot showing the comparison of plaNET‐seq signal of NRPB2 (blue) and NRPB2 (red) in the region corresponding to the second half of PAS‐PAS gaps (n = 1,384) between genes located in “tail‐to‐tail” orientation. Mann–Whitney test, **** denotes P = 7.10e‐14. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile.

Quantification of read‐through transcription in compared to (related to Fig 5)

Box plot showing the comparison of plaNET‐seq signal of NRPB2 (blue) and NRPB2 (red) in the region corresponding to the second half of PAS‐TSS gaps (n = 5,753) between tandemly oriented genes. Mann–Whitney test, **** denotes P = 1.70e‐43. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile. Box plot showing the comparison of plaNET‐seq signal of NRPB2 (blue) and NRPB2 (red) in the region corresponding to the second half of PAS‐PAS gaps (n = 1,384) between genes located in “tail‐to‐tail” orientation. Mann–Whitney test, **** denotes P = 7.10e‐14. The solid horizontal lines and box limits represent median, lower and upper quartiles of data values in each group. The upper and lower whiskers extend to the largest or smallest value, respectively, no further than 1.5*IQR from the relevant quartile. Our findings highlight molecular and organismal consequences of altered RNAPII elongation speed in a multicellular organism. The two main peaks of RNAPII localization in genomes at gene boundaries were depleted when transcription speed was accelerated (Fig 6A and B). Accelerated RNAPII transcription impacted gene expression after transcriptional initiation, through profound effects on splicing and transcriptional termination. Our data support that transcription speed control at gene boundaries is a key step in gene expression of multicellular organisms.
Figure 6

Cartoon summarizing the effect of on RNAPII genomic stalling and transcription read‐through

A schematic illustrating the effect of transcription speed on RNAPII stalling at promoter‐proximal regions. Accelerated RNAPII is prone to move out of stalling regions (centered at the position of the +1 nucleosome) at 5′ end of genes, resulting in reduced promoter‐proximal stalling peaks in RNAPII profile by plaNET‐seq.

A schematic illustration showing that accelerated RNAPII tends to evade from RNAPII stalling near gene poly‐(A) sites (PAS). This leads to less efficient transcription termination and extended transcription read‐through, reflected by the absence of RNAPII signal peaks downstream of PAS and elevated signal downstream of PAS‐stalling region by plaNET‐seq.

Cartoon summarizing the effect of on RNAPII genomic stalling and transcription read‐through

A schematic illustrating the effect of transcription speed on RNAPII stalling at promoter‐proximal regions. Accelerated RNAPII is prone to move out of stalling regions (centered at the position of the +1 nucleosome) at 5′ end of genes, resulting in reduced promoter‐proximal stalling peaks in RNAPII profile by plaNET‐seq. A schematic illustration showing that accelerated RNAPII tends to evade from RNAPII stalling near gene poly‐(A) sites (PAS). This leads to less efficient transcription termination and extended transcription read‐through, reflected by the absence of RNAPII signal peaks downstream of PAS and elevated signal downstream of PAS‐stalling region by plaNET‐seq.

Discussion

RNAPII transcription speed and organismal development

While we succeeded in generation of viable plants carrying a fast RNAPII mutation, we were unable to obtain plants with a mutation in a conserved residue that reduced RNAPII transcription speed in yeast. This observation is reminiscent of embryonic lethality in mice through a point mutation in the largest RNAPII subunit that decreases transcription speed 43. Female germline development in Arabidopsis involves more complex cellular differentiation than male germline development 44. A genetic dissection of factors required for female germline development revealed cell specification by multiple splicing factors 44, 45. Accelerated RNAPII transcription in NRPB2 was associated with increased splicing efficiency, perhaps offering an explanation for enhanced viability of the female gametophyte compared to nrpb2‐2.

Accelerated RNAPII transcription and RNAPII stalling

Promoter‐proximal stalling represents a common feature of transcription throughout eukaryotic genomes 46. The purpose of promoter‐proximal RNAPII stalling is debated actively, yet a reduction in RNAPII transcription speed during stalling could be part of a checkpoint regulating gene expression. In organisms without NELF, for example, plants, promoter‐proximal RNAPII stalling correlates with the position of the +1 nucleosome encountered by the transcription machinery 8. Accelerated transcription in NRPB2 decreased promoter‐proximal stalling and resulted in increased intragenic RNAPII transcription. Our data thus suggest that a reduction in RNAPII elongation speed near promoters facilitates the accumulation of promoter‐proximal RNAPII peaks. In Arabidopsis, these peaks form independently of NELF and perhaps form through contributions by nucleosome barriers that correlate well with the peak position 8, 47. Possible advantages of promoter‐proximal RNAPII stalling include a reduced response time to adjust gene expression to new environmental conditions. Instead of initiating the process of RNAPII transcription from recruitment and complex assembly, stalled RNAPII may represent pre‐assembled and elongation competent RNAPII complexes waiting for signals to transcribe the full gene 1. Defense signaling is crucial for plant fitness and regulated with fast temporal dynamics, perhaps achieved by a release of RNAPII into elongation from promoter‐proximal stalling sites. If true, this would predict constitutive defense signaling when transcription is accelerated. Interestingly, NRPB2 nrpb2‐2 plants resembled mutants with constitutively active defense signaling 34. PR gene induction represents a diagnostic molecular hallmark of elevated defense signaling 48. While alternative molecular explanations for stunted growth in NRPB2 nrpb2‐2 may exist, for example, indirect effects, the induction of PR gene expression is consistent with an auto‐immunity phenotype triggered by accelerated transcription. Our data thus provide a potential connection between plant defense signaling, promoter‐proximal RNAPII stalling, and the speed of RNAPII transcription. In conclusion, these data imply that transcription speed limits at gene boundaries may benefit plants by avoiding constitutive defense signaling that triggers auto‐immunity.

Accelerated RNAPII transcription and RNA processing

Our targeted introduction of candidate point mutations represents a direct approach to address mechanistic links between the speed of RNAPII transcription and RNA processing. Nevertheless, some molecular effects we reported could represent indirect effects caused by differences in growth and development between NRPB2 nrpb2‐2 and NRPB2 nrpb2‐2 . RNA‐seq revealed that intron retention is reduced when RNAPII is accelerated; in other words, splicing efficiency of poorly spliced introns is increased. plaNET‐seq data indicate that increased splicing efficiency is associated with the capture of splicing intermediates with 3′ terminal bases overlapping 5′SS, perhaps indicating that splicing of retained introns could be increased by promoting RNAPII binding to 5′SS. In conclusion, the speed of RNAPII transcription contributes to plant gene expression by modulating splicing efficiency, particularly at retained introns. plaNET‐seq data informed on transcriptional termination of RNAPII. Strikingly, we found a reduction in RNAPII peaks associated with termination when transcription is accelerated, and an increased distance of read‐through transcription downstream of the PAS (Fig 5D). Read‐through transcription triggered by elevated temperature has been reported in budding yeast and mammalian cell culture 37, 49. Extended read‐through as observed in an accelerated RNAPII transcription mutant may have functional consequences on gene expression. The increased transcriptional read‐through may result in gene expression defects for neighboring genes, for example, by transcriptional interference 50, 51. In summary, our data support the idea that a reduction in RNAPII transcription speed promotes RNAPII density peaks in genomes with functional consequences for the process of transcriptional termination.

Materials and Methods

Reagents and Tools Table

Methods and Protocols

Plant material and growth conditions

The Arabidopsis mutant lines generated in this study were based on A. thaliana Columbia ecotype (Col‐0) background. Generation of transgenic Arabidopsis plants was performed by Agrobacterium‐mediated transformation as described 52. NRPB2 and NRPB2 transgenes were first introduced to Col‐0 and then crossed with nrpb2‐2 mutant. NRPB2 nrpb2‐2 and NRPB2 nrpb2‐2 double mutants were screened by genotyping from F3 generation (Fig EV1A). For in vitro growth, Arabidopsis seeds were surface‐sterilized and placed on ½ MS media agar plates (1% sucrose). The seeds were stratified in 4°C for 3 days before transferred into 22°C with 16/8‐h light/dark. For flagellin 22 treatment experiments, Arabidopsis seeds were grown on ½ MS media agar plate (1% sucrose) for 12 days, and seedlings were transferred into ½ MS liquid media (1% sucrose) and grew in flasks under 22°C with 16/8‐h light/dark and 150 rpm shaking for 2 days. For growth on soil, Arabidopsis seeds were sowed on soil directly and undergo stratification in 4°C for 3 days before growth under 22°C with 16/8‐h light/dark on soil.

Plasmid construction

The construction of vectors for Agrobacterium‐mediated stable transformation was based on pEarleyGate 302 vector (pEG302). pEG302‐AtNRPB2WT‐FLAG construct was kindly provided by Craig Pikaard 33. To generate pEG302‐AtNRPB2Y732F‐FLAG and pEG302‐AtNRPB2Y732F‐FLAG, pEG302‐AtNRPB2WT‐FLAG construct was linearized by DraIII digestion and used as backbone in isothermal assembly reactions; the inserts in isothermal assembly are partial genomic NRPB2 sequences containing Y732F (TAT to TTT) point mutation and P979S (CCG to TCG), respectively. Fragment containing DNA mutation for Y732F mutant was generated by overlapping PCR (primer pair 3089/3082) fusing two fragments generated by primer pair 3089/3467 and 3082/3466. By using similar strategy, fragment containing DNA mutation for P979S mutant was also generated by overlapping PCR (primer pair 3089/3082) fusing two fragments generated by primer pair 3089/3084 and 3082/3083. Isothermal assembly was performed subsequently to generate pEG302‐AtNRPB2Y732F‐FLAG and pEG302‐AtNRPB2Y732F‐FLAG. All constructs were verified by extensive restriction enzyme digestions, and the fragment with DNA mutations for NRPB2 and NRPB2 was confirmed by DNA sequencing analysis. The primers used in plasmid construction are listed in Table EV1.

Flagellin treatment

Flagellin 22 (N‐terminus acetylated) was synthesized by Schafer‐N (https://schafer-n.com). For each replicate, flagellin 22 treatment was carried out by adding the 0.75 ml flagellin 22 solution (1 mg/ml in DMSO) to Arabidopsis seedlings from 50 μl seeds growing in 100 ml liquid MS media in a flask (3.3 μM as final concentration of flagellin 22). The treatment was set in 0‐min (before treatment), 2‐, 3‐, and 4‐min time course. Each experiment was performed in three independent replicates. After flagellin 22 treatment, the seedlings were flash‐frozen in liquid nitrogen.

Protein extraction and western blotting

NRPB2WT‐FLAG, NRPB2Y732F‐FLAG, and NRPB2P979S‐FLAG proteins were extracted from 2‐week‐old Arabidopsis seedlings of NRPB2 Col‐0, NRPB2 Col‐0, and NRPB2 Col‐0, respectively. Equal amounts of plant material were ground into a fine powder, and proteins were extracted in 2.5× extraction buffer (150 mM Tris–HCl pH 6.8; 5% SDS; 25% Glycerol; 0.025% Bromophenol blue; 0.1 mM DTT). Total proteins were separated by SDS–PAGE on precast 4–15% Criterion TGX stain‐free protein gels (Bio‐Rad) and transferred to PVDF membrane by Trans‐Blot Turbo transfer system (Bio‐Rad). 5% non‐fat milk in PBS was used to block blotted membrane (30 min at room temperature). Anti‐FLAG (Sigma F1804 or F3165) antibodies and anti‐mouse HRP‐conjugated secondary antibody (Dako P0161) were used as primary and secondary antibodies for the detection of FLAG‐tagged NRPB2 proteins. Anti‐H3 (abcam ab1791) antibody and anti‐rabbit HRP‐conjugated secondary antibody (Dako P0217) were used as primary and secondary antibodies for the detection of histone H3. The membrane was incubated with primary antibody overnight at 4°C with gentle rotation (final concentration 0.25 μg/ml in PBS). Membranes were washed with PBS and then incubated with secondary antibody (1:10,000 dilution in PBS) for 1 h at room temperature, followed by two‐time washes with PBS (5 min each) and one‐time wash with PBST (10 min). Chemiluminescent signals were detected using Super‐Signal West Pico Chemiluminescent (Thermo Fisher Scientific) according to manufacturer's instructions.

Yeast strains, media, and primer extension analysis

Yeast media are prepared as described 27. For MPA and Mn2+ growth assay, MPA (final concentration 20 mg/ml) and MnCl2 (15 mM) were supplemented to minimal SC‐Leucine medium. The yeast RNAPII mutant strains were generated by site‐directed mutagenesis as previously described 20. TSS selection of ADH1 gene was assayed by primer extension analysis. In brief, corresponding yeast strains were grown in YPD media to mid‐log phase; 30 μg of isolated yeast total RNA from each indicated strains was used in primer extension analysis exactly as previously described 20, 27.

Nascent RNA isolation

Nascent RNA was isolated according to previously described protocol with minor changes 38. NRPB2 Col‐0, NRPB2 Col‐0 seedlings from flagellin 22 treatment were ground into a fine powder. Nuclei were isolated and washed with HONDA buffer [0.44 M sucrose, 1.25% Ficoll, 2.5% Dextran T40, 20 mM Tris–HCl pH 7.5, 10 mM MgCl2, 0.5% Triton X‐100, 5 mM DTT, 1× EDTA‐free Complete protease inhibitor (Roche)]. The nuclear fraction was digested by 600 U DNase I in 0.5 ml Lysis buffer [0.3 M NaCl, 20 mM Tris–HCl pH 7.5, 5 mM MgCl2, 5 mM β‐mercaptoethanol, 1× EDTA‐free Complete protease inhibitor (Roche), 0.5% Tween‐20, 10 μl RNase inhibitor (moloX GmbH, http://www.molox.de)] at 4°C for 20 min with shaking at 2,000 rpm on an Eppendorf ThermoMixer. The supernatant of a centrifugation (10,000 g for 10 min at 4°C) was recovered into a new tube and combined with Dynabeads M‐270 (Invitrogen) coupled with anti‐FLAG antibody (Sigma) for 2 h at 4°C with gentle rotation. Anti‐FLAG antibody was coupled with Dynabeads according to the manufacturer's instructions. After FLAG‐IP, beads were washed 6 times using 0.5 ml Wash buffer [0.3 M NaCl, 20 mM Tris–HCl pH 7.5, 5 mM MgCl2, 5 mM β‐mercaptoethanol, 1× EDTA‐free Complete protease inhibitor (Roche), and RNase inhibitor]. Bead‐bound protein was eluted with 0.5 mg/ml 3xFLAG peptide (ApexBio) for 30 min twice at 4°C. RNA attached to immunoprecipitated proteins was isolated using QIAGEN miRNeasy Mini Kit according to manufacturer's instructions. Western blot has been done as previously described 38 for input, unbound, and eluted fractions to monitor IP efficiency.

Nascent RNA analysis

Isolated nascent RNA was treated with Turbo DNase to remove DNA contamination following the manufacturer's instruction (Ambion). Hundred nanograms of DNase‐treated RNA was used for reverse transcription into cDNA by gene‐specific primers following the manufacturer's instruction of Superscript IV (Invitrogen) kit. Quantitative analysis of the generated cDNA was carried out by qPCR using the GoTaq qPCR Master Mix (Promega) and CFX384 Touch Real‐Time PCR Detection System (Bio‐Rad). Negative controls lacking the reverse transcriptase enzyme (−RT) were performed alongside all RT–qPCR experiments. qPCR expression level of each primer pair was calculated relative to the level of reference gene ACT2. All the primers used in RT and qPCR were summarized in Table EV1.

PlaNET‐seq library construction and sequencing

Libraries for plaNET‐seq were prepared as previously described 8. Nascent RNA from NRPB2 nrpb2‐2 and NRPB2 nrpb2‐2 seedlings was used in plaNET‐seq. In specific, the plaNET‐seq libraries were constructed according to Bioo Scientific's NEXTflex Small RNA‐Seq Kit v3 using a customized protocol. Two independent replicate libraries were constructed for each plant genotype. Approximately 100 ng RNA was used for each replicate. The isolated RNA was ligated with 3′‐linker and fragmented in alkaline solution (100 mM NaCO3 pH 9.2, 2 mM EDTA). The fragmented RNA was cleaned up and subjected to T4 PNK treatment (20 U PNK, NEB) for 20 min at 37°C followed by re‐annealing of RT primer (5′‐GCCTTGGCACCCGAGAATTCCA‐3′; 70°C, 5 min; 37°C, 30 min; and 25°C, 15 min). The RNA was then re‐introduced to the manufacturer's protocol at the adapter inactivation step. The final libraries were quantified with DNA High Sensitivity Kit on Agilent Bioanalyzer 2100 and then sequenced on the Illumina HiSeq 4000 platform in PE150 mode at Novogene (https://en.novogene.com).

Total RNA extraction and RNA‐seq

Total RNA was extracted from 2‐week‐old NRPB2 nrpb2‐2 and NRPB2 nrpb2‐2 Arabidopsis seedlings using Plant RNeasy Mini Kit following manufactural instructions (QIAGEN). Turbo DNase (Ambion) was used to treat extracted RNA using oligo‐dT primers and Superscript IV (Invitrogen) as per manufacturer's instructions. The poly(A)‐enriched libraries for RNA‐seq were constructed using Illumina TruSeq Sample Prep Kit v2 following the manufacturer's protocol and quantified on Agilent 2100 Bioanalyzer. The sequencing was performed on Illumina HiSeq 4000 platform in PE100 mode.

Bioinformatics

All the supporting code for bioinformatics analysis is available at https://github.com/Maxim-Ivanov/Leng_et_al_2019. Alignment and post‐processing of plaNET‐seq reads were done as previously described 8. The first 4 bases of both R1 and R2 reads in plaNET‐seq are unique molecular identifiers (UMIs). They were trimmed from read sequences and appended to read names using UMI‐Tools v0.5.3. After UMI trimming, the 5′‐terminal base of R2 corresponds to the 3′‐end of original RNA molecule and thus denoted the genomic position of RNAPII active center. R2 reads were aligned to TAIR10 genome assembly using STAR v2.5.2b in transcriptome‐guided mode with the following settings: –outSAMmultNmax 1 –alignEndsType Extend5pOfRead1 –clip3pAdapterSeq GATCGTCGGACT. Ensembl Plants release 28 was used as the source of transcript annotation for alignment. The BAM files were sorted using Samtools v1.3.1. The following categories of reads were filtered out: (i) PCR duplicates (UMI‐Tools); (ii) reads aligned within 100 bp from any rRNA, tRNA, snRNA, or snoRNA gene from Araport11 on either strand (BEDtools v2.17.0); and (iii) reads aligned with MAPQ < 10 (Samtools). The filtered BAM files were imported into R environment v3.5.1 using GenomicAlignments_1.18.1 library. The strand orientation of reads was flipped to restore strandedness of the original RNA molecules. 3′‐terminal bases of flipped reads were found to overlap with donor or acceptor splice sites much more frequently than could be expected by chance. Such reads most likely represent splicing intermediates due to co‐immunoprecipitation of the spliceosome together with FLAG‐tagged RNAPII complexes. These reads were filtered out by overlap with the union of splice sites obtained from both Ensembl Plants 28 (TxDb.Athaliana.BioMart.plantsmart28 package) and Araport11 annotations. In addition, all split reads were removed as possible mature RNA contaminations. The remaining reads are expected to represent the nascent RNA population. Their genomic coverage was exported as strand‐specific BigWig and bedgraph files using rtracklayer_1.42.2. For details on the alignment procedure, see 01‐Alignment_of_plaNET‐Seq_data.sh and 02‐Postprocessing_of_plaNET‐Seq_data.R. Araport11 annotation was used throughout all further steps of data analysis because it is more comprehensive in terms of non‐coding transcripts than both TAIR10 and Ensembl Plants 28 annotations. We adjusted gene borders from Araport11 using coordinates of TSS and PAS tag clusters which were called using CAGEfightR package 53 from the available TSS‐seq 50 and direct RNA‐seq datasets 54, 55, respectively. If multiple TSS or PAS tag clusters were connected to the same gene, the strongest of them was chosen as the new border. For details, see 03_Adjustment_of_Araport11_gene_boundaries. To draw metagene plots of plaNET‐seq, we merged Bedgraph tracks of the two biological replicates of each genotype. The merged tracks were then normalized to 1 million reads in nuclear protein‐coding genes. The X axes of metagene plots represent the genomic intervals of choice (whole genes, exons, introns, etc.) which were scaled to the defined number of bins. Intervals overlapping multiple annotated transcription units were excluded from consideration. Both introns and exons were trimmed by 5 bp each side prior to scaling to avoid possible artifacts. The Y axes show the sequencing coverage averaged between the genomic intervals. The shaded area in metagene plots represents normal‐based standard error of mean of normalized plaNET‐seq signal at each genomic bin. For details, see 08‐Metagene_plots.R. The positions of nucleosomes in Arabidopsis were obtained from the PlantDHS database 56. To calculate the read‐through (RT) length, we considered transcribed genes (plaNET‐seq FPKM in WT samples above 5). Genomic intervals for RT length estimation were defined from PAS of the analyzed gene to the nearest downstream TSS. Coordinates of TSS and PAS clusters were called from TSS‐seq and direct RNA‐seq datasets as described above. For each gene of interest, the empirical distribution of plaNET‐seq tag counts in 100‐bp sliding window was obtained (the “transcription” model). The “random” model corresponding to the un‐transcribed state was represented by Poisson distribution where the rate parameter was estimated from plaNET‐seq tag counts in intergenic regions. Then, plaNET‐seq tags were counted in every 100‐bp window moving in 10‐bp steps along the candidate RT genomic interval. For each window, the probability to observe at most this tag count under the gene‐specific “transcription” model was divided by the probability to observe at least this tag count under the alternative “random” model. The start position of the first window where the probability ratio dropped below 1 was considered as the end of the read‐through region. For details, see 04‐Read‐through_distance.R. To calculate promoter‐proximal RNAPII stalling index for each gene, we first found 100‐bp windows with the highest plaNET‐seq coverage within the interval (TSS − 100 bp, TSS + 300 bp). Center of this window was considered as the summit of promoter‐proximal RNAPII peak. The stalling index was then calculated as the ratio of plaNET‐seq coverage in this window vs the whole gene (normalized by gene width). To avoid statistical artifacts, genes shorter than 1 kb or having plaNET‐seq FPKM below 1 were skipped from consideration. For details, see 05‐Promoter‐proximal_stalling_index.R. RNA‐seq reads were adapter‐ and quality‐trimmed by TrimGalore v0.4.3 in paired‐end mode and then aligned to TAIR10 by STAR v2.5.2b in local mode. Aligned reads with MAPQ below 10 were removed by Samtools v1.3.1. BAM files were converted to unstranded Bedgraph and BigWig files using BEDtools genomecov v2.26.0 and kentUtils bedGraphToBigWig v4, respectively. The code was detailed in the section 06‐Alignment_of_RNA‐Seq_data.sh in the mentioned GitHub page. Differentially expressed genes were called from RNA‐seq data using DESeq2 57. Differentially expressed exons and introns were detected independently from the changes in gene expression level by DEXSeq 58. Exons and introns were defined as disjoint exonic or intronic intervals, respectively, in Araport11. For details, see 07‐Differential_expression.R. To detect the differential usage of alternative 5′ and 3′ splice sites, transcript isoforms were first quantified by Cufflinks 59. Then, the Cufflinks output was used to quantify the different AS events extracted from an Arabidopsis reference transcript dataset AtRTD2 60 with SUPPA2 61. For details, see 10‐SUPPA2_pipeline.sh and 11‐Differential_AS.R scripts.

Author contributions

Conceptualization: XL, CDK, SM; Methodology: MI, PK, SM; Investigation: XL, PK, IM, MI, SM; Formal analysis: MI, AT, AS; Data curation: MI; Writing—original draft: XL, SM; Writing—review and editing: XL, MI, CDK, PK, PB, AS, SM; Visualization, XL, MI; Resources: CDK, SM; Supervision: CDK, SM; Funding acquisition: SM.

Conflict of interest

The authors declare that they have no conflict of interest. Expanded View Figures PDF Click here for additional data file. Table EV1 Click here for additional data file. Table EV2 Click here for additional data file. Source Data for Expanded View Click here for additional data file. Review Process File Click here for additional data file. Source Data for Figure 1 Click here for additional data file. Source Data for Figure 2 Click here for additional data file. Source Data for Figure 4 Click here for additional data file.
Reagent or resourceSourceIdentifier
Antibodies
Rabbit Anti‐Mouse Immunoglobulins/HRPDakoP0161; RRID:AB_2687969
Swine anti‐rabbit Ig HRP antibodyDakoP0217; RRID:AB_2728719
Anti‐Histone H3 antibodyAbcamAb1791; RRID:AB_302613
Monoclonal ANTI‐FLAG® M2 antibody produced in mouseSigma‐AldrichF3165; RRID: AB_259529
Monoclonal ANTI‐FLAG® M2 antibody produced in mouseSigma‐AldrichF1804; RRID:AB_262044
Bacterial and virus strains
Escherichia coli DH5α™ competent cellsThermo FisherCat no. 18265017
Agrobacterium tumefaciens GV3101(PMP90)N/AN/A
Peptides
Flagellin 22 peptidesSchafer‐NPeptide 40007
Critical commercial assays
miRNeasy Mini KitQIAGENID: 217004
SuperScript™ IV First‐Strand Synthesis SystemInvitrogenCat. no.18091050
NEXTflex Small RNA‐Seq Kit v3Bioo ScientificN/A
DNA High Sensitivity KitAgilent5067‐4626
4–15% Criterion™ TGX Stain‐Free™ Protein GelBio‐RadCat. no. 5678084
TURBO DNA‐free KitThermo FisherCat. no. AM1907
Dynabeads™ Antibody Coupling KitThermo FisherCat. no. 114311D
TruSeq RNA Library Prep Kit v2IlluminiaRS‐122‐2001
GoTaq® qPCR Master MixPromegaCat no. A6002
RNeasy Plant Mini KitQIAGENID: 74904
Deposited data
GSE133143NCBI GEO https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE133143
Experimental models: organisms/strains
Arabidopsis thaliana Col‐0N/AN/A
NRPB2/nrpb2‐2 mutantDr. Craig PikaardN/A
Software and algorithms
Software for bioinformatic analysis: see Materials and Methods GitHub https://github.com/Maxim-Ivanov/Leng_et_al_2019
  61 in total

1.  Widespread Backtracking by RNA Pol II Is a Major Effector of Gene Activation, 5' Pause Release, Termination, and Transcription Elongation Rate.

Authors:  Ryan M Sheridan; Nova Fong; Angelo D'Alessandro; David L Bentley
Journal:  Mol Cell       Date:  2018-11-29       Impact factor: 17.970

Review 2.  Plant germline formation: common concepts and developmental flexibility in sexual and asexual reproduction.

Authors:  Anja Schmidt; Marc W Schmid; Ueli Grossniklaus
Journal:  Development       Date:  2015-01-15       Impact factor: 6.868

3.  Transcriptional interference and termination between duplicated alpha-globin gene constructs suggests a novel mechanism for gene regulation.

Authors:  N J Proudfoot
Journal:  Nature       Date:  1986 Aug 7-13       Impact factor: 49.962

4.  CDK9 inhibitors define elongation checkpoints at both ends of RNA polymerase II-transcribed genes.

Authors:  Clélia Laitem; Justyna Zaborowska; Nur F Isa; Johann Kufs; Martin Dienstbier; Shona Murphy
Journal:  Nat Struct Mol Biol       Date:  2015-04-06       Impact factor: 15.369

5.  Mutations in the Saccharomyces cerevisiae RPB1 gene conferring hypersensitivity to 6-azauracil.

Authors:  Francisco Malagon; Maria L Kireeva; Brenda K Shafer; Lucyna Lubkowska; Mikhail Kashlev; Jeffrey N Strathern
Journal:  Genetics       Date:  2006-03-01       Impact factor: 4.562

6.  From structure to systems: high-resolution, quantitative genetic analysis of RNA polymerase II.

Authors:  Hannes Braberg; Huiyan Jin; Erica A Moehle; Yujia A Chan; Shuyi Wang; Michael Shales; Joris J Benschop; John H Morris; Chenxi Qiu; Fuqu Hu; Leung K Tang; James S Fraser; Frank C P Holstege; Philip Hieter; Christine Guthrie; Craig D Kaplan; Nevan J Krogan
Journal:  Cell       Date:  2013-08-08       Impact factor: 41.582

7.  Pre-mRNA splicing is facilitated by an optimal RNA polymerase II elongation rate.

Authors:  Nova Fong; Hyunmin Kim; Yu Zhou; Xiong Ji; Jinsong Qiu; Tassa Saldi; Katrina Diener; Ken Jones; Xiang-Dong Fu; David L Bentley
Journal:  Genes Dev       Date:  2014-12-01       Impact factor: 11.361

8.  Autoimmune response as a mechanism for a Dobzhansky-Muller-type incompatibility syndrome in plants.

Authors:  Kirsten Bomblies; Janne Lempe; Petra Epple; Norman Warthmann; Christa Lanz; Jeffery L Dangl; Detlef Weigel
Journal:  PLoS Biol       Date:  2007-09       Impact factor: 8.029

9.  Improved annotation of 3' untranslated regions and complex loci by combination of strand-specific direct RNA sequencing, RNA-Seq and ESTs.

Authors:  Nicholas J Schurch; Christian Cole; Alexander Sherstnev; Junfang Song; Céline Duc; Kate G Storey; W H Irwin McLean; Sara J Brown; Gordon G Simpson; Geoffrey J Barton
Journal:  PLoS One       Date:  2014-04-10       Impact factor: 3.240

10.  PlantDHS: a database for DNase I hypersensitive sites in plants.

Authors:  Tao Zhang; Alexandre P Marand; Jiming Jiang
Journal:  Nucleic Acids Res       Date:  2015-09-22       Impact factor: 16.971

View more
  7 in total

1.  Chromatin-based mechanisms to coordinate convergent overlapping transcription.

Authors:  Soichi Inagaki; Mayumi Takahashi; Kazuya Takashima; Satoyo Oya; Tetsuji Kakutani
Journal:  Nat Plants       Date:  2021-03-01       Impact factor: 15.793

2.  Unique and contrasting effects of light and temperature cues on plant transcriptional programs.

Authors:  Mai Jarad; Rea Antoniou-Kourounioti; Jo Hepworth; Julia I Qüesta
Journal:  Transcription       Date:  2020-10-04

3.  Light in the transcription landscape: chromatin, RNA polymerase II and splicing throughout Arabidopsis thaliana's life cycle.

Authors:  Rocío S Tognacca; M Guillermina Kubaczka; Lucas Servi; Florencia S Rodríguez; Micaela A Godoy Herz; Ezequiel Petrillo
Journal:  Transcription       Date:  2020-08-04

4.  TrancriptomeReconstructoR: data-driven annotation of complex transcriptomes.

Authors:  Maxim Ivanov; Albin Sandelin; Sebastian Marquardt
Journal:  BMC Bioinformatics       Date:  2021-05-31       Impact factor: 3.169

5.  R-loops at microRNA encoding loci promote co-transcriptional processing of pri-miRNAs in plants.

Authors:  Lucia Gonzalo; Ileana Tossolini; Tomasz Gulanicz; Damian A Cambiagno; Anna Kasprowicz-Maluski; Dariusz Jan Smolinski; María Florencia Mammarella; Federico D Ariel; Sebastian Marquardt; Zofia Szweykowska-Kulinska; Artur Jarmolowski; Pablo A Manavella
Journal:  Nat Plants       Date:  2022-04-21       Impact factor: 17.352

6.  Unique features of transcription termination and initiation at closely spaced tandem human genes.

Authors:  Noa Nissani; Igor Ulitsky
Journal:  Mol Syst Biol       Date:  2022-04       Impact factor: 13.068

7.  Enhancer transcription detected in the nascent transcriptomic landscape of bread wheat.

Authors:  Yilin Xie; Yan Chen; Zijuan Li; Jiafu Zhu; Min Liu; Yijing Zhang; Zhicheng Dong
Journal:  Genome Biol       Date:  2022-04-30       Impact factor: 17.906

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.