Literature DB >> 34863780

Real-Time Single-Molecule Studies of RNA Polymerase-Promoter Open Complex Formation Reveal Substantial Heterogeneity Along the Promoter-Opening Pathway.

Anssi M Malinen¹, Jacob Bakermans², Emil Aalto-Setälä³, Martin Blessing⁴, David L V Bauer⁵, Olena Parilova³, Georgiy A Belogurov³, David Dulin⁶, Achillefs N Kapanidis⁷.

Abstract

The expression of most bacterial genes commences with the binding of RNA polymerase (RNAP)-σ70 holoenzyme to the promoter DNA. This initial RNAP-promoter closed complex undergoes a series of conformational changes, including the formation of a transcription bubble on the promoter and the loading of template DNA strand into the RNAP active site; these changes lead to the catalytically active open complex (RPO) state. Recent cryo-electron microscopy studies have provided detailed structural insight on the RPO and putative intermediates on its formation pathway. Here, we employ single-molecule fluorescence microscopy to interrogate the conformational dynamics and reaction kinetics during real-time RPO formation on a consensus lac promoter. We find that the promoter opening may proceed rapidly from the closed to open conformation in a single apparent step, or may instead involve a significant intermediate between these states. The formed RPO complexes are also different with respect to their transcription bubble stability. The RNAP cleft loops, and especially the β' rudder, stabilise the transcription bubble. The RNAP interactions with the promoter upstream sequence (beyond -35) stimulate transcription bubble nucleation and tune the reaction path towards stable forms of the RPO.

Entities: Chemical

Keywords: molecular mechanism; reaction pathway; total internal reflection fluorescence microscopy; transcription initiation; σ factor

Mesh：

Substances：

Year: 2021 PMID： 34863780 PMCID： PMC8783055 DOI： 10.1016/j.jmb.2021.167383

Source DB: PubMed Journal: J Mol Biol ISSN： 0022-2836 Impact factor: 5.469

Introduction

Transcription initiation is the first and most regulated step in gene expression in all organisms. The expression of most bacterial genes commences with the binding of RNA polymerase (RNAP)–σ70 holoenzyme to the promoter DNA. The initial RNAP–promoter closed complex (RPC) undergoes large conformational changes leading to a RNAP–promoter open complex (RPO), which is capable of RNA synthesis. These conformational changes are of paramount importance, since their modulation by promoter DNA sequence, protein transcription factors, and small-molecule ligands strongly affects the number of active open complex, and thus the transcription efficiency. Further, several antimicrobials, including clinically used drugs rifampicin3, 4 and fidaxomicin, exert their effect by blocking RNAP from proceeding during a specific step of transcription initiation. However, despite substantial progress in defining the structural basis of transcription initiation mechanism,7, 8, 9, 10 the identity, sequence, and kinetics of conformational changes leading to RPO formation remain elusive. At the initial step of the RPO formation pathway, the RNAP–σ70 holoenzyme recognises the promoter by forming sequence-specific contacts with the −35 element, and sequence-independent contacts upstream from the −35 (“upstream sequence”) as well as around the −10 element [reviewed in 2, 11] In this RPC state, the promoter remains fully double-stranded, but is bent by ∼17° at the −10 element, thus positioning the downstream promoter DNA above the DNA-binding cleft of the RNAP. Studies using footprinting,12, 13, 14, 15 atomic force microscopy and ensemble FRET have indicated additional extensive bending and wrapping of the promoter upstream sequence (between the −35 element and −82); this bending, which is driven by the C-terminal domains of the two RNAP α-subunits (αCTDs) interacting with the promoter upstream, brings the upstream DNA to the RNAP surface, and strongly facilitates RPO complex formation.14, 18, 19 The isomerisation of the RPC towards RPO complex begins with the flipping of non-template DNA (ntDNA) −11 conserved adenine base from the duplex DNA to a specific pocket in σ70.20, 21 The promoter melting then somehow propagates downstream until the full transcription bubble in the RPO complex covers positions −11 to +2. The bubble melting is coupled with the loading of downstream DNA duplex into the RNAP cleft, and the loading of single-stranded template DNA (tDNA) into the RNAP active site. Structural9, 10 and biochemical studies have identified several putative intermediates on the path from the RPC to RPO; however, the number and structural properties of the intermediates detected appear to heavily depend on the promoter sequence, transcription factors, and experimental conditions. The mechanism discussed above describes the formation of a uniform RPO complex on a standard linear reaction pathway. A more complete description of the transcription initiation, however, needs to consider several studies that suggested that individual RPO molecules are not identical, and they instead differ in functional properties.22, 23, 24, 25 One of the most notable variation among RPO complexes is their tendency to perform abortive initiation, i.e., the premature release of short RNAs synthesised by promoter-bound RNAP (reviewed in 26). In fact, it has been estimated that >50% of the RPO complexes are permanently locked into the abortive initiation mode and cannot produce full-length RNA.22, 23, 24, 25 The presence of at least two different RPO classes – one productive and one non-productive (abortive) – raises the possibility that the RPO pathway is also not linear, but instead branches to allow the formation of structurally and functionally different RPO molecules. It has been further suggested that the ratio of productive and non-productive RPO complexes can be modulated by transcription factors and thus offers a layer for gene regulation in the cell. On the other hand, recent single-molecule studies revealed long-lived pausing, backtracking and arrest of initially transcribing bacterial and mitochondrial RNAPs that could potentially explain the productive and abortive RNA synthesis by a single type of RPO complexes.28, 29, 30 The RPO formation pathway branching – its occurrence and mechanism – thus warrants further study. Here, we utilise single-molecule techniques to resolve asynchronous, multi-step and branched reaction mechanisms during σ70-dependent RPO formation on a well characterised consensus lac promoter. Our results strongly suggest that the RPO formation pathway is indeed branched both at the step of initial promoter melting and the step of open transcription bubble stabilisation. Furthermore, αCTD interactions with the promoter upstream sequence strongly stimulate bubble initiation and tune the reaction pathway towards more stable RPO complexes. The RNAP cleft loops (and especially the β′ rudder one), play a key role in stabilising the open transcription bubble.

Results

Direct formation of surface-immobilised catalytically active open complexes

To be able to monitor RNAP–promoter open complex (RPO) formation in real-time at the single-molecule level, we used FRET to look at the changes in distances between two points, i.e., positions −15 and +15 relative to the transcription start site (position +1) on a promoter DNA fragment. A fluorophore pair incorporated in positions −15 (donor) and +15 (acceptor) produces FRET signatures that vary depending on the transcription bubble conformation; this pair has been employed before to monitor conformational changes in populations of single transcription complexes,31, 32 conformational dynamics of RPO complexes and conformational changes after the formation of RPO complex on a consensus lac promoter (lacCONS) (Figure S1(A, B)). Here, we modified our previous protocols to detect the nascent RNAP–promoter complex (RPC) and its subsequent maturation to RPO (Figure 1). To this end, we attached molecules of the Escherichia coli RNA polymerase–σ70 holoenzyme to the surface of a coverslip and started imaging the surface using TIRF microscopy (Figure 1(A, B)). Subsequent addition of the dual-labelled promoter DNA to the reaction buffer was expected to lead to the appearance of co-localised fluorescent spots on the donor (Cy3B label) and acceptor (ATTO647N label) detection channels of the microscope upon binding to the surface-attached holoenzyme (Figure 1(B)). The timing of the appearance of the fluorescent spots on the surface (due to DNA binding and formation of RPC complexes) is precisely defined in the single-molecule trajectories by the simultaneous appearance of Cy3B and ATTO647N fluorescence signals (“DNA binds” time point, Figure 1(C)). The −15/+15 ruler reports low FRET for the RPC complex, and intermediate FRET for the RPO complex, since the formation of the transcription bubble decreases the distance between the −15 and +15 positions in the DNA. The RPC → RPO transition in the trajectories is thus indicated by a sharp FRET increase (“DNA transcription bubble opens” time point, Figure 1(C)), which may occur in one or several steps, depending on the intermediates on the reaction pathway.

Figure 1

Single-molecule FRET method to monitor the RNAP–promoter open complex formation in real-time. (A) E. coli RNAP–σ70 holoenzyme is immobilised on the PEGylated microscope coverslip using biotinylated anti-His-tag-antibody. lacCONS promoter, which is labelled with a donor fluorophore (D, Cy3B) at non-template DNA position −15 and an acceptor fluorophore (A, ATTO647N) at template DNA position + 15, is added to the reaction buffer. The promoter binds to the RNAP and becomes visible on the coverslip surface. The initial RNAP–promoter closed complex isomerises to the open complex, which decreases the distance between the −15 and +15 dyes and increases the FRET. (B) Schematic microscope field-of-view before and after promoter addition to the reaction buffer. Data on the DD (donor excitation–donor emission) and AA (acceptor excitation–acceptor emission) channels is used to identify the RNAP–promoter complexes containing both the Cy3B and ATTO647N dyes. These molecules are highlighted with yellow circles. (C) Schematic single-molecule trajectory showing promoter binding to the RNAP and subsequent isomerisation to the open complex state. Abrupt increase in the Cy3B and ATTO647N fluorescence intensities defines the moment of promoter binding. The increase in the FRET from the low level to the intermediate level defines the moment of open complex formation. (D) Two experimental trajectories show promoter binding and open complex formation. DA indicates the signal on the donor excitation–acceptor emission channel. Our experimental single-molecule trajectories indeed show the expected fluorescence intensity and FRET signatures of RPC complex formation and isomerisation to the RPO state (Figure 1(D)). The moment of RNAP-promoter complex formation was precisely defined by the simultaneous appearance of Cy3B and ATTO647N fluorescence in the single-molecule trajectories (e.g., at 12.25 s and 13.5 s in the left and right panels, respectively, of Figure 1(D)). The apparent FRET efficiency (E*) of the first stable complexes was E* ∼ 0.2 (Figure 1(D)), a value identical to that we obtained previously for the closed transcription bubble state. After a short time, the FRET increased to E* ∼0.45 (at ∼12.6 s and ∼15.7 s in the traces of Figure 1(D)), a value identical to that we obtained previously for the open transcription bubble state. DNA binding to the coverslip surface was strictly mediated by the RNAP, since the number of non-specific DNA binding events was negligible in the absence of RNAP on the surface (cf. Figure S2(A, B)). On the population level, the newly formed RNAP–promoter complexes displayed a bimodal FRET distribution, with mean FRET values ∼0.2 and ∼0.45 (Figure S2(C)) contrasting with the unimodal FRET distribution (mean ∼ 0.18) of the protein-free immobilised promoter DNA (Figure S2(D)). To test whether the ∼0.45 FRET state is indeed a catalytically competent RPO complex, we added NTPs to the sample buffer; this addition almost eliminated the ∼0.45 FRET state, as expected if RPO complexes engage RNA synthesis and escape the promoter (Figure S2(C)). To provide further proof for the formation of catalytically active RPO complexes in situ on the coverslip surface, we performed experiments using a promoter with a different labelling scheme, which is very effective in monitoring the progress of initial transcription (dyes at positions −15 and +20; Figure S1(C)). The scrunching of the downstream DNA towards the RNAP during initial RNA synthesis leads to a stepwise increase in FRET until RNAP escape from the promoter returns the FRET to a low level (Figure S3(A, B)). Example trajectories in Figure S3(C) demonstrate abortive initiation and promoter-escape events occurring shortly after the formation of the RNAP–promoter complexes. However, we note that some RNAP–promoter complexes (typically 20–50% of all complexes) on the surface neither form RPO nor engage RNA synthesis; these molecules remain in stable low FRET state (∼0.2) and may thus represent unproductive complexes resulting, e.g., from RNAP binding to the ends of the promoter DNA fragment (Figure S3(D)). HMM analysis of these complexes (on both −15/+15 and −15/+20 labelled promoters) did not produce Viterbi changes, further corroborating our interpretation that the transition from low FRET state (E* ∼ 0.2) to a long-lived higher FRET state (E* ∼ 0.45) on −15/+15 promoter indeed indicates a RPO formation event instead of spurious fluorescence fluctuation. However, because the FRET sensitivity is not sufficient to confidently distinguish RPC complex from non-specific RNAP–DNA complexes, we decided to analyse further only the RNAP–promoter complexes which directly show the appearance of the FRET signature of the RPO complex (i.e., the ∼0.45 FRET state) on the −15/+15 labelled promoter.

Extended promoter upstream sequence stimulates RPO formation

To study the kinetics of RPO complex formation in real-time and its modulation by the αCTD–promoter upstream interactions, we performed experiments using a long (DNA extending from position −89 to position +25) and short (DNA extending from position −39 to position + 25) version of the lacCONS promoter (Figure 2(A)). We also examined the kinetics of open complex formation by using additional versions of the promoter DNAs, which were either fully double-stranded (dsLC2 promoter; Figure S1(A)) or contained a mismatch in the promoter region from −10 to −4 (a.k.a. pre-melted promoter, pmLC2; Figure S1(B)).

Figure 2

Rate of RP (A) Schematic representation of the real-time RPO formation experiment. The promoter has donor and acceptor labels at positions −15 and +15, respectively. Promoters were employed as short (promoter span −39/+25), long (−89/+25), fully double-stranded and pre-melted (mismatch region −10/−4) versions. (B) Example trajectory on the left demonstrates promoter binding to the surface-immobilised RNAP and the formation of RPC complex at 16.6 s. The RPC isomerises to RPO complex at 17.2 s. Another example trajectory of real-time RPO complex formation is shown on the right. Dwell-times in the RPC state were fit to mono-exponential function to obtain the rate constant of RPO complex formation (C) on the long dsLC2 promoter, (D) long pre-melted LC2 promoter and (E) short pre-melted LC2 promoter. RPO formation was inefficient in the case of short dsLC2; in fact, we could identify only 5 real-time promoter-binding events indicating RPO complex formation (3% of all promoter-binding events, N = 167); even after prolonged incubation of the RNAP–promoter complexes (∼5 min) on the surface, the RPO complex (i.e., the FRET species with E* ∼ 0.45) was nearly absent from the population histogram (Figure S4(A)). In contrast, the RPO complex formed efficiently on the long promoters, as well as on the short pre-melted promoter, as seen in the E* histograms (Figure S4(B–D)) and individual trajectories (Figure 2(B)). We then performed Hidden Markov modelling (HMM) of the trajectories to extract the dwell times in the RPC state (E* ∼ 0.2) before transcription bubble opening and RPO complex formation (Figure 2(B)). The observed distribution of dwell times in the RPC state for the long dsLC2 promoter (Figure 2(C)) was fitted to a mono-exponential decay function to determine a mean lifetime for the RPC complex of 1.43 ± 0.09 s (±SE; amplitude parameter was 35.2 ± 1.5). We also tried to fit the dwell-time distribution using a bi-exponential equation, but rejected this more complex kinetic model because the fit parameters were poorly defined as evident from large SE (27% for lifetimes and 13–45% for amplitudes). Using a similar analysis, we estimated the RPC complex lifetime as 0.35 ± 0.04 s on the long pre-melted LC2 (Figure 2(D)) and 0.39 ± 0.03 s on the short pre-melted LC2 (Figure 2(E)), respectively. These values indicate that the αCTD interactions with the upstream sequence (−89 to −40) significantly enhance the isomerisation rate of the RPC to RPO complex; however, this happens only on a fully double-stranded promoter. Because the introduction of the pre-melted region (−10/−4) to the promoter nearly equalised the rate of RPC isomerisation to the RPO complex on the short and long promoters, the αCTD–promoter interactions appear to predominantly contribute to the lowering of the activation energy of initial transcription bubble nucleation.

A subpopulation of RPO complexes form via a kinetically significant intermediate

Close inspection of the HMM fit to the RPO formation FRET trajectories revealed that, even though most bubble-opening events were described by a two-state model, (i.e., the promoter conformation in the initial complex changed to the RPO state in a single step; Figure 2(B)), a subpopulation included an intermediate state (hereafter, RPi complex) between the initial RPC and final RPO states (Figure 3(A)).

Figure 3

Intermediate on the RP (A) Example trajectories demonstrate the presence of an intermediate, RPi complex, on the pathway from RPC to RPO complex. The trajectory was fit to 3-state HMM. (B) E* histograms for the RPC, RPi and RPO states were extracted from the HMM segmented trajectories. The E* distributions were fit Eq. (2). Data from different promoter versions was pooled. (C) E* histogram of the RNAP–promoter complexes formed in the presence of 100 µM Myx inhibitor. The complexes were imaged ∼5 min after their initial formation on the coverslip surface. E* distribution was fit to Eq. (2). Specifically, the RPi complex was identified in 20% (exact 95% binomial confidence interval: 11–30%), in 14% (8–23%) and in 14% (8–23%) of all trajectories in the case of long dsLC2, long pmLC2 and short pmLC2 promoter, respectively. The arithmetic mean lifetime of the RPi state was 0.32 s (95% CI by bootstrapping (10,000 iterations): 0.18–0.49 s, N = 15), 0.15 s (0.07–0.26 s, N = 13) and 0.17 s (0.10–0.23 s, N = 13) on the long dsLC2, long pmLC2 and short pmLC2 promoter, respectively. Similarity of the estimates suggests that the RPi lifetime does not strongly depend on the used promoter type. The sporadic appearance of the RPi in the trajectories could indicate either heterogeneous reaction step (bubble opening with or without RPi intermediate), failure to detect most of the RPi states (if RPi was an obligatory intermediate) or false positive HMM state assignment (if RPi does not really exist). To evaluate these scenarios, we used simulated smFRET trajectories and estimated the efficiency of RPi detection by the HMM routine. To obtain a conservative estimate, the trajectories were simulated using a short mean RPC lifetime (0.16 s) and FRET signal noise that was 15–50% higher than in experimental data (Figure S5(A–F), example trajectories; Figure S5(G–L), trajectory noise). As expected, the detection efficiency increased with the length of the RPi dwell: 1, 2–4 and ≥5 frame dwells (1 frame = 20 ms) were detected with ∼15%, ∼60% and ∼80–90% efficiency, respectively (Figure S5(M)). The false positive rate of RPi on trajectories simulated without this state was only 1% (4 events in 360 trajectories) and does not thus affect conclusions. To obtain a rough estimate of how much the detection efficiency contributes to the estimated RPi lifetime, we pooled and binned all RPi dwells on different promoters and corrected the bins for the missed RPi dwells. The fit of corrected and uncorrected dwell time histograms to mono-exponential function estimated the RPi lifetime as 0.15 ± 0.02 s (±SE) and 0.22 ± 0.4 s, respectively (Figure S5(N)). Taking into account the dwell length distribution and detection efficiency, we would expect to identify the RPi state in ∼60% of trajectories if the RPC → RPO transition would always proceed via this intermediate. The fact that this number is significantly larger than the measured 14–20% occurrence of RPi raises the intriguing possibility that the RPO formation pathway on lacCONS promoter is branched, i.e., one path is a direct RPC → RPO transition while the other path involves the RPi as an intermediate between these states. To compare the mean FRET of the RPi intermediate to that of the RPC and RPO complexes, we extracted FRET efficiency histograms from HMM-segmented trajectories for each of the three states (Figure 3(B)). The mean FRET values, obtained as the centres of the fit Gaussian distribution (Eq. (2)), were found as 0.196 ± 0.001 (±SE), 0.318 ± 0.002 and 0.448 ± 0.001 for the RPC, RPi and RPO complex, respectively. The mean FRET values suggest that the average distance between the −15 and +15 labels in the RPi state has become shorter than in the RPC complex but remains still longer than in the mature RPO complex. To probe the structural origin of the RPi state, we included in the reaction buffer the RNAP inhibitor myxopyronin B (Myx). Biochemical and structural studies using Myx and structural studies using corallopyronin A (Cor), a Myx analogue, have suggested that this class of RNAP inhibitors block the formation of RPO complex by preventing the loading of template DNA strand into the active site cleft. We observed that the FRET distribution of the RNAP–promoter complexes preformed in the presence of Myx was described by two Gaussians with mean FRET values 0.231 ± 0.001 and 0.307 ± 0.010 (Figure 3(C), long dsLC2 promoter). The inspection of individual trajectories revealed three classes of molecules: the first and most abundant class involved RNAP–promoter–Myx complexes characterised by an E* ∼ 0.3 state (N = 58, 60% of all molecules, Figure S6(A)). Interestingly, a sub-fraction (N = 19/58) of these molecules stochastically sampled a very short-lived, i.e., typically 20–40 ms (1–2 frames), higher E* state (Figure S6(B)). The second class of molecules involved potential non-productive complexes as indicated by a stable E* ∼ 0.2 value (N = 37, 38%, Figure S6(C)) and the third class involved RPO complexes characterized by a long-lived E* ∼ 0.45 state (N = 2, 2%, Figure S6(D)). Preformed complexes on the long pre-melted LC2 promoter confirmed the bimodal FRET distribution with two sub-populations having mean E* values of 0.224 ± 0.002 and 0.290 ± 0.045 (Figure S6(E)). Consistent with the above equilibrium FRET values, 32% (N = 21) of real-time promoter binding trajectories in the presence of Myx inhibitor demonstrated the formation of initial RPC complex (E* ∼ 0.2) and its subsequent isomerisation to E* ∼ 0.3 state (Figure S6(F)) while the remaining 68% (N = 45) of the nascent RNAP–promoter complexes maintained the E* ∼ 0.2 state for the entire duration of the trajectory (Figure S6(G)). The increasing trend in observed FRET values as the RNAP–promoter complexes react towards RPO is consistent with structural modelling data; the distance between the −15 and +15 labels decreases from 98 Å in the RPC complex, to 87 Å in the Cor-stabilised RNAP-promoter intermediate, and further to 66 Å in the RPO complex (Figure S6(H)).

Transcription bubble opening leads to static and dynamic RPO complexes

We next analysed the transcription bubble behaviour immediately after initial RPO complex formation; our observation span for these measurements was 1.3–22 s (median = 4 s, N = 119). HMM analysis of the trajectories revealed two RPO complex sub-populations: a “static” (or “stable”) sub-population (73% of the nascent RPO complexes; exact 95% binomial CI: 64–81%), where the transcription bubble remained open for the entire duration of the trajectory (reflected by an E* ∼ 0.45 state; Figure 4(A)); and a dynamic sub-population (27%, CI: 19–36%), where the transcription bubble fluctuates rapidly between the open state (E* ∼ 0.45) and state(s) characterised by lower FRET (Figure 4(B)). The dynamic RPO complexes do not appear to convert to a static RPO within our observation span, suggesting that the dynamic RPO–like complex is not an on-pathway intermediate which eventually isomerises to form the stable RPO complex. This conclusion, which invokes the formation of static and dynamic RPO complexes on parallel pathways, is also supported by the presence of a similar distribution of static and dynamic RPO complexes in samples of preformed, heparin-challenged RNAP–promoter complexes (see next section of the manuscript and Ref. 33). Notably, both the 2-state (i.e., showing no RPi intermediate; Figure 2(B)) and 3-state (Figure 3(A)) bubble-opening mechanisms produced both static and dynamic RPO complexes (Figure 4(C)), suggesting that the presence of the RPi intermediate does not dictate the subsequent stability of the bubble.

Figure 4

Parallel formation of static and dynamic RP (A) Example trajectories demonstrating the formation of static RPO. (B) Example trajectories demonstrating the formation of dynamic RPO. The static population represents 73% (N = 107) of the nascent RPO complexes, whereas the dynamic population represents 27% of the complexes (N = 40). (C) The abundance of static and dynamic RPO’s is shown separately following the initial bubble opening either via 2-state (grey bars) or 3-state (pink) mechanisms. Exact 95% binomial confidence intervals are shown. (D) E* histogram of dynamic RPO’s. The complexes were imaged ∼5 min after their initial formation on the coverslip surface (N = 211 molecules). The E* distributions were fit using Eq. (2). To evaluate the bubble conformations accessed by the dynamic RPO complexes, we analysed the E* distribution of these complexes, which was fit well by two Gaussian functions with mean E* values of 0.265 ± 0.004 and 0.467 ± 0.001 (Figure 4(D)). In contrast, the E* histogram of the static RPO showed only a single distribution centred at E* of 0.448 ± 0.001 (Figure 3(B)). These values suggest that the dynamic RPO complexes do not sample either the RPC (E* ∼ 0.20) or the on-pathway intermediate RPi (E* ∼ 0.32) states. Instead, it is more likely that the dynamic RPO complexes sample an off-pathway state, which is characterised by E* ∼ 0.27 and which we coin as RPISO.

The role of RNAP–promoter upstream interactions in the RPO pathway selection

We next evaluated how RNAP αCTD–promoter upstream sequence interactions contribute to the relative formation of stable and dynamic RPO complexes and the kinetic parameters of the transcription bubble dynamics (Figure S7(A)). To this end, we prepared RPO complexes at 37 °C using either a short or a long dsLC2, challenged them with heparin, i.e., a DNA competitor, and immobilised them on the coverslip surface for smFRET analysis. In this protocol, RPO complexes also form on the short dsLC2 promoter fragments, allowing direct comparison to the long dsLC2 promoter. HMM-based classification of the trajectories indicated that the dynamic RPO complexes were 1.7-fold more prevalent (25 ± 3.6% vs 15 ± 2.7% of all complexes; mean and SD of three independent experiments) on the short promoter compared to the long promoter (Figure S7(B)). A two-sample T-test (p = 0.035) confirmed that the observed difference in the relative number of dynamic complexes on the two promoters is statistically significant. Further, the observation span for the measurements was 2.0–25 s (median 8.0 s, N = 348) on the long promoter and 1.4–25 s (median 6.9 s, N = 435) on the short promoter, suggesting that the higher prevalence of dynamic RPO complexes on the short promoter is not explained simply by longer trajectories that are expected to accumulate more state transitions. As an additional control, we estimated the identification accuracy of dynamic/static RPO complexes in simulated trajectories, which had state lifetimes and FRET noise levels similar to the experimental data. HMM analysis results indicate that the detection efficiency of dynamic RPO complexes is ∼95% when the trajectory length is ≥4 s (a length observed for 84% of experimental trajectories) and remains at 84% when the trajectory length is ≥3 s (a length observed for 92% of experimental trajectories) (Figure S7(C)). These numbers indicate that the trajectory length variation in the experiment does not significantly bias the static/dynamic RPO estimations. Noteworthy, the analysis of simulated trajectories did not produce any false positive dynamic RPO’s. Kinetic analysis indicated that the lifetimes the RPISO (85–90 ms, Figure S7(D, F)) and RPO (560–660 ms, Figure S7(E, G)) states were similar on both promoters (Figure S7(H)). Collectively, our results suggest that the αCTD–promoter interactions steer the RPO pathway selection towards the static RPO complex; however, the effect is moderate, and significant number of dynamic RPO complexes form also on the long promoter. The similarity in the timescales of transcription bubble dynamics on the short and long promoters indicates that the bubble isomerisation rates are independent of the status of the αCTD–promoter upstream sequence interactions.

The role of RNAP cleft loops in the RPO stabilisation

To interrogate protein structural elements contributing to the RPO stability, we deleted the β gate loop (ΔGL), β‘ rudder loop (ΔRL) or β‘ lid loop (ΔLL) from the RNAP and determined the effects of these deletions on the transcription bubble dynamics using preformed RPO complexes. Structurally, GL mediates the RNAP β-pincer interaction with the RNAP β‘-clamp, thus forming a barrier for the DNA entry and exit from the RNAP cleft (Figure 5(A))9, 10; RL locates between the tDNA and ntDNA strands in the RPO; and LL locates adjacent to the RL and is able to interact with the tDNA around base −6 in the RPO.

Figure 5

Effect of RNAP cleft loop deletions on the reaction pathway branching and transcription bubble kinetics. (A) The position of deleted lid loop (LL), rudder loop (RL) and gate loop (GL) are shown in the cryo-EM model of E. coli RPO (PDB: 6psw, (10)). α, β and ω RNAP subunits and TraR transcription factor are omitted for clarity. ntDNA and tDNA are shown in dark and light grey, respectively. Blue sphere is the active site Mg2+ ion. (B) The RPO complexes were classified as static or dynamic based on the 2-state HMM fit of the FRET trajectories. (C) The relative amounts of static and dynamic RPO. Error bar: exact 95% binomial CI. WT, N = 212 molecules; ΔRL, N = 206; ΔGL, N = 136; ΔLL, N = 115. (D) The E* histogram of ΔRL RNAP–promoter complexes on the coverslip surface before (grey, N = 126 molecules) and 2–5 min after the addition of 1 mM NTPs (orange, N = 127). (E) Dwell time distributions of the RPO state within the dynamic RPO population were fit to mono-exponential equation. (F) The lifetime of RPO state was obtained from the dwell distributions in panel E. Error bars are 1 SE extracted from the fit. (G) Dwell time distributions of the RPISO state within the dynamic RPO population were fit to mono-exponential decay equation. (H) The lifetime of RPISO state was obtained from the dwell distributions in panel G. Error bars are 1 SE extracted from the fit. We used 2-state HMM of FRET trajectories to classify RPO complexes into static (i.e., no bubble dynamics) and dynamic (Figure 5(B)) and found that all deletions shifted the balance of RPO formation towards the direction of dynamic RPO complexes. The effects of ΔGL and ΔLL were moderate, as they increased the fraction of dynamic RPO complexes from the wild-type (WT) RNAP level by 1.3–1.4-fold, i.e., from 26% (exact 95% binomial CI: 22–31%) in WT to 34% (CI: 29–40%) in ΔLL and 36% (CI: 30–43%) in ΔGL (Figure 5(C)). However, the ΔRL effect was much stronger, 2.9–fold, making most RPO complexes, i.e., 77% (CI: 72–83%), dynamic. The addition of NTPs to ΔRL-RNAP–promoter complexes depopulated the RPO state as indicated by the substantial decrease in E* ∼ 0.45 state probability and corollary increase in ∼0.2 E* state (Figure 5(D)). Because the NTP-response was similar in the case of WT-RNAP (Figure S2(C)), which in contrast to ΔRL-RNAP forms mostly static RPO’s, both static and dynamic RPO’s appear capable to initiate RNA synthesis. Kinetic analysis of the dynamic RPO’s indicated that ΔRL and ΔLL shortened the lifetime of the open bubble state by 2.8–fold, whereas ΔGL had no effect (Figure 5(E, F)). In contrast, all three mutations increased the lifetime of the RPiso state. Specifically, the RPiso lifetime increased by 2.5-, 1.8- and 1.5-fold in the case of ΔLL, ΔRL and ΔGL mutant RNAPs, respectively (Figure 5(G, H)). Notably, the median observation span for these measurements was 4.26 s, 4.84 s, 5.20 s and 6.00 s in the case of WT, ΔLL, ΔGL and ΔRL, respectively. The variation in the observation span, however, does not significantly contribute to the classification of the molecules to the static and dynamic RPO classes or kinetic parameters because, even in the combination of shortest trajectories (median 4.26 s) and the most stable state, i.e., wild-type RPO complex (lifetime 0.5 s), the probability that RPO → RPISO transition does not take place within the trajectory is extremely small (0.0002). Collectively, our data indicate that the GL, RL and LL domains in the RNAP contribute to the RPO pathway branching (by changing the balance between static and dynamic RPO complexes) and affect the rates of the transcription bubble conformation changes.

Discussion

In this work, we establish the ability to look at the earliest stages of transcription initiation in real-time and at the single-molecule level. This unique capability bypasses the need to synchronise complexes and offers unprecedented access to co-existing reaction pathways and transient intermediates; as a result, we gained new mechanistic insight about the paths and intermediates used by RNA polymerase to form RPO complexes on a lac promoter derivative. Our work also provides further insight on the dynamics and heterogeneity of open complexes and their determinants. Open complex formation may proceed via more than one path. Our data indicate that the RPC → RPO isomerisation step involves mechanistic branching. In most cases (∼60% of molecules), the isomerisation occurs in one step without observable intermediate(s) within our 20-ms temporal resolution, suggesting that bubble initiation was followed by rapid bubble progression, downstream DNA loading to the RNAP cleft, and template DNA loading to the active site cleft. However, the remaining molecular trajectories (∼40%) have an intermediate state (RP) between the RPC and RPO. We employed simulated trajectories to exclude the possibility that all experimental trajectories without an identified RPi are simply due to the failure to detect this state. Our data support the presence of parallel paths for RPO formation, or at least, the presence of RNAP populations with different propensity to form an open complex. Possible structure of RP. We first considered that possibility that the intermediate may have structural similarities to an RNAP-promoter complex stabilised by an antibiotic targeting RNAP. Recent cryo-EM data of Mycobacterium tuberculosis RNAP showed that corallopyronin A (a Myx analogue) stabilises a partially melted transcription bubble (region −11/−4). The same cryo-EM work showed that a similar conformation was present in the absence of Myx, raising the possibility that the observed conformation may represent an intermediate on the RPO pathway. However, our real-time trajectories using E. coli RNAP show an intermediate (RPi) with a structural signature with significantly different FRET efficiency (E* ∼ 0.3) than that expected by the partially melted intermediate (Ei2 ∼ 0.2). Further, the presence of Myx does not prevent full opening of the promoter DNA, as sensed by fluorescence enhancement of a Cy3 probe introduced at position +2, both at the ensemble and at the single-molecule level; we note that such enhancement is expected only when the bubble opening has been complete. Instead, the possibility we favour for the structure of the RPi is an intermediate further down the promoter opening pathway (hence, more consistent with the E* value of ∼0.3 we observe for RPi), where all the melting has been completed, but the template DNA has not been fully loaded to the active site cleft; such a structure is supported by results showing that Myx does not prevent full opening of DNA on two prototypical promoters (λPR and lacCONS). Regardless, at least for the lac promoter derivative used in this work, the intermediate is kinetically significant only in a subset of RPO formation events. Possible sources for heterogeneity in open complex formation pathways. Since the RP is detectable only in a subset of trajectories, it is conceivable that, for those trajectories, a structural module of the RNAP delays downstream tDNA loading to the active site cleft. One candidate for such a structural module is the RNAP β′ switch-2 segment. Based on mutational analysis, structural studies, and observed Myx effects, it has been established that the switch-2 region can adopt two different conformations. The conformation dominant in the absence of Myx is compatible with template loading to the active site; in contrast, the alternative conformation (which is stabilised by Myx and specific mutations in the switch-2) blocks template loading to the active site. If the switch-2 was at the moment of DNA bubble initiation in the blocking conformation in a fraction of RNAPs, the loading of template DNA to the active site cleft would be delayed by the necessary switch-2 refolding. The RPi heterogeneity may also result from alternative promoter discriminator (region −6/+1) conformations in different RNAP molecules. This hypothesis is supported by the previous finding that G−6G−5G−4 and C−6C–5C/T−4 motifs in the ntDNA stabilised in crystallo two distinct discriminator conformations and imposed in solution one base-pair difference in the predominant transcription start site. GTG motif, which is found in our promoter, had transcription start site statistics halfway between the GGG and CCC/T motifs, consistent with the assumption that a promoter with this motif can readily adopt either of the two discriminator conformations. RP A longstanding question in the transcription field is whether all RPO on a given promoter are the same or differ in their structural and functional properties.22, 23, 24, 25 Our results show that indeed there is another layer of heterogeneity as indicated by the differing stability of the transcription bubble, even immediately after RPO formation (as judged by the appearance of the E* ∼ 0.45 state). Most RPO complexes can keep the bubble open for at least several seconds; however, a more dynamic RPO subpopulation samples different bubble states in the millisecond timescale. The stable or dynamic RPO mode was set before or during the first-time opening of the bubble and the modes did not interconvert within our observation span (∼8 seconds); this observation rules out the possibility that the dynamic RPO were indeed intermediates on the linear pathway leading to the stable RPO complexes. This new insight aligns well with our previous observation of stable and dynamic RPO molecules within the population of pre-formed RPO complexes, while allowing further mechanistic insight by linking the formation of stable and dynamic complexes on the existence of a branched RPO pathway and the sampling of a short-lived off-pathway state (RPISO) by the dynamic RPO’s. The analysis of mutant RNAPs suggest that the main difference between the dynamic and stable RPO complexes arises from the RNAP interaction with the single-stranded template DNA in the active site cleft. The deletion of rudder loop, which presses against the template DNA positions −7 to −9, tripled the amount of dynamic RPO (from 26% to 77%; Figure 5(C)) and decreased 3-fold the open bubble lifetime in the dynamic RPO complexes (Figure 5(F)). The deletion of lid loop, which interacts with the template DNA base −6, had a similar effect on the open bubble lifetime. We previously found that deletion of the σ70 3.2 region (the σ “finger”, which interacts with the template DNA strand from bases −3 to −6), destabilised the RPO. These interactions with the template DNA form late in the RPO mechanism, i.e., when the bubble forms fully and the template DNA strand loads to the active site cleft. Our data suggest that these final interactions form by two alternative ways generating “tight” and “loose” template DNA binding modes: the tight binding mode gives rise to the stable RPO complexes, and the loose binding mode gives rise to the dynamic RPO complexes. It will be interesting to investigate in future single-molecule studies whether such significant differences in RPO stability have functional consequences, and whether they are related to reports of non-uniform RPO function. Specifically, a subset of RPO complexes (on many promoters) appear to be locked in an abortive initiation mode, in which they repetitively synthesise short RNA products (<12-mer, with the exact sequence depending on the specific promoter), whereas another RPO subset escape the promoter efficiently and synthesise full-length RNA products.22, 23, 24, 25 The failure of promoter escape may also result from long-lived backtracking and arrest of initially transcribing RNAPs.28, 29 The RPO’s apparently locked in the abortive mode are also referred as “moribund” complexes, and they apparently have a role in transcription regulation in the cell. Mechanistically, the dynamic RPO could be candidates to form such moribund complexes, since unstable template DNA binding to the active site is likely to enhance the dissociation probability of short RNAs, leading to abortive initiation. Consistent with this reasoning, the Δ3.2 σ70 mutant (which increase RPO dynamics substantially) released 4–7-mer RNAs more efficiently compared to the WT. However, it has also recently been suggested that the intermediate RPi3 and not the stable RPO is the productive initiation complex on the λPR promoter.39, 40 Our observation of NTP-dependent depopulation of RPO’s formed either by wild-type RNAP (mostly static RPO’s) or ΔRL RNAP (mostly dynamic RPO’s) indicates that both forms of RPO‘s can at least initiate RNA synthesis (Figure S2(C), Figure 5(D)). Role of the promoter upstream interactions on the RP. The RNAP αCTDs interact with the promoter upstream sequences either by specifically recognising the promoter UP element or via sequence-independent interactions14, 19; both interactions are important for RPO formation. We found that the upstream part of the lacCONS promoter (from −40 to −89; Figure S2(A, B)), which does not contain a full UP element but is partially similar to the distal UP subsite, facilitates transcription bubble melting in the context of fully double-stranded promoters. In fact, the short double-stranded promoter (which lacks αCTDs interactions) failed to form RPO under our experimental conditions, which involve measurements at room temperature. On the other hand, if the requirement for the DNA melting nucleation was bypassed (e.g., by using a pre-melted promoter), the αCTD interactions with upstream sequence no longer increased the rate of RPO formation (Figure 2(D, E)). This finding is consistent with previous biochemical studies showing that the αCTD interactions with the UP element enhance both the initial promoter binding and subsequent isomerisation to competitor-resistant conformation.14, 19 We also found that the presence of upstream sequence interactions did not substantially change the ratio of initial bubble opening events that occur in one step or in two steps (i.e., via the RPi), and did not significantly change the rates of transcription bubble dynamics in the pre-formed RPO. However, the dynamic RPO complexes formed more often on the short promoter in comparison to the long promoter (25% vs. 16%), suggesting that the αCTD–promoter interactions, instead of being fully decoupled from mechanistic steps occurring after the bubble nucleation, have a role in the late steps of RPO pathway branching; the exact mechanism of such modulation is unclear, but it may involve the bending of the upstream sequence on the RNAP surface, as observed by Record and collegues, and subsequent interactions that affect RNAP conformation dynamics in a way that it influences bubble dynamics. Our promoter sequence has only a partial similarity to the distal UP element subsite, predicting a non-ideal (if any) sequence-specific interaction with αCTDs. It has been shown that promoters with a full UP element drive stronger transcription activity than promoters with partial UP element or non-UP element sequence. To generalise our findings, further studies on many different promoters are needed to establish the UP element and general sequence-dependence of the promoter upstream control over the formation of static vs dynamic RPO’s. A working model for the RP We summarise our key findings in the context of the RPO formation mechanism on the lacCONS promoter in Figure 6. The process begins with the RPC complex formation as the RNAP holoenzyme binds to the promoter and establishes interactions with the −35 element, −10 element and upstream sequences. Interaction of αCTDs with upstream sequences stimulate RPO formation by bending the upstream DNA around the RNAP12, 13, 14, 15, 16, 17 and coupling it energetically with bubble formation.

Figure 6

A working model for the RP The reaction pathway from the promoter binding to the RPO complex has heterogeneity in two separate steps. The first step is hypothesised to depend on a mobile RNAP element, which can be either in an active or inactive conformation (green and red flaps, respectively). The inactive conformation blocks the loading of the tDNA strand into the active site cleft, resulting the formation of intermediate RPi. The isomerisation of the mobile element to the active conformation clears the block and allows progress from RPi to RPO. The second branching is related to the stability of the RNAP–template DNA interaction in the active site cleft. In ∼15% of the RPO complexes, the interaction is weak, allowing continuous dynamic movement of the template DNA and thus the downstream DNA. Because stable and dynamics RPO complexes formed both from RPC1 and RPi, we assume that these pathways merge before the next branching step leading to the stable and dynamics RPO’s. Green vs red pins depict tight and loose interactions between the tDNA and the RNAP, respectively. The numeration (1, 2 and 3) indicates the key steps in the mechanism that may define the rate and efficiency of RPO complex formation. The initial nucleated bubble expands via two different mechanistic paths: in the first path (most common for our lac promoter derivative), the RNAP melts the entire bubble and loads the template DNA strand to the active site cleft in one apparent step without detectable intermediates; the second path, however, involves a short-lived intermediate, RPi, which features incomplete template loading to the active site cleft. We hypothesise that the intermediate appears when a mobile element of the RNAP, e.g., the switch-2 module, is initially in a conformation incompatible with template loading to the active site cleft. Template DNA loading to the active site cleft leads to the tight-binding and loose-binding states, which do not readily interconvert. Because stable and dynamics RPO complexes formed with similar probability both directly from RPC and via RPi, we assume that these pathways merge before the branching to the stable and dynamics RPO’s takes place at the template DNA loading step (Figure 6). The tight template DNA binding mode keeps transcription bubble open whereas the loose-binding features dynamic movement of the template DNA. Template DNA interactions with the RNAP rudder loop and σ finger are part of the key determinants of tight binding mode. Ongoing studies in our group aim to decipher the promoter-sequence dependence of the RPO formation mechanism and functional significance of the RPO heterogeneity.

Materials and methods

Promoter preparation

Labelled and unlabelled DNA oligos to make lacCONS promoter, also known as lacCONS+2, constructs were purchased from IBA Lifesciences (Germany) (Figure S1(A–C)). Short lacCONS promoters (−39/+25) were reconstituted by annealing PAGE purified labelled template and non-template DNA oligos at 1 µM in annealing buffer [10 mM Tris-HCl (pH 8.0), 50 mM NaCl, 0.1 mM EDTA]. The annealing program consisted of initial denaturation (93 °C, 3 min) followed by step-wise cooling to 4 °C (each step: 1.2 °C, 30 s). DNA strands for the long lacCONS promoters (−89/+25) were constructed using a previously described protocol.

Protein preparation

Escherichia coli core RNAPs were expressed in E. coli and purified as previously described. The wild-type RNAP was expressed from plasmid pVS10 (T7p-α-β-β′_His6-T7p-ω). ΔRudder loop RNAP (T7p-α-β-β′[ΔN309-K325]_TEV_His10-T7p-ω), Δlid loop RNAP (T7p-α-β-β′[ΔP251-S263 → GG]_TEV_His10-T7p-ω) and Δgate loop RNAP (T7p-α-His6_β[ΔR368-P376 → GG]-β′-ω) were expressed from pMT041, pHM001 and pTG011, respectively. Wild-type E. coli σ70 was purified as previously described. Holoenzymes were assembled by incubating 0.5 µM RNAP with 1.5 µM σ70 for 15 min at 30 °C in RNAP storage buffer [20 mM Tris-HCl (pH 8.0), 150 mM NaCl, 50% (vol/vol) glycerol, 0.1 mM EDTA, 0.1 mM dithiothreitol (DTT)].

Microscope coverslip preparation

Borosilicate glass coverslips (1.5 MenzelGläzer, Germany) were heated to 500 °C in oven for 1 h to reduce background fluorescence. The coverslips were then rinsed with HPLC-grade acetone and immerged into 1% (v/v) Vectabond (product code #SP-1800, Vector Labs, CA, USA) in acetone for 10 min to functionalise the glass surface with amino groups. Coverslips were then rinsed with acetone followed by deionized water before drying them under a stream of nitrogen gas. A silicone gasket (103280, Grace Bio-Labs, OR, USA) with four reaction wells was placed in the middle of the coverslip. The coverslip surface was then simultaneously passivated by pegylation against unspecific protein/DNA binding and biotinylated to provide attachment points for specific protein immobilisation. Each well on the coverslip was thus filled with 20 µl of 180 mg/ml methoxy-PEG (5 kDa)-SVA (Laysan Bio, AL, USA) and 4.4 mg/ml biotin-PEG (5 kDa)-SC (Laysan Bio, AL, USA) in 50 mM MOPS-KOH buffer (pH 7.5), incubated for ∼ 3 h at room temperature and finally the wells were thoroughly rinsed with phosphate-buffered saline (PBS; Sigma Aldrich, UK). The coverslips remained functional for at least two weeks when stored at 4 °C in plastic pipette tip box containing a layer of deionised water at the bottom. During the storage the coverslip wells were filled with PBS.

Single-molecule experiments

On the day of microscopy experiment, the pegylated coverslips were incubated for ∼10 min with 0.5 mg/ml of Neutravidin (31050, ThermoFisher Scientific, UK) in 0.5 × PBS and subsequently rinsed with 1 × PBS. The coverslips were then incubated for ∼10 min with 4 µg/ml of Penta·His biotin conjugate antibody (34440, Qiagen, UK) in reaction buffer [40 mM HEPES buffer (pH 7.3, BP299100, Fisher Scientific, UK), 100 mM potassium glutamate, 10 mM MgCl2, 1 mM DTT, 1 mM cysteamine hydrochloride, 5% glycerol (vol/vol), 0.2 mg/ml bovine serum albumin] and subsequently rinsed with the reaction buffer. To analyse the RPO complex formation in real-time at 22 °C the anti-His-tag-antibody coated coverslip was incubated ∼10 min with 1 nM label-free holoenzyme in the reaction buffer, rinsed thoroughly with the reaction buffer and mounted on the microscope. 25 µl of imaging buffer [i.e., reaction buffer supplemented with 2 mM UV-treated Trolox, 1% (w/v) glucose, 0.4 µg/ml catalase (10106810001, Roche Diagnostics, Germany), 1 µg/ml glucose oxidase (G2133, Sigma Aldrich, UK)] was replaced to the imaged well. Data recorder was started to take an 80 s movie. 1 µl of 4 nM promoter in the reaction buffer was gently pipetted to the well at ∼8 s time-point. The formation of RNAP–promoter complexes was evident by the appearance of bright co-localised spots on the Cy3B and ATTO647N fluorescence channels. In some experiments these surface-formed RNAP–promoter complexes were further imaged after exchanging fresh imaging buffer to the well and finding non-bleached field-of-view. The age of RNAP–promoter complexes at the moment of recording these 20 s post-binding movies was ∼3–7 min. In some control experiments, we monitored the initial RNA synthesis activity of RNAP by including 1 mM NTPs (ATP, GTP, CTP and UTP) in the imaging buffer. To analyse transcription bubble dynamics in pre-assembled RPO complexes, 2 nM holoenzyme was incubated with 5 nM promoter in reaction buffer for 15 min at 37 °C. 100 µg/ml sodium heparin (H4784, Sigma, UK) was added to disrupt non-specific RNAP–promoter complexes and ∼1.3 µl of the mixture was transferred to anti-His-tag-antibody coated coverslip well containing 25 µl reaction buffer. The RPO complex immobilisation at 22 °C was let to continue until ∼50 molecules were detected on the field-of-view. The well was then rinsed with reaction buffer and finally filled with 25 µl imaging buffer. Data was recorded as 20 s movies from about ten field-of-view per well at 22 °C. Single RNAP–promoter complexes were imaged using objective-based single-molecule TIRF microscope previously described. The donor (Cy3B) and acceptor (ATTO647N) fluorophores in the promoter were excited using 532 nm and the 642 nm lasers in alternating laser excitation (ALEX) mode, respectively. The emission of donor and acceptor fluorophores was separated from each other and from the excitation light, using dichroic mirrors and optical filters, and recorded side-by-side on an electron multiplying charge-coupled device camera (iXon 897, Andor Technologies, Northern Ireland). The frame time of the recordings was 20 ms with 10 ms ALEX excitation by each laser. The measured laser power before the dichroic mirror was set to ∼4 mW and ∼1 mW for the 532 nm and 642 nm laser, respectively.

Single-molecule data analysis

To extract the intensities of co-localised donor and acceptor fluorophores, the recorded movies were processed with custom-made TwoTone TIRF-FRET analysis software (46; see also https://groups.physics.ox.ac.uk/genemachines/group/Main.Software.html). If the processed movies had fluorescent complexes on the surface already at the beginning of the movie, i.e., post-binding and pre-formed RPO complex movies (see above), the following Twotone-ALEX parameters were applied to select only complexes containing a single ATTO647N acceptor dye and a single Cy3B donor dye: channel filter as DexDem&&AexAem (colocalisation of the donor dye signal upon donor laser excitation, the acceptor dye signal upon acceptor laser excitation), a width limit between the donor and the acceptor between 1 and 2 pixel, a nearest-neighbour limit of 6 pixels, and signal averaging from the frames 2–40. In the case of real-time RPO complex formation movies, the nearest-neighbour limit was turned off and the time-window for the search of the surface-bound fluorescent molecules was set with the signal averaging setting (i.e. typically frames ∼1000–3000) to the part of the movie in which most promoter binding events took place. The trajectories selected by the TwoTone-ALEX analysis were manually sorted by eliminating all traces that displayed extensive fluorophore blinking, multi-step photobleaching indicating more than one donor or acceptor dye in the same diffraction limited intensity spot, or other aberrant behaviour. The apparent FRET efficiency (E*) was calculated using Eq. (1) where IDD and IDA are the emission intensities of the donor (Cy3B) and acceptor (ATTO647N) dyes upon donor excitation (532 nm), respectively. The trajectories were analysed using a modified version of the hidden Markov model ebFRET software.49, 29 The trajectories from the pre-formed RPO or post-binding movies were fit to 2-state HMM model followed by noise filtering by requiring an accepted dwell time to satisfy the criteria that the step (i.e., change in E*) is separated from the subsequent step by more than 3-fold the Allan deviation.50, 29 The trajectories were then classified into dynamic or static populations depending whether they displayed >2 or ≤2 accepted E* transitions, respectively. The dwell times were extracted from the dynamic trajectories to compile a dwell time distribution. The dwells with undefined length, i.e. the first and last dwell, were discarded at this point. The trajectories from the real-time RPO formation movies were analysed separately for the first transcription bubble opening event, i.e., the RPC → RPO transition, and transcription bubble dynamics after the RPO formation. The latter analysis was identical to the case of pre-formed RPO complexes with the exception that the RPC → RPO transition at the beginning of the trajectory was trimmed away before HMM. In contrast, the analysis of the RPC → RPO transition in the trajectories was performed after trimming away possible bubble dynamics subsequent to the first transcription bubble opening event. We fit the first bubble opening trajectories using 2-state HMM, i.e., RPC → RPO mechanism, and 3-state HMM, i.e., RPC → RPi → RPO mechanism. The initial fits were filtered by requiring true state transitions to be at least 2-fold the Allan deviation.50, 29 Selection of the more complex 3-state model for the trajectory also required that both the HMM lower bound value and Aikake information criteria, calculated as previously described, favoured this model. The dwell times in the RPC and RPi states were compiled to separate dwell time distributions. The lifetime of the RPC state was determined by fitting the dwell time distribution to the mono-exponential decay function using Origin software (OriginLab Corporation, MA, USA). We validated the above analysis procedure for its accuracy to detect the RPi state and dynamics RPO’s. To this end, we used DeepFRET software to simulate FRET trajectories for each state RPC, RPi and RPO using FRET efficiency and FRET noise levels extracted from the experimental trajectories. Specifically, the FRET efficiency was 0.196 (noise 0.05), 0.318 (0.06) and 0.448 (0.05) for the simulated RPC, RPi and RPO state, respectively. The noise of the corresponding experimental FRET data was 20–28% smaller, i.e., 0.036, 0.047 and 0.040 for the RPC, RPi and RPO state, respectively. The complete trajectories for a RPC → RPi → RPO mechanism, were then assembled from the state-specific simulation by using a custom Python script and mono-exponential distribution of state dwell lengths (as in experimental data). The trajectories to determine the detection efficiency of dynamic RPO complexes, i.e., complexes showing dynamics, were simulated by DeepFRET using FRET efficiency setting 0.279 (noise 0.06) and 0.462 (noise 0.06) for the RPISO and RPO state, respectively. The mean lifetime of the RPISO and RPO state complex was set as 0.085 s and 0.56 s, respectively. The trajectory length was 2–7 s (100–350 frames). The histograms of E* values were fit in Origin software to one or two Gaussian distributions using Eq. (2) with n fixed as 1 or 2, respectively. The fit parameters E, w and A are the centre, width and area of the Gaussian distributions, respectively.

CRediT authorship contribution statement

Anssi M. Malinen: Conceptualization, Methodology, Formal analysis, Investigation, Writing – original draft, Writing – review & editing, Funding acquisition. Jacob Bakermans: Software, Investigation, Writing – review & editing. Emil Aalto-Setälä: Software, Investigation. Martin Blessing: Resources, Writing – review & editing. David L.V. Bauer: Resources, Writing – review & editing, Funding acquisition. Olena Parilova: Investigation, Funding acquisition. Georgiy A. Belogurov: Resources, Writing – review & editing. David Dulin: Conceptualization, Formal analysis, Writing – review & editing. Achillefs N. Kapanidis: Conceptualization, Methodology, Writing – review & editing, Funding acquisition, Supervision.

50 in total

1. Bacterial promoter architecture: subsite structure of UP elements and interactions with the carboxy-terminal domain of the RNA polymerase alpha subunit.

Authors: S T Estrem; W Ross; T Gaal; Z W Chen; W Niu; R H Ebright; R L Gourse
Journal: Genes Dev Date: 1999-08-15 Impact factor: 11.361

2. Structural basis for promoter-10 element recognition by the bacterial RNA polymerase σ subunit.

Authors: Andrey Feklistov; Seth A Darst
Journal: Cell Date: 2011-12-01 Impact factor: 41.582

3. High Spatiotemporal-Resolution Magnetic Tweezers: Calibration and Applications for DNA Dynamics.

Authors: David Dulin; Tao Ju Cui; Jelmer Cnossen; Margreet W Docter; Jan Lipfert; Nynke H Dekker
Journal: Biophys J Date: 2015-11-17 Impact factor: 4.033

4. Strand opening-deficient Escherichia coli RNA polymerase facilitates investigation of closed complexes with promoter DNA: effects of DNA sequence and temperature.

Authors: Victoria M Cook; Pieter L Dehaseth
Journal: J Biol Chem Date: 2007-05-16 Impact factor: 5.157

5. Stepwise Promoter Melting by Bacterial RNA Polymerase.

Authors: James Chen; Courtney Chiu; Saumya Gopalkrishnan; Albert Y Chen; Paul Dominic B Olinares; Ruth M Saecker; Jared T Winkelman; Michael F Maloney; Brian T Chait; Wilma Ross; Richard L Gourse; Elizabeth A Campbell; Seth A Darst
Journal: Mol Cell Date: 2020-03-10 Impact factor: 17.970

6. A branched pathway in the early stage of transcription by Escherichia coli RNA polymerase.

Authors: T Kubori; N Shimamoto
Journal: J Mol Biol Date: 1996-03-01 Impact factor: 5.469

7. Multiplexed protein-DNA cross-linking: Scrunching in transcription start site selection.

Authors: Jared T Winkelman; Irina O Vvedenskaya; Yuanchao Zhang; Yu Zhang; Jeremy G Bird; Deanne M Taylor; Richard L Gourse; Richard H Ebright; Bryce E Nickels
Journal: Science Date: 2016-03-04 Impact factor: 47.728

8. Structures of an RNA polymerase promoter melting intermediate elucidate DNA unwinding.

Authors: Hande Boyaci; James Chen; Rolf Jansen; Seth A Darst; Elizabeth A Campbell
Journal: Nature Date: 2019-01-09 Impact factor: 49.962

9. Transcription initiation at a consensus bacterial promoter proceeds via a 'bind-unwind-load-and-lock' mechanism.

Authors: Abhishek Mazumder; Richard H Ebright; Achillefs N Kapanidis
Journal: Elife Date: 2021-10-11 Impact factor: 8.140

Review 10. Structures of RNA polymerase-antibiotic complexes.

Authors: Mary X Ho; Brian P Hudson; Kalyan Das; Eddy Arnold; Richard H Ebright
Journal: Curr Opin Struct Biol Date: 2009-11-18 Impact factor: 6.809

2 in total

1. Quantitative parameters of bacterial RNA polymerase open-complex formation, stabilization and disruption on a consensus promoter.

Authors: Subhas C Bera; Pim P B America; Santeri Maatsola; Mona Seifert; Eugeniu Ostrofet; Jelmer Cnossen; Monika Spermann; Flávia S Papini; Martin Depken; Anssi M Malinen; David Dulin
Journal: Nucleic Acids Res Date: 2022-07-22 Impact factor: 19.160

2. Alteration of DNA supercoiling serves as a trigger of short-term cold shock repressed genes of E. coli.

Authors: Suchintak Dash; Cristina S D Palma; Ines S C Baptista; Bilena L B Almeida; Mohamed N M Bahrudeen; Vatsala Chauhan; Rahul Jagadeesan; Andre S Ribeiro
Journal: Nucleic Acids Res Date: 2022-08-26 Impact factor: 19.160

2 in total