Literature DB >> 24502198

Molecular mechanisms of transcription through single-molecule experiments.

Manchuta Dangkulwanich1, Toyotaka Ishibashi, Lacramioara Bintu, Carlos Bustamante.   

Abstract

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24502198      PMCID: PMC3983126          DOI: 10.1021/cr400730x

Source DB:  PubMed          Journal:  Chem Rev        ISSN: 0009-2665            Impact factor:   60.622


× No keyword cloud information.

Introduction

Transcription represents the first step in gene expression. It is therefore not surprising that transcription is a highly regulated process and its control is essential to understand the flow and processing of information required by the cell to maintain its homeostasis. During transcription, a DNA molecule is copied into RNA molecules that are then used to translate the genetic information into proteins; this logical pattern has been conserved throughout all three kingdoms of life, from Archaea to Eukarya, making it an essential and fundamental cellular process. Even though some viruses that encode their genome in an RNA molecule use it as a template to make mRNA, others synthesize an intermediate DNA molecule from the RNA, a process known as reverse transcription, from which regular transcription of viral genes can then proceed in the host cells.[1] Why has transcription evolved into such an essential cellular process? Why not directly express the information encoded in the DNA genome into proteins? There are several reasons to justify the evolution of transcription as an intermediate step for the synthesis of proteins. First, transcription expands the variety of gene products by allowing for splicing. Second, copying the information within DNA into many RNA molecules increases the rate of total protein synthesis in the cell and avoids the bottleneck that would result from expression of a gene directly from the DNA. Third, the number of RNA molecules available at any given time to synthesize proteins can be precisely regulated to give a burst of products. The signal amplification implicit in the transcription process increases the dynamic range of the expression, allowing the cell to control its RNA throughput with higher precision and in a gene-specific manner. This amplification also gives rise to stochasticity in gene expression, making it possible to produce various outcomes from genetically identical cells.[2] RNA synthesis in the cell is a complex process that requires a finite time for completion. Having the capability to follow the time course of transcription and its progression in real time is, therefore, essential to understand its regulation. In bulk, one can hope to follow, at most, the progression of transcription as an average of unsynchronized contributions from individual molecules within a population. This averaging obscures crucial information contained in the time-dependent behavior of individual molecules. Single-molecule methods overcome the limitations inherent to the ensemble averaging of bulk methods by allowing one to follow the trajectories of individual molecules in real time. The picture that emerges from single-molecule studies of transcription is that of a rich and complex process that provides many checkpoints for regulation throughout transcription. Over the past two decades, various methods of single-molecule manipulation and detection have been employed to characterize all three stages of transcription. In the first stage of transcription initiation, RNA polymerase (RNAP) must locate specific promoter sites on the genome in the densely packed cellular environment. Single-molecule methods, such as atomic force microscopy (AFM) and fluorescence-based approaches, have provided insights into how RNAP locates its promoter and unwinds the DNA duplex. Because of the DNA helical structure, unwinding of the duplex is accompanied by changes in its twist. Through the use of magnetic tweezers, it has been possible to both apply torque and follow the torsional states of individual initiating RNAP complexes. During the second stage of elongation, RNAP operates as a molecular motor, converting difference between high-energy phosphoanhydride bonds and lower energy phosphodiester bonds into mechanical work, through the generation of force (in piconewton range) and displacement (in subnanometer scale). Methods of single-molecule manipulation, such as optical tweezers, are ideally suited to precisely measure forces and displacements on this scale; thus, optical tweezers are capable of providing unique insight on the mechanochemical conversion in the transcription process as well as the mechanisms by which transcription factors regulate the dynamics and the progress of the enzyme. When the RNAP finishes synthesizing the full-length transcript, it must stop at a specific location and release the transcript in a controlled manner. Single-molecule techniques make it possible to selectively apply loads on either the DNA template or the RNA transcript, and to dissect regulatory elements in the final stage of transcription, termination. Here, we present a review of the various aspects of transcription that have been addressed using methods of single-molecule detection and manipulation. We have organized this Review along the three stages of transcription. In the initiation and termination stages, where the factors involved differ substantially between the prokaryotic and eukaryotic systems, we will describe first the results established in prokaryotes prior to detailing those obtained in eukaryotes.

Transcription Initiation

Whereas single-subunit viral polymerases such as T7 and SP6 RNAP can start transcription at a promoter region without additional cofactors, multisubunit bacterial and eukaryotic RNAPs require transcription factors that aid the enzyme to recognize and bind to the promoter. Together they form an initiation complex that unwinds the DNA at the promoter and produces a nascent RNA transcript that stabilizes the complex and primes the enzyme for the processive synthesis of a full-length RNA message. Although similar events occur in prokaryotes and eukaryotes, different factors participate in the initiation process. Next, we address them in order.

Prokaryotic Transcription Initiation

In prokaryotes, core RNAP (consisting of five subunits, ββ′α2ω) can nonspecifically bind to DNA. In vitro, it can initiate transcription from DNA ends. However, to initiate transcription from a promoter, the core polymerase must assemble into the holoenzyme by binding to a σ factor, which recognizes specific sequences of promoter DNA, facilitates DNA unwinding, and influences the early phase of transcription elongation.[3] To begin specific transcription initiation, RNAP must first locate its promoter, unwind the DNA to form an open promoter complex, and begin transcribing the DNA until it releases the promoter and transitions into the elongation phase (Figure 1). As we will describe in detail in this section, various single-molecule approaches, such as single-molecule fluorescence-based methods, atomic force microscopy, and magnetic tweezers assays, have provided a detailed picture of transcription initiation.
Figure 1

Summary of steps in transcription initiation for prokaryotic RNAP. The clamp of free RNAP exists in three different conformations: open (red), closed (green), and collapsed (blue) states. The RNA holoenzyme interacts with the promoter via the σ factor to form the closed promoter complex; only the open clamp state was observed in this state. In the open promoter complex, the RNAP wraps the upstream DNA around itself, closes the clamp, and unwinds the promoter. Next, the RNAP synthesizes and releases short transcripts in a process called abortive initiation before transiting into the elongation phase. The schematic of RNAP is inspired by Murakami and Darst.[117]

Summary of steps in transcription initiation for prokaryotic RNAP. The clamp of free RNAP exists in three different conformations: open (red), closed (green), and collapsed (blue) states. The RNA holoenzyme interacts with the promoter via the σ factor to form the closed promoter complex; only the open clamp state was observed in this state. In the open promoter complex, the RNAP wraps the upstream DNA around itself, closes the clamp, and unwinds the promoter. Next, the RNAP synthesizes and releases short transcripts in a process called abortive initiation before transiting into the elongation phase. The schematic of RNAP is inspired by Murakami and Darst.[117]

Promoter Search

The investigation of how DNA-binding proteins search and find their targets was pioneered in the studies of lac repressor, which was shown to bind to the target operator much faster than the 3D diffusion limit.[4] Subsequent investigations have shown that the target search process of DNA-binding proteins can be accelerated by reducing the dimensionality of the search.[5,6] According to these thoughts, the protein finds the target much faster by binding nonspecifically on the DNA genome and subsequently sliding one-dimensionally along the double helix, hopping within a DNA segment, or transferring between contacting segments of the DNA molecule (Figure 2).
Figure 2

Schematic of various promoter search mechanisms by RNA polymerase. The blue clamp-like shape represents RNA polymerase, and the gray line represents the DNA. 1D sliding is the one-dimensional diffusion of the RNAP along the DNA segment. 1D hopping is a transient association and disassociation of the RNAP along the DNA. Intersegment transfer is the translocation of the RNAP from one point on the DNA to a distant point via a loop intermediate.

Schematic of various promoter search mechanisms by RNA polymerase. The blue clamp-like shape represents RNA polymerase, and the gray line represents the DNA. 1D sliding is the one-dimensional diffusion of the RNAP along the DNA segment. 1D hopping is a transient association and disassociation of the RNAP along the DNA. Intersegment transfer is the translocation of the RNAP from one point on the DNA to a distant point via a loop intermediate. Single-molecule studies have directly observed the trajectories of RNAP molecules during the search process. Earlier experiments observed one-dimensional sliding of fluorescently labeled E. coli RNAP holoenzyme along the DNA; it was estimated that the 1D diffusion coefficient for the RNAP was 1 × 104 nm2/s over a short distance of ∼90 nm,[7] although a longer diffusion distance of over 10 μm was also observed in an independent study.[8] During this sliding process, RNAP tracks along the DNA groove, as was observed by rotation of fluorescent-bead labeled DNA when it was being dragged over immobilized RNAP holoenzyme.[9] Atomic force microscopy (AFM) has also been used to observe the 1D sliding of the σ70 RNAP on the DNA and reported the 1D diffusion coefficient of 1.1 × 101 nm2/s.[10] The smaller diffusion coefficient reported in the AFM study may be due to the surface–DNA interactions that are expected to hinder the diffusion process. Events of hopping and intersegment transfer were also observed in AFM imaging.[11] A recently developed fast-scanning AFM, which has a temporal resolution of 1–2 frames per second, was used to re-examine the promoter search mechanism of holoenzyme RNAP and found that RNAP utilizes all of the mechanisms described for facilitated targeting, that is, 1D sliding, hopping, and intersegment transfer of RNAP to locate the promoter sequence.[12] Together, these studies showed that RNAP uses various mechanisms of facilitated targeting; however, they did not address the relative contribution of these mechanisms to the binding of RNAP to the promoter. Two recent fluorescence-based in vitro studies have investigated the relative contributions of 1D sliding, hopping, and intersegment transfer to promoter search by RNAP. Friedman et al. have used colocalization single-molecule fluorescent spectroscopy (CoSMos) to assess the contribution of 1D sliding in target finding using the σ54 RNAP holoenzyme.[13] The σ54 RNAP holoenzyme is responsible for transcription initiation of genes required for survival under conditions of stress, such as heat shock and nitrogen depletion.[14] The widely studied σ70-bound holoenzyme, on the other hand, is involved in the expression of housekeeping genes needed during the exponential growth phase. In the CoSMos study by Friedman et al., two DNA templates, with or without a promoter sequence, were labeled with different fluorescent dyes and immobilized on glass surface (Figure 3A). The positions of these templates were identified using fluorescence imaging of the DNA-specific dyes. The holoenzyme was labeled with a different fluorescent dye and introduced into the chamber. Binding events were scored by an appearance of the enzyme fluorescence in the same spot with the immobilized DNA template for 0.2 s or longer.[13]
Figure 3

A CoSMos experiment to study the influence of 1D sliding to promoter search. (A) The DNA templates were labeled with different fluorescent dyes (red and blue), and immobilized on the glass surface. The σ54 holoenzyme was labeled with another fluorescent dye. Binding events were scored from colocalization of fluorescence signals. (B) Frequency of binding events with lifetimes greater than or equal to the indicated dwell time on DNAs with 7 bp downstream from the promoter (blue), or a control (red). Inset shows a magnified view of the short dwell time plotted in a linear scale. (C) Frequency of binding events with lifetimes greater than or equal to the indicated dwell time on DNAs with 2993 bp downstream from the promoter (blue), or a control (red). Inset shows a magnified view of the short dwell time plotted on a linear scale. Reprinted with permission from ref (13). Copyright 2013 National Academy of Sciences.

A CoSMos experiment to study the influence of 1D sliding to promoter search. (A) The DNA templates were labeled with different fluorescent dyes (red and blue), and immobilized on the glass surface. The σ54 holoenzyme was labeled with another fluorescent dye. Binding events were scored from colocalization of fluorescence signals. (B) Frequency of binding events with lifetimes greater than or equal to the indicated dwell time on DNAs with 7 bp downstream from the promoter (blue), or a control (red). Inset shows a magnified view of the short dwell time plotted in a linear scale. (C) Frequency of binding events with lifetimes greater than or equal to the indicated dwell time on DNAs with 2993 bp downstream from the promoter (blue), or a control (red). Inset shows a magnified view of the short dwell time plotted on a linear scale. Reprinted with permission from ref (13). Copyright 2013 National Academy of Sciences. To determine if short-lived events such as sliding or hopping play a role in promoter search, the authors varied the length of the DNA segment flanking a promoter site. The assumption here was that if nonspecific binding contributes to the promoter search, shortening or eliminating the flanking DNA should decrease the observed rates of promoter binding. However, shortening the flanking DNA to 7 base pair (bp) or increasing it to 3000 bp had no effect on the promoter binding rate of RNAP (Figure 3B and C). This observation is inconsistent with the prediction from the facilitated 1D diffusion model. Therefore, this study excluded the facilitated diffusion mechanism over distances between 7 and 3000 bp as a significant pathway for σ54 RNAP to reach the promoter, favoring instead 3D diffusion as the major promoter search mechanism. Wang et al. followed quantum dot-labeled E. coli σ70 RNAPs as they searched for promoters on a nanofabricated array of stretched λ phage DNA, called DNA curtains (Figure 4A), and, in contrast to prior studies, observed no 1D sliding of RNAP.[15] However, given the temporal (10 ms) and spatial (∼40 nm) resolutions of the experiments, fast sliding events or events occurring on shorter length scales may not have been detected. The authors, therefore, derived a theoretical framework to estimate the contributions of hopping and 1D diffusion for promoter binding by RNAP that occur below the temporal and spatial resolutions of the experiments. They concluded that the most important contribution to the observed rate in these calculations (for stretched DNA) is the 3D diffusion term, which is dependent on the concentration of the RNAP. The authors then measured the promoter association rate to DNA curtains at various concentrations of RNAP; they observed that the rates of target association at concentrations below 500 pM of RNAP exceeded the expected rates from 3D diffusion solely (Figure 4B). The probability of finding the promoter target by direct collision with the curtains in 3D increases with protein abundance and will eventually surpass the probability of success target location through facilitated diffusion process in one dimension (Figure 4B). On the basis of these curtain experiments, these authors concluded that the higher RNAP concentrations present in vivo render 3D diffusion an efficient promoter search mechanism.
Figure 4

DNA curtain experiment for RNAP-promoter binding. (A) Quantum dot-labeled RNAP (magenta) bound to tethered DNA curtain (green). (B) Observed rate promoter association rates (ka) as a function of RNAP concentration. The gray region shows the regime that the theoretical target association rate from 3D diffusion (magenta) is lower than the observed rate, which may reflect promoter-finding acceleration due to facilitated diffusion. Reprinted with permission from ref (15). Copyright 2013 Nature Publishing Group.

DNA curtain experiment for RNAP-promoter binding. (A) Quantum dot-labeled RNAP (magenta) bound to tethered DNA curtain (green). (B) Observed rate promoter association rates (ka) as a function of RNAP concentration. The gray region shows the regime that the theoretical target association rate from 3D diffusion (magenta) is lower than the observed rate, which may reflect promoter-finding acceleration due to facilitated diffusion. Reprinted with permission from ref (15). Copyright 2013 Nature Publishing Group. Thus, while the AFM studies have shown that RNAP can use all three target-search mechanisms, 1D sliding, hopping, and intersegment transfer,[11,12] the two recent fluorescence-based studies favored 3D diffusion and concluded that long-range 1D sliding does not play a significant role in promoter search.[13,15] However, these in vitro studies differ from the in vivo environment in two main aspects. First, a bacterial nucleoid exists in a coiled structure; therefore, the contributions from intersegment transfer to promoter search in vivo may be more significant than assumed. By design, DNA curtain experiments eliminate the possible contributions of intersegment transfer to promoter search. Second, the interior of the cell is crowded with other macromolecules; therefore, it is unrealistic that promoter search occurs strictly through 3D diffusion. Rather, it is possible that intersegment transfer events followed by short (few nanometers) 1D diffusion runs play a significant role during promoter search in vivo. Notwithstanding these considerations, the importance of intersegment transfer for promoter search in vivo remains to be established.

Closed-to-Open Complex Transition

The binding of RNAP to the promoter leads to the formation of the closed promoter complex (RPc, Figure 1). In this complex, the DNA is still in double-stranded form and has not been inserted into the RNAP active center cleft. Subsequently, the enzyme unwinds ∼13 bp of the DNA duplex and isomerizes into the RNAP open promoter complex (RPo, Figure 1). Biochemical studies have characterized the sequence of events during the RPo complex formation at the λPR promoter, which consists of at least three intermediate steps: DNA loading, DNA unwinding, and assembling of the polymerase clamp on the downstream DNA (see Saecker et al.[16] for a review). In one experiment, DNA footprinting assays revealed an increased protection of 10 bp on the upstream side of the RPc relative to the RPo (∼100 bp in the RPc and ∼90 bp in the RPo are protected).[17] These extended protections suggest that the DNA bends and wraps around the RNAP in these complexes. Although footprinting assays are not able to directly detect DNA wrapping, RPo has been observed to wrap DNA by AFM imaging. Using the λPR promoter, Rivetti et al. found that ∼90 bp of DNA were unaccounted for in the images of RPo complexes (Figure 5A). These authors proposed that the missing DNA resulted from the wrapping of DNA around the enzyme by ∼300° (Figure 5B).[18] This wrapping generates mechanical stress in the DNA; assuming a persistent length of 53 nm for DNA,[19] the bending energy stored in the 90 bp segment of DNA wrapped 300° around the enzyme is ∼14 kcal/mol. This energy must be paid by the binding energy between the enzyme and the DNA. It has been estimated that the binding free energy of RNAP to the promoter to form the RPo at the λPR promoter is −9 kcal/mol at 37 °C, corresponding to a dissociation constant of ∼3.7 × 10–8 M.[20] Thus, without the energy cost of DNA bending, the free energy for RPo formation would be around −23 kcal/mol, which is equivalent to the free energy from hydrolysis of 2.5 mol of ATP, and would render the bound complex too stable, making promoter clearance more difficult. Subsequent studies have shown that the decrease in DNA contour length observed in the AFM images (from which wrapping can be inferred) varies among different promoter sequences ranging from ∼6 to ∼90 bp. Moreover, it was found that the extent of DNA wrapping in RPo decreases by 2–3-fold when the carboxy terminal domains of α-subunits are removed, indicating that this domain may mediate the interactions between RNAP and DNA in this process.[21]
Figure 5

Wrapping of DNA around RNAP in the open promoter complex. (A) AFM images of DNA in the absence (top) and presence of RNAP (bottom). The contour length of DNA in RPo is shorter than that in the free DNA. (B) A schematic of σ70E. coli RNAP wraps DNA around itself in the RPo. (A) Reprinted with permission from ref (21). Copyright 2007 Wiley-VCH Verlag GmbH & Co. KGaA. (B) Reprinted with permission from ref (18). Copyright 1999 Wiley-VCH Verlag GmbH & Co. KGaA.

Wrapping of DNA around RNAP in the open promoter complex. (A) AFM images of DNA in the absence (top) and presence of RNAP (bottom). The contour length of DNA in RPo is shorter than that in the free DNA. (B) A schematic of σ70E. coli RNAP wraps DNA around itself in the RPo. (A) Reprinted with permission from ref (21). Copyright 2007 Wiley-VCH Verlag GmbH & Co. KGaA. (B) Reprinted with permission from ref (18). Copyright 1999 Wiley-VCH Verlag GmbH & Co. KGaA. During RPo formation, the polymerase unwinds the promoter, and the DNA outside the enzyme positively supercoils to compensate for the promoter unwinding. This process has been observed in real time at the single-molecule level using magnetic tweezers, which can apply torque to twist the DNA and allow it to form plectonemes (Figure 6A).[22] Using the lacCONS promoter, Revyakin et al. observed an extension increase of 50 ± 5 nm of negatively supercoiled DNA.[22] When the experiment was done with positively supercoiled DNA, its extension decreased by 80 ± 5 nm during RPo formation. These observations allowed the authors to separate the contributions from the unwinding and wrapping of the DNA around the enzyme during RPo formation to the DNA extensions. The authors determined that DNA unwinds 1.2 ± 0.1 turns or 13 ± 1 bp and wraps by 15 ± 5 nm or ∼44 bp,[22] in good agreement with the wrapping observed in an AFM study.[21] No DNA unwinding intermediates were observed in this study.
Figure 6

Promoter unwinding and DNA scrunching from single-molecule magnetic tweezers experiments. (A) A promoter DNA is tethered between a glass slide and a magnetic bead, and then positively (or negatively, not shown) supercoiled by rotating the bead, which results in shortening the end-to-end distance. Unwinding of n turns of DNA by RNAP results in the compensatory gain of n positive supercoils, which is detected by the movement of the bead. (B) An example trajectory of DNA extension over time showing transitions between the four states detected. Numbered arrows indicate unwinding and rewinding events. Numbers 2 and 3 mark events of scrunching and reversal of scrunching, respectively. The histogram shows the end-to-end distance change of the RPo, the initial transcribing complex, and the elongation complex from the initial state.[25] Reprinted with permission from ref (25). Copyright 2006 American Association for the Advancement of Science.

Promoter unwinding and DNA scrunching from single-molecule magnetic tweezers experiments. (A) A promoter DNA is tethered between a glass slide and a magnetic bead, and then positively (or negatively, not shown) supercoiled by rotating the bead, which results in shortening the end-to-end distance. Unwinding of n turns of DNA by RNAP results in the compensatory gain of n positive supercoils, which is detected by the movement of the bead. (B) An example trajectory of DNA extension over time showing transitions between the four states detected. Numbered arrows indicate unwinding and rewinding events. Numbers 2 and 3 mark events of scrunching and reversal of scrunching, respectively. The histogram shows the end-to-end distance change of the RPo, the initial transcribing complex, and the elongation complex from the initial state.[25] Reprinted with permission from ref (25). Copyright 2006 American Association for the Advancement of Science. In addition to inducing bending and wrapping of the DNA, RNAP also closes its clamp upon RPo formation (Figure 1). This conformational change was detected by single-molecule FRET between the tips of the β′ and β pincers labeled with fluorophores (Figure 1). Chakraborty et al. observed that the free σ70 holoenzyme and the core enzyme could exist in three noninterconverting FRET states, which they denoted open, closed, and collapsed clamp states, in a ratio of 2:1:1 approximately.[23] The authors estimated that the width between the pincers is ∼20 Å in the open state, sufficient to accommodate dsDNA. This width reduces to ∼12 Å in the closed state, capable of accommodating only ssDNA. In the collapsed state, the width further reduces to ∼8 Å, which is too small for ssDNA. Upon the addition of DNA, the σ70 RNAP quickly transits through the RPc and readily forms the RPo complex, and only the closed clamp conformation was observed. This closed clamp conformation was retained from this point onward throughout the initiation and elongation phases (Figure 1).[23] Because σ70 RNAP has a short lifetime in the RPc state, σ54 RNAP, which transits from the RPc to the RPo state only in the presence of AAA+ ATPase activator NtrC, was studied.[23] In the RPc and various intermediates to the RPo of the σ54 RNAP, the majority of the molecules are found in the open clamp state, with a small population of the closed clamp state. Only when both ATP and NtrC1 are present, the RPo is formed and only the closed clamp state is observed.[23] This result indicates that the RNAP clamp only closes in the RPo state upon loading of DNA into, and unwinding of DNA in the active-center cleft of the enzyme. The σ54 RNAP has also been employed to study the kinetics of RPo formation using the previously discussed CoSMos study by Friedman and Gelles.[24] In this experiment, the DNA template was labeled with a fluorescent dye and immobilized on the surface. The σ54 RNAP was labeled with a different dye, and binding of the enzyme to the DNA was detected by colocalization of the two fluorescent dyes. From the lifetimes of the individual RNAP–DNA complexes, two distinct types of RPc were discovered: a short-lived RPc with a lifetime of 2.3 ± 0.5 s that is a precursor of a longer lived RPc, which has a lifetime of 79 ± 13 s. After adding the NtrC and ATP, the longer-lived RPc isomerizes into RPo with the rate of 1.9 × 10–3 s–1, which is the rate-limiting step of the initiation process.[24] The mechanism by which NtrC uses the energy from ATP to catalyze the open complex formation is currently unknown.

Abortive Initiation and Promoter Clearance

After unwinding the DNA duplex, RNAP synthesizes and often releases small transcripts up to 11 nucleotides in a series of prematurely terminated initiation events termed “abortive initiation”. This process represents the early stages of transcript synthesis before the enzyme commits itself to processive elongation. Both a single-molecule magnetic tweezers assay and a single-molecule FRET experiment have shown that abortive initiation occurs via a “scrunching” mechanism, wherein the polymerase remains stationary and reels the downstream DNA into the active site.[25,26] Kapanidis et al. used single-molecule FRET to determine the location of RNAP relative to the template.[26] These authors found that the leading and trailing edges of the enzyme remain stationary, while the downstream DNA moves closer to the active site during abortive initiation.[26] In a parallel study, Revyakin et al. used a magnetic tweezers to observe the DNA unwinding at different stages of transcription initiation at the lacCONS promoter (Figure 6A).[25] The authors observed four different DNA extensions, which were assigned to (1) the initial state, (2) RPo, (3) RNAP–promoter initial transcribing complex, and (4) RNAP–DNA elongation complex.[25] DNA scrunching was seen as an overshoot in DNA extension that followed RPo formation and preceded the formation of the elongation complex (Figure 6B). This scrunching signature was observed in 80% of the transcription traces. When the RNAP was allowed to synthesize only a 2-nt RNA transcript, scrunching was not observed. Scrunching occurred when the enzyme was allowed to synthesize a 4- or 8-nt RNA transcript, and the duplex in the RPo was seen to unwind by 2 or 6 additional bp, respectively.[25] These observations suggest that the polymerase can accommodate 2 nt of RNA transcript before it starts to scrunch the DNA template.[25] Scrunching should lead to unwinding and compaction of the DNA that would result in the generation of a stressed intermediate as previously hypothesized.[27] The enzyme can relieve this stress either by releasing the DNA to the downstream side, aborting transcript synthesis, and returning to the RPo, or by releasing the DNA to the upstream side, breaking the interactions of its trailing edge with the DNA, and escaping from the promoter. Indeed, each base pair of DNA scrunched stores on average a free energy of ∼2 kcal/mol from breakage of the base pair.[28] At a typical promoter, RNAP synthesizes ∼9–11 nt, corresponding to ∼7–9 bp of scrunched DNA, before it can proceed to elongation. It is possible that part of the ∼14–18 kcal/mol stored in this process is used to overcome the RNAP–promoter interactions, estimated to be ∼7–9 kcal/mol at 37 °C.[20]

The Fate of the σ Factor

The fate of the σ factor, once the polymerase transits from the initiation to the elongation phase, is still unclear. It may remain bound to the promoter, be released from the promoter, or be retained in the transcribing elongation complex. Recent studies observed retention of the σ70 factor in the elongation complex.[29−31] A single-molecule FRET study showed that ∼20–90% of early elongation complexes harboring 11- or 14-nt transcripts retain the σ70 factor with a half-life between 20 and 90 min.[29] The initial extent of σ70 retention may increase to ∼70–100%, depending on the DNA sequence.[29] In addition, about 50% of the late elongation-phase RNAPs harboring 50-nt RNA transcripts still retain the σ70 with a half-life of over 50 min.[29] These observations raise the possibility that σ70 may remain associated with the polymerase throughout most of the elongation phase; however, this hypothesis is yet to be tested for polymerases approaching the termination phase. The role of σ70 during the elongation phase is unknown. Nonetheless, the fate of σ70 may not be shared by other σ factors. The σ54 factor was shown to be released from >90–95% of the complexes within 10 s after RNAP binding to the promoter, while RNA transcript was detected ∼20 s after σ54 release.[24] This result raises interesting possibilities that different σ factors might play different roles in transcription regulation. While the σ70 may remain bound to the core enzyme and participate in transcription elongation, σ54 may only function in transcription initiation, and its release could trigger the transition of RNAP into the elongation phase.[24]

Eukaryotic Transcription Initiation

To date, most of our understanding of transcription initiation has come from studies in the prokaryotic system. However, biophysical methods have recently begun to shed light on the structure and dynamics of transcription initiation in eukaryotes. As a result, we are beginning to understand the mechanism of open complex formation, of abortive initiation, as well as the rates for these steps in the eukaryotic system. While the core RNAP bound to the σ factor is sufficient to locate the promoter and to initiate transcription in prokaryotes, an additional set of general transcription factors that include TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIIH are involved in eukaryotic transcription initiation. In vitro biochemical reconstitution studies have revealed the order of assembly of these factors to form a functional preinitiation complex (PIC).[32] The first factor to locate and bind the promoter is TFIID, one of whose subunits is the TATA-box binding protein (TBP). TFIIA and TFIIB are recruited next and help stabilize the interactions of TBP with the complex. The bound TFIIB recruits RNA polymerase II (Pol II) and TFIIF. TFIIE binds next to the complex followed by TFIIH, whose ATPase and helicase activities are required to unwind the DNA duplex to form the transcription bubble. The binding of TFIIH completes the formation of the PIC. Recently, two different cryo-electron microscopy (cryo-EM) structures of eukaryotic PIC have been communicated and both structures are supported by cross-linking experiments.[33−37] Murakami et al. published a yeast PIC structure consisting of two lobes.[38] The authors assigned one lobe to Pol II and the other lobe to the general transcription factors and the DNA (Figure 7A). In this structure, the DNA only interacts with the transcription factors, TBP, TFIIF, TFIIE, and TFIIH, but does not interact with the polymerase.
Figure 7

Eukaryotic preinitiation complex. (A) Cryo-EM reconstruction of purified yeast PIC with available crystal structures docked, from Murakami et al.[38] (B) Negative stain reconstruction of stepwise assembled human PIC with available crystal structures docked, from He et al.[39] (C) Cryo-EM reconstruction of a partial human PIC that lacks TFIIH with available crystal structures docked, from He et al.[39] Images courtesy of E. Nogales.

Eukaryotic preinitiation complex. (A) Cryo-EM reconstruction of purified yeast PIC with available crystal structures docked, from Murakami et al.[38] (B) Negative stain reconstruction of stepwise assembled human PIC with available crystal structures docked, from He et al.[39] (C) Cryo-EM reconstruction of a partial human PIC that lacks TFIIH with available crystal structures docked, from He et al.[39] Images courtesy of E. Nogales. He et al. characterized the human PIC and studied the structural changes that accompanied the sequential addition of the various transcription factors.[39] In this structure, the DNA makes interactions with most of the transcription factors, as seen by Murakami et al., but also interacts directly with the polymerase in the presence of TFIIF (Figure 7B).[39] Binding of TFIIF to the TBP-TFIIA-TFIIB-Pol II bound promoter leads to an engagement of the DNA along the cleft of the polymerase. Binding of TFIIE further stabilizes the PIC and provides a platform for binding of TFIIH, which positions its XPB helicase domain directly on the downstream DNA. The authors hypothesized that the XPB domain could translocate and twist the DNA to generate stress, which would be relieved by duplex unwinding, favoring the formation of the open promoter complex.[39] One other significant difference is seen in the fitting quality resulting from the docking of the crystal structures of the various transcription factors to the two cryo-EM densities (Figure 7A and B). The crystal structures fit very well to He et al.’s cryo-EM densities, whereas, in the study by Murakami et al., parts of the crystal structures of Pol II and the transcription factors did not fit well to their cryo-EM densities, and parts of the density assigned to Pol II were not accounted for in the crystal structure (Figure 7). Even though these structures are derived from different organisms, the differences observed between them are somewhat surprising. To explain these differences, Murakami et al. have suggested that the structure from He et al. represents an incomplete PIC, which lacks TFIIE and TFIIH. These authors also claimed that their cryo-EM densities cover a larger fraction of the transcription factors than those presented in the structure by He et al. However, it should be pointed out that He et al.’s cryo-EM densities fully accounted for the structured sections of these proteins observed in crystal structures. Thus, at the point of writing this Review, the controversy persists as to which structure more accurately represents the eukaryotic PIC. Perhaps, the two structures represent different stages of the PIC. The structure proposed by Murakami et al. could correspond to the promoter containing all of the transcription factors that has just recruited an additional polymerase after the first round of transcription, thus explaining the lack of interactions between the DNA and the enzyme. The structure proposed by He et al. could represent the sequential binding of transcription factors and Pol II to the promoter to initiate the first round of transcription. Future experiments should shed light on the origin of these differences. Superresolution imaging is now being used to characterize the spatial distribution of initiation complexes of RNAP in live eukaryotic cells.[40] In parallel, single-molecule methods have recently begun to examine eukaryotic transcription initiation in vitro both structurally and dynamically. By fluorescently labeling various positions on the template DNA, nontemplate DNA, TBP, TFIIB, and the Rpb7 subunit of yeast Pol II, Treutlein et al. derived a model for a minimal open promoter complex structure.[41] Their model of the open complex suggests that the downstream DNA can adopt two different conformations: lying between the Pol II clamps, as appears in crystal structures of the elongation complex,[42] and on top of this cleft, closer to the Rpb7. The rates at which the DNA transits between the two conformations were determined to be kin clamp = 1.5 ± 0.4 s–1 and kout of lamp = 0.8 ± 0.1 s–1, respectively. However, it should be pointed out that this model was determined from a minimal open promoter complex that lacks TFIIA, TFIIE, TFIIH, and other subunits of TFIID, and an 11-nt mismatched DNA was employed to mimic a transcription bubble instead of a complementary nontemplate DNA. To study the dynamics of eukaryotic transcription initiation, Revyakin et al. used fluorescence imaging to count the rounds of transcription that initiated from the same promoter by determining the number of photobleaching events of fluorescently labeled RNA probes at a given time. They found that the distribution of photobleaching events follows a Poisson distribution.[43] This observation suggests that transcription initiation and reinitiation are noncooperative and independent, which is somewhat inconsistent with a previous notion that the scaffold of transcription initiation factors remains bound to the promoter and reinitiates multiple rounds of transcription at the end of each abortive event.[44]

Transcription Elongation

The elongation phase of transcription starts once the polymerase has produced a long enough RNA chain and has cleared the promoter region. In this phase, the polymerase uses the energy of NTP incorporation into the growing RNA chain to advance on DNA. Single-molecule studies have provided important insights into the molecular mechanism of transcription by analyzing the temporal and spatial dynamics of elongating RNA polymerase molecules. These experiments are usually performed with optical tweezers, an instrument that allows the manipulation of micrometer-sized beads with a focused laser beam. The elongating polymerase is attached to the surface of a bead held in a trap, while one of the ends of the DNA being transcribed is attached to another bead held in another trap (Figure 8). As elongation proceeds, one can infer the precise position of the polymerase on DNA by monitoring the movement of the two beads. In addition, this setup permits the application of force to the polymerase either in the same direction as transcription (assisting force, Figure 8A) or in the opposite direction (opposing force, Figure 8B). Studying the effect of force on the movement of molecular motors is an ideal tool to unravel how these motors couple chemical energy to mechanical motion.[45]
Figure 8

Dual-trap optical tweezers setup for transcription elongation assays. (A) Assisting force. (B) Opposing force. The arrow indicates the direction of RNA polymerase elongation. (C) Single-molecule elongation traces of Pol II during assisting force (left) with detail (right) showing pauses (red) and active elongation (black). Adapted with permission from ref (53). Copyright 2009 American Association for the Advancement of Science.

Dual-trap optical tweezers setup for transcription elongation assays. (A) Assisting force. (B) Opposing force. The arrow indicates the direction of RNA polymerase elongation. (C) Single-molecule elongation traces of Pol II during assisting force (left) with detail (right) showing pauses (red) and active elongation (black). Adapted with permission from ref (53). Copyright 2009 American Association for the Advancement of Science.

The Kinetic Cycle of Transcription Elongation

Initially, single-molecule transcription elongation experiments were performed with the E. coli RNAP.[46−49] These studies have shown that elongation has two phases: active translocation and pausing (Figure 8C). The same phases were also observed for the eukaryotic RNA polymerase II (Pol II),[50] and for the mitochondrial Rpo41.[51] During on-pathway active translocation, the polymerase incorporates nucleotides into the nascent RNA chain and advances along the DNA template. The paused states were shown to be off-pathway from the NTP incorporation cycle.[46] In the paused state, the polymerase can be stationary at one position on DNA, or it can diffuse backward along the DNA and then recover from these backtracks. Both on- and off-pathway phases of elongation are of interest, as different transcription factors can interact with the polymerase to modify either its active cycle or the probability of entering or remaining in the paused state. As we will describe in detail in this section, single-molecule experiments demonstrated that RNAP translocation occurs through a Brownian ratchet mechanism rectified into a forward movement by NTP binding, and that long pauses correspond to periods in which the enzyme backtracks on DNA, diffusing back and forth on the template until its active site re-engages with the 3′ end of the RNA transcript. There still remains some controversy about the exact details of the nucleotide incorporation cycle, and about the origin of short pauses.

The Nucleotide Incorporation Cycle

Single-molecule studies have confirmed structural and biochemical data indicating that RNA polymerase advances one base pair at a time as it incorporates one ribonucleotide during the synthesis of an RNA chain.[52] When pauses that are not considered part of the nucleotide incorporation cycle are removed, the resulting mean elongation velocities, pause-free velocities, at saturating concentrations of NTPs are: 15–23 bp/s for Pol II,[50,51,53] 10–25 bp/s for the bacterial RNAP,[52,54,55] and 20–24 bp/s for the mitochondrial Rpo41.[51] We provide a range of mean values here, as the pause-free velocities vary slightly depending on the DNA template, forces that aid or oppose the enzyme, and the algorithms that detect and remove pauses. One early question that single-molecule experiments have addressed is whether the transcriptional translocation event is a power stroke, where the enzyme’s translocation is directly coupled to a chemical step, or a Brownian ratchet, where thermally driven movement of the polymerase is rectified to one direction by its substrate. The answer to this question has implications about the possible modes of action of various inhibitors and transcription factors. The pause-free velocity is given by the distance the polymerase has to translocate during one cycle (d), divided by the total time it takes to complete the elongation cycle. This time is the sum of the times necessary to bind the incoming NTPNTP), complete the condensation reaction that incorporates the NTP to the RNA chain (τcond), and release the pyrophosphate before starting a new cycle (τPPi):If translocation is driven by thermal noise and biased forward by NTP binding followed by the irreversible condensation reaction (Brownian ratchet), τNTP should be sensitive to force, because that is the step associated with net movement on DNA. Instead, if pyrophosphate release induces, or coincides with, a change in conformation of the elongation complex, triggering translocation (power stroke), τPPi would be sensitive to force instead. At limiting NTP concentrations, the time it takes to bind NTPsNTP) becomes dominant over the time of pyrophosphate release (τPPi). In these conditions, if elongation follows a Brownian ratchet mechanism, the velocity should be sensitive to force, while if it follows a power stroke, the velocity should not depend on force. Single-molecule data have shown that at low NTPs, the pause-free velocity does depend on force.[52] This result is thus inconsistent with the power stroke mechanism, and supports the Brownian ratchet mechanism for transcription elongation, originally determined from bulk studies (Figure 9).[56] To date, no motor has been shown to change its mechanism of operation as a function of its substrate concentrations; thus, the ratchet is likely to be valid for RNAP at all NTP concentrations.
Figure 9

Pause-free velocity as a function of force at low NTPs concentrations. The data fit a Brownian ratchet model, but do not fit a power stroke model.[52] Reprinted with permission from ref (52). Copyright 2005 Nature Publishing Group.

Pause-free velocity as a function of force at low NTPs concentrations. The data fit a Brownian ratchet model, but do not fit a power stroke model.[52] Reprinted with permission from ref (52). Copyright 2005 Nature Publishing Group. While it is accepted that RNA polymerases act as Brownian ratchets, some aspects of the nucleotide addition cycle are still under debate.[57] The nucleotide addition cycle has multiple steps: the movement of the polymerase from a pre-translocated to a post-translocated state, binding of the incoming NTP, and catalysis of a phosphodiester bond between the NTP and the nascent RNA chain. The simplest model includes these steps sequentially in a linear fashion (Figure 10A). Because it is difficult to directly measure individual rates involved in nucleotide incorporation, one measures the elongation velocity of the polymerase in different conditions and fits the rates of the model to the data. The data obtained from single-molecule experiments offer the advantage to define and focus selectively on pause-free velocity, which in principle reflects only the time the enzyme spent in the on-pathways reactions. Note, however, that the temporal and spatial resolution of the experiments and algorithms used to detect pauses typically limit their extraction to those longer than 1 s. Additionally, the force applied to the polymerase during transcription allows one to make quantitative predictions about the dependence of the pause-free velocity on this applied force, and, as such, it can be used to put any elongation model to the test.
Figure 10

Two different kinetic models for the on-pathway nucleotide addition cycle. (A) Linear Brownian ratchet model of transcription elongation, which only allows NTP binding after translocation. (B) Branched Brownian ratchet model of transcription elongation that allows NTP binding either before or after translocation. (C) Fits of the linear model with slow translocation and the branched model to the force–velocity relationship of the wild-type Pol II.[58,54] (D) Fits of the linear model with slow translocation and the branched model to the force–velocity relationship of the fast mutant Pol II.[58,54]

Two different kinetic models for the on-pathway nucleotide addition cycle. (A) Linear Brownian ratchet model of transcription elongation, which only allows NTP binding after translocation. (B) Branched Brownian ratchet model of transcription elongation that allows NTP binding either before or after translocation. (C) Fits of the linear model with slow translocation and the branched model to the force–velocity relationship of the wild-type Pol II.[58,54] (D) Fits of the linear model with slow translocation and the branched model to the force–velocity relationship of the fast mutant Pol II.[58,54] When fitting the single-molecule data, most studies up to date have made the simplifying assumption that the rates of translocation and nucleotide binding are much faster than the rate of NTP incorporation. When using this simplified assumption and the linear model (Figure 10A), Abbondanzieri et al. and Larson et al. found that the velocity versus force data could not be fit well. This observation prompted them to propose a more complex, branched model of elongation that involves the existence of a secondary NTP binding site (Figure 10B).[52,54] While in the linear model the incoming NTP can only bind after the polymerase has translocated, in the branched model the NTP can bind to both the pre- and the post-translocated states. However, Dangkulwanich et al. recently questioned the validity of the common assumption that the transition rates from the pre- to post-translocated states are much larger than those of other kinetic steps.[58] Whereas the application of external force affects both the forward and the reverse translocation rates of the enzyme, Dangkulwanich et al. recognized that a mechanical barrier such as a nucleosome only affects the forward rate, making it possible to separate it from the reverse rate. Using this approach, the authors found that the forward translocation rate is indeed comparable to the subsequent catalytic rates.[58] Thus, the fast translocation assumption previously used to fit the linear model to the data is not appropriate. Accordingly, the pause-free velocities as a function of force can be fit with the exact solution of the linear Brownian ratchet model just as well as to the branched model (Figure 10C). Therefore, it is not necessary to invoke a second NTP binding site to explain the force–velocity data. This finding makes the simpler, linear Brownian ratchet model (Figure 10A) more appealing to explain the nucleotide incorporation cycle of the polymerase.

Pausing

Polymerase pauses can significantly reduce the overall transcription rate during elongation. For instance, at saturating concentrations of NTPs, the velocity of the eukaryotic Pol II including pauses is about one-half of its pause-free counterpart: 11.6 ± 2.5 bp/s[59] as compared to 22.9 ± 5.0 bp/s (same data as Bintu et al.,[59] unpublished result). This observation raises the possibility that various transcription factors can potentially regulate the length and the frequency of pauses to control the rate of transcription. Characterizing the nature of these pauses and how they are regulated is, therefore, essential for a comprehensive understanding of the regulation of gene expression. It is accepted that a pause corresponds to a state that deviates from the main nucleotide incorporation pathway, as pausing has been shown to compete kinetically with elongation: the higher is the elongation rate, the fewer are the pauses.[46] However, the nature of these pauses is actively being studied. The duration of transcriptional pauses varies from under a second to minutes, and cannot be fit with a single exponential distribution.[50,52,55,59] This observation suggests that there is not a single paused state. An initial analysis divided bacterial RNAP pauses into two categories: short pauses, under 20 s, and long pauses, above 20 s.[55,60] Short pause durations can be fit with two exponentials with time constants of 1.2 and 6 s. By contrast, long pauses are infrequent with durations that are broadly distributed. The long pauses were clearly shown to be associated with the backward movement of the polymerase on DNA averaging about 5 bp,[60] followed by the return of the polymerase to the original position. This phenomenon, initially described by ensemble footprinting studies, was termed “backtracking” (Figure 11C).[61,62] However, pauses defined as short in the single-molecule studies of the bacterial RNAP did not show backtracking (Figure 11D).[55,60] Moreover, the density (number of pauses per bp transcribed) and the durations of short pauses were reported to be insensitive to force.[55] Because backtracking involves movement on DNA, one would expect an assisting force to decrease pausing. Therefore, the force insensitivity of short pauses was interpreted as another piece of evidence that the short pauses are not associated with backtracking. These short pauses were called “ubiquitous” or “elemental”, and were speculated to come from molecular rearrangements of the elongation complex that would render it elongation incompetent. In fact, a recent crystal structure of the elemental pause state of bacterial RNAP showed that the clamp is open, and the bridge helix is kinked and blocks the NTP binding site, whereas the RNA–DNA hybrid binding site along with the RNA exit channel are widened.[63] In one view of the mechanisms of transcriptional pausing, RNAP must first enter this elemental paused state.[64−66] These elemental pauses can be subsequently stabilized into longer-lived pauses by RNAP backtracking or by the formation of a hairpin structure in the nascent RNA transcript.[67,68]
Figure 11

Mechanisms of transcriptional pauses. (A) Model of pausing including elemental/ubiquitous pauses that can lead to backtracking or hairpin stabilized pauses. (B) Backtracking model of pausing, which suggests that the elemental pause state is not obligatory. (C) Example single-molecule trace showing long pauses and backtracking. (D) Example trace showing a short pause. (E) Predictions of average trajectories for long and short pauses based on the backtracking model. (C,D) Reprinted with permission from ref (60). Copyright 2003 Nature Publishing Group. (E) Adapted with permission from ref (69). Copyright 2009 Elsevier.

Mechanisms of transcriptional pauses. (A) Model of pausing including elemental/ubiquitous pauses that can lead to backtracking or hairpin stabilized pauses. (B) Backtracking model of pausing, which suggests that the elemental pause state is not obligatory. (C) Example single-molecule trace showing long pauses and backtracking. (D) Example trace showing a short pause. (E) Predictions of average trajectories for long and short pauses based on the backtracking model. (C,D) Reprinted with permission from ref (60). Copyright 2003 Nature Publishing Group. (E) Adapted with permission from ref (69). Copyright 2009 Elsevier. While some of the short pauses may not be associated with backtracking (ubiquitous or elemental pauses), it is entirely possible that some of the pauses classified as ubiquitous are short backtracked pauses. An alternative view poses that the elemental pause state is not obligatory and attributes most pauses to backtracking. One expects short backtracking pauses to be associated with short backward excursions (under the resolution of these experiments ∼3 bp) and therefore to display an apparent force insensitivity.[69] In addition, the rate of entering the 1-bp backtracked state is faster than entering further backtracked states,[58] which predicts that short backtracks are on the same time scale (<1 s) as what has been called ubiquitous pauses. In an analysis of eukaryotic Pol II, pauses longer than 1 s can be aptly described by a backtracking model in which a pause begins with the backward movement of Pol II by one base pair (Figure 11B).[50] As Pol II backtracks, the entire elongation bubble shifts, and the 3′ end of RNA loses its register with the active center of the polymerase and inhibits NTP incorporation. The polymerase performs a random walk back and forth on the DNA until the 3′ end of the transcript realigns with the active center to allow Pol II to resume elongation. According to this model, the diffusion of the enzyme along the DNA explains a wide distribution of pause durations: in some cases, the random walk finishes in a few steps, while in others the polymerase randomly backtracks many base pairs, so it takes much longer to recover. In fact, the distribution of pause durations can be derived exactly from this model by calculating the probability of observing a random trajectory with n steps, multiplying it by the distribution of times that it takes to perform those steps (which is given by a Gamma distribution), and summing over all allowed values of n.[53] The resulting pause duration distribution is given by:where I1 is the modified Bessel function of the first kind, and kf and kb are the forward and backward stepping rates during backtracking (Figure 11B).[53,69] These rates depend on the applied force (F), the distance to the transition state for a step (d), and the back and forth stepping rate of the backtracked polymerase on DNA in the absence of applied force (k0) as follows: kf = k0e and kb = k0e–. Note that for short durations, on the order of the rate of backtracking, the distribution ψ(t) behaves as an exponential, while for long durations and small forces, the probability distribution follows a t–3/2 power law. In fact, the experimental distribution of pauses for Pol II was found to follow this power law.[50] The fact that the backtracking model fits all pause durations very well, with just one parameter (k0), makes it quite appealing.[50,53,69,70] However, the pausing mechanism in the eukaryotic Pol II could be different from that of the prokaryotic RNAP. In fact, Kireeva et al. showed the E. coli RNA polymerase paused at certain sequences without backtracking, while yeast Pol II does not recognize the same pause signal.[71] Therefore, the possibility that some of the short pauses are not associated with backtracks is likely.

DNA Sequence and Nascent RNA Effects on Elongation Dynamics

Ensemble biochemical studies revealed that RNA polymerases pause or arrest on certain DNA templates.[71] An important question is then how the transcribed sequence modulates pausing. Possible candidates are the local stability of the DNA–RNA hybrid, the DNA–DNA interactions upstream and downstream of the hybrid, the extent and stability of RNA secondary structures behind the polymerase, and allosteric interactions of the RNA with the enzyme (Figure 12A). Although some of these mechanisms have been shown to affect the pausing probability at a given sequence, the process by which the enzyme pauses in a sequence-dependent manner remains an area of active research.
Figure 12

Regulations of transcriptional pauses. (A) Different elements that influence sequence-dependent pausing. (B) Dwell times from single-molecule data showing sequence-dependent pausing of RNAP.[64] (C) For Pol II, mean pause durations decrease as the GC content of the template increases.[51] The effect disappears as the nascent RNA is digested, suggesting the structure of the RNA aids pause recovery. (B) Adapted with permission from ref (64). Copyright 2006 Elsevier. (C) Adapted with permission from ref (51). Copyright 2012 National Academy of Sciences.

Regulations of transcriptional pauses. (A) Different elements that influence sequence-dependent pausing. (B) Dwell times from single-molecule data showing sequence-dependent pausing of RNAP.[64] (C) For Pol II, mean pause durations decrease as the GC content of the template increases.[51] The effect disappears as the nascent RNA is digested, suggesting the structure of the RNA aids pause recovery. (B) Adapted with permission from ref (64). Copyright 2006 Elsevier. (C) Adapted with permission from ref (51). Copyright 2012 National Academy of Sciences. Two kinetic models have explicitly calculated the energy differences between the pre-translocated state, the post-translocated state, and backtracked states at each position on DNA.[72,73] One of the models[72] posits that there is a high activation barrier to enter the backtracking state (on the order of 40 kBT), and therefore predicts that most of the sequence specific pauses are trapped in the pre-translocated state. The alternative model[73] assumes that entering a backtrack is relatively easy, but that backtracks are on average limited to 9 bp by the folding of the nascent RNA. These models can correctly predict 60–80% of pauses observed experimentally by RNAP. However, in addition to missing pauses, both models predict pauses at positions that are not experimentally associated with pausing: 20–40% of predicted pauses are false positives. Single-molecule data can test the predictions of these models by analyzing the pause dynamics and thus inferring the mechanism of pausing at various sequences. However, single-molecule studies on the sequence dependence of pausing are difficult, because even if changes in polymerase positions can be determined with base-pair resolution, the accuracy with which the absolute position of the polymerase on DNA can be determined is much lower. One study overcame this shortcoming by rescaling all traces so that they are perfectly aligned at the point where RNAP runs off the template.[74] Using this technique, the authors were able to achieve an accuracy of about 5 bp, and they showed that for a particular sequence (ΔtR2), the pause durations were sensitive to the applied force, suggesting that backtracking takes place at this pause site. A different single-molecule study of pausing at multiple sites (Figure 12B) did not detect backtracking at sequences that induced RNAP pausing.[64] Note that the alignment of RNAP position was performed using the peak pause sites at the well-studied his sequence as a reference point. For the alignment to work correctly, pausing at the his site has to always take place without backtracking, which is believed to be the case. The lifetimes of the paused states at each pausing site could be fit with one exponential, and the time scales of these pauses varied between 1 and 6 s. By correlating the length of the RNA transcript from bulk transcription to the pause peaks in single-molecule traces, the authors proposed that these sequence-specific pauses, which occur with the polymerase stuck in the pre-translocated state,[75,76] are in fact the same as the ubiquitous pauses identified before, and that they are prerequisites for longer pauses stabilized by backtracking or RNA hairpins. By analyzing the DNA sequence at the pause sites, the authors noted that in all seven sequences, G or C bases were present immediately upstream of the RNA–DNA hybrid (at position −10 and −11). The mechanism by which these sequences induced RNAP pausing is still not clear. These bases could extend the RNA–DNA hybrid to the upstream side and generate stress in the enzyme, stabilize a state backtracked by 1 or 2 base pairs, or delay the elongation bubble from moving from the pre- to the post-translocated state. Because the authors did not observe backtracking at these pause sites, the mechanisms involving backtracked state stabilizing are not likely. From these two experimental results, it seems that different sequences can induce different types of pausing. Moreover, it appears that the eukaryotic Pol II and the prokaryotic RNAP respond to the same pause sequences differently.[71] This result raises the question of whether interactions of the polymerase with the surrounding DNA or RNA are different in the prokaryotic and eukaryotic enzymes. The crystal structures of the prokaryotic and eukaryotic polymerases show them to be quite similar in the region that interacts with the elongation bubble. However, this might not be the case for interactions of the nucleic acids with the outer part of the enzymes. To address the question, one has to map the location of the DNA and RNA once they exit the polymerase. This type of experiment was performed in Pol II by mapping the RNA exiting Pol II using single-molecule fluorescent techniques.[77] Their results showed that after leaving the exit channel, the 5′ end of the RNA is initially free, but then the RNA reassociates with the base of the Pol II dock domain when it reaches about 26–29 nt in length. This type of interaction could prevent backtracking or otherwise allosterically modulate pausing. In addition, most of the sequence-dependent data analysis has focused on the region immediately surrounding the transcription bubble. However, sequences already transcribed can significantly alter the dynamics of pausing via the secondary structure of the nascent RNA. Recent results show that templates that are richer in GC base pairs lead to fewer and shorter pauses than AT-rich templates for both the nuclear yeast Pol II as well as the mitochondrial Rpo41 (Figure 12C).[51] The difference disappears when the RNA transcript harbored by the enzyme is digested with RNase A, suggesting that GC-rich RNAs form stronger secondary structures that prevent pausing, most likely by limiting backtracking.

Transcription through the Nucleosome

In eukaryotes, the DNA is wrapped in nucleosomes, which act as mechanical barriers to the advancing polymerase. It has been shown that chromatin structure and organization constitute an important mechanism of control of gene expression.[78,79] As such, it is of great interest to establish what happens when a transcribing enzyme encounters a nucleosome. Conversely, it is of interest to understand the fate of histones during and after transcription. We can propose two ways Pol II can overcome the nucleosomal barrier. In one scenario, the enzyme actively unwraps the nucleosomal DNA from the surface of the histone octamer to advance, and, in the other, the polymerase simply stops and waits for a spontaneous unwrapping of the nucleosomal DNA before it can advance. These different modes of interactions between the polymerase and the nucleosome have different implications for the regulation of gene expression. Single-molecule experiments in which a nucleosome was placed downstream of the transcribing Pol II have shown that advancement of the enzyme through the barrier depends on a fine interplay between the enzyme dynamics (including pausing and translocation), and the fast dynamics of nucleosomal fluctuations.[53] Upon encountering a nucleosome, the pause durations and pause densities of Pol II increased, while the pause-free velocity was seen to decrease considerably (Figure 13A). Increased pause densities at the nucleosome are consistent with a model where Pol II cannot actively unwrap the nucleosomal DNA even when it is in an elongation competent state.[53] Instead, Pol II has to wait for nucleosomal fluctuations to allow its access to the downstream DNA. These interpretations are consistent with the observation that Pol II can only generate a maximum of ∼7.5 pN of force (stall force) before entering irrecoverable backtracks on bare DNA.[50] While mechanical pulling from both ends of the DNA requires a force of ∼8 pN to peel the DNA off the octamer surface (at 300 mM KCl),[70] wedging of a transcribing polymerase through a nucleosomal barrier is a different process as the enzyme only exerts force through one end of the nucleosomal DNA. Even E. coli RNAP, which has a stall force 3 times higher than yeast Pol II,[48] cannot peel the DNA from the octamer surface.[80] Thus, disrupting the histones–DNA interactions via this wedging mechanism requires an enzyme that can generate much higher force. In the presence of the nucleosome, Pol II pauses, and the distribution of pause durations was similar to that on bare DNA, except that it shifted toward longer pauses. Accordingly, this distribution was well described by the same backtracking model previously used for bare DNA with one modification: recovery from backtracks is only possible when the DNA downstream of the enzyme is unwrapped (Figure 13B).[50,53] Because nucleosome fluctuations are fast relative to the rate of elongation and the rate of diffusion of the enzyme during backtracking, the rate of recovery from backtracks (kf) is reduced by the probability of finding the nucleosome locally unwrapped. This modification explains the observed increase in the extent and duration of backtracks when the nucleosome is present.
Figure 13

Dynamics of nucleosomal transcription. (A) Single-molecule traces showing increased pausing at the nucleosome as compared to bare DNA.[53] (B) Comparison of 3′ end of the RNA position to the RNAP position on the template reveals backtracking at the nucleosome.[80] (C) Kinetic model for transcription at the nucleosome.[53] (D) Pausing at the nucleosome for tailless and Sin-modified nucleosomes.[70] (E) Model of histone transfer through looping.[59] (B) Adapted with permission from ref (80). Copyright 2010 Nature Publishing Group. (C) Adapted with permission from ref (53). Copyright 2009 American Association for the Advancement of Science. (D) Adapted with permission from ref (70). Copyright 2012 Elsevier.

Dynamics of nucleosomal transcription. (A) Single-molecule traces showing increased pausing at the nucleosome as compared to bare DNA.[53] (B) Comparison of 3′ end of the RNA position to the RNAP position on the template reveals backtracking at the nucleosome.[80] (C) Kinetic model for transcription at the nucleosome.[53] (D) Pausing at the nucleosome for tailless and Sin-modified nucleosomes.[70] (E) Model of histone transfer through looping.[59] (B) Adapted with permission from ref (80). Copyright 2010 Nature Publishing Group. (C) Adapted with permission from ref (53). Copyright 2009 American Association for the Advancement of Science. (D) Adapted with permission from ref (70). Copyright 2012 Elsevier. The increase in backtracking of a polymerase upon encountering a nucleosome was directly shown in another single-molecule study,[80] although using the prokaryotic RNAP as a substitute for Pol II. In this case, the authors unzipped the two strands of DNA to find the position of the polymerase after allowing it to transcribe through the nucleosome for various amounts of time. By comparing the positions of polymerases on DNA from these single-molecule experiments with the distribution of RNA lengths obtained by running the labeled product of transcription on a gel, the authors conclude that the polymerase backtracks by about 15 bp on average when it encounters the nucleosome (Figure 13C). Consistent with this finding, it has been shown that the presence of a second trailing enzyme helps the leading polymerase overcome the nucleosome by preventing it from backtracking.[80] The leading polymerase also helps the trailing one transcribe through the nucleosome by preventing the latter from rewrapping, and thus providing a clear path for the second enzyme. A theoretical analysis of the mechanism through which a pair of polymerases can influence each other activities has appeared recently.[81] This mechanism could be at work in genes with high rates of transcription, where multiple polymerases would cooperate to transcribe through the nucleosome in a synergistic manner. In vivo, transcription elongation is regulated by specialized machineries that modify or remodel nucleosomes. However, in many cases, it is unclear whether these modifications directly affect polymerase elongation, or indirectly regulate transcription through recruitment of other factors. A recent single-molecule study of transcription through modified nucleosomes showed that removal or mock acetylation of the histone tails has a modest effect on overall transcription elongation.[70] The number and durations of pauses for transcription through tailless and acetylated nucleosomes decreased only in the entry region of the nucleosome (Figure 13D), but remained unchanged in the central region, which constitutes the main barrier to Pol II. This finding suggests that the histone tails control the gate into the nucleosomal region for Pol II and possibly for other remodeling factors. In contrast, single amino-acid mutations of residues in histones H3 or H4 that make direct contact with the DNA near the nucleosome dyad dramatically decreased Pol II, pausing in the central region of the nucleosome (Figure 13D). These mutations, while not natural, highlight the contribution of histone–DNA contacts to the magnitude of the barrier in the central region, and suggest that a nucleosome binding- or remodeling-factor that can disrupt even a single one of these contacts can dramatically enhance the efficiency of elongation through the nucleosome. What is the fate of histones during and after transcription? To answer this question, Hodges et al. applied force between the upstream DNA and the polymerase (Figure 8B) after the latter had transcribed through a nucleosome. These authors found that ∼60% of the histone cores remained associated to the DNA after the passage of the enzyme. Interestingly, it was found that this fraction reduced to less than 10% when a force of only 3–5 pN was applied during the passage of the enzyme through the histone core.[53] This observation suggests that the partially unwrapped histones contact the upstream DNA forming a (tension sensitive) loop that allows their transfer upstream of the elongation complexes (Figure 13E). A subsequent study using AFM to image elongation complexes at different points during transcription has provided additional support to this looping model.[59] Significantly, this AFM study showed that the fate of the transcribed histones (detachment, partial dissociation, or upstream transfer) depends on the elongation rates of the transcribing enzyme, making it possible to rationalize the observations made by different laboratories when the polymerase encounters the nucleosome. In some studies, a histone octamer has been reported to move upstream of its initial nucleosome positioning sequence;[82−84] in other studies, a mixture of octamers and hexamers was observed,[85,86] and, in yet other studies, complete detachment of the nucleosome from the template was reported.[84] Bintu et al. suggest that the outcome of the process depends on a kinetic competition among the elongation rate of the transcribing polymerase, the rate of octamer transfer behind the polymerase, and the rate of H2A–H2B dissociation from the octamer.[59] According to their interpretations, as Pol II advances through the nucleosome, the DNA is being detached from the histones. The octamer being a collection of positively charged proteins is unstable at salt concentrations under 1 M. Unless the histones contact another piece of DNA that neutralizes their charges and stabilizes their association, the octamer may dissociate with partial loss of its components. Initially, as the nucleosome partially unwraps during Pol II advancement, enough of the histone core is exposed to allow contact with the upstream DNA through a temporary DNA loop, but not so much as to cause H2A/H2B dissociation. During slow transcription (100 μM NTPs), this partially exposed histone intermediate lasts long enough to allow transfer of the intact octamer onto the upstream DNA. However, if the rate of transcription is increased slightly, more of the nucleosome will unwrap, and, as enough of the histone core becomes exposed, dimer dissociation starts competing with octamer transfer to the upstream DNA. Under these conditions, representative for transcription at 200 and 1000 μM NTPs, both octamers and hexamers can be found as a result of transcription. Finally, when the rates of transcription are even higher, enough DNA is unwrapped from the surface of the histone core that the complete histone detachment from DNA greatly outcompetes the rates of histone transfer and histone–histone dissociation, thus leading to bare DNA formation.

Transcription Factors That Modulate Elongation

In addition to the effects of DNA sequence, nascent RNA, and nucleosomes, many transcription factors regulate the elongation phase in the cell. Certain factors stimulate elongation, while other factors hinder it. Some also regulate the fidelity of transcription. Optical tweezers-based single-molecule studies have investigated the underlying mechanisms of a few of these elongation factors, GreA,[60] GreB,[60] NusG,[87] and NusA from E. coli,[88] and TFIIS from S. cerevisiae.[50] Prokaryotic transcription factors GreA, GreB, and their eukaryotic functional homologue, TFIIS, have been identified to enhance transcriptional fidelity by stimulating cleavage of misincorporated nucleotides.[89−91] At the single-molecule level, both GreA and GreB were observed to decrease the frequency and duration of long transcriptional pauses. However, these two factors affect pauses differently. While GreB was found to decrease both the frequency and the duration of long pauses, GreA mainly decreased the frequency. In terms of backtracked distance, GreB prevents the rearward movement during a pause, whereas GreA does not.[60] These observations are consistent with the known activities of these two factors. GreA only stimulates the cleavage of 2 or 3 nt of backtracked RNA, thus rescuing the polymerase at the pause entry, but it is unable to completely eliminate pauses associated with longer backtracks.[92] In contrast, GreB stimulates the cleavage of larger fragments, thus completely eliminating the need of recovery from backtracks.[92] The presence of a nucleotide analogue ITP increases both the density and the durations of transcriptional pauses. GreA and GreB decrease the durations of the pauses substantially, while they slightly decrease the pause density.[60] These observations indicate that these factors aid the polymerase to exit from misincorporation-induced pauses without affecting pause entering, consistent with their known function in proofreading through the stimulation of the intrinsic cleavage activity of RNAP. In eukaryotes, TFIIS like GreB stimulates the cleavage of the backtracked RNA transcript, and assists Pol II to recover faster from backtracks. In the absence of TFIIS, Pol II was only able to transcribe against an opposing load of 7.5 ± 2.0 pN, because its propensity to backtrack stalled the enzyme at high opposing loads. By rescuing the backtracked Pol II, TFIIS allowed the enzyme to transcribe up to a force of 16.9 ± 3.4 pN (Figure 14A).[50] Note that the stall force of the enzyme is highly dependent on the DNA template. This result is consistent with ensemble experiments that have shown that TFIIS can help Pol II transcribe against the nucleosome,[93,94] which, like an opposing force, also induces long backtracks of the enzyme.
Figure 14

Effects of transcription factors on the elongation dynamics. (A) An example of cycles of backtracking and TFIIS rescue at 18 pN.[50] (B,C) Representative traces of transcription by RNAP along the ops repeat template in the absence of any transcription factors (red), in the presence of 2 μM NusG (blue in (B)),[87] or in the presence of 0.5 μM NusA (blue in (C)).[88] (D) Energy diagram illustrating the transcriptional modulation by NusA.[88] (A) Adapted with permission from ref (50). Copyright 2007 Nature Publishing Group. (B) Adapted with permission from ref (87). Copyright 2010 Elsevier. (C,D) Adapted with permission from ref (88). Copyright 2011 Elsevier.

Effects of transcription factors on the elongation dynamics. (A) An example of cycles of backtracking and TFIIS rescue at 18 pN.[50] (B,C) Representative traces of transcription by RNAP along the ops repeat template in the absence of any transcription factors (red), in the presence of 2 μM NusG (blue in (B)),[87] or in the presence of 0.5 μM NusA (blue in (C)).[88] (D) Energy diagram illustrating the transcriptional modulation by NusA.[88] (A) Adapted with permission from ref (50). Copyright 2007 Nature Publishing Group. (B) Adapted with permission from ref (87). Copyright 2010 Elsevier. (C,D) Adapted with permission from ref (88). Copyright 2011 Elsevier. Another E. coli transcription factor that is known to assist elongation is NusG, which associates with RNAP and increases the overall velocity of the process. Single-molecule assays of E. coli RNAP transcription elongation in the presence of NusG have shown that it increased the pause-free velocity of RNAP by 10–20% and simultaneously decreased its pause frequency (Figure 14B). As pausing kinetically competes with translocation, the observation that NusG decreased the pause frequency suggests that it shifts the equilibrium between the pre- and post-translocated state of the enzyme toward the latter. Interestingly, this result implies that translocation of the E. coli polymerase must be one of the rate-limiting steps of the kinetic cycle, just as has been recently shown for the eukaryotic enzyme.[58] In contrast to the factors discussed above, optical tweezers experiment demonstrated that E. coli NusA slowed the pause-free velocity of RNAP and dramatically decreased the force where the pause-free velocity is half-maximal (Figure 14C) (without NusA 18 ± 2 pN of assisting load, with NusA 1 ± 4 pN of opposing load).[88] The effects of NusA are equivalent to exerting a hindering load of 19 ± 6 pN, which corresponds to an average energy barrier to forward translocation of 2.0 ± 0.4 kBT, a magnitude equivalent to that of the intrinsic translocation barrier of Pol II.[58] NusA increases the pause frequency without affecting the duration of pauses. These observations are consistent with a model where NusA shifts the equilibrium of the translocation step toward the pre-translocated state (Figure 14D). In addition, the effects of NusA and NusG on pause frequency vary for different engineered pause sequences. The mechanisms responsible for the sequence-dependent roles of NusA and NusG remain to be determined.

Transcription under Torsion

Another important regulator of transcription elongation in cells is DNA supercoiling. As RNAP transcribes the DNA template, it generates positively supercoiled DNA ahead of itself and negatively supercoiled DNA in its wake, which in turn affects the dynamics of the enzyme.[95] A recent study used a nanofabricated quartz cylinder held in an angular optical trap,[96] which can simultaneously control and measure rotation, torque, displacement, and force of the trapped cylinder to address the effect of DNA supercoiling on transcription elongation.[97] The experiments show that downstream torque (positive supercoil) decreases the pause-free velocity of the E. coli RNAP, while it increases the pause density and the duration of pauses. Elongation is halted when the enzyme transcribed against a downstream torque load of 11.0 ± 3.7 pN nm.[96] When the torsional stress is relaxed, the RNAP resumes transcription. Similarly, the enzyme could work against an average upstream torque (negative supercoil) of 10.6 ± 4.1 pN nm. The amount of torque required to melt DNA is ∼10 pN nm.[96,98] These results indicate that the E. coli RNAP can generate sufficient torque in its wake to melt the DNA. The torque so generated could regulate other processes, such as displacement of histones or other DNA-binding proteins that regulate transcription, or could influence transcription of other RNA polymerases on the same or opposite strand. In vivo, the interplay between topoisomerases and torque-generating processes, such as transcription, should ultimately regulate the torsional state of the DNA, which in turn should affect all DNA transactions.[99]

Transcription Termination

Termination is the last stage of transcription wherein the elongation complex (EC) reaches the termination sequence, releases the nascent RNA, and the RNA polymerase disengages from the DNA template. It is known that bacterial and eukaryotic transcription termination mechanisms are quite different. Bacteria employ two termination pathways: intrinsic and Rho-dependent termination. During intrinsic termination (also called Rho-independent termination), a stem-loop hairpin encoded in the termination sequence causes the EC to dissociate. Formation of these RNA hairpin structures can be modulated by the binding of ligands such as adenine, thiamine pyrophosphate, or antiterminator RNA binding proteins. The other transcription termination mechanism involves Rho, a ring-shaped helicase that translocates along the mRNA and presumably pulls it out of the RNA–DNA hybrid inside the EC or induces a conformational change in the EC. Recently, both mechanisms of bacterial transcription termination have been studied using single-molecule techniques. Eukaryotic transcription termination mechanisms are more complex and less understood as compared to the bacterial counterpart. Therefore, we will mainly discuss bacterial transcription termination mechanisms in this section.

Rho-Dependent Termination

The Rho-dependent mechanism constitutes 20–50% of all bacterial RNA synthesis termination. Rho is a ∼50 kDa ring-shaped hexameric ATPase motor protein that translocates along the RNA and subsequently disrupts the RNA–DNA hybrid inside the EC to release the transcript. Rho contains two domains: the N-terminal RNA binding domain and the C-terminal RecA-like ATPase domain.[100] Rho-dependent termination starts with the binding of Rho to the RNA at a Rho utilization (rut) site, which is a C-rich region of 85–97 nt. Rho binds to the rut site with high affinity: Ka ≈ 1010 M–1.[101,102] Once the RNA associates with the primary binding site of Rho (RNA binding domain), the motor translocates RNA through its center. The affinity of RNA to the secondary site (RecA-like ATPase domain) is relatively weak: Ka ≈ 4 × 106 M–1.[101,103] After Rho reaches a paused RNAP, other transcription factors such as NusA and NusG also participate in the transcript release process.[104] NusA decreases the efficiency of Rho-dependent transcription termination at the tiZ1 and tiZ2 intragenic terminators, while NusG increases it.[104] Surprisingly, the roles of these factors in transcription termination are opposite from what one would expect from their effects in the elongation phase. Because NusA decreases the overall elongation velocity, we would expect it to enhance termination. Conversely, NusG increases efficiency of elongation; hence, we would expect it to reduce termination efficiency. It is unclear what mechanisms allow these factors to affect termination and elongation in such different manners. Recently, the binding of Rho was examined using optical tweezers.[105] These experiments revealed that Rho binds 57 ± 2 nt of RNA in the primary site (9–10 nt per monomeric protein) and 28 ± 2 nt of RNA in the secondary site (Figure 15A). In addition, the data suggest that Rho translocation occurs by the “tethered tracking mechanism”, in which Rho remains bound to the rut site while the motor threads the downstream RNA sequence in the 5′ to 3′ direction through the secondary binding site (Figure 15B).[102]
Figure 15

A model for Rho-dependent termination. (A) A general model of Rho binding to mRNA and translocation through the pore.[105] The N-terminal domain of Rho (cyan) associates with the RNA transcript, which is then passed into the center of the hexamer, allowing Rho to translocate downstream toward RNAP with a loop. (B) Proposed model for Rho-dependent termination. (A) Adapted with permission from ref (105). Copyright 2012 Elsevier.

A model for Rho-dependent termination. (A) A general model of Rho binding to mRNA and translocation through the pore.[105] The N-terminal domain of Rho (cyan) associates with the RNA transcript, which is then passed into the center of the hexamer, allowing Rho to translocate downstream toward RNAP with a loop. (B) Proposed model for Rho-dependent termination. (A) Adapted with permission from ref (105). Copyright 2012 Elsevier. However, many mechanistic details about Rho-dependent termination remain unknown. For instance, how fast does Rho translocate? How many bases does Rho translocate per ATP-hydrolysis? What are the roles of NusA and NusG during transcription termination? What is the role of Rho during EC dissociation? Future single-molecule studies of Rho will help elucidate these mechanistic details.

Intrinsic Termination

Intrinsic termination involves a nascent RNA transcript, which forms a stem loop structure followed by 7–9 nt of a U-rich RNA–DNA hybrid (Figure 16).[101,102] Larson et al. employed a single-molecule optical tweezers assay to study the mechanism of intrinsic transcription termination.[106] By quantifying the termination efficiency at different applied forces on either the DNA or the RNA, the authors characterized mechanisms of transcription in three representative bacterial intrinsic terminators (his, t500, and tR2). Transcription termination efficiencies at these sequences vary from 30% to 98% in bulk.[106] The termination efficiency for his and tR2 terminators did not change with assisting or hindering loads applied on the DNA. This observation indicates that entering termination at these locations does not require translocation of the RNAP along the DNA. On the other hand, the t500 terminator, which contains a U–A pair instead of a G–C pair at the base of the hairpin and two interruptions in the U tract, shows a force dependency in both the kinetics and the efficiency of termination in a manner corresponding to a forward translocation of the RNAP by ∼2.9 bp (Figure 16, hypertranslocation).
Figure 16

Intrinsic transcription termination. RNAP synthesizes the GC-rich RNA hairpin (blue loop), followed by U-rich segment, which forms a weak hybrid with DNA. RNAP can translocate forward along the DNA template without NTP incorporation in a process called hypertranslocation. Alternatively, the hairpin can destabilize the DNA–RNA hybrid either by shearing or by allosteric interactions. These destabilized structures induce the dissociation of the elongation complex.

Intrinsic transcription termination. RNAP synthesizes the GC-rich RNA hairpin (blue loop), followed by U-rich segment, which forms a weak hybrid with DNA. RNAP can translocate forward along the DNA template without NTP incorporation in a process called hypertranslocation. Alternatively, the hairpin can destabilize the DNA–RNA hybrid either by shearing or by allosteric interactions. These destabilized structures induce the dissociation of the elongation complex. The integrity of the terminator RNA hairpin is required to induce the changes in the EC involved in termination. Results from force applied to the nascent RNA are consistent with this conclusion. The authors proposed a model in which the terminator hairpin closure induces shearing of the RNA–DNA hybrid to dissociate the elongation complex (Figure 16, shearing), but alternative mechanisms involving dissociation through an allosteric mechanism cannot be ruled out.

Effects of RNA Structure Dynamics on Intrinsic Transcription Termination

As we described above, the secondary structure of the nascent RNA can determine the fate of transcription termination. The control of transcription termination by RNA-binding proteins that modulate RNA structure is an important regulatory mechanism of gene expression in bacteria. Several antitermination proteins, including HutP, GlcT, and LicT, directly bind single-stranded RNA, and stabilize a competitive alternative secondary structure.[107−109] Using single-molecule FRET, the RNA hairpin structure fluctuations induced by binding of LicT and SacY antiterminator proteins from Bacillus subtillis were monitored.[110] These proteins prevent transcription termination by forming a shorter RNA hairpin (antiterminator RNA hairpin) that precludes the folding of the intrinsic RNA hairpin terminator (Figure 16). The strength of the antiterminator depends on the stability of the protein–RNA interactions. The length of the stem is the main determinant of the stability of these RNA hairpins.[110] These experiments were done on purified RNA strands; it will be interesting to study how these antiterminator proteins affect termination as the nascent RNA emerges from the polymerase one nucleotide at a time and folds cotranscriptionally. The effect of cotranscriptional folding of nascent RNA in termination was addressed in an independent study of adenine-sensitive riboswitch. Riboswitches are regulatory segments of mRNA that bind a small molecule, and are involved in transcription termination, inhibition of translation, and splicing.[110,111] A riboswitch consists of two parts: an aptamer that binds a small molecule and an expression platform that regulates gene expression (Figure 17). A recent optical tweezers study monitored the cotranscriptional folding of nascent RNA of an adenine-sensitive riboswitch.[112] In the absence of adenine, when the nascent transcript is held at a constant force below 7 pN, a small hairpin is seen to fold first, sequestering 21 ± 1 nt. This event is followed by the folding of the terminator hairpin, which sequesters 50 ± 1 nt, and releases the transcript. In the presence of adenine, a higher fraction of the polymerase transcribes through the termination sequence due to the formation of an alternative aptamer structure instead of the terminator hairpin. At forces higher than ∼13 pN, the unfolding force of this terminator hairpin, transcription termination efficiency decreases.[112] These experiments highlight the importance of competing, alternative secondary structures in the nascent RNA and their folding dynamics, as a basic mechanism through which small molecule effectors and transcription factors can control gene expression.
Figure 17

Regulation mechanisms of transcription termination by (A) antiterminator or (B) riboswitch. In these cases, the binding of either antiterminator protein or adenine inhibits the formation of the terminator hairpin and prevents transcription termination.

Regulation mechanisms of transcription termination by (A) antiterminator or (B) riboswitch. In these cases, the binding of either antiterminator protein or adenine inhibits the formation of the terminator hairpin and prevents transcription termination.

Eukaryotic Transcription Termination

As compared to transcription initiation and elongation, little is known about the mechanism of transcription termination in eukaryotes. Termination mechanisms differ among various RNA polymerases. RNA polymerase I (Pol I) terminates at a poly (T) stretch by binding of the NTS1 family of silencing proteins, Nsi 1[113] (see more details in the review by Németh et al.[114]). Pol III termination is induced by a poly (T) stretch alone. More details about the mechanism of Pol III termination can be found in work by Arimbasseri et al.[115] The transcription termination mechanism in Pol II is different from that of other polymerases. It involves phosphorylation of the C-terminal domain of Pol II and association of several transcription factors (more detailed information can be found in work by Mischo and Proudfoot[116]). Future studies, including single-molecule approaches, should shed light on the detailed mechanisms of transcription termination in eukaryotes.

Concluding Remarks

Transcription is a highly regulated process made up of distinct, strictly regulated steps. The emergence of single-molecule studies over the last two decades has provided many mechanistic insights into this process from the mechanisms by which polymerases locate and bind to promoters to molecular events that lead to the release of the RNA transcript from the complex. These studies have also begun to provide a coherent picture of the energy flow during the transcription cycle. In particular, the processes through which the energy released in the binding and hydrolysis of NTPs is converted into mechanical movement of the enzyme along the template and the generation of force and torque are starting to appear in sharper focus. As a result, scientists are now adding rich dynamic information derived from carefully designed single-molecule experiments to the increasing number of crystal structures depicting snapshots of RNA polymerases in different states of their kinetic cycle. Although initially much information was derived from studies on bacterial transcription, more recent investigations have paid increasing attention to the behavior of the more complex eukaryotic process. Comparison between these two systems will surely provide important additional insights on the molecular mechanisms underlying the regulation of the three phases of transcription, and their evolution. Single-molecule methods are maturing at a fast pace, making it possible to design and execute ever more complex experiments involving many different molecular actors. These advances should help scientists to obtain a more realistic picture of the complex dynamics and control of transcription resembling more closely those operating in vivo, without sacrificing the precision and quantitative description afforded by in vitro studies. Likewise, through the combination of single-molecule manipulation and single-molecule fluorescence methods in the same experiment, it should be possible to follow, for example, the internal dynamics of the polymerase or the binding of a regulatory factor and simultaneously monitor the mechanical variables of position, force, and torque. The result of these efforts will be a multidimensional picture of transcription that will provide crucial information about the relative timing of various molecular events and therefore reveal their causal connection. At the end of the last century, a large gap existed between our knowledge of the structures responsible for the readout of the genetic information and our understanding of the functional and structural transitions of those structures. The difficulty to synchronize the individual trajectories of large numbers of molecules undergoing complex transitions in an attempt to arrive at a timed-ordered sequence of events was largely responsible for that gap. The advent of single-molecule methods has begun to fill this knowledge gap, and scientists can now confidently expect that future research using these methods will eventually yield a detailed “moving picture” of the molecular processes underlying transcription.
  117 in total

1.  The kinetic pathway of RNA binding to the Escherichia coli transcription termination factor Rho.

Authors:  D E Kim; S S Patel
Journal:  J Biol Chem       Date:  2001-01-23       Impact factor: 5.157

2.  Pausing by bacterial RNA polymerase is mediated by mechanistically distinct classes of signals.

Authors:  I Artsimovitch; R Landick
Journal:  Proc Natl Acad Sci U S A       Date:  2000-06-20       Impact factor: 11.205

3.  A transcription reinitiation intermediate that is stabilized by activator.

Authors:  N Yudkovsky; J A Ranish; S Hahn
Journal:  Nature       Date:  2000-11-09       Impact factor: 49.962

4.  Real-time dynamics of RNA polymerase II clustering in live human cells.

Authors:  Ibrahim I Cisse; Ignacio Izeddin; Sebastien Z Causse; Lydia Boudarene; Adrien Senecal; Leila Muresan; Claire Dugast-Darzacq; Bassam Hajj; Maxime Dahan; Xavier Darzacq
Journal:  Science       Date:  2013-07-04       Impact factor: 47.728

5.  RNA polymerase approaches its promoter without long-range sliding along DNA.

Authors:  Larry J Friedman; Jeffrey P Mumm; Jeff Gelles
Journal:  Proc Natl Acad Sci U S A       Date:  2013-05-29       Impact factor: 11.205

6.  Single-molecule study of transcriptional pausing and arrest by E. coli RNA polymerase.

Authors:  R J Davenport; G J Wuite; R Landick; C Bustamante
Journal:  Science       Date:  2000-03-31       Impact factor: 47.728

7.  Specific interaction of the RNA-binding domain of the bacillus subtilis transcriptional antiterminator GlcT with its RNA target, RAT.

Authors:  I Langbein; S Bachem; J Stülke
Journal:  J Mol Biol       Date:  1999-11-05       Impact factor: 5.469

8.  Architecture of an RNA polymerase II transcription pre-initiation complex.

Authors:  Kenji Murakami; Hans Elmlund; Nir Kalisman; David A Bushnell; Christopher M Adams; Maia Azubel; Dominika Elmlund; Yael Levi-Kalisman; Xin Liu; Brian J Gibbons; Michael Levitt; Roger D Kornberg
Journal:  Science       Date:  2013-09-26       Impact factor: 47.728

9.  Transcription under torsion.

Authors:  Jie Ma; Lu Bai; Michelle D Wang
Journal:  Science       Date:  2013-06-28       Impact factor: 47.728

10.  Complete dissection of transcription elongation reveals slow translocation of RNA polymerase II in a linear ratchet mechanism.

Authors:  Manchuta Dangkulwanich; Toyotaka Ishibashi; Shixin Liu; Maria L Kireeva; Lucyna Lubkowska; Mikhail Kashlev; Carlos J Bustamante
Journal:  Elife       Date:  2013-09-24       Impact factor: 8.140

View more
  33 in total

1.  Riboswitch structure and dynamics by smFRET microscopy.

Authors:  Krishna C Suddala; Nils G Walter
Journal:  Methods Enzymol       Date:  2014       Impact factor: 1.600

2.  Piconewton-Scale Analysis of Ras-BRaf Signal Transduction with Single-Molecule Force Spectroscopy.

Authors:  Chae-Seok Lim; Cheng Wen; Yanghui Sheng; Guangfu Wang; Zhuan Zhou; Shiqiang Wang; Huaye Zhang; Anpei Ye; J Julius Zhu
Journal:  Small       Date:  2017-08-15       Impact factor: 13.281

3.  Kinetics coming into focus: single-molecule microscopy of riboswitch dynamics.

Authors:  Sujay Ray; Adrien Chauvier; Nils G Walter
Journal:  RNA Biol       Date:  2018-10-29       Impact factor: 4.652

4.  Multi-wavelength single-molecule fluorescence analysis of transcription mechanisms.

Authors:  Larry J Friedman; Jeff Gelles
Journal:  Methods       Date:  2015-05-30       Impact factor: 3.608

5.  Metalloregulator CueR biases RNA polymerase's kinetic sampling of dead-end or open complex to repress or activate transcription.

Authors:  Danya J Martell; Chandra P Joshi; Ahmed Gaballa; Ace George Santiago; Tai-Yen Chen; Won Jung; John D Helmann; Peng Chen
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-19       Impact factor: 11.205

6.  Kinetics of transcription initiation directed by multiple cis-regulatory elements on the glnAp2 promoter.

Authors:  Yaolai Wang; Feng Liu; Wei Wang
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

7.  A code for transcription elongation speed.

Authors:  Eyal Cohen; Zohar Zafrir; Tamir Tuller
Journal:  RNA Biol       Date:  2017-11-22       Impact factor: 4.652

8.  Mechanically switching single-molecule fluorescence of GFP by unfolding and refolding.

Authors:  Ziad Ganim; Matthias Rief
Journal:  Proc Natl Acad Sci U S A       Date:  2017-10-03       Impact factor: 11.205

9.  Interactions between RNA polymerase and the "core recognition element" counteract pausing.

Authors:  Irina O Vvedenskaya; Hanif Vahedian-Movahed; Jeremy G Bird; Jared G Knoblauch; Seth R Goldman; Yu Zhang; Richard H Ebright; Bryce E Nickels
Journal:  Science       Date:  2014-06-13       Impact factor: 47.728

10.  Massively Systematic Transcript End Readout, "MASTER": Transcription Start Site Selection, Transcriptional Slippage, and Transcript Yields.

Authors:  Irina O Vvedenskaya; Yuanchao Zhang; Seth R Goldman; Anna Valenti; Valeria Visone; Deanne M Taylor; Richard H Ebright; Bryce E Nickels
Journal:  Mol Cell       Date:  2015-11-25       Impact factor: 17.970

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.