The sigma(54) factor associates with core RNA polymerase (RNAP) to form a holoenzyme that is unable to initiate transcription unless acted on by an activator protein. sigma(54) is closely involved in many steps of activator-dependent transcription, such as core RNAP binding, promoter recognition, activator interaction and open complex formation. To systematically define sigma(54) residues that contribute to each of these functions and to generate a resource for site specific protein labeling, a complete mutant library of sigma(54) was constructed by alanine-cysteine scanning mutagenesis. Amino acid residues from 3 to 476 of Cys(-)sigma(54) were systematically mutated to alanine and cysteine in groups of two adjacent residues at a time. The influences of each substitution pair upon the functions of sigma(54) were analyzed in vivo and in vitro and the functions of many residues were revealed for the first time. Increased sigma(54) isomerization activity seldom corresponded with an increased transcription activity of the holoenzyme, suggesting the steps after sigma(54) isomerization, likely to be changes in core RNAP structure, are also strictly regulated or rate limiting to open complex formation. A linkage between core RNAP-binding activity and activator responsiveness indicates that the sigma(54)-core RNAP interface changes upon activation.
The sigma(54) factor associates with core RNA polymerase (RNAP) to form a holoenzyme that is unable to initiate transcription unless acted on by an activator protein. sigma(54) is closely involved in many steps of activator-dependent transcription, such as core RNAP binding, promoter recognition, activator interaction and open complex formation. To systematically define sigma(54) residues that contribute to each of these functions and to generate a resource for site specific protein labeling, a complete mutant library of sigma(54) was constructed by alanine-cysteine scanning mutagenesis. Amino acid residues from 3 to 476 of Cys(-)sigma(54) were systematically mutated to alanine and cysteine in groups of two adjacent residues at a time. The influences of each substitution pair upon the functions of sigma(54) were analyzed in vivo and in vitro and the functions of many residues were revealed for the first time. Increased sigma(54) isomerization activity seldom corresponded with an increased transcription activity of the holoenzyme, suggesting the steps after sigma(54) isomerization, likely to be changes in core RNAP structure, are also strictly regulated or rate limiting to open complex formation. A linkage between core RNAP-binding activity and activator responsiveness indicates that the sigma(54)-core RNAP interface changes upon activation.
Major regulation of gene expression mainly depends upon the control of transcription initiation. In particular, the activity of RNA polymerase is regulated to control the level of gene expression. In bacteria, RNA polymerase holoenzyme is composed of core RNAP and a σ factor, which is responsible for the specific recognition and melting of promoter sequence (1). The σ factors in bacteria can be divided into two classes: the σ70 and σ54 class (2). Although they bind to the same core RNAP, these two classes of σ factors have no similarity in primary sequence and the modes of their transcription regulation are very different.Unlike σ70-holoenzyme, the initiation rates of σ54-holoenzyme are mainly controlled via regulation of the DNA melting step (3,4). The σ54 -holoenzyme first binds promoter DNA in a transcriptionally silent closed complex (5–7). Conversion of the closed complex to a transcriptionally proficient open complex requires a specialized activator protein which interacts with σ54-holoenzyme through DNA looping and brings about a conformational change in the closed promoter complex in a nucleotide hydrolysis dependent way, leading to σ54-holoenzyme isomerization and promoter melting (7–11).σ54 plays an important role in the process of open complex formation and interacts extensively with activator protein, promoter DNA and core RNAP to regulate the proper formation of the open complex. Open complex formation by σ54-holoenzyme appears to involve the activator dependent relocation of three structurally distinct σ54 domains within the holoenzyme to (i) allow positioning of promoter DNA for entry into the active site of RNAP and (ii) removal of a σ54 associated protein density that would otherwise prevent DNA delivery (12). Based on biochemical and genetic characterization and fragmentation analyses, σ54 can be divided into three regions (2,13–16) (Figure 1). Region I comprises the amino terminal 56 residues and is important for activator responsiveness (10,17,18). Region II comprises residues 57–106 and is characterized by a predominance of acidic residues. Region II is not essential for the function of σ54. In the carboxyl terminal (residues 107–477) lies Region III which contains a primary core RNAP-binding domain located between residues 120 and 215 (16), and a DNA-binding domain located between residues 329 and 477 (13,19,20). A modulation domain lies between these two domains and influences DNA-binding activity indirectly (14).
Figure 1.
Diagram of the functional regions of σ54 and summary of the results. Grey bar below the diagram of σ54 indicates the predicted helical region in Region I. Black bars below the diagram of σ54 indicate the helices in the carboxyl terminal DNA binding domain based on the partial structure of σ54 from Aquifex aeolicus (20,21). Darker color is used to represent higher activity. The function each color represents is indicated in the left of the figure. The squares at the left of the colored bars indicate the color corresponding to the activity of Cys(–)σ54. In the bar representing the activity of σ54-activator interaction, the dark blue color indicates the variants that form additional complexes in the presence of PspF1−275 and ADP·AlFx. The data are from Table 1.
Diagram of the functional regions of σ54 and summary of the results. Grey bar below the diagram of σ54 indicates the predicted helical region in Region I. Black bars below the diagram of σ54 indicate the helices in the carboxyl terminal DNA binding domain based on the partial structure of σ54 from Aquifex aeolicus (20,21). Darker color is used to represent higher activity. The function each color represents is indicated in the left of the figure. The squares at the left of the colored bars indicate the color corresponding to the activity of Cys(–)σ54. In the bar representing the activity of σ54-activator interaction, the dark blue color indicates the variants that form additional complexes in the presence of PspF1−275 and ADP·AlFx. The data are from Table 1.
aActivities in all the assays are measured relative to Cys(–)σ54. Isomerization ability of σ54 is calculated as a percentage of initially bound DNA converted to ssσ–DNA in an activator and dGTP dependent way.
bTested using β-galactosidase assay.
cNew complexes were formed when PspF1–275 was trapped with σ54 in the presence of ADP·AlFx.
dLow level of in vivo expression.
Due to the lack of a high-resolution structure of σ54, it is still not clear which residues in σ54 are responsible for the interactions with itself or other elements such as core RNAP and promoter DNA in the process of transcription initiation. Nor are the primary σ54 sequences of densities evident in electron microscopy studies precisely delineated (12). To systematically define the functions of each residue in σ54, a site-directed mutant library of σ54 was constructed using alanine–cysteine scanning mutagenesis. Mutation of residues other than glycine to alanine can be viewed as side-chain deletion and alanine scanning has been widely used in the research of protein functions (21,22), while mutation to cysteine can facilitate site specific protein modification (23–25). Alanine–cysteine scanning mutagenesis used in this article combines the advantages of these two kinds of mutations and facilitates both structural and functional analyses of a protein.Here, residues 3–476 of a cysteine free form of Klebsiella pneumoniae σ54 were changed to alanine and cysteine in groups of two consecutive residues, producing a total of 237 variants of σ54. This single cysteine mutant library is very helpful in both the structural and functional analyses of σ54, and this paper focuses on the latter.The results indicate that: (i) two parts of the Region I are the main targets for activator interaction; (ii) the carboxyl terminal of modulation domain can affect σ54 isomerization, probably through it influencing the −12 promoter sequence interaction; (iii) the carboxyl terminal part of DNA-binding domain is important for maintain the in vivo integrity of σ54, but less important in the regulation of σ54 activity. The linkage between different functions of σ54 and possible mechanisms are discussed. The single cysteine mutant library of σ54 provides a powerful tool for further structural analyses using ensemble and single molecule Foster Resonant Energy Transfer (FRET) approaches (26).
MATERIALS AND METHODS
Site-directed mutagenesis
The pET28 based plasmid pSRWCys(−) (25), which directs the synthesis of the amino-terminal 6-His-tagged Cys(−)σ54, was used as the template to create the single cysteine variants of σ54 using the Quickchange mutagenesis kit (Stratagene). The whole rpoN gene directing the synthesis of σ54 protein in each mutated plasmid was sequenced to ensure that only the desired substitutions were present.
β-Galactosidase assays
Plasmids carrying mutated rpoN genes were transformed to Escherichia coli rpoN knockout strain TH1/pMB221 (27). pMB221 contains both the reporter gene lacZ and the activator gene nifA. The reporter gene lacZ is under the control of K. pneumoniae nifH promoter, and the activator NifA is expressed from bla promoter. Transformants were grown at 37°C and then 100 μl of the overnight culture was used to inoculate 10 ml of LB with 50 μg/ml kanamycin and 34 μg/ml chloromycetin. Cells were grown at 37°C until A600 reaches 0.8. σ54-dependent gene expression in this system depend on a basal T7-RNAP independent expression of rpoN from pET28. The average β-galactosidase results from three independent colonies are presented.
Immunoblotting
Mutated plasmids were transformed to TH1/pMB221 and were cultured under the same condition used in β-galactosidase assays. Cells (1 ml) were then collected by centrifugation and resuspended in 60 μl of 10 mM Tris, 0.1 mM EDTA, pH 7.9. Ten microliters of the concentrated cell solution was lysed with 10 μl of 2× SDS sample buffer, heated at 95°C and 10 μl of each were loaded. Proteins were separated on denaturing 8% SDS–PAGE gels and blotted onto PVDF membranes. Anti-σ54 (prepared by Institute of Genetics and Developmental Biology, Chinese Academy of Sciences) and alkaline phosphatase conjugated anti-rabbit IgG (Pierce) were used for detection.
Protein purification
Mutated σ54 proteins were overexpressed in E. coli B834 (DE3). Two hundred microliters of overnight cultures were used to inoculate 20 ml of LB with 50 μg/ml kanamycin and cells were grown at 37°C until A600 reached 0.6. The cultures were then shifted to 25°C and IPTG was added to 1 mM. Incubation continued for 2 h at 25°C. Cells were collected by centrifugation, and resuspended in 500 μl of Buffer A [25 mM sodium phosphate (pH 7.0), 50 mM NaCl, 5% (v/v) glycerol]. The concentrated cells were lysed by adding 55 μl of FastBreak™ Reagent (Promega) into the buffer and were shaken for 15 min at room temperature. The lysates were then incubated for 3 min with 450 μl of Ni–NTA resin (Qiagen) in the columns, which were then washed with 500 μl of Buffer B (25 mM sodium phosphate (pH 7.0), 50 mM NaCl, 30 mM imidazole, 5% (v/v) glycerol). Mutated σ54 proteins were eluted in 400 μl of Buffer C [25 mM sodium phosphate (pH 7.0), 50 mM NaCl, 1 M imidazole, 5% (v/v) glycerol]. Purified σ54 proteins were dialysed against 10 mM Tris (pH 7.5), 0.1 mM EDTA, 100 mM NaCl, 50% glycerol, and stored at −20°C.
Core RNAP-binding assays
E. coli core RNAP (Epicentre, 250 nM) and different amounts of σ54 variants were mixed together in Tris–NaCl buffer (40 mM Tris–HCl (pH 8.0), 10% (v/v) glycerol, 0.1 mM EDTA, 1 mM DTT, 100 mM NaCl) and incubated for 10 min at 30°C, followed by the addition of glycerol bromophenol blue loading dye. Samples were loaded onto native 4.5% PAGE gels and run at 50 V for 2 h at room temperature in TG buffer (25 mM Tris, 200 mM glycine, pH 8.6). Proteins were visualized by Coomasie Blue staining.
Gel mobility shift assay
32P-labeled 88-mer top strand −12/−11 mismatched heteroduplex DNA fragment consisting of the −60 to +28 Sinorhizobium meliloti nifH promoter sequence was used as a template for gel mobility shift assays. The reactions contained 20 nM DNA, 1 μM σ54 in STA buffer [25 mM Tris–acetate (pH 8.0), 8 mM magnesium acetate, 10 mM KCl, 3.5% (w/v) PEG 8000] and were incubated for 10 min at 30°C. Where indicated, PspF1−275 (5 μM), dGTP (4 mM) were added for 5 min prior to gel loading. Free DNA and σ54 bound DNA were separated on 4.5% native PAGE gels and run at 80 V for 70 min at room temperature in TG buffer (25 mM Tris, 200 mM glycine, pH 8.6). Quantitative data were from phosphorimager analyses of the gels.
σ54-activator interaction
Twenty micromolar PspF1−275 and 1 μM of σ54 were incubated at 30°C for 5 min in the above STA buffer with ADP (0.2 mM) and NaF (5.0 mM). After addition of AlCl3 (0.2 mM) the samples were incubated for a further 10 min and loaded onto 4.5% native PAGE gels and run at 120 V for 50 min at room temperature in the above TG buffer. Proteins were visualized by Coomassie Blue staining.
Abortive transcription assay
Open complexes were formed on plasmid pMKC28 (28) (10 nM final concentration) in the presence of 100 nM σ54-holoenzyme (constituted using 2.5:1 molar ratio of σ54 over core RNAP), 20 U RNase inhibitor, 4 mM dATP and 5 μM of PspF1−275. A mix containing 100 μg/ml heparin, 0.5 mM UpG, 4 μCi [α-32P] GTP was added to the reaction for the abortive synthesis of a four nucleotide transcript (UGGG). The reactions were separated on a 20% denaturing gel and quantified as in gel mobility shift assay. Replicate assays were conducted to confirm the low activities of the variants defective in abortive transcription.
RESULTS
Mutagenesis
σ54 from K. pneumoniae contains two cysteine residues at positions 198 and 346 (29). Substitution of these two cysteine residues with alanine does not affect the function of σ54 significantly (25). Therefore, in order to construct variants carrying a single cysteine, the pET28b based plasmid pSRWCys(–), which directs the synthesis of the Cys(–)σ54, was used as the template (25). Residues of Cys(–)σ54 from 3 to 476 were systematically substituted with alanine–cysteine in groups of two adjacent residues at a time, for a total of 237 variants of σ54. Substitution with alanine can be viewed as a side-chain deletion, whereas substitution with cysteine facilitates side-chain modification at specific positions. Each mutated rpoN gene was sequenced in its entirety to establish the presence of only the expected substitutions.
Region II is not essential for the in vivo activity of σ54
β-Galactosidase assays were used to test the in vivo activities of the variants. The plasmid carrying the mutated rpoN gene and another plasmid, pMB221 carrying the reporter and activator genes (27), were transformed to E. coli TH1, which lacks functional σ54. The expression of mutated σ54 depends on the low level of basal expression of rpoN gene from the pET28 based vector. The β-galactosidase gene is under the control of K. pneumoniae nifH promoter and the activator NifA is constitutively expressed from the bla promoter. NifA acts with σ54 to activate the expression of β-galactosidase gene from σ54 dependent nifH promoter. Therefore, the activity of β-galactosidase can be used to indicate the in vivo activity of σ54 variants.The mutations that affected the in vivo activity of σ54 are distributed extensively in Region I and III, and are relatively concentrated in the carboxyl terminal of σ54 that includes the −24 DNA-binding bihelical structure (30) (Table 1). Consistent with the fact that Region II is not well conserved in sequence, none of the Region II variants exhibited poor activity.Summary of the activities of σ54 variants+: σ54 variants bind PspF1–275 normally. +/−: σ54 variants bind PspF1–275 weakly. −: σ54 variants cannot bind PspF1–275.aActivities in all the assays are measured relative to Cys(–)σ54. Isomerization ability of σ54 is calculated as a percentage of initially bound DNA converted to ssσ–DNA in an activator and dGTP dependent way.bTested using β-galactosidase assay.cNew complexes were formed when PspF1–275 was trapped with σ54 in the presence of ADP·AlFx.dLow level of in vivo expression.Low activities of the variants could be caused by protein instability. Immunoblotting was therefore carried out to measure the expression levels of the variants which retained <20% activity compared to Cys(–)σ54. σ54 variants were expressed under the same condition used in the in vivo activity assay. Variants found at lower than wild-type expression levels have substitutions mainly located in the DNA-binding domain [AAs 329–477, defined previously by proteolysis experiments and including the −24 promoter recognition structure (30,31)], especially in the carboxyl terminal, between residues 431 and 476 (Figure 2), indicating that many residues in this region are important for maintaining the structural integrity of σ54
in vivo, and that the instabilities of these variants may be one of the reasons for their low in vivo activities. However, the series of purified σ54 variants changed in residues 457–476 migrated as discrete bands in native gels, each with a mobility similar to the Cys(–)σ54 (when loaded at higher concentrations for detection), suggesting that their gross structural integrity is not seriously affected in vitro (Figure S1A). These variants were detected in native gels at concentrations where the Cys(–)σ54 was not easily detected, indicating a less diffuse behavior, and by inference some differences in conformation compared to the control protein Cys(–)σ54.
Figure 2.
In vivo stability of σ54 variants which retained <20% in vivo activity compared to Cys(-)σ54. The functional regions in which these variants are located are also shown. The asterisks indicate the variants with very low expression levels. Immunoblots were of lysates of TH1 cells used in in vivo activity assays, containing pMB221 and mutant rpoN genes. Equal amounts of each cell extract were loaded. Immunoblots of purified σ54 samples are also shown presented at 4, 8 and 12 ng.
In vivo stability of σ54 variants which retained <20% in vivo activity compared to Cys(-)σ54. The functional regions in which these variants are located are also shown. The asterisks indicate the variants with very low expression levels. Immunoblots were of lysates of TH1 cells used in in vivo activity assays, containing pMB221 and mutant rpoN genes. Equal amounts of each cell extract were loaded. Immunoblots of purified σ54 samples are also shown presented at 4, 8 and 12 ng.
Residues contributing to core RNAP binding are extensively distributed throughout Region I and III
Next, all the variants as well as Cys(–)σ54 were purified by Ni affinity chromatography for further functional analyses in vitro. An essential and first step en route to open complex formation is holoenzyme forming between σ and core RNAP. Native gel mobility shift assay was used to measure the binding of σ54 variants to core RNAP. Holoenzyme can be detected in Coomassie blue stained gels based on its different mobility versus core RNAP.Consistent with the view that the core RNAP interface of σ54 comprises at least two functionally important but distinct sequences: 1–56 and 120–215 (16), many mutations located in these two sequences altered the interaction of σ54 with core RNAP: Variants AC 5/6, AC 19/20, AC 121/123, AC 139/140, AC 151/152, AC 175/176 and AC 179/180 shifted all core RNAP at a molar ratio of 4:1 (σ54: core RNAP), whereas Cys(–)σ54 can shift all core RNAP at a molar ratio of 1:1 (Figure 3). In Region I, mutations that affected binding of σ54 to core RNAP mainly locate in its amino terminus, while mutations between residues 39–56 had no effect upon holoenzyme formation (Figure S1A). Furthermore, in this assay, many residues in modulation domain or DNA-binding domain were found to be important for core RNAP binding (Figure 3), indicating residues affecting interaction of σ54 with core RNAP are widely distributed across Regions I and III. Although Region II is reported to play a role in the binding of σ54 to core RNAP (32), none of the mutations in Region II affected the core RNAP-binding activities in our assays (Figure S1B).
Figure 3.
Formation of holoenzyme by σ54 variants which bind core RNAP with low affinity. Holoenzyme were formed at 1:1, 1:2 and 1:4 ratios of core RNAP (250 nM) to σ. Migration positions of core RNAP, holoenzyme and free σ are indicated.
Formation of holoenzyme by σ54 variants which bind core RNAP with low affinity. Holoenzyme were formed at 1:1, 1:2 and 1:4 ratios of core RNAP (250 nM) to σ. Migration positions of core RNAP, holoenzyme and free σ are indicated.
Regions I and III cooperate for activator interaction
σ54-Holoenzyme initiates transcription in response to an interaction with an activator protein, and σ54 is the major and direct target for this interaction (7,33,34). To identify the residues in σ54 responsible for the stable interaction with activator protein, an assay to measure the interaction between the σ54 and the activator protein by native-PAGE of the reaction was used. A DNA-binding domain deletion form of the E. coli phage shock protein F (PspF1−275) served as a model activator in these assays (35). Since the interaction between σ54 and its activators is normally very transient, stable complex formation between σ54 and PspF1−275 depends on ADP·AlFx—an ATP analog that mimics the state of ATP at the point of hydrolysis, which ‘traps’ the transient complex between PspF1−275 and σ54 so that it can be resolved and detected on native gels (34).Results show that four Region I variants (AC17/18, AC29/30, AC37/38 and AC47/48.) failed to form the ADP·AlFx dependent stable complex with PspF1−275, indicating the substituted residues contribute to part of the interface σ54 makes with activator protein (Figure 4). Mutations of these residues may alter this interface and prevent the variants from making stable interaction with activator protein. Low in vivo activities of these four variants may also be attributed to their altered interaction with activator protein.
Figure 4.
σ54 variants which cannot bind PspF1–275 normally in the presence of ADP·AlFx. ADP·AlFx dependent complexes formed between σ54 and PspF1–275 were detected by Coomassie staining. σ54 and PspF1–275 were presented at 1 and 20 μM separately. Arrow (a) indicates the complex formed between σ54 and PspF1–275 in the presence of ADP·AlFx. Arrow (b) indicates the changed position of PspF1–275 complex in the presence of ADP·AlFx. Arrow (c) and (d) indicates the new bands formed in this assay.
σ54 variants which cannot bind PspF1–275 normally in the presence of ADP·AlFx. ADP·AlFx dependent complexes formed between σ54 and PspF1–275 were detected by Coomassie staining. σ54 and PspF1–275 were presented at 1 and 20 μM separately. Arrow (a) indicates the complex formed between σ54 and PspF1–275 in the presence of ADP·AlFx. Arrow (b) indicates the changed position of PspF1–275 complex in the presence of ADP·AlFx. Arrow (c) and (d) indicates the new bands formed in this assay.Many Region III variants also cannot form a detectable stable complex with PspF1−275, implying that determinants contributing to the interface between σ54 and activator protein are widely distributed. Considering that Region I constitute the primary target for activator interaction (34), it is probable that substitution of these Region III residues indirectly affect the stable σ54-activator complex formation through the interaction with Region I.Interestingly, some σ54 variants in the trapping reactions showed formation of a new complex which has not been observed before. This new complex has a slower mobility compared to the normal σ54-activator complex (Figure 4). We speculate that this new complex might be a σ54-activator complex with a new conformation, caused by the altered interaction of σ54 with PspF1−275. The new complexes were detected under conditions supporting PspF–ADP·AlFx formation, but we did not determine if all the reaction components (AlCl, NaF, ADP) were required. However, the presence of ADP alone was not sufficient for formation of new complexes (data not shown). Among the variants with this phenotype, most have substitutions located in Region III and only one variant was in Region II, residues 83–84.
Promoter DNA binding
σ54 is a DNA-binding protein that can in vitro occupy some of its cognate promoters in the absence of core RNAP (36). Interaction of σ54 with repressive fork junction promoter sequences around −12 plays an important role in the regulation of open complex formation (12,37–39). To determine the σ54 sequences that contribute to promoter DNA binding, we conducted DNA-binding assay using heteroduplex promoter DNA (opened at −12 and −11, termed early melted DNA) to mimic the state of DNA in the closed complex (Figure 5A). Mutations that affected early melted DNA binding mainly located in two regions: the carboxyl terminal of Region I (residues from 25 to 48) and the DNA-binding domain (residues from 329 to 477). In Region I (residues from 25 to 38), it is interesting to note the periodicity of the DNA-binding defective phenotype (Figure 5B). Some other mutations located in modulation domain (residues 207–210, 241–242 and 295–300) also affect early melted DNA binding, consistent with the view that modulation domain functions to assist DNA binding (14).
Figure 5.
(A) S. meliloti nifH heteroduplex promoter probe used for DNA binding and σ54 isomerization assay. The consensus GG and GC elements are underlined. The mismatched region and transcription start sequence are boxed in black. (B) The amounts of σ–DNA and ssσ–DNA complex resulting from the action of PspF1–275 and dGTP are plotted. Only the examples of variants that fail to bind DNA, form ssσ–DNA complex, or variants that more readily form the ssσ–DNA complex are shown. I. Sigma binding to early melted DNA. II. ssσ–DNA complex formation in the presence of PspF1–275 and dGTP.
(A) S. meliloti nifH heteroduplex promoter probe used for DNA binding and σ54 isomerization assay. The consensus GG and GC elements are underlined. The mismatched region and transcription start sequence are boxed in black. (B) The amounts of σ–DNA and ssσ–DNA complex resulting from the action of PspF1–275 and dGTP are plotted. Only the examples of variants that fail to bind DNA, form ssσ–DNA complex, or variants that more readily form the ssσ–DNA complex are shown. I. Sigma binding to early melted DNA. II. ssσ–DNA complex formation in the presence of PspF1–275 and dGTP.
Variants defective in early melted DNA-binding function abnormally in isomerization assay
σ54 bound to early melted DNA responds to activator protein in the presence of an hydrolysable NTP source and isomerizes independently of core RNAP to form a new slower migrating ‘supershifted (ss)’ DNA complex (33). To characterize the sequences in σ54 that contribute to activator responsiveness, we attempted to form the isomerized supershifted σ–DNA complex (ssσ–DNA) using activator PspF1−275 and hydrolysable nucleotide dGTP. For each variant, a comparison of the fraction of initially bound DNA converted, in an activator and hydrolysable nucleotide dependent way, to the new isomerized ssσ–DNA species was made. Reactions without hydrolysable nucleotide were also conducted to potentially identify any variant that can response to activator protein in a hydrolysable nucleotide independent way. However, we failed to detect any variant with this phenotype, indicating the importance of hydrolysable nucleotide to the normal function of activator protein (data not shown).The σ54 isomerization assay revealed two classes of variants that behave differently from Cys(–)σ54. The first class of variants is represented by some Region I variants which failed to form the isomerized ssσ–DNA species. This phenotype is also shared with some other variants that have substitutions located in core RNAP-binding domain (153–154), carboxyl terminal of modulation domain (295–296, 299–300), and DNA-binding domain (323–324, 335–336 and 455–456) (Figure 5B). However, residues that contribute to σ54 isomerization mainly located in two parts of Region I (11–18 and 25–36), indicating that Region I comprises the major elements influencing activator responsiveness, but likely interacts with other parts of σ54. The second class of variants is represented by some Region III variants. In striking contrast to the first class, this class of variants can form isomerized ssσ–DNA species much more efficiently than Cys(–)σ54 (Figure 5B). As expected, early melted DNA binding seems to be important to the normal formation of ssσ–DNA, since most variants defective in early melted DNA binding either failed to form ssσ–DNA (represented by Region I variants), or formed ssσ–DNA much more efficiently (represented by Region III variants) (Figure 5B and Table 1). The latter group of variants seem to have a reduced barrier to being isomerized, implying Region III residues keep isomerization in check and that σ54 could function through concerted changes in the organization of its domains.
Transcription activities of the variants in carboxyl terminal of DNA-binding domain are restored in vitro
To further define the elements in σ54 that contribute to open complex formation and transcription initiation, abortive transcription assays were conducted from a supercoiled S. meliloti nifH promoter template (pMKC28) (28), whose sequence from −1 to +3 is TGGG (Figure 5A). If the activator PspF1−275 and dATP (needed for the ATPase activity of the activator protein) were provided, the tetranucleotide UGGG can be synthesized by σ54-holoenzyme in the presence of the priming dinucleotide UpG, and transcription substrate GTP. Heparin was also added together with GTP and after dATP plus PspF to measure transcription from a single round of activation.The in vitro transcription activities of σ54 variants were largely in accordance with their in vivo activity, but in the carboxyl terminal of DNA-binding domain, many variants with low in vivo activity functioned normally in the abortive transcription assay (Table 1 and Figure S3). Considering the low in vivo expression levels of these variants detected by immunoblotting, the low in vivo activities of these variants could be attributed to their in vivo instability or reduced abilities to compete with other σ factors in vivo, and these defects are not operational in vitro. Notably for many variants that had improved activities in forming the isomerized ssσ–DNA species, the abilities of the holoenzyme they formed to initiate transcription were seldom improved, sometimes even impaired (Figure 6 and Table 1), demonstrating the influence of steps after σ54 isomerization to open complex formation and suggesting coordinated changes in σ54 and core RNAP structure are required for making activator dependent open complexes.
Figure 6.
Mutations in σ54 that affect transcription initiation. Autoradiograph of the denaturing gel showing the synthesis of abortive transcripts from holoenzyme formed with σ54 variants which affect transcription initiation.
Mutations in σ54 that affect transcription initiation. Autoradiograph of the denaturing gel showing the synthesis of abortive transcripts from holoenzyme formed with σ54 variants which affect transcription initiation.
DISCUSSION
Cryo-EM reconstruction of σ54–holoenzyme reveals four density regions (D1, Db, D2 and D3) attributable to σ54. Region I sequence, including the −12 recognition region, is within Db density which is located at the position of DNA loading into the RNAP active centre, and is suggested to prevent the active centre from accessing the template DNA strand, therefore inhibiting the spontaneously formation of the open complex from double stranded DNA. Interaction of activator protein and σ54 likely causes a conformational change of Region I which removes the steric obstruction and modifies the interaction with −12 promoter to allow the loading of the template strand into the active centre, leading to open complex formation (Figure 7) (12). Our results fully support this model, indicating that Region I is the major element interacting with activator protein and plays an important role in coordinating activator responsiveness. Although several distinct activities reside in discrete parts of σ54, residues of σ54 contributing to core RNAP binding or activator contacting are widely distributed across Regions I and III of σ54, consistent with the observation that multiple interactions with core RNAP occur with several domains of σ54 and that the main connecting density between PspF1−275 and σ54–holoenzyme occurs not only at the interface of Db, but also at D1 and D2 which contain major core RNAP-binding element of σ54 and part of Region III of σ54 (12). These residues probably constitute the surfaces of the density regions through which σ54 exerts its influence upon the reorganization of RNAP.
Figure 7.
The left panel shows the cryo-EM reconstruction of σ54-holoenzyme bound to the activator protein in the absence of promoter DNA (12). Shown in green and yellow are the positions of the σ54 densities before and after binding to the activator protein, respectively. The σ54-holoenzyme is shown alone (light blue) and in complex with the activator PspF1–275 (red). The middle and right panels show cartoons depicting the structural changes inferred from cryo-EM reconstruction of the σ54-holoenzyme and σ54-holoenzyme bound by the activator protein. The promoter DNA is shown as an orange line: In the ‘CC conformation’ σ54 domains D2 and D3 contact the −24 and −12 consensus promoter sequences upstream of the +1 site, respectively, thereby distorting DNA next to −12. The interaction with the activator protein (red/pink) in a mixed nucleotide state is thought to enable σ54-holoenzyme to adopt a conformation similar to that in the intermediate complex (stabilized experimentally with ADP·AlFx).
The left panel shows the cryo-EM reconstruction of σ54-holoenzyme bound to the activator protein in the absence of promoter DNA (12). Shown in green and yellow are the positions of the σ54 densities before and after binding to the activator protein, respectively. The σ54-holoenzyme is shown alone (light blue) and in complex with the activator PspF1–275 (red). The middle and right panels show cartoons depicting the structural changes inferred from cryo-EM reconstruction of the σ54-holoenzyme and σ54-holoenzyme bound by the activator protein. The promoter DNA is shown as an orange line: In the ‘CC conformation’ σ54 domains D2 and D3 contact the −24 and −12 consensus promoter sequences upstream of the +1 site, respectively, thereby distorting DNA next to −12. The interaction with the activator protein (red/pink) in a mixed nucleotide state is thought to enable σ54-holoenzyme to adopt a conformation similar to that in the intermediate complex (stabilized experimentally with ADP·AlFx).
Roles of Region I sequences
The amino terminal Region I plays a central role in activator responsiveness and may act as an organizing centre that brings the key σ54, core RNAP and DNA component together (10,40,41). Mutations affecting σ54 function located between residues 15 and 47 have been described previously (18,21), and we identified one additional sequence affecting activator responsiveness of σ54, residues 11–14 (Figure 1). Most mutations affecting σ54 isomerization were located between residues 11–18 and 25–36, suggesting these two sequences are the primary sequences for activator responsiveness. Since the −12 promoter sequence was shown to have a role in contributing to supershifted ssσ–DNA complex formation (33), for some variants defective in early melted DNA binding (AC25/26, 29/30, 33/34, 47/48), the failure to form isomerized ssσ–DNA species may be attributed to the altered interaction of σ54 with −12 promoter sequence. Although some variants in DNA-binding domain also bound DNA poorly and less ssσ–DNA complexes were formed, the percentage of the initial bound DNA converted to ssσ–DNA is much higher compared to Cys(–)σ54, suggesting that for these Region I variants, poor DNA binding per se does not limit the isomerization of sigma–DNA complex. Indeed tight DNA binding at −12 may limit isomerization. It has been shown before that interaction of σ54 and −12 promoter sequence influences the regulation of open complex formation (37,39,42). Consistent with this, our results show a very clear correlation between Region I variants that have a defect in early melted DNA binding and their poor activities in activated transcription, contrasting the result that many other variants function normally in activated transcription, even if they have the same overall defect in early melted DNA binding (Figure 1). It has been suggested that an α-helix in Region I could be required for normal σ54 function (21), and in our assay, it is interesting to note that mutations impairing DNA binding distribute periodically between residues 25–38 (Figures 1 and 5B).Region I sequence binds core RNAP weakly and can be footprinted by core RNAP (16,31), and our results identified the sequences in Region I contributing to this function, associated with inhibiting the RNA polymerase from making open complexes (12). It has been demonstrated previously that residue 36 is proximal to core RNAP (25). Consistent with this, our results show that mutations around residue 36 (residues 33–34 and 37–38) affect core RNAP binding slightly (Figure S2). The variants with reduced core RNAP-binding activity are also impaired in σ54 isomerization and/or activator responsiveness, suggesting that the substituted residues contribute to the interface σ54 makes with core RNAP through which Region I exerts its influence upon holoenzyme to regulate function.
Roles of Region II sequences
Region II is very variable in sequence. Groups of acidic residues in Region II are involved in promoter melting and deletion of Region II affects holoenzyme formation and activator responsiveness of σ54 (32,43). In our activator interaction assays, one Region II variant AC83/84, behaves differently from Cys(–)σ54 (Figure 4), suggesting that an altered interaction with activator protein may occur. The activity of this variant in isomerized ss σ–DNA formation is also improved compared to Cys(–)σ54, implying the substituted residues may influence activator responsiveness (Table 1). However, we failed to identify any Region II variants significantly affecting the transcription activity of the assembled σ54-holoenzyme both in vivo and in vitro.
Roles of Region III sequences
Region III includes a major determinant for core RNAP binding at its amino terminal and sequences for DNA binding located at the carboxyl terminal (13,15,16). In the cryo-EM reconstruction of σ54–holoenzyme, the primary core RNAP-binding domain and carboxyl terminal DNA-binding domain are assigned to D1 and D3, respectively, while some other Region III residues located in D2 with Region I contribute to maintaining the stable closed complex (Figure 7) (28,41,44,45).Our data show that although the primary core RNAP-binding sequence is located between residues 120 and 215, many other residues outside this region also affect core RNAP binding (Figure 3). This is consistent with the fact that sequences footprinted by core RNAP lie outside the primary core RNAP-binding domain, and are near or within the DNA-binding domain (31). On the basis of the model of the complex formed between the carboxyl terminal DNA-binding domain of σ54 (AAs 396–465) and promoter −24 sequence, it is proposed that the first loop of this domain may interact with the β-flap region of core RNAP (20,30). Interestingly, we find that mutations between residues 423–426, which locate in the first loop of the carboxyl terminal DNA-binding domain based on the structure of σ54 from A. aeolicus (Figure 1), indeed affect the core RNAP-binding activity (Figure 3).Residues affecting activator interaction are also distributed widely in Region III (Figure 4). Since Region I is the major determinant for activator interaction (33), these Region III variants may influence activator interaction through the interaction between Regions III and I (28,41). Region III variants affecting activator interaction can be divided into two classes. Class one variants are somewhat defective in activator interaction, but unlike Region I variants with this same phenotype, many of these Region III variants can still respond to activator protein. Class two variants can interact with activator protein, but an additional complex with slower mobility was formed (Figure 4). This new complex has not been observed before and we suggest that this new complex might reflect a conformational change in the σ54-activator complex, implying an altered interaction of σ54 with activator protein. Residues around 307 have been proposed to be a site for activator interaction (46). Interestingly, our data show that mutations between residues 297 and 318 either lead to the failure of σ54 variants to form stable complex with activator protein, or the formation of an additional complex (Table 1), both cases reflecting the altered interaction with activator protein. Interactions that activator protein mediates between Regions III and I can be used to achieve the apparently concerted domain movements of σ54 seen in structural reconstructions of the PspF1−275 bound σ54-holoenzyme (Figure 7) (12).In contrast to the results of core RNAP-binding activity and σ54-activator interaction, residues affecting DNA binding are mainly located in the previously delineated DNA-binding domain, residues 329–477, fully consistent with the reconstruction model of σ54-holoenzyme–DNA complex where D2 and D3 densities cover −12 and −24 promoter sequences, with D1 containing the primary core RNAP-binding domain (residues 120–215). Based on the structure of the carboxyl terminal DNA-binding domain of σ54 from A. aeolicus, the residues D439, T457 and N471 of σ54 from K. pneumoniae can interact with −24 promoter sequence through hydrogen bonding (30). In accordance with this, our results show that substitutions of these three residues all affect the DNA-binding ability significantly (Table 1). Specific residues in modulation domain affecting DNA binding were also identified, confirming the view that modulation domain functions to assist DNA binding (14).Unlike Region I variants, there is no close linkage between the defect in early melted DNA binding and ssσ–DNA formation. Among the DNA-binding defective variants in Region III, only three of them (AC295/296, 299/300 and 355/356) are defective for early melted DNA binding (Table 1). It has been suggested that residue 336 is proximal to −12 promoter sequence and interacts with sequences downstream to −12 GC element to regulate RNAP function together with Region I (28,41). Thus, like the Region I variants, the defect of variant AC335/336 in ssσ–DNA formation could also result from the altered interaction of σ54 and the −12 promoter sequence. In contrast to the properties of these variants, in the carboxyl terminal DNA-binding domain variants defective in DNA binding (AC439/440, AC441/442, AC457/458 etc.) are not impaired in ssσ–DNA formation, indicating that it is an interaction with −12 promoter sequence that is crucial for σ54 isomerization, with the rpoN box providing the −24 promoter sequence contact that is not directly involved in the isomerization process. Variants AC295/296 and AC299/300 share the phenotype of AC335/336 and some Region I variants, implying that residues 295–300 located in the carboxyl terminal of modulation domain may also have a role in interactions with −12 promoter sequence, which may have a basis in communication with Region I.Although most Region III variants are not impaired in ssσ–DNA formation, a distinctive class of variants is revealed on the basis of their greatly improved activities in ssσ–DNA formation. This class of variants can be roughly divided into two groups based on the linkage with other functional defects. Group I isomerization changed variants are represented by the variants in core RNAP-binding domain whose core RNAP-binding activities are impaired. Mutations leading to this phenotype are mainly located in two patches around residues 120 and 180 (Figure 1). It has been suggested that some interactions between σ54 and core RNAP are changed upon isomerization of the σ54–DNA complex and recent structural analyses of the activator bound σ54-holoenzyme are in agreement with the activator causing a reorganization of σ54 domains within the holoenzyme (12,47). The impaired core RNAP-binding activities of these variants described here may reflect an altered core RNAP-binding surface which is closer to the conformation of σ54 in the isomerized state, hence leading to the improved activity in isomerized ss σ–DNA formation. The activities of these σ54 variants in transcription assay are somehow impaired, suggesting that the substituted residues may contribute to the communication with core RNAP and that when they are substituted, σ54 isomerization is uncoupled from the changes that must occur in the core RNAP, required for the later stages of the DNA opening process and stable open complex formation (48,49).Group II isomerization changed variants are in the DNA-binding domain. For these variants there is a clear linkage between the defect in DNA binding and an improved activity in ssσ–DNA formation (Figure 1). The weak contact these variants make with DNA may facilitate the changes in σ–DNA interactions which are important for ssσ–DNA complex formation (33,47). However, increased formation of the ssσ–DNA complex has little overall effect on the normal function of holoenzyme in the context of in vitro transcription assays, arguing for the existence of other steps limiting open complex formation. For most of the variants in carboxyl terminal of σ54 (residues 431–476), the normal functions of the holoenzyme in in vitro transcription assay, together with their low expression levels in vivo, strongly indicates the carboxyl terminal of σ54 plays an important role in maintaining the structural integrity perhaps related to turnover of σ54
in vivo.In conclusion, by constructing the single cysteine mutant library of σ54 and analyzing the functions of σ54 variants both in vivo and in vitro, σ54 sequences interacting with core RNAP, DNA and activator protein are systematically identified. Linkages between different functions are revealed and the possible mechanisms for those linkages are also discussed. Furthermore, the characterized σ54 library provides a powerful resource for functional analyses using site specific probes, and for interpretations of molecular structures.
SUPPLEMETARY DATA
Supplementary Data are available at NAR Online.
FUNDING
Human Frontier Science Program [RGP22/2007 to Y.P.W.]; Natural Science Foundation of China [No. 30830005 to Y.P.W.]; the Program of Introducing Talents of Discipline to Universities [No. B06001 to Y.P.W.]; 973 program on nitrogen fixation to Y.P.W.; BBSRC project grant to M.B. and BBSRC David Phillips Fellowship [BB/E023703 to S.R.W.].Conflict of interest statement. None declared.
Authors: Christof Francke; Tom Groot Kormelink; Yanick Hagemeijer; Lex Overmars; Vincent Sluijter; Roy Moezelaar; Roland J Siezen Journal: BMC Genomics Date: 2011-08-01 Impact factor: 3.969
Authors: David J Samuels; Jonathan G Frye; Steffen Porwollik; Michael McClelland; Jan Mrázek; Timothy R Hoover; Anna C Karls Journal: BMC Genomics Date: 2013-09-05 Impact factor: 3.969