Literature DB >> 21398637

In vitro transcription profiling of the σS subunit of bacterial RNA polymerase: re-definition of the σS regulon and identification of σS-specific promoter sequence elements.

Anna Maciag1, Clelia Peano, Alessandro Pietrelli, Thomas Egli, Gianluca De Bellis, Paolo Landini.   

Abstract

Specific promoter recognition by bacterial RNA polymerase is mediated by σ subunits, which assemble with RNA polymerase core enzyme (E) during transcription initiation. However, σ(70) (the housekeeping σ subunit) and σ(S) (an alternative σ subunit mostly active during slow growth) recognize almost identical promoter sequences, thus raising the question of how promoter selectivity is achieved in the bacterial cell. To identify novel sequence determinants for selective promoter recognition, we performed run-off/microarray (ROMA) experiments with RNA polymerase saturated either with σ(70) (Eσ(70)) or with σ(S) (Eσ(S)) using the whole Escherichia coli genome as DNA template. We found that Eσ(70), in the absence of any additional transcription factor, preferentially transcribes genes associated with fast growth (e.g. ribosomal operons). In contrast, Eσ(S) efficiently transcribes genes involved in stress responses, secondary metabolism as well as RNAs from intergenic regions with yet-unknown function. Promoter sequence comparison suggests that, in addition to different conservation of the -35 sequence and of the UP element, selective promoter recognition by either form of RNA polymerase can be affected by the A/T content in the -10/+1 region. Indeed, site-directed mutagenesis experiments confirmed that an A/T bias in the -10/+1 region could improve promoter recognition by Eσ(S).

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21398637      PMCID: PMC3141248          DOI: 10.1093/nar/gkr129

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Bacteria must cope with drastic changes in their environment, such as nutritional up- and downshifts, and variations in pH, osmolarity and temperature. Bacterial cells can quickly adapt to such environmental changes by modulating gene expression, at both transcriptional and post-transcriptional levels. At the transcription initiation level, gene expression can be regulated either through accessory transcription factors (activators and repressors), or via assembly of different forms of RNA polymerase. The latter mechanism of gene regulation involves the assembly of RNA polymerase core enzyme (indicated as E) with one of several σ factors that can direct RNA polymerase to specific promoter sequences (1). Typically, in the bacterial cell, one σ factor is devoted to transcription of a large part of the genome, including the essential cellular functions (housekeeping σ factor), while the so-called ‘alternative σ factors’ direct transcription of smaller sets of genes, often linked to specific functions (e.g. response to cellular stresses). In Escherichia coli, seven σ factors have been identified: σ70 or σD (the housekeeping σ) and six alternativeσ factors: σE, σF, σH, σI, σN and σS (2). Most alternative σ factors recognize promoter sequences that strongly diverge from the consensus sequence for σ70; in contrast, genes under the control of σS are characterized by promoter sequences very similar to σ70-dependent genes (3,4). In line with this observation, in vitro selection of DNA sequences bound with high affinity by RNA polymerase associated with σS (EσS) led to the identification of a consensus sequence very similar to the one recognized by σ70 (5). Some level of overlapping in promoter recognition by σ70 and σS might be consistent with σS function: indeed, in conditions leading to slow metabolic activity, such as nutrient starvation or oxidative stress, σS might take over in transcription of genes important for cell survival that are under σ70 control during faster growth (6). However, in order to switch from the fully active to the slow metabolic state, specific gene expression, and thus specific recognition of σ70- versus σS-dependent promoters, must take place in the bacterial cell. Some promoter sequence determinants can favour recognition by either σ70 or σS (3): for instance, a C nucleotide upstream of the −10 promoter element (−13C) enhances transcription by EσS (7). However, some sequence features favouring promoter recognition by σS seem to be dependent on specific promoter contexts: for instance, at the EσS-dependent aidB promoter, EσS, but not Eσ70, can recognize with equal efficiency either C or T as the first nucleotide in the −10 promoter element (8). However, the percentage of σS-dependent promoters carrying a −12C element is not significantly higher than in σ70-dependent promoters (9), suggesting that the presence of a −12C might only contribute to specific promoter recognition by σS at selected promoters. In addition to sequence determinants, it has been proposed that transcription factors such as CRP, IHF and Lrp can selectively block (or promote) promoter recognition by either Eσ70 or EσS (10). A transcription regulator important for the modulation of promoter accessibility to different RNA polymerase holoenzymes is the H-NS protein, which can repress transcription by Eσ70, but not by EσS, at various promoters (11), a phenomenon known as transcriptional silencing (12). Specific promoter recognition by EσS is also affected by the degree of DNA supercoiling (13). In addition, σS activity and intracellular concentrations are affected by various factors, such as the presence of an anti-sigma factor for σ70 (14), and by the accumulation of the signal molecules ppGpp (15) and polyphosphate (3). Work aimed to the identification of σS-specific promoter elements has mostly been carried out in vivo, comparing relative gene expression in a wild type versus an rpoS mutant derivative unable to produce the σS protein (9,16–20). Although this approach has proven very useful for the identification of rpoS-dependent genes and in the characterization of σS-specific promoter elements such as the −13C, it cannot distinguish between promoters directly recognized by the EσS form of RNA polymerase and promoters under the indirect control of the rpoS gene. In contrast, dependence on EσS, as determined by biochemical experiments with purified RNA polymerase, has only been determined for a limited number of promoters [e.g. fic (21), csiD (22) and aidB (23)]. In this work, we have performed in vitro transcription experiments with either Eσ70 or EσS, using the whole E. coli genome as template, to identify promoter regions selectively recognized by two forms of RNA polymerase. Our results support previous observations that Eσ70- and EσS-dependent promoters differ in conservation of the UP element and of the −35 sequence, and in the sequence immediately upstream of the −10 promoter element. In addition, we show that differences in the A/T content in the −10/+1 promoter region can favour transcription by either form of RNA polymerase. Finally, our work has led to the identification of novel σS-dependent genes, thus providing further insight on the physiological role of σS.

MATERIALS AND METHODS

Protein purification and RNA polymerase reconstitution

Escherichia coli RNA polymerase core enzyme was purchased from Epicentre (Madison, WI, USA); histidine-tagged σ factors were produced and purified as described (8,24). The σS protein appeared totally pure from contaminants as determined by denaturing protein gel electrophoresis; in contrast, in the σ70 preparations the presence of faint additional bands, corresponding to the molecular weight of the core RNA polymerase subunits α, β and β′, could be detected (data not shown). Weak contamination of σ70 preparations by core RNA polymerase subunits is consistent with the high affinity of σ70 for the core enzyme. For reconstitution of RNA polymerase holoenzymes, the core enzyme was incubated for 10 min at 37°C with either σS or σ70 at a 1:10 ratio. For calculation of RNA polymerase concentrations in transcription assays, it was assumed that, after reconstitution, core enzyme would be 100% active and fully saturated by either σ factor.

In vitro transcription on supercoiled plasmids

Promoter regions of interest were amplified from the genome of E. coli MG1655 (25) and cloned into the pJCD01 plasmid (22) using the BamHI and EcoRI sites, with the exception of the ilvY promoter, which was cloned using the BamHI and SphI sites, due to the presence of an EcoRI site in the ilvY promoter region. Single-round in vitro transcription experiments were carried out on supercoiled templates (3 nM) in the presence of 10 nM reconstituted RNA polymerase holoenzyme. Plasmid DNA and reconstituted holoenzyme were incubated for 10 min at 37°C in 18 µl of transcription buffer (40 mM HEPES pH 8.0, 10 mM magnesium chloride, 150 mM potassium glutamate, 2 mM dithiothreitol, 100 µg/ml bovine serum albumin) prior to the addition of 2 µl of a mixture of ribonucleotide triphosphates and heparin to a final concentration of 500 µM each for ATP, CTP, GTP and UTP and 250 µg/ml heparin. Transcription reactions were allowed to proceed for 10 min at 37°C and were stopped by addition of NaCl to a final concentration of 0.5 M followed by incubation at 70°C for 5 min. Samples were extracted with a 1:1 phenolchlorophorm mixture, precipitated with ethanol and resuspended in 12 µl TE buffer. A volume of 2.5 µl of resuspended samples were treated with DNaseI (1 U in 10 mM TrisHCl pH 7.6, 2.5 mM MgCl2, 0.5 mM CaCl2, in a final volume of 20 µl) for 1 h at 37°C, and DNaseI was heat-inactivated at 65°C for 10 min. Transcript amounts were determined by quantitative real-time polymerase chain reaction (PCR): 1 µl of DNaseI-treated samples was retrotranscribed and the cDNAs were amplified in quantitative real-time PCR using the RNA-I transcript as reference gene. The sequences of the primers used in real-time PCR are available upon request. DNaseI-treated transcription reactions not incubated with reverse transcriptase were added as negative controls for the presence of undigested plasmid DNA. To verify that transcripts would indeed originate from the promoters of interest, and not from non-specific transcription initiation events, in vitro transcription start sites were determined by Rapid Amplification of cDNA Ends (RACE) analysis as described (26): start sites observed in the in vitro transcription experiments corresponded to those described in the literature (data not shown). Site-directed mutagenesis of the ssrS P1 promoter was performed using the three-step PCR method (27).

Run-off microarray experiments on E. coli genome

Genomic DNA from E. coli MG1655 was isolated with GenElute™ Bacterial Genomic DNA Kit (Sigma) and digested with EcoRI overnight. The size of digested genomic DNA ranged from 0.6 to 30 kb as judged from visualization on agarose gels. After purification with Wizard SV Gel and PCR Clean-Up System (Promega), 1 µg of genomic DNA was used in single transcription assay for run-off micro array (ROMA) experiments. RNA polymerase holoenzyme concentrations were 100 nM in 50-µl reaction mixture. Digested genomic DNA and reconstituted holoenzyme were incubated in transcription buffer (40 mM HEPES pH 8.0, 10 mM magnesium chloride, 150 mM potassium glutamate, 2 mM dithiothreitol, 100 µg/ml bovine serum albumin) for 15 min at 37°C to allow binding of RNA polymerase to DNA. Transcription reactions were started with the addition of 2.5 µl of ATP, CTP, GTP and UTP to a final concentration of 500 µM each, were allowed to proceed for 30 min at 37°C and were stopped by incubation at 65°C for 5 min. The reaction mixture was incubated with 10 U of DNaseI at 37°C for 30 min to degrade genomic DNA. RNA was purified from reaction with miRNeasy Mini Kit (Qiagen), which allows recovering of total RNA, including small RNA molecules (<200 nt). The quantity of RNA produced by in vitro transcription was determined using Nanodrop Spectrophotometer, while the average length of the transcripts, as well as of the corresponding cDNA from reverse transcription (see below), was evaluated by capillary electrophoresis on an Agilent Bioanalyzer 2100 using an RNA pico and nano assay, respectively. Capillary electrophoresis analysis of transcription reactions performed in the presence of EσS, Eσ70 or core RNA polymerase clearly showed that, while the average transcript size generated by any form of RNA polymerase was comparable, a significantly lower amount of RNA was produced in transcription reactions performed with core RNA polymerase alone compared to either form of RNA polymerase holoenzyme (Supplementary Figure S1). Indeed, in vitro transcription reactions performed using RNA polymerase core enzyme generated 4 ng/µl of transcripts, versus 31 ng/µl and 23 ng/µl total RNA produced in the presence of Eσ70 and EσS, respectively. Dependence on the presence of a σ factor for efficient in vitro transcription strongly suggests that, in our experimental conditions, transcription initiation does not originate from non-specific initiation sites such as 5′ DNA termini, but requires interaction of RNA polymerase holoenzyme with promoter sequences. Ten independent run-off transcription assays with either EσS or Eσ70 were performed, and transcripts were pooled together (∼1 µg total RNA). From the pool of the 10 independent multi-round transcription reactions two distinct hybridizations on microarrays were performed. For hybridization onto microarrays, we used the Affymetrix Genechip E. coli Genome 2.0 array, which includes 10 144 probe sets covering all the predicted transcripts from four strains of E. coli: the laboratory strain MG1655, the uropathogenic CFT073 and the enteropathogenic strain OH157:N7, subtypes EDL933 and SAKAI. Due to the high degree of similarity between the E. coli strains, typically a single probe set is tiled to represent the equivalent orthologue in all four strains, and strain-specific probes are only used for genes displaying low levels of conservation, or present solely in one strain. Probe sets match every open reading frame (ORF) in E. coli; in addition, 1427 probe sets targeting 714 E. coli MG1655 intergenic regions, probes for various antibiotic resistance markers, and additional control and reporter genes from the previous generation E. coli arrays are also represented in the Affymetrix Genechip E. coli Genome 2.0 array. The RNA samples were processed for microarray hybridization, following the instructions of the GeneChip® Expression Analysis Technical Manual (Chapter 5, 6 and 7, Prokaryotic target Preparation, Hybridization, Washing, Staining and Scanning), except that cDNA synthesis by reverse transcription was performed with 1 μg RNA. cDNA was fragmented with DNase I treatment and labelled with biotin using Terminal Deoxynucleotidyl Transferase. The fragmented and labelled cDNAs were then hybridized for 16 h at 45°C on individual E. coli Genome 2.0 arrays. After hybridization, GeneChips were washed and stained with streptavidin-conjugated phycoerythrin by using the Fluidic Station FS450 (Affymetrix) according to the FS450_0006 Protocol. Fluorescent images of the arrays were acquired using a GeneChip Scanner 3000 7G (Affymetrix). All Chip images and files have been deposited in the GEO (Gene Expression Omnibus) (http://www.ncbi.nlm.nih.gov/geo/) repository (accession number:GSE22207). After quality control of data distribution, the raw data (CEL files) were used to perform normalization and probe set summarization through Robust Multiarray Analysis (RMA) algorithm (by using the Affymetrix Gene Expression Console Software (www.affymetrix.com). Normalized data were also analyzed with OneChannelGUI Software (http://www.bioinformatica.unito.it/oneChannelGUI/) in order to perform an additional expression analysis based on different parameters. OneChannelGUI is an add-on Bioconductor package extending the capability of the affylmGUI package (28); it is a library providing a graphical interface (GUI) for Bioconductor libraries to be used for the complete single-channel microarray analysis. To perform statistic analysis on the two hybridizations, we decided to apply a simple non-parametric statistical method based on ranks of fold changes to perform a two-class paired differential analysis. To select genes with significantly different transcription levels, we set the following parameters for the Rank Product analysis: 100 permutations and 0.1 cut-off percentage of false positives (pfp) which corresponds to a P-value <0.01. Fold differences higher than 1.5-fold were considered indicative of significantly different expression, similar to previous ROMA experiments (29): genes more efficiently transcribed by either EσS or Eσ70 are listed in Tables 1 and 2, respectively. Determination of start sites on transcripts generated in the in vitro transcription assays was carried out by RACE analysis (26).
Table 1.

Genes transcribed more efficiently in the presence of EσS

Genes and promoter regionsaGene productbb numbercTranscription levels (EσS/Eσ70 ratio)dKnown regulatory factorse
Stress response
yieFChromate reductaseb37131.50σE (59); σS (19)
IG1341353_1341620-rIntergenic region including osmB promoterN.A.1.51σS, RcsBA (60)
osmBOsmolarity-inducible lipoproteinb12831.51σS, RcsBA (60)
gshAγ-glutamate–cysteine ligase (glutathione biosynthesis)b26882.21
ahpFAlkyl-peroxidase reductaseb06061.52OxyR (61)
pphAProtein phosphataseb18381.54σH (62,63)
macAMacrolide resistance efflux pumpb08781.54
DNA and RNA metabolism and modification
ppkPolyphosphate kinase, part of RNA degradosome,b25011.50Regulator of intracellular σS concentrations (3)
IG2522899_2523146-rIntergenic region including xapA promoterN.A.1.62XapR (64)
xapAXanthosine phosphorylase, nucleotide synthesis/degradationb24071.56XapR (64)
gyrBB subunit of DNA gyraseb36991.65Fis (65)
recTRac prophage, recombinase, in recET-lar-ydaCQ operonb13491.93σS-Dependent in biofilm-growing cells (18)
larRac prophage,restriction alleviation and modification enhancement, in recET-lar-ydaCQ operonb13481.93
ydaCRac prophage, in recET-lar-ydaCQ operonb13471.74as recT
ydaQRac prophage, possible recombinase, in recET-lar-ydaCQ operonb13461.66as recT
Polyamine metabolism
puuBγ-glutamyl–putrescine oxidaseb13011.61σS (19)
gabDSuccinate semialdehyde dehydrogenaseb26611.65σS, Nac (9,66)
speBAgmatinase (in arginine/putrescine degradation pathway)b29371.66
Transcription regulation
IG582284_582903-fIntergenic region including appY promoterN.A.1.69H-NS (67)
appYRegulator of hyaABCDEF operonb05641.51H-NS (67)
yciTPutative deoR-type transcription regulatorb12841.56
ilvYRegulator of ilvC operonb37731.75
rhaSRegulator of rhamnose transportb39051.81CRP (29); RhaS (68)
Sugar metabolism
treFTrehalaseb35191.53σS (9,19)
ytfTPutative galactose ABC transporterb42301.55σS (19)
otsBTrehalose-6-phosphate phosphatiseb18971.56σS (7)
araFArabinose transporter, component of an ABC transport systemb19011.57CRP, AraC (69); σS (19)
glgPGlycogen phosphorylase, part of glgCAP operonb34281.60CRP (70); σS controls glgCAP operon (19)
gpmMPutative 2,3-bisphosphoglycerate mutaseb36121.61
glvCSugar transport phosphotransferaseb36831.79
Other metabolic functions
IG1030936_1031361-fIntergenic region including hyaABCDEF promoterN.A.2.02
hyaAHydrogenase small subunit; in hyaABCDEF operonb09721.68σS, AppY, anaerobic regulation by ArcA and NarP/NarL (20,71,72)
hyaBHydrogenase large subunit; in hyaABCDEF operonb09731.66as hyaA
hyaFHydrogenase subunit (nickel-binding protein); in hyaABCDEF operonb09771.50as hyaA
sydSecY-interacting proteinb27931.59
murPAcetyl-muramic acid permeaseb24291.59CRP, MurR (73)
murRTranscriptional repressor of murQPb24271.62
tamTrans-aconitate methylatransferaseb15191.60σS (74)
ydhYPredicted oxidoreductase, Fe-S protein; in ydhYVWXUT operonb16741.80FNR, NarL (75)
ydhVPredicted oxidoreductase, Fe-S protein; in ydhYVWXUT operonb16731.62FNR, NarL (75). σS-dependent in biofilm-growing cells (18)
Unknown and miscellaneous functions
yccTUnknownb09641.51Induced in stationary phase (76)
yffPPredicted protein, prophageb24471.52
yidKPutative membrane transporterb36791.53
ynfDPredicted proteinb15861.54
yfiLPutative lipoproteinb26021.55
yi91aUnknown, in CP4-6 prophage sequenceb02551.56
yfjLUnknown, possible prophage geneb26251.57σS (9)
yagLUnknown, prophage proteinb02781.59σS-Dependent in biofilm-growing cells (18)
eutAReactivating factor for ethanolamine ammonia lyaseb24511.59eutH, in eutHA operon, is σS-dependent (9)
yhjGPredicted outer membrane proteinb35241.61σS (9)
ybeHHypothetical proteinb06251.66
yedSUnknownb19641.69
yfgJUnknown, mutant affecting swarming motilityb25101.74
G7353Phantom geneb25961.79Upstream of a ribosome modulation factor induced in stationary phase (yfiA)
yqiGUnknown, interrupted by IS elementb30461.81
IG1006824_1007066-rIntergenic region including ycbX promoterN.A.2.26
ycbXUnknownb09472.05
yfjHUnknownb26232.30
IG2755422_2755664-rIntergenic region downstream of yfjHN.A.2.01
ychSUnknownb12283.21
Non-coding RNAs
ryeE (cyaR)Small RNA, promotes degradation of ompX and nadE RNAb44381.50CRP, σE (77,78)
sgrS/sgrTSmall RNA, inhibits ptsG translation/SgrT proteinb45771.51SgrR protein
ECs3934sibD/sibE non coding RNA- ibsD/ibsE toxic peptidesb44471.62Complex locus including two non coding RNAs overlapped by two small ORFs encoding a putative toxin/antitoxin system
b4664
micAmicA small RNA (downregulates ompA expression)b44422.00
Intergenic regions, genes not annotated in MG1655
c3878Unknown, annotated in CFT073N.A.1.50
IG2922538_2922756-fIntergenic region between syd (predicted protein) and csrB (ncRNA), antisenseN.A.1.50
c5008Unknown, annotated in CFT073, downstream of malMN.A.1.50
IG2438141_2438404-fBetween fabB and ycfJ, antisenseN.A.1.50
ECs5537Unknown, annotated in O157:H7 SAKAIN.A.1.51
IG2885243_2885600_rIntergenic region, upstream of predicted helicase ygcBN.A.1.52
c4656Unknown, annotated in CFT073 (antisense of atpC)N.A.1.52
IG3665211_3665420-fIntergenic region upstream of gadA, antisenseN.A.1.52
c1010Unknown, annotated in CFT073, upstream of aqpZN.A.1.52aqpZ (aquaporin) is σS-dependent (18,79)
c0723Unknown, annotated in CFT073, upstream of dacA, transcribed in opposite directionN.A.1.54
IG2481360_2481774-rIntergenic region upstream emrKY (TolC-multidrug efflux pump)N.A.1.54
IG2228406_2228643-fIntergenic region upstream of yohJK (inner membrane proteins)N.A.1.54
IG2201932_2202549-rIntergenic region between yehI-yehK, antisenseN.A.1.54
c2806Unknown, annotated in CFT073 (antisense of menB)N.A.1.55
c3010Unknown, annotated in CFT073, upstream of perM gene, transcribed in opposite directionN.A.1.55
IG3669525_3669971-rIntergenic region, upstream of yhjBN.A.1.58
IG2898371_2898613-fIntergenic region upstream of yqcEN.A.1.60
c5221Unknown, annotated in CFT073 (antisense of aspA)N.A.1.63
IG223409_223770-rIntergenic region upstream of rrsH ribosomal operon (transcribed in opposite direction)N.A.1.64
IG3420831_3421058-rIntergenic region, downstream of rrfF ribosomal operonN.A.1.66
c4942Unknown, annotated in CFT073 (antisense of rplL)N.A.1.67
IG2428784_2429041-fIntergenic region, upstream of cvpA gene, transcribed in opposite directionN.A.1.71
c1908Unknown, annotated in CFT073 (antisense of yddM)N.A.1.71
c0703Unknown, annotated in CFT073 (antisense of citF)N.A.1.74
ECs5165yjfO (biofilm-related protein) in O157:H7 SAKAIN.A.1.86
IG1903284_1903567-rIntergenic region downstream of yobD, transcribed in opposite directionN.A.1.87
IG3358642_3358810-fIntergenic region downstream of gltDN.A.1.89
IG2190243_2190534-rIntergenic region downstream of yehEN.A.1.94
c3113Unknown, annotated in CFT073 (upstream of rrsG ribosomal operon, transcribed in opposite direction)N.A.1.95
c2317Unknown, annotated in CFT073 (antisense of azuC)N.A.1.99Complex locus including small RNA isrB
IG2755422_2755664-rIntergenic region downstream yfjHN.A.2.01
IG2519349_2519612-fIntergenic region downstream of xapRN.A.2.05
IG330721_331594-rIntergenic region upstream of yahA, transcribed in opposite directionN.A.2.07

aFor known genes, we used the nomenclature reported in the NCBI database (http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi); intergenic region are indicated using the nomenclature by Affymetrix, which indicates the start and the end of the intergenic region (IG) covered by the various probe sets; –f or –r indicate if the probe sets are in forward or reverse orientation relative to the (+) strand of the E. coli chromosome; genes already described as rpoS-dependent in vivo are underlined.

bGene product (or predicted product).

cRelative location of the various ORFs on the E. coli MG1655 chromosome.

dDetermined by microarray analysis as described in ‘Materials and Methods’ section.

eWhen not otherwise stated, the information is taken from http://ecocyc.org/.

Table 2.

Genes transcribed more efficiently in the presence of Eσ70

Genes and promoter regionsaGene productbb numbercTranscription levels (Es70/Ess ratio)dKnown regulatory factorse
Ribosomal genes and protein synthesis
IG1286552_1286760-rIntergenic region including tyrT promoterN.A.2.97FIS (80)
rttRNon-coding RNA, part of tyrT transcriptb44251.97FIS (80)
tprSmall protamine-like protein part of tyrT transcriptb12292.03
serTSerine tRNAs geneb09711.89FIS (79); upregulated in rpoS mutant of MG1655 (17)
rpmIRibosomal protein L35b17171.91
prfBRelease Factor RF2b28911.91Upregulated in a biofilm-growing rpoS mutant derivative of MG1655 (18)
rimPRibosomal maturation proteinb31702.01
rpsFRibosomal protein S6b42002.04
rpmGRibosomal protein L33b36362.10
relAppGpp alarmone biosynthetic enzymeb27842.35
rplURibosomal protein L21, in rplU-rpmA operonb31862.50
rpmARibosomal protein L27, in rplU-rpmA operonb31852.58
rpsURibosomal protein S21b30652.78Upregulated in a biofilm-growing rpoS mutant derivative of MG1655 (18)
IG3426400_3426656-rIntergenic region including rrnH promoterN.A.2.87
DNA repair
sulASOS response inhibitor of cell divisionb09582.04LexA (81)
dinIAP endonuclease, SOS responseb10612.18LexA (82); upregulated in an rpoS mutant derivative of OH157:H7 EDL 933 (19)
Multifunctional operons
cvpAColicin V production; in cvpA-purF-ubiX operonb23131.85PurR (83); upregulated in an rpoS mutant derivative of OH157:H7 EDL 933 (19)
purFAmidophosphoribosyl transferase (ribonucleotide metabolism); in cvpA-purF-ubiX operonb23122.12as cvpA
ubiX3-octaprenyl-4-hydroxybenzoate decarboxylase (ubiquinone biosynthesis); in cvpA-purF-ubiX operonb23111.90as cvpA
c2854Unknown, annotated in CFT073 as part of the cvpA-purF-ubiX operonN.A.2.00
yjeFPutative carbohydrate kinase; in yjeFE-amiB-mutL operonb41671.88
yjeEEssential protein with weak ATPase activity; in yjeEF-amiB-mutL operonb41681.95
amiBN-acetylmuramyl-l-alanine amidase needed for septum formation during cell division; in yjeEF-amiB-mutL operon.b41692.02
mutLMethyl-directed mismatch repair, subunit in MutHLS complex; in yjeEF-amiB-mutL operon.b41702.06
yhbEInner membrane protein; in yhbE-obgE operonb31842.36
obgEGTP-binding protein, involved in ppGpp turnover; in yhbE-obgE operonb31831.91
Transcription regulation
IG3717398_3717677-fIntergenic region including cspA promoterN.A.1.85
cspACold shock protein Ab35562.34Upregulated in an rpoS mutant derivative of OH157:H7 EDL 933 (19)
alpACP4-57 prophage gene, regulator of tmRNAsb26242.31
Metabolic functions
fhuFIron reductaseb43671.83Fur, OxyR (84)
ydhRPutative monooxygenaseb16671.91
artJArginine transporterb08602.13Upregulated in a biofilm-growing rpoS mutant derivative of MG1655 (18)
Unknown and miscellaneous functions
ycgYUnknownb11961.83
yefMAntitoxin in yefM-yoeB toxin-antitoxin systemb20171.85
yehLUnknown, possible component of ABC transport systemb21191.92
yebNUnknown, putative membrane proteinb18212.06
ydiEPutative lipoproteinb17052.10
Non-coding RNAs
isrBSmall RNAb44342.25
spfSmall RNA, regulates DNA polymerase I activityb38642.72
ssrS6S RNA, modulates σ70 activityb29113.74Upregulated in a rpoS mutant derivative of MG1655 (20)
Intergenic regions, genes not annotated in MG1655
c1714Unknown, annotated in CFT073, upstream of cls, transcribed in opposite directionN.A.1.83
IG330721_331594-fIntergenic region including yahA promoter regionN.A.1.85
ECs1613Unknown, annotated in O157:H7 SAKAI, possible prophage gene (renD)N.A.1.86
IG2424809_2425028-rIntergenic region between argT and hisJN.A.1.92
IG1120179_1120464-rIntergenic region upstream bssS (regulator of biofilm formation)N.A.1.92
c2481Unknown, annotated in CFT073, upstream of cobUN.A.1.94
IG127588_127911-fIntergenic region upstream of lpdN.A.1.96
IG2404662_2405580-rIntergenic region upstream of lrhAN.A.1.98
Z0043Annotated in O157:H7 OH157:H7 EDL 933, caiCN.A.2.05
c2568Annotated in CFT073, wcaMN.A.2.13
Z5055Annotated in O157:H7 OH157:H7 EDL 933, rfaGN.A.2.16
c4052Unknown, annotated in CFT073N.A.2.27
IG2815526_2815805-rIntergenic region upstream yqaBAN.A.2.31
IG583654_583902-rIntergenic region downstream of ompTN.A.2.35
c2230Unknown, annotated in CFT073 (antisense of cspC)N.A.2.42
IG1905616_1906284-fIntergenic region, between yobF and yebON.A.2.44
Z3239Annotated in O157:H7 OH157:H7 EDL 933 (antisense of yegI)N.A.2.49
c4719Annotated in CFT073 homologous to aslAN.A.2.54
c4352Annotated in CFT073 as dppCN.A.2.99
IG2428784_2429041-fIntergenic region, between yobD and yebNN.A.3.10
c1434Unknown, annotated in CFT073, ydfRN.A.3.45
Z5945Annotated in O157:H7 OH157:H7 EDL 933 (toxic peptide)N.A.4.02
Z5868Annotated in O157:H7 OH157:H7 EDL 933; yjgM, putative acetyltransferaseN.A.4.44

aFor known genes, we used the nomenclature reported in the NCBI database (http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi); intergenic region are indicated using the nomenclature by Affymetrix, which indicates the start and the end of the intergenic region (IG) covered by the various probe sets; –f or –r indicate if the probe sets are in forward or reverse orientation relative to the (+) strand of the E. coli chromosome; genes previously described as being upregulated in an rpoS mutant strain are double underlined.

bGene product (or predicted product).

cRelative location of the various ORFs on the E. coli MG1655 chromosome.

dDetermined by microarray analysis as described in ‘Materials and Methods’ section.

eWhen not otherwise stated, the information is taken from http://ecocyc.org/.

Genes transcribed more efficiently in the presence of EσS aFor known genes, we used the nomenclature reported in the NCBI database (http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi); intergenic region are indicated using the nomenclature by Affymetrix, which indicates the start and the end of the intergenic region (IG) covered by the various probe sets; –f or –r indicate if the probe sets are in forward or reverse orientation relative to the (+) strand of the E. coli chromosome; genes already described as rpoS-dependent in vivo are underlined. bGene product (or predicted product). cRelative location of the various ORFs on the E. coli MG1655 chromosome. dDetermined by microarray analysis as described in ‘Materials and Methods’ section. eWhen not otherwise stated, the information is taken from http://ecocyc.org/. Genes transcribed more efficiently in the presence of Eσ70 aFor known genes, we used the nomenclature reported in the NCBI database (http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi); intergenic region are indicated using the nomenclature by Affymetrix, which indicates the start and the end of the intergenic region (IG) covered by the various probe sets; –f or –r indicate if the probe sets are in forward or reverse orientation relative to the (+) strand of the E. coli chromosome; genes previously described as being upregulated in an rpoS mutant strain are double underlined. bGene product (or predicted product). cRelative location of the various ORFs on the E. coli MG1655 chromosome. dDetermined by microarray analysis as described in ‘Materials and Methods’ section. eWhen not otherwise stated, the information is taken from http://ecocyc.org/. In order to manage and retrieve microarray data, a genome browser was set up (http://155.253.6.64/cgi-bin/gbrowse/provakappa/#search). The genome browser is based on the Generic Genome Browser (Gbrowse) which is a combination of database and interactive webpage for manipulating and displaying annotations on genomes. This bioinformatic tool allows users to view and navigate through the MG1655 genome (GenBank Accession Number:U00096) with information about the gene annotation, coming from the website http://www.genome.wisc.edu/tools/asap.htm (downloaded as GFF3 file), and all the probe sets contained in the Affymetrix E. coli Genome 2.0 Array manually remapped on the genome.

Determination of rpoS-dependent gene expression in bacterial cells

Bacterial strains used were MG1655 (wild type) and its rpoS mutant derivative EB1.3 (30). Strains were grown at 30°C in three different media: the complex Luria Bertani (LB) broth, the glucose-based M9Glu/sup medium (31) and LB medium diluted 1:4 (LB1/4). The LB1/4 medium was utilized since it was shown to stimulate expression of rpoS-dependent genes such as the csg operons (32). Thus, growth in LB1/4 medium might positively affect either σS concentration or σS activity. Samples for RNA extraction were taken both in late logarithmic (OD600 = 0.6–0.7) and in late stationary phase (overnight cultures, OD600≥1.5). RNA was extracted using the small RNA miRNeasy Mini Kit (Qiagen), and further reverse transcription and cDNA amplification in quantitative real-time-PCR were performed as described (33). Primer sequences are available upon request. All reactions were performed twice, each time in duplicate, and always showed very similar results. The relative amounts of the transcripts were determined using 16S rRNA as the reference gene ([CtGene of interest – Ct16S] = ΔCt).

RESULTS

ROMA analysis of σS- and σ70-dependent promoters

To identify sequence determinants that can direct selective promoter recognition by either the σ70- or the σS- associated form of RNA polymerase, we compared EσS and Eσ70 in ROMA experiments (29,34). Run-off transcription assays were performed using as DNA template the whole genome of E. coli MG1655 after digestion with EcoRI; cDNAs generated from the transcripts were hybridized on microarrays to determine their relative amounts. Unlike genome expression studies performed in living cells, which do not distinguish between direct and indirect effects, such as transcription factor-dependent promoter recognition, the ROMA analysis solely detects genes whose promoter are recognized by either EσS or Eσ70 (or both) in the absence of any additional factor. Out of ∼10 000 probe sets on the microarray, only 173 (∼1.7%) showed differences in transcription levels higher than 1.5-fold, considered to be significant (pfp<0.1, P-value <0.01; see ‘Materials and Methods’ section). For several operons, only a portion of the transcriptional unit (or even a single gene) showed significant difference in transcription levels by either EσS or Eσ70: in the case of reduced transcription of distal genes within a given operon, this effect might depend either on premature arrest of transcription by RNA polymerase or on the presence of an EcoRI restriction site within the operon, since the EcoRI restriction nuclease was used to digest chromosomal DNA used in ROMA experiments (see ‘Materials and Methods’ section). However, the effects of EcoRI digestion were not so severe as it could be expected, possibly due to the fact that chromosomal DNA was only partially digested by the enzyme in our conditions. For instance, despite the presence of an EcoRI site in the ydhY gene, transcription extending into the downstream genes was still detectable (Table 1 and Supplementary Table S1). In the case of distal genes of an operon showing more significant differences in EσS- versus Eσ70-dependent transcription than genes proximal to the promoter, this might be due to either lack of detection of proximal genes in the microarray experiment or the presence of unidentified promoters internal to the operons. We verified that genes within the same operon would not show dependence on different forms of RNA polymerase: out of the several hundred operons in the E. coli genome, only in the weakly Eσ70-dependent atpBEFHAGDC and flgBCDEFGHIJ operons could we observe the presence of a single gene showing dependence on EσS in our ROMA experiments (data not shown). This result might suggest the presence of EσS-dependent promoters, or promoter-like sequences, within these operons; however, we decided to focus our investigation on operons consistently showing preferential recognition by either form of RNA polymerase, and the atpBEFHAGDC and flgBCDEFGHIJ operons were not considered further in our study. In Supplementary Table S1, we show the ratios of EσS- versus Eσ70-dependent transcription for operons featuring genes preferentially transcribed in the presence of EσS. Complete results of ROMA experiments are summarized in Tables 1 and 2, listing genes and intergenic regions transcribed more efficiently either by EσS (Table 1) or by Eσ70 (Table 2). Out of the 54 genes preferentially transcribed by EσS in ROMA experiments that are annotated in E. coli MG1655, 21 (39%) had already been described as rpoS-dependent genes, either from genetic characterization or from microarray experiments comparing wild-type and rpoS mutant strains (underlined in Table 1). In contrast, no known rpoS-dependent gene was found to be preferentially transcribed by Eσ70 (Table 2). However, several genes clearly identified by previous works as rpoS-dependent (e.g. katE, dps and gadA) only showed slight (<1.5-fold) preferential recognition by EσS in our experimental conditions (Supplementary Table S2). This would suggest that, for these rpoS-dependent genes, promoter recognition by EσS might be mediated by regulatory proteins, or facilitated by additional factors such as DNA supercoiling or effector molecules (e.g. ppGpp), missing in ROMA experiments. Alternatively, in ROMA experiments, Eσ70 might recognize promoters which might not be accessible to this form of RNA polymerase in vivo, due to selective negative recognition by regulatory proteins such as H-NS (11,12). A significant fraction of genes more efficiently transcribed by EσS is involved in sugar and in polyamine metabolism, in response to cellular stresses and in nucleic acid synthesis, modification and turnover (Table 1), consistent with the role of σS as a starvation- and stress-related protein; no essential gene was found. Interestingly, several EσS-dependent transcripts correspond to non-coding RNAs, intergenic regions and ORFs of unknown function only annotated in pathogenic E. coli strains. This last result was surprising, since the DNA template used in the in vitro transcription experiments came from E. coli MG1655; however, probe set analysis and sequence comparison performed using BLAST (http://blast.ncbi.nlm.nih.gov/Blast.cgi) revealed that these ORFs are also present in MG1655, although they are not annotated in this strain. Several of these unlisted ORFs overlap known genes in an antisense direction, as indicated in Table 1. Interestingly, one of such ORFs (c3113, annotated in the uropathogenic strain CFT073), as well as one intergenic region (IG223409_223770-r), is located in the promoter regions of ribosomal operons rrsG, and rrsH in an antisense direction, possibly suggesting that they may play a role in EσS-dependent control of ribosomal operon transcription. In contrast, Eσ70 appeared to transcribe with higher-efficiency genes encoding ribosomal proteins and other protein synthesis-related genes, such as rRNA- and tRNA-encoding genes and prfB, encoding for release factor 2 (Table 2). Several of the protein synthesis-related genes listed in Table 2 are essential, as are the yjeE and the amiB genes, part of a multifunctional operon also preferentially transcribed by Eσ70. Interestingly, regulatory genes directly affecting σ70 activity, such as relA, responsible for biosynthesis of the ppGpp alarmone in response to amino acid starvation (35), and ssrS, encoding a 6S RNA able to modulate Eσ70-dependent at several promoters (36), showed dependence on Eσ70 in ROMA experiments (Table 2). For ssrS, our data confirm literature data showing that the main ssrS promoter (ssrS P1) is strictly Eσ70-dependent in vitro (37). Interestingly, several genes preferentially transcribed by Eσ70 in ROMA experiments are upregulated in rpoS mutant strains (underlined in Table 2); it has been proposed that negative regulation by σS can depend on competition with σ70 for a limiting amount of RNA polymerase core enzyme, which results in lower intracellular Eσ70 concentrations and, in turn, in impaired transcription initiation at strictly σ70-dependent promoters (38).

In vivo validation of EσS-dependent genes identified in ROMA experiments

In order to validate the results of ROMA experiments, we tested in vivo expression of 18 genes preferentially transcribed by EσS, comparing their transcript levels in MG1655 and in its rpoS mutant derivative EB1.3. Transcript levels were determined by quantitative real-time PCR. Different genetic backgrounds (MG1655, MC4100 and OH157:H7 EDL933 strains) and different growth conditions (LB medium, glucose-based medium and biofilm growth) can strongly affect rpoS-dependent gene expression (9,17–20), suggesting that most genes belonging to the σS regulon are also subject to additional forms of gene expression regulation. Thus, in vivo gene expression was determined in three different growth media: the peptone-based LB medium, either full strength or diluted 1:4, and the glucose-based M9Glu/sup medium (Table 3). In addition, we took samples from cultures both in late exponential phase and in late stationary phase of growth; indeed, although σS-mediated gene expression is typically associated with stationary phase, several rpoS-dependent genes are maximally expressed at the transition between exponential and stationary phase or even in mid-exponential phase (19,39; Landini,P., unpublished data).
Table 3.

Gene expression in bacterial cells

GeneLB
M9Glu/sup
LB 1/4
ExpStatExpStatExpStat
ychS2.40.11.16.71.72.0
gshA1.01.51.01.11.93.3
ycbX1.10.50.72.01.52.6
recT0.90.40.72.51.82.0
rhaS1.80.31.52.71.52.7
cyaR1.01.71.510.00.80.6
ilvY1.70.31.010.21.92.2
hyaA1.25.62.12.92.317.5
speB1.51.31.42.71.286.3
gyrB1.40.80.92.81.03.1
gabD4.80.712.11.25.169.0
ydhY1.50.30.919.21.12.1
appY1.20.70.93.42.011.4
puuB1.70.20.63.53.01.8
treF1.53.07.20.62.152.6
bsmA1.11.02.30.91.02.1
xapA2.40.20.84.31.12.7
c31131.01.41.38.12.71.3
dps0.910.710.70.314.230.3
cspA1.30.21.10.80.70.2

Relative expression is indicated as WT/rpoS ratio.

Values higher than 2.5-fold were considered significant and are shown in boldface type.

Values are the average of two independent experiments performed in duplicate.

Gene expression in bacterial cells Relative expression is indicated as WT/rpoS ratio. Values higher than 2.5-fold were considered significant and are shown in boldface type. Values are the average of two independent experiments performed in duplicate. For five genes (recT, hyaA, gabD, puuB and treF), dependence on a functional rpoS gene has been described (Table 1); however, for the recT gene, part of the recET-lar-ydaCQ operon, both positive and negative control by rpoS was reported (17,18). As a positive control, we tested the expression of dps that, although showing only weak dependence on EσS in our ROMA experiments (Supplementary Table S2), has consistently been described as rpoS-dependent in vivo in several reports (9,16,17); in agreement with the literature data, dps expression showed strong rpoS-dependence in all conditions tested (Table 3). We also tested the expression of cspA, which was transcribed with higher efficiency by Eσ70 in ROMA experiments (Table 2); consistent with this result, in vivo gene expression studies show that a functional rpoS allele is not required for cspA expression, which is in fact upregulated in the rpoS mutant when grown in peptone-based media (Table 3), in agreement with previous observations (20). For in vivo gene expression experiments, we considered as significant a fold difference ≥2.5 in WT versus rpoS mutant relative expression ratio. As shown in Table 3, growth conditions strongly affected gene expression: in LB medium, expression of only three genes (hyaA, gabD and treF) was dependent on a functional rpoS allele, while seven genes were in fact upregulated in the rpoS mutant, suggesting negative control by σS. In contrast, 12 genes were expressed in an rpoS-dependent fashion when bacteria were grown in 1:4 diluted LB, while 15 genes showed dependence on rpoS in M9Glu/sup medium (Table 3). Of all genes tested, only the bsmA gene did not show relative expression values ≥2.5 in any growth condition, while recT and ycbX only displayed weak dependence on a functional rpoS allele (2.5- and 2.6-fold increase in one growth condition). All other genes showed either strong dependence on the rpoS gene (up to 86.3-fold for speB in LB1/4 medium) or were affected by the rpoS mutation in more than one growth condition (e.g. rhaS, induced 2.7-fold in the wild-type strain both in M9Glu/sup and in LB1/4 growth media). Finally, in vivo gene expression experiments demonstrated that the c3113 gene, tested as a representative of ORFs only annotated in the uropathogenic E. coli CFT073, is indeed expressed in MG1655, and it shows dependence on the rpoS gene when bacteria are grown in M9Glu/sup (Table 3).

In vitro transcription assays on single promoters

To further confirm the results of ROMA experiments, we performed in vitro transcription assays comparing EσS and Eσ70 on single promoters. To this aim, promoter regions of the ilvY, speB, xapA, ydhY and ssrS genes were cloned into the pJCD01 vector (22) and the obtained plasmids were used as DNA template for in vitro transcription. The ilvY, speB, xapA and ydhY genes were selected since their promoter region has been experimentally identified and they show rpoS-dependence in vivo (Table 3). In contrast, ssrS shows the highest dependence on Eσ70 in ROMA experiments among genes whose promoter has been characterized (Table 2). Strong dependence on Eσ70 is in agreement with previous results showing that ssrS P1, the main ssrS promoter, is Eσ70-dependent in vitro (37), and consistent with upregulation of ssrS in an rpoS mutant of MG1655 (20). Transcription of the different genes of interest was normalized to the RNA-I transcript, as previously described (22), and transcript quantitation was performed by real-time PCR as described in ‘Materials and Methods’ section. As shown in Figure 1, in vitro transcription experiments performed on plasmids showed that the ilvY, speB, xapA and ydhY are transcribed more efficiently in the presence of EσS. Extent of dependence on EσS was higher than in ROMA experiments, ranging from a 3.4-fold difference for speB to a 10.2-fold difference for xapA, thus suggesting that, at least at these promoters, DNA supercoiling does not negatively affect promoter recognition by EσS. In contrast, the ssrS P1 promoter showed clear dependence on Eσ70, in agreement with previous observations (37). Thus, results of in vitro transcription experiments performed on single promoters were fully consistent with ROMA experiments.
Figure 1.

In vitro transcription experiments on supercoiled plasmids. Transcription from the ilvY, speB, xapA, ydhY and ssrS P1 promoter regions cloned into pJCD01 plasmid was performed in the presence of either EσS or Eσ70; transcript amounts were determined by quantitative real-time PCR as described in ‘Materials and Methods’ section, using the RNA-I transcript as reference as previously described (22). Relative transcript levels are shown as EσS/Eσ70 ratio. Experiments were performed three times in duplicate, and standard errors are shown.

In vitro transcription experiments on supercoiled plasmids. Transcription from the ilvY, speB, xapA, ydhY and ssrS P1 promoter regions cloned into pJCD01 plasmid was performed in the presence of either EσS or Eσ70; transcript amounts were determined by quantitative real-time PCR as described in ‘Materials and Methods’ section, using the RNA-I transcript as reference as previously described (22). Relative transcript levels are shown as EσS/Eσ70 ratio. Experiments were performed three times in duplicate, and standard errors are shown.

Sequence elements involved in specific EσS- versus Eσ70 recognition of promoter regions

The genes identified in ROMA experiments define, at least partially, what can be considered as the ‘core σS regulon’, i.e. a set of genes whose transcription is directly controlled by EσS. The core σS regulon can be opposed to the ‘expanded σS regulon’, i.e. genes dependent on a functional rpoS allele in vivo, whose promoters are, however, not necessarily recognized by EσS. In order to identify sequence features important for specific promoter recognition by EσS, it can be very informative to compare the sequences of EσS-dependent promoters; however, it is important to limit this comparison to the promoters that are exclusively, or at least preferentially, under EσS control. Thus, we performed a sequence alignment on the promoter sequences of genes preferentially transcribed by either EσS or Eσ70. Known promoter sequences were retrieved from the Ecocyc database (http://ecocyc.org/): only promoters whose transcription initiation start site had been experimentally determined were considered for sequence alignment. However, since many rpoS-dependent genes are controlled by multiple promoters in vivo, suggesting complex regulation that might involve different sigma factors, we verified that their transcription start sites in the in vitro transcription reactions did indeed correspond to the transcription sites reported in the literature. Thus, using RACE analysis, we determined transcription start sites on in vitro transcription assays carried out with EσS and performed as in the ROMA experiment (data not shown): we were able to identify precisely the in vitro transcription start sites at 31 promoter regions controlling 29 different genes, as listed in Supplementary File S1. For most genes with already known promoter regions, we could confirm the transcription start observed in vivo, although for several genes reported to be controlled by multiple promoters, only one promoter was found to be recognized by EσS in vitro (e.g. osmB, otsB, glgC and murQ). For the gabD gene, two of the three promoters described as functional in the bacterial cell were also recognized by EσS in the in vitro transcription assays. In contrast, for the araF gene, we identified a second promoter additional to the one already described in the literature. Finally, we determined the transcription start sites for eight genes (or operons) preferentially transcribed by EσS in ROMA experiments with yet-unknown transcription start sites, namely puuCBE, tam, treF, yciT, yffOP, yhjG, yi91a and yqiG: their transcription start site and putative promoter elements are listed in Supplementary File S1. For Eσ70-dependent promoters, we verified the transcription start sites for the ssrS transcript obtained in ROMA experiments performed with Eσ70: the transcription start site matched the known transcription site for the ssrS P1 promoter previously identified (37). For promoter sequence comparison, we considered DNA sequences extending from −100 to +2 bp relative to the transcription start site, i.e. an area that includes all promoter elements described for Eσ70. For sequence analysis, we divided the −100 to +2 promoter sequence in three parts: the −17 to +2 region, carrying the −10 element and the transcription start site; the −60 to −18 region, containing the UP element and the −35 sequence; and the −100 to −61 region. Alignment of the −17 to +2 regions was centred on the first nucleotide of the −10 element (conventionally referred to as the −12 position, Figure 2B). For the −60 to −18 regions, the alignment was centred either on the first nucleotide of the −35 element (when present) or on the nucleotide located 22 bp upstream of the −10 element, which would correspond to the first nucleotide of a hypothetical −35 element placed at the optimal 17 bp distance from the −10 sequence (Figure 2A). In total, 31 promoters showing preferential recognition by EσS (listed in Supplementary File S1) and 30 promoters preferentially recognized by Eσ70 (Supplementary File S2) were selected for sequence analysis. Consensus sequence conservation within promoters preferentially recognized by either EσS or Eσ70 was displayed as sequence logos using the Weblogo 2.8.2 application (40) (http://weblogo.berkeley.edu/). Results of sequence analysis showed that, as expected, the most conserved sequences corresponded to the known promoter elements and were comprised between −60 and +2 (Figure 2). Comparison of the sequences located further upstream (−100 to −61) is shown in Supplementary Figure S2.
Figure 2.

Conserved sequence features in promoters of genes showing preferential transcription by either Eσ70 or EσS, shown as a sequence logo derived from multiple sequence alignments. (A) Comparison of alignments in the −60 to −18 promoter regions. (B) Comparison of alignments in the −17 to +2 promoter regions. Note that different y-axis scales were used in the two panels to account for the different levels of sequence conservation. Multiple alignment included 31 promoters controlling genes preferentially transcribed by EσS in ROMA experiments (listed in Supplementary File S1) and 29 promoters preferentially recognized by Eσ70 (listed in Supplementary File S2).

Conserved sequence features in promoters of genes showing preferential transcription by either Eσ70 or EσS, shown as a sequence logo derived from multiple sequence alignments. (A) Comparison of alignments in the −60 to −18 promoter regions. (B) Comparison of alignments in the −17 to +2 promoter regions. Note that different y-axis scales were used in the two panels to account for the different levels of sequence conservation. Multiple alignment included 31 promoters controlling genes preferentially transcribed by EσS in ROMA experiments (listed in Supplementary File S1) and 29 promoters preferentially recognized by Eσ70 (listed in Supplementary File S2). For promoters more efficiently recognized by Eσ70, the −35 and −10 sequences, i.e. the most important Eσ70-dependent promoter elements, are clearly recognizable in the sequence logos (Figure 2). In addition, the −60/−40 region is characterized by a high occurrence of A and T residues (Figure 2A), consistent with strong conservation of the UP element, i.e. a binding site for the α subunit of RNA polymerase (41). Other than the already known promoter features for σ70, only the transcription start site (−3/+1) showed a moderately conserved sequence (CCCG, Figure 2B). Several differences in conserved regions were detectable in the promoter set for genes more efficiently transcribed by EσS in ROMA experiments. The −10 sequence is clearly the main conserved promoter element in this set (Figure 2B). No conserved sequences were detectable in the −35 region (Figure 2A), in agreement with previous works reporting that the −35 element does not play an important role in promoter recognition by EσS (42,43), while a weakly conserved sequence similar to a −35 element seems to be located at around −30 (Figure 2A). Another difference between the two promoter sets resides in the lack of an A/T rich region between −60 and −40 (Figure 2A). In contrast, however, promoters more efficiently transcribed by EσS seem to possess an increased occurrence of T residues in the −90 to −70 region (Supplementary Figure S2). Finally, the discriminator, i.e. the sequence located between the −10 element and the transcription start site, appears to be biased towards a high A/T content in promoters transcribed more efficiently by EσS. In particular, T residues appear to be conserved at positions −6, −2 and −1. Rather than being associated to specific nucleotide position, however, the bias towards an A/T-rich discriminator seems to be a common feature in promoters of genes preferentially transcribed by EσS. Indeed, while the average length of the discriminator is identical for both Eσ70- and EσS-dependent promoters (6.1 bp, Supplementary Table S3), the GC content is significantly higher in Eσ70-dependent promoters (0.58 versus 0.41, Supplementary Table S3). Finally, an additional deviation between Eσ70- and EσS-specific promoter elements could be observed immediately upstream of the −10 hexamer: the Eσ70-dependent promoters showed some conservation of a TGG motif upstream of the −10 [the ‘extended −10′ (44)], which was replaced by a TNGC motif, which includes the characteristic −13C element, in EσS-dependent promoters.

Mutations in the discriminator region affect selective promoter recognition at the Eσ70-dependent promoter ssrS P1

It might be inferred that differences in the specific elements of the promoter regions might account for different transcription efficiency by either EσS or Eσ70 in ROMA experiments. In particular, we investigated whether the presence of an A/T rich region immediately downstream of the −10 element, conserved among promoters preferentially recognized by EσS in ROMA experiment (Figure 2B), could indeed be a determinant for selective promoter recognition by EσS. To test this possibility, we targeted for mutagenesis the ssrS P1 promoter, which shows preferential recognition by Eσ70 in our in vitro assays (Table 1 and Figure 1) and whose dependence on Eσ70 had already been reported in the literature (37). We changed the CG nucleotides at positions −3/−2 of the ssrS P1 promoter to TA (Figure 3), thus making the −6/+1 region of the promoter A/T-rich. In vitro transcription experiments on supercoiled templates in the presence of either EσS or Eσ70 showed that CG to TA substitutions at positions −3/−2 of the ssrS P1 promoter resulted in loss of specific recognition by Eσ70 from 3.5- to 1.6-fold (Figure 3), suggesting that the A/T content in the −6/+1 region can indeed play a role in modulating transcription efficiency by either EσS or Eσ70.
Figure 3.

In vitro transcription experiments on pJCD01 plasmid derivatives in which either the ssrS P1 promoter (ssrS P1 WT) or a mutated derivative (ssrS P1 mut) had been cloned. The ssrS P1 mut carries a double substitution (CG to TA) at positions −3/−2 (shown in the figure). In vitro transcription was performed in the presence of either EσS or Eσ70; transcript amounts were determined by quantitative real-time PCR as described in ‘Materials and Methods’ section, using the RNA-I transcript as reference as previously described (22). Experiments were performed three times in duplicate, and standard errors are shown.

In vitro transcription experiments on pJCD01 plasmid derivatives in which either the ssrS P1 promoter (ssrS P1 WT) or a mutated derivative (ssrS P1 mut) had been cloned. The ssrS P1 mut carries a double substitution (CG to TA) at positions −3/−2 (shown in the figure). In vitro transcription was performed in the presence of either EσS or Eσ70; transcript amounts were determined by quantitative real-time PCR as described in ‘Materials and Methods’ section, using the RNA-I transcript as reference as previously described (22). Experiments were performed three times in duplicate, and standard errors are shown.

DISCUSSION

In this work, we have attempted to identify bona fide EσS-dependent promoters through in vitro transcription experiments using the whole E. coli genome as template, followed by identification of EσS-dependent transcripts by microarray analysis (ROMA experiments). The in vitro transcription experiments have been performed on linear DNA, in the absence of any additional factor or molecule able to affect transcription initiation by either EσS or Eσ70 (i.e. DNA supercoiling, transcription regulators, histone-like proteins, ppGpp), and thus they represent a direct measurement of sequence-specific interactions between promoters and RNA polymerase. Our results indicate that selective promoter utilization by Eσ70 and EσS is mediated by specific sequence features, and confirm the role of σS-specific promoter elements previously identified. Promoters more efficiently recognized by Eσ70 in ROMA experiments are characterized by a high occurrence of the UP-like element, i.e. an A/T-rich region located immediately upstream of the −35 sequence and acting as a binding site for the α subunit of RNA polymerase (41), suggesting that, at least in vitro, the UP element might favour promoter recognition by Eσ70. It is worth mentioning that, in Bacillus subtilis, UP elements are highly conserved among σA-dependent promoters (45), thus suggesting that UP elements might favour promoter recognition by the housekeeping σ factor in different bacteria. In contrast, UP-like elements flanking the −35 sequence are less conserved in promoters better recognized by EσS, where, however, A/T-rich elements seem to be scattered in the region spanning 70−90 nt upstream of the transcription start (Supplementary Figure S2). It might be speculated that, similar to the UP element for Eσ70-dependent promoters, AT-rich sequences located in the −70 to −90 region might also be involved in interaction with RNA polymerase α subunit at σS-dependent promoters. It is conceivable that the α subunit might contact alternative upstream promoter elements when assembled in different forms of RNA polymerase holoenzyme; indeed, differential ability to interact with UP elements has already been described for Eσ70 and EσS (46). A sequence element showing strong differences between the two promoter sets is the discriminator, i.e. the region spanning between −10 and +1. This region shows a high T/A content for promoters more efficiently recognized by EσS; in contrast, Eσ70-dependent promoters identified in ROMA experiments are biased towards a CG-rich region in the −3/+1 residues (CCCG, Figure 2). Mutations in the discriminator region of the Eσ70-dependent ssrS P1 promoter increasing its A/T content result in partial loss of preferential recognition by Eσ70 (Figure 3), providing further confirmation for a role of the −10/+1 region in specific recognition by either form of RNA polymerase. GC versus AT content in the discriminator region can directly affect promoter melting, and GC-rich discriminators are a common feature among promoters subject to negative regulation by ppGpp (47,48), which, indeed, inhibits Eσ70-dependent transcription while promoting transcription by alternative forms of RNA polymerase (49). The sequence features indicated by our promoter analysis are consistent with previous observations based on sequence analysis of rpoS-dependent promoters in the bacterial cell (4); this also includes the fairly strong conservation of a C nucleotide immediately upstream of the −10 hexamer (the −13 C element) among σS-dependent promoters. Indeed, the −13 C element occurs in almost half of the promoters transcribed more efficiently by EσS in ROMA experiments (14 out of 31 promoters; Supplementary File S1), opposed to a much lower frequency in Eσ70-dependent promoters (4 out of 29, Supplementary File S2). This result is not particularly surprising: indeed, a large amount of data has clearly shown that the presence of a C residue immediately upstream of the −10 nt favours promoter recognition by EσS (7,8,50). The −13C element might play a similar role as the TG motif at Eσ70-dependent promoters lacking a −35 region (44). It is noteworthy that an in vivo analysis suggests that the −13C element occurs in more than 70% of putative rpoS-dependent promoters (4): this observation would suggest that the −13C element might be needed to improve EσS–promoter interaction in the bacterial cell, possibly to overcome the negative effects of DNA-binding proteins, such as H-NS, that can modulate RNA polymerase–promoter interaction. The results of the ROMA approach have also expanded our knowledge of the σS regulon: the σS protein is considered a central element of the so-called ‘general stress response’ (51). Intracellular σS concentration and expression of σS-dependent genes respond to reduction in growth rate (52); thus, any cellular stress affecting growth rate is likely to induce σS accumulation, which in turn plays a direct role in oxidative, acid and osmotic stress through activation of specific genes. In addition, σS activates metabolic genes associated to stationary-phase metabolism, in particular carbon storage genes involved in glycogen (19,20) and trehalose metabolism (9,20); consistent with these observations, stress response and carbon metabolism genes are highly represented among genes preferentially transcribed by EσS in ROMA experiments (Table 1). In addition, our results underline the importance of σS for the expression of genes involved in polyamine metabolism. Polyamines, in particular putrescine, play an important role in various cellular processes and can affect intracellular concentrations of the regulatory proteins σN, Cra and H-NS, thus impacting global gene expression (53). As shown in Figure 4, putrescine, the most abundant polyamine in bacterial cells (54), is a product of arginine degradation. Putrescine accumulates at the transition between exponential and stationary phase (55), and it can be subsequently converted into other polyamines or degraded to succinate, which can be shunted into the tricarboxylic acid (TCA) cycle, in a NADPH-generating process (Figure 4). Interestingly, mutants unable to synthesize polyamines are more sensitive to oxidative stress, since they cannot induce ahpC, katE and katG genes, encoding three different peroxidase, suggesting that polyamine accumulation might control expression of rpoS-dependent genes involved in the response to oxidative stress (54). In turn, our results indicate that genes encoding enzymes involved in both accumulation and degradation of putrescine, one of the main polyamines found in the bacterial cell, are rpoS-dependent. Indeed, the speB gene, encoding the putrescine-biosynthetic enzyme agmatinase, is preferentially transcribed by EσS in vitro (Table 1 and Figure 1) and is rpoS-dependent in vivo, in particular in LB1/4 medium (Table 3). Likewise, the genes involved in putrescine degradation, belonging to the puuCBE and gabDTP operons, are efficiently transcribed by EσS in ROMA experiments (Table 1 and Supplementary Table S1) and show dependence on a functional rpoS gene in vivo (Table 3). The gabDTP operon, as well as the puuA and puuD genes, also involved in putrescine degradation, have already been reported to be rpoS-dependent in vivo (20,55,56). Thus, σS could control every step in putrescine metabolism by regulating genes responsible for both its biosynthesis and its degradation, as shown in Figure 4. Activation of putrescine biosynthesis rather than degradation might respond to different growth conditions and environmental cues, as also suggested by the very different levels of rpoS-dependent regulation of putrescine-related genes (i.e. speB, gabD and puuB) in different growth media (Table 3).
Figure 4.

Schematic representation of a biochemical pathway under the control of the rpoS gene: arginine degradation to succinate via putrescine (adapted from EcoCyc; http://ecocyc.org/). Genes belonging to this pathway and found to be preferentially transcribed by EσS in ROMA experiments are boxed with solid lines (speB, puuB and gabD); genes co-transcribed with either puuB or gabD (i.e. belonging to the puuCBE and gabDTP operons; see also Supplementary Table 1) are boxed with dashed lines. Other genes of the pathway known to be dependent on the rpoS gene in vivo (puuA and puuD) (20,55) are underlined.

Schematic representation of a biochemical pathway under the control of the rpoS gene: arginine degradation to succinate via putrescine (adapted from EcoCyc; http://ecocyc.org/). Genes belonging to this pathway and found to be preferentially transcribed by EσS in ROMA experiments are boxed with solid lines (speB, puuB and gabD); genes co-transcribed with either puuB or gabD (i.e. belonging to the puuCBE and gabDTP operons; see also Supplementary Table 1) are boxed with dashed lines. Other genes of the pathway known to be dependent on the rpoS gene in vivo (puuA and puuD) (20,55) are underlined. rpoS-dependent control of intracellular polyamines concentrations represents an important mechanism of indirect gene regulation by the σS protein, since polyamines act as signal molecules able to impact global transcription pattern in the bacterial cell (53). Indirect control of gene expression by σS can also occur through activation of regulatory proteins and of regulatory RNAs. Indeed, ROMA experiments would suggest that EσS directly controls at least four genes encoding regulatory proteins (appY, ilvY, rhaS and yciT) and four genetic loci encoding regulatory RNAs (cyaR, micA, sgrS/sgrT and ECs3934) (Table 1). Interestingly, the sgrS/sgrT locus, encoding both a non-coding RNA (SgrS) and a small regulatory protein (SgrT), negatively affects expression of the ptsG glucose uptake system (57), which is overexpressed in rpoS mutant strains of E. coli (20,58). However, the extent of σS-dependent indirect regulation of gene expression through non-coding RNA might not be limited to these four loci. Indeed, a significant number of intergenic regions, often located immediately upstream of known ORFs in antisense direction, was efficiently transcribed by EσS (Table 1). It is possible that at least a part of these intergenic regions might indeed be transcribed in the bacterial cell and function as cis-acting regulatory RNAs. Similarly, several ORFs only annotated in pathogenic E. coli strains were detectable as transcripts in the ROMA experiment, although MG1655 genomic DNA had been used in the experiments. Sequence comparison allowed us to determine that these ORFs are indeed present in MG1655, but they are not accounted for in the available databases. As observed for intergenic regions efficiently transcribed by EσS, several of these ORFs also overlap known genes in an antisense direction, suggesting that at least some of them might be involved in modulating gene expression. One ORF, c3113, partially overlaps the promoter region of the rrsG ribosomal operon, and quantitative real-time PCR experiments suggest that it is transcribed in a stationary-phase-dependent and rpoS-dependent manner in bacterial cells grown in glucose-based medium (Table 3). Similar to c3113, an intergenic region showing preferential transcription by EσS (IG223409_223770-r) also partially overlaps the ribosomal operon rrsH, but in an antisense direction. Thus, our observations would suggest that small proteins or non-coding RNAs might play an important role in rpoS-dependent negative regulation of genes associated with fast growth, such as ribosomal operons and the ptsG glucose uptake system.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

The Swiss National Fund for Scientific Research (Project 3100A0-109433 to T.E. and P.L.); Italian Ministry for University and Research (CHEM-PROFARMA-NET Research Program, Project RBPR05NWWC_004 to P.L., and FIRB-MIUR NG-LAB Research Program, Project RBLA03ER38_004 to G.D.B.). Funding for open access charge: Italian Ministry for University and Research (MIUR). Conflict of interest statement. None declared.
  82 in total

1.  A DNA architectural protein couples cellular physiology and DNA topology in Escherichia coli.

Authors:  R Schneider; A Travers; T Kutateladze; G Muskhelishvili
Journal:  Mol Microbiol       Date:  1999-12       Impact factor: 3.501

2.  What makes an Escherichia coli promoter sigma(S) dependent? Role of the -13/-14 nucleotide promoter positions and region 2.5 of sigma(S).

Authors:  G Becker; R Hengge-Aronis
Journal:  Mol Microbiol       Date:  2001-03       Impact factor: 3.501

3.  Involvement of differential efficiency of transcription by esigmas and esigma70 RNA polymerase holoenzymes in growth phase regulation of the Escherichia coli osmE promoter.

Authors:  P Bordes; F Repoila; A Kolb; C Gutierrez
Journal:  Mol Microbiol       Date:  2000-02       Impact factor: 3.501

Review 4.  A role for the sigma S subunit of RNA polymerase in the regulation of bacterial virulence.

Authors:  R Hengge-Aronis
Journal:  Adv Exp Med Biol       Date:  2000       Impact factor: 2.622

5.  6S RNA regulates E. coli RNA polymerase activity.

Authors:  K M Wassarman; G Storz
Journal:  Cell       Date:  2000-06-09       Impact factor: 41.582

6.  Involvement of a novel transcriptional activator and small RNA in post-transcriptional regulation of the glucose phosphoenolpyruvate phosphotransferase system.

Authors:  Carin K Vanderpool; Susan Gottesman
Journal:  Mol Microbiol       Date:  2004-11       Impact factor: 3.501

7.  sigma factor selectivity of Escherichia coli RNA polymerase: role for CRP, IHF and lrp transcription factors.

Authors:  F Colland; M Barth; R Hengge-Aronis; A Kolb
Journal:  EMBO J       Date:  2000-06-15       Impact factor: 11.598

8.  Transcriptional regulation in response to oxygen and nitrate of the operons encoding the [NiFe] hydrogenases 1 and 2 of Escherichia coli.

Authors:  D J Richard; G Sawers; F Sargent; L McWalter; D H Boxer
Journal:  Microbiology       Date:  1999-10       Impact factor: 2.777

9.  Identification of the CRP regulon using in vitro and in vivo transcriptional profiling.

Authors:  Dongling Zheng; Chrystala Constantinidou; Jon L Hobman; Stephen D Minchin
Journal:  Nucleic Acids Res       Date:  2004-11-01       Impact factor: 16.971

10.  Regulation of 6S RNA biogenesis by switching utilization of both sigma factors and endoribonucleases.

Authors:  Kwang-sun Kim; Younghoon Lee
Journal:  Nucleic Acids Res       Date:  2004-11-18       Impact factor: 16.971

View more
  45 in total

1.  Mutations in Escherichia coli Polyphosphate Kinase That Lead to Dramatically Increased In Vivo Polyphosphate Levels.

Authors:  Amanda K Rudat; Arya Pokhrel; Todd J Green; Michael J Gray
Journal:  J Bacteriol       Date:  2018-02-23       Impact factor: 3.490

2.  A small RNA activates CFA synthase by isoform-specific mRNA stabilization.

Authors:  Kathrin Sophie Fröhlich; Kai Papenfort; Agnes Fekete; Jörg Vogel
Journal:  EMBO J       Date:  2013-10-18       Impact factor: 11.598

3.  A regulatory feedback loop between RpoS and SpoT supports the survival of Legionella pneumophila in water.

Authors:  Hana Trigui; Paulina Dudyk; Jinrok Oh; Jong-In Hong; Sebastien P Faucher
Journal:  Appl Environ Microbiol       Date:  2014-11-21       Impact factor: 4.792

Review 4.  Oxidative stress protection by polyphosphate--new roles for an old player.

Authors:  Michael J Gray; Ursula Jakob
Journal:  Curr Opin Microbiol       Date:  2015-01-10       Impact factor: 7.934

5.  Characterization of a Minimal Type of Promoter Containing the -10 Element and a Guanine at the -14 or -13 Position in Mycobacteria.

Authors:  Yan Zhu; Chunyou Mao; Xingyi Ge; Zhongwei Wang; Pei Lu; Yong Zhang; Shiyun Chen; Yangbo Hu
Journal:  J Bacteriol       Date:  2017-10-03       Impact factor: 3.490

Review 6.  Inorganic polyphosphate, a multifunctional polyanionic protein scaffold.

Authors:  Lihan Xie; Ursula Jakob
Journal:  J Biol Chem       Date:  2018-11-13       Impact factor: 5.157

7.  TusA Is a Versatile Protein That Links Translation Efficiency to Cell Division in Escherichia coli.

Authors:  Tugba Yildiz; Silke Leimkühler
Journal:  J Bacteriol       Date:  2021-03-08       Impact factor: 3.490

8.  Structural basis of bacterial σ28 -mediated transcription reveals roles of the RNA polymerase zinc-binding domain.

Authors:  Wei Shi; Wei Zhou; Baoyue Zhang; Shaojia Huang; Yanan Jiang; Abigail Schammel; Yangbo Hu; Bin Liu
Journal:  EMBO J       Date:  2020-06-02       Impact factor: 11.598

9.  Tail-Anchored Inner Membrane Protein ElaB Increases Resistance to Stress While Reducing Persistence in Escherichia coli.

Authors:  Yunxue Guo; Xiaoxiao Liu; Baiyuan Li; Jianyun Yao; Thomas K Wood; Xiaoxue Wang
Journal:  J Bacteriol       Date:  2017-04-11       Impact factor: 3.490

Review 10.  Inorganic polyphosphate in the microbial world. Emerging roles for a multifaceted biopolymer.

Authors:  Tomás Albi; Aurelio Serrano
Journal:  World J Microbiol Biotechnol       Date:  2016-01-09       Impact factor: 3.312

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.