Literature DB >> 32297955

A non-canonical promoter element drives spurious transcription of horizontally acquired bacterial genes.

Emily A Warman1, Shivani S Singh1, Alicia G Gubieda1, David C Grainger1.   

Abstract

RNA polymerases initiate transcription at DNA sequences called promoters. In bacteria, the best conserved promoter feature is the AT-rich -10 element; a sequence essential for DNA unwinding. Further elements, and gene regulatory proteins, are needed to recruit RNA polymerase to the -10 sequence. Hence, -10 elements cannot function in isolation. Many horizontally acquired genes also have a high AT-content. Consequently, sequences that resemble the -10 element occur frequently. As a result, foreign genes are predisposed to spurious transcription. However, it is not clear how RNA polymerase initially recognizes such sequences. Here, we identify a non-canonical promoter element that plays a key role. The sequence, itself a short AT-tract, resides 5 base pairs upstream of otherwise cryptic -10 elements. The AT-tract alters DNA conformation and enhances contacts between the DNA backbone and RNA polymerase.
© The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Year:  2020        PMID: 32297955      PMCID: PMC7229825          DOI: 10.1093/nar/gkaa244

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

All living organisms transcribe their genomes using the enzyme RNA polymerase (1). The process initiates at defined DNA sequences called promoters (1). In Escherichia coli, a multisubunit core RNA polymerase (α2ββ') binds one of seven dissociable σ factors to recognise promoter DNA (2). The housekeeping σ70 factor is best studied and targets two promoter regions; the -10 (5′-TATAAT-3′) and -35 elements (5′-TTGACA-3′) (2). The -10 sequence facilitates promoter DNA unwinding and is usually indispensable (3). Conversely, the -35 element aids initial RNA polymerase recruitment and can be replaced by transcription factors fulfilling the same role (3). In isolation, core promoter elements are ineffective (4). For instance, specific interactions between σ70 and the -10 element occur only in the context of single stranded DNA (5). Hence, -10 elements support DNA melting but not sequence specific RNA polymerase recruitment (4,5). Sections of the E. coli genome acquired by horizontal gene transfer have an unusually high AT-content (6). Consequently, sequences resembling promoter -10 elements occur frequently (7). These can participate in spurious transcription initiation (8–10). The histone-like nucleoid structuring (H-NS) protein counteracts this by coating large AT-rich DNA islands (8,11,12). The resulting nucleoprotein complexes hinder transcription (13,14). Hence, H-NS acts as a xenogeneic silencer (15,16). Importantly, silencing prevents titration of RNA polymerase that otherwise perturbs housekeeping transcription (8). Given the lack of ancillary promoter elements, and binding sites for transcription factors, it is not clear how RNA polymerase initially recognizes promoters within horizontally acquired genes. In this paper, we sought to understand recognition of spurious promoters by RNA polymerase. Our study reveals the importance of a short AT-tract, positioned upstream of the -10 element, at happenstance but not conventional promoters. This unusual sequence element permits transcription from otherwise cryptic -10 hexamers. Mechanistically, the AT-tract facilitates an interaction with σ70 residue R451, which recognizes the DNA backbone (17). We show that uncontrolled transcription of AT-rich genes is not restricted to σ70 dependence; the alternative σ38 factor also plays a role.

MATERIALS AND METHODS

Strains, plasmids and oligonucleotides

Strains, plasmids and oligonucleotides are described in Table 1. Standard procedures for cloning and DNA manipulation were used throughout. Promoter DNA fragments were made either using olignonucleotides or synthetic DNA fragments described in Table 1. All promoter DNA fragments were flanked by EcoRI and HindIII restriction sites to allow cloning in plasmid pRW50 or pSR. To construct DNA fragments with random sequence, but defined AT-content, we used the oligonucleotide ‘Random R’ in combination with variants of the ‘Random F’ primer (Table 1). Each variant of the latter was synthesised using a different mixture of nucleotides to generate ‘any base’ (N). Whilst the A:T and G:C ratios were always the same the overall AT-content varied as indicated. Primers used to generate other synthetic promoter sequences, with or without AT-tracts, are also listed in Table 1. These promoters were made using pairs of oligonucleotides with short regions of complementarity at the 3′ end with the remainder of the sequence serving as a template for DNA polymerase. The ATR fragment introduced a random string of A or T bases (denoted W in Table 1). RPB104 Δhns was constructed by Gene Doctoring as described previously (8,18).
Table 1.

Strains, plasmids and oligonucleotides

NameDescriptiona,b,cSource
Strains
JCB387Δnir Δlac(33)
RPB104 ΔhnsRPB104MG1655 with C-terminal SPA-tagged rpoS(23)
RPB104 hns::kanThis work
Plasmids
pRW50low copy number lac fusion vector with EcoRI/HindIII cloning site and TetR(34)
pSRpBR322-derived vector with EcoRI/HindII cloning site upstream of λoop terminator.(20)
Encodes AmpR.
pSR Δ45-9A-10TpSR carrying an optimised derivative of the cbpA promoter(17)
Primers for generating random DNA sequences
Random FGGCTGCGAATTCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNThis work
NNNNNAGGAGGATGCGGACTATG
Random RCGCCCGAAGCTTcatagtccgcatcctcctThis work
Primers for generating synthetic promoter sequences
-10 FGGCTGCGAATTCgaccggcgagcttcgcagtcagctgactataattgccgcgcgcaThis work
-10 RCGCCCGAAGCTTcatagtccgcatcctcctgcgcgcggcaattatagtcagctgacThis work
-10/-35TT FGGCTGCGAATTCgaccttcgagcttcgcagtcagctgactataattgccgcgcgcaThis work
-10/ATi FGGCTGCGAATTCgaccggcgagcttcgctatttattgactataattgccgcgcgcaThis work
-10/ATi RCGCCCGAAGCTTcatagtccgcatcctcctgcgcgcggcaattatagtcaataaatThis work
-10/ATi/-35TT FGGCTGCGAATTCgaccttcgagcttcgctatttattgactataattgccgcgcgcaThis work
-10/ATii FGGCTGCGAATTCgaccggcgagcttcgcagaattttgactataattgccgcgcgcaThis work
-10/ATii RCGCCCGAAGCTTcatagtccgcatcctcctgcgcgcggcaattatagtcagctgacThis work
-10/ATii/-35TT FGGCTGCGAATTCgaccttcgagcttcgcagaattttgactataattgccgcgcgcaThis work
-10 general RGCCCGAAGCTTCatagtccgcatcctcctgcgcgcggcaattatagtcThis work
Primers for introducing an AT-rich spacer of random sequence
-10/ATR FGGCTGCGAATTCgaccggcgagcttcgcwwwwwwwtgactataattgccgcgcgcThis work
-10/ATR/-35TT FGGCTGCGAATTCgaccttcgagcttcgcwwwwwwwtgactataattgccgcgcgcThis work
Primers for amplifying intragenic promoters
wzxB 1.1 FGGCTGCGAATTCacgttactttatctttactatctgcThis work
wzxB 1.1 RGCCCGAAGCTTCCTCCTttgtaagaacacttggtcctgaaaaThis work
yigG 1.2 FGGCTGCGAATTCtactccattatctcgtcatcaacatgThis work
yigG 1.2 RGCCCGAAGCTTCCTCCTcattgcctgaacaggcaaaatcttcThis work
yqiI 2.2 FGGCTGCGAATTCataagttacaccgaaagtataagagThis work
yqiI 2.2 RGCCCGAAGCTTCCTCCTgaatattttatgaatgttttctgThis work
ygaQ 1.1 FGGCTGCGAATTCcggttacacaatactaacttatttaacThis work
ygaQ 1.1 RGCCCGAAGCTTCCTCCTtgaaaaatcaatggcgcttaaatcatcThis work
wcaD FGGCTGCGAATTCTcaaacagtttggtatcaaaacgThis work
wcaD RGCCCGAAGCTTCATAGTCCGCATCCTCCTcccctgaaaacgatccggThis work
lpxD FGGCTGCGAATTCAccagtgccagattgcacataacgThis work
lpxD RGCCCGAAGCTTCATAGTCCGCATCCTCCTtcaggctgcccgccataatgacgThis work
Overlapping primers replacing AT-rich spacers with a GC-rich spacer
wcaD GC fwdGGCTGCGAATTCtcaaacagtttggtatcaaacttcgcagtcagcttgctatgatThis work
wcaD GC revAGCCCGAAGCTTcctcctcccctgaaaacgatccggataatattatccctgcgagaatThis work
catagcaagctgactgcgaagtttgatThis work
lpxD GC fwdGGCTGCGAATTCaccagtgccagattgcaccttcgcagtcagctgacgacaatThis work
lpxD GC revAGCCCGAAGCTTcctccttcaggctgcccgccataatgacgccaccggcaaccgccgtThis work
attgtcgtcagctgactgcgaaggtgcaaThis work
Synthetic DNA fragments to replace AT-rich spacers with GC-rich spacer
wzxB1.1 gc_SpacerCTTGAGTCCACGCTAGATCTGGCTGCGAATTCAcgttactttatctttactatctgctgThis work
ctttggcaatactctgagttgctgtgagattgaaacttcgcagtcagctgactatcatatatagcatagtcg
cttggcaaaaaccgaatataccgaaattttcaggaccaagtgttcttacaaaggaggAAGCTTCGG
GCTTGTCAGTGCGCAAAAAGAT
yigG1.2 gc_spacerCTTGAGTCCACGCTAGATCTGGCTGCGAATTctactccattatctcgtcatcaacatgaThis work
attgccagcgactccgtgatagtggtttcatctatatacttcgcagtcagcttggtacattagcagtatatatc
atctctatcatcacaatgatagccgaagattttgcctgttcaggcaatgaggaggAAGCTTCGGGCTT
GTCAGTGCGCAAAAAGAT
yqiI2.2 gc_spacerCTTGAGTCCACGCTAGATCTGGCTGCGAATTCataagttacaccgaaagtataagagttThis work
ttgattataaaagtcttgacctcttcgcagtcagctgactatatttgcccatgcagatgggtattcttctcctggag
atgggcctggtagtgcattattacagaaaacattcataaaatattcaggaggAAGCTTCGGGCTTGT
CAGTGCGCAAAAAGAT
ygaQ1.1 gc_spacerCTTGAGTCCACGCTAGATCTGGCTGCGAATTCcggttacacaatactaacttatttaacThis work
ccaaaatatcataaaaaagccgttatgaatttcgcagtcagcttggtaacttgtcagttggatgaacaacaa
atgtcatcactgctttatgaaagagatgatttaagcgccattgatttttcaaggaggAAGCTTCGGGCTT
GTCAGTGCGCAAAAAGAT

aN is either A, C, G or T incorporated into the oligonucleotide at random but supplied at a defined % of each nucleotide. Used to generate the DNA fragment library described in Figure 1.

bW is A or T, with an equal likelihood of either base being incorporated.

cSynthetic promoter -10 elements are underlined and key base changes introduced by oligonucleotides are in bold.

Strains, plasmids and oligonucleotides aN is either A, C, G or T incorporated into the oligonucleotide at random but supplied at a defined % of each nucleotide. Used to generate the DNA fragment library described in Figure 1.
Figure 1.

Promoters are more prevalent and active in random DNA sequences with a higher AT-content. (A) Experimental strategy for generation and selection of promoters from random DNA sequences of defined AT-content. Briefly, 43 bp DNA fragments of random sequence, but a defined AT-content between 40% and 75%, were fused to the lacZ gene in plasmid pRW50. The resulting plasmid libraries were used to transform the E. coli Δlac strain JCB387. Transformants expressing LacZ were identified as red or pink colonies on MacConkey agar. Such lac+ colonies were selected and their random 43 bp DNA insertion was sequenced. (B) The number of lac+ colonies increases as AT-content increases. A total of 10,735 colonies were examined and the percentage of lac+ colonies is plotted against the % AT-content of the corresponding insertion library. (C) Average promoter activity increases in AT-rich DNA sequences. The LacZ activity, quantified for lac+ colonies by β-galactosidase assay, is plotted against library % AT-content. (D) DNA sequence motifs associated with different classes of promoter

bW is A or T, with an equal likelihood of either base being incorporated. cSynthetic promoter -10 elements are underlined and key base changes introduced by oligonucleotides are in bold.

β-Galactosidase assays

Assays were done following the Miller protocol using E. coli strain JCB387 (19). Cells were grown to mid-log phase in LB media, supplemented with 35 μg/ml tetracycline, at 37°C. All experiments, except for the ATR assays, were done in triplicate and mean values are shown. For the ATR assays we present the overall distribution of activities obtained from single experiments. Error bars represent standard deviation.

Proteins

RNA polymerase core enzyme was purchased from NEB. WT and R451A σ factors were purified as previously described (17).

In vitro transcription

In vitro transcription assays were done using the system of Kolb et al. (20) and the protocol of Savery et al. (21). Briefly, pSR carrying promoters of interest was isolated using a QIAGEN maxiprep kit. Plasmid DNA was mixed to a final concentration of 16 μg/ml with transcription buffer (20 mM Tris pH 7.9, 200 mM GTP/ATP/CTP, 10 mM UTP, 5 μCi (α32P) UTP, 5 mM MgCl2 and 100 μg/ml BSA). RNA polymerase was mixed with either WT or R451A σ70 then added to reactions for 10 min at 37°C. RNA products were visualized on a 7% denaturing polyacrylamide gel. RNAI transcript was used as a loading control. Full gel images are shown in Supplementary Figure S1.

Promoter DNA bending assays

To compare differences in DNA bending, double stranded promoter fragments generated by PCR were separated on a 7.5% non-denaturing polyacrylamide gel. Electrophoresis was done in TBE buffer at 4°C. DNA was stained with ethidium bromide and viewed on a UV transilluminator. Full gel images are shown in Supplementary Figure S1.

ChIP-seq

Experiments were done according to the protocol of Haycocks et al. (22) using strain RPB104Δhns that encodes SPA-tagged rpoS. Duplicate cultures were grown to an OD600 of 3.0 in LB then crosslinked, lysed and sonicated. Next, σ38-DNA complexes were immunoprecipitated with anti-FLAG antibody and Protein A sepharose beads. Fragments were blunted and poly(A) tailed with 5′-3′ exo- Klenow (NEB). NEXTflex barcoded adaptors (Bioo Scientific) were attached by ligation. Following elution, complexes were de-crosslinked by boiling. The resulting DNA libraries were amplified by PCR and quantified by Qubit analysis before pooling and sequencing using an Illumina MiSeq instrument. The raw data are available from ArrayExpress (E-MTAB-8778).

Bioinformatics

FastQ files were converted to Sanger format using FastqGroomer and aligned to the MG1655 genome using Bowtie for Illumina. The resulting SAM files were converted to BAM files and read depth per base was calculated using MultiBam summary. Data were normalised to the same average read count to allow comparison. Peaks for σ38 binding were called if the average read depth was 4 or above. The same analysis was applied to results of a ChIP-seq assay of σ38 binding in the parent strain RPB104 (23). To identify motifs in collections of putative promoter DNA sequences we used MEME (24).

RESULTS

Happenstance promoters share a conserved AT-rich sequence element

The starting point for this work was our previous analysis of promoters within horizontally acquired genes (8,10). We speculated that such promoters were chance occurrences resulting from the high AT-content of foreign DNA (10). To quantify the relationship between DNA AT-content and promoter occurrence we generated 8 separate DNA fragment libraries. The fragments in each library were 43 bp in length and had random sequences. However, the overall AT-content of libraries was different and set between 40% and 75%. Fragments were fused to lacZ in plasmid pRW50 and used to transform E. coli strain JCB387. A total of 10,735 transformants were selected on MacConkey agar. This allowed 1,039 red lac+ colonies to be identified, corresponding to active promoters. Our experimental strategy is summarised in Figure 1A. For each fragment library, we calculated the percentage of all DNA fragments with promoter activity. These data are plotted against percentage AT-content in Figure 1B. There was a clear correlation between library AT-content and the number of promoters identified. Few promoters were generated in DNA fragments with an AT-content <50%. We also measured LacZ activity in lysates of cultures derived from each lac+ colony. This allowed us to determine the average activity of all promoters in each library (Figure 1C). Whilst a positive correlation was evident, there was no increase in average promoter activity when the AT-content exceeded 60% (Figure 1C). Each active promoter was sequenced and a DNA sequence logo was generated (Figure 1D, top). For comparison, we also made DNA sequence logos representing intragenic promoters subject to repression by H-NS (middle) and canonical intergenic promoters (bottom) (10,25). In all logos, the best conserved feature was the promoter -10 element, particularly bases one, two and six. Conversely, -35 elements were poorly conserved; only the 5′-TT-3′ dinucleotide at positions one and two was evident. Randomly generated and intragenic promoters had an AT-tract between promoter positions -17 and -23. This element was not enriched at canonical promoters. Note that AT-tract sequence differed in randomly generated and intragenic promoters (Figure 1D, compare top two panels). We subsequently refer to these variants as ATi and ATii. Promoters are more prevalent and active in random DNA sequences with a higher AT-content. (A) Experimental strategy for generation and selection of promoters from random DNA sequences of defined AT-content. Briefly, 43 bp DNA fragments of random sequence, but a defined AT-content between 40% and 75%, were fused to the lacZ gene in plasmid pRW50. The resulting plasmid libraries were used to transform the E. coli Δlac strain JCB387. Transformants expressing LacZ were identified as red or pink colonies on MacConkey agar. Such lac+ colonies were selected and their random 43 bp DNA insertion was sequenced. (B) The number of lac+ colonies increases as AT-content increases. A total of 10,735 colonies were examined and the percentage of lac+ colonies is plotted against the % AT-content of the corresponding insertion library. (C) Average promoter activity increases in AT-rich DNA sequences. The LacZ activity, quantified for lac+ colonies by β-galactosidase assay, is plotted against library % AT-content. (D) DNA sequence motifs associated with different classes of promoter

AT-tracts can activate cryptic -10 elements

As noted above, the promoter -10 element alone is ineffective. Hence, transcription factors play a key role by activating canonical promoters. We reasoned that the AT-tract may circumvent the need for transcription factors, or a promoter -35 element, at happenstance promoters. To test this, we generated a set of synthetic promoters. The promoters all had a consensus -10 element. This was augmented with combinations of ATi, ATii and a partial -35 hexamer (5′-TT-3′). The different promoters are illustrated schematically alongside LacZ activity measurements in Figure 2A. As expected, the -10 element alone was unable to drive lacZ expression. Addition of the partial -35 element had no impact. Conversely, addition of either AT-tract variant resurrected promoter activity. This activation increased further when a partial -35 element was also present. To confirm our observations, selected promoters were cloned upstream of the λoop terminator in plasmid pSR. Transcripts terminating at λoop can be detected following electrophoresis. The RNAI transcript is derived from the pSR replication origin and serves as an internal control. No transcripts were produced from promoters lacking AT-tracts (Figure 2B, lanes 1–3). Addition of an AT-tract allowed transcription (lane 4) that increased further upon inclusion of the partial -35 element (lane 5).
Figure 2.

AT-tracts activate transcription from cryptic -10 elements and alter DNA bending. (A) AT-tracts increase promoter activity. Synthetic promoter sequences containing a consensus -10 element (yellow box), -35 element (blue box) and two different AT-tract sequences (ATi and ATii, red box) were cloned upstream of the lacZ gene. For each promoter, LacZ activity was measured in triplicate. Activity values are shown as a percentage relative to the ‘-10/ATi/-35’ promoter. The absolute activity of this promoter was 236 Miller units. Error bars show standard deviation. (B) An AT-tract is required for transcription initiation in vitro. Bands on the gel are RNA transcripts produced in vitro using the indicated promoter sequences. The RNAI transcript serves as an internal control. The control lane shows transcripts generated from empty pSR plasmid. (C) AT-tracts alter DNA bending. Bands on the gel correspond to DNA fragments with or without an AT-tract. All DNA fragments are the same length but have different electrophoretic mobility due to altered curvature.

AT-tracts activate transcription from cryptic -10 elements and alter DNA bending. (A) AT-tracts increase promoter activity. Synthetic promoter sequences containing a consensus -10 element (yellow box), -35 element (blue box) and two different AT-tract sequences (ATi and ATii, red box) were cloned upstream of the lacZ gene. For each promoter, LacZ activity was measured in triplicate. Activity values are shown as a percentage relative to the ‘-10/ATi/-35’ promoter. The absolute activity of this promoter was 236 Miller units. Error bars show standard deviation. (B) An AT-tract is required for transcription initiation in vitro. Bands on the gel are RNA transcripts produced in vitro using the indicated promoter sequences. The RNAI transcript serves as an internal control. The control lane shows transcripts generated from empty pSR plasmid. (C) AT-tracts alter DNA bending. Bands on the gel correspond to DNA fragments with or without an AT-tract. All DNA fragments are the same length but have different electrophoretic mobility due to altered curvature.

AT-tracts alter promoter DNA bending

Changes to the DNA sequence, particularly the introduction of AT-tracts, can alter DNA bending (17). Altered bending affects electrophoretic mobility of DNA during native PAGE. We examined the mobility of different promoters with or without an AT-tract (Figure 2C). DNA fragments containing a -10 element, with or without a partial -35 hexamer, had the same electrophoretic mobility (Figure 2C, lanes 1 and 2). Addition of an AT-tract reduced mobility of DNA fragments during electrophoresis (lanes 3 and 4).

Most AT-tract sequences stimulate transcription

Interestingly, both ATi and ATii were able to activate transcription despite having different sequences (Figure 2A). We reasoned that many AT-tracts may be able to stimulate transcription. To test this we made a new library of promoter DNA fragments. The fragments had a consensus -10 hexamer in the presence or absence of a partial -35 element. Promoter positions -17 to -23 were a random combination of As and Ts (denoted ATR). We examined 103 promoters containing ATR; all were transcriptionally active (Figure 3). This was evident both in the presence (44 promoters) and absence (59 promoters) of the partial -35 element. We conclude that most, and potentially all, appropriately positioned AT-tracts stimulate transcription. In turn, this suggests that a broad range of DNA conformations can be beneficial.
Figure 3.

AT-tracts of any sequence can activate transcription from a cryptic -10 element. Promoter sequences containing a -10 element and a randomised AT-tract, with or without a partial -35 element, were fused to the lacZ gene in plasmid pRW50. The resulting library was used to transform the ΔlacE. coli strain JCB387 and LacZ activity was determined for each transformant. Data are presented as a box plot with each point representing one colony. Measured LacZ activity is also shown for a promoter without an AT-tract (-10/-35) and the empty plasmid.

AT-tracts of any sequence can activate transcription from a cryptic -10 element. Promoter sequences containing a -10 element and a randomised AT-tract, with or without a partial -35 element, were fused to the lacZ gene in plasmid pRW50. The resulting library was used to transform the ΔlacE. coli strain JCB387 and LacZ activity was determined for each transformant. Data are presented as a box plot with each point representing one colony. Measured LacZ activity is also shown for a promoter without an AT-tract (-10/-35) and the empty plasmid.

Activation by AT-tracts requires σ70 residue R451

We previously showed that an A or T at promoter position -18 could stimulate transcription by enhancing a DNA backbone contact with σ70 side chain R451 (17). We predicted that σ70 R451 would also be important at promoters dependent on the AT-tract. To test this, we repeated our in vitro transcription analysis and compared wild type RNA polymerase with the σ70 R451A derivative. The R451A mutation resulted in a total loss of transcription at all promoters dependent on an AT-tract (Figure 4, lanes evenly numbered up to 8). However, the mutant σ70 was unimpaired at a control promoter (lanes 9 and 10). The control promoter is dependent on a near consensus -35 hexamer and has a G at position -18 (17).
Figure 4.

Sidechain R451 of σ70 is required for transcription at promoters dependent on an AT-tract. (A) The gel image shows transcripts generated by RNA polymerase associated with either WT σ70 or the R451A mutant. The RNAI transcript is derived from the pSR replication origin and serves as an internal control. The -9A-10T promoter is constitutively active and does not have an AT-tract.

Sidechain R451 of σ70 is required for transcription at promoters dependent on an AT-tract. (A) The gel image shows transcripts generated by RNA polymerase associated with either WT σ70 or the R451A mutant. The RNAI transcript is derived from the pSR replication origin and serves as an internal control. The -9A-10T promoter is constitutively active and does not have an AT-tract.

Many H-NS repressed intragenic promoters require AT-tracts

We speculated that many naturally occurring promoters, within H-NS silenced genes, would be dependent on AT-tracts and σ70 R451 for activity. To test this prediction, we used six intragenic promoters from our previous analysis of horizontally acquired genes (10,26). The promoters were within the coding sequences of ygaQ, yigG, wcaD, lpxD, yqiI or wzxB. We determined the size of transcripts generated from each promoter in vitro (Supplementary Figure S2). This allowed transcription start sites to be mapped. The annotated promoter sequences are shown in Figure 5A. All six promoters contain an appropriately positioned AT-rich sequence. However, only the promoters within yqiI and wzxB had -35 and -10 sequences near to the consensus. Transcription was measured in vitro using RNA polymerase or the σ70 R451A derivative. Production of the ygaQ, yigG, wcaD and lpxD derived transcripts was greatly reduced by the R451A mutation (Figure 5B, lanes 1–8). Conversely, transcription from the yqiI and wzxB DNA fragments was unchanged (lanes 9–14). We next replaced the AT-tract upstream of each -10 element with a GC-rich sequence. Only promoters requiring σ70 R451 were inactivated when the AT-tract was removed (Figure 5C, lanes 1 and 3, 5 and 7, 9 and 11, 13 and 15). Conversely, promoters not requiring R451 functioned independently of the AT-tract (lanes 17–24).
Figure 5.

Effect of the σ70 R451A mutation on spurious intragenic promoters. (A) Intragenic promoter DNA sequences. Transcription start sites and promoter elements are highlighted. The number of base matches to the consensus sequence for each element is also indicated. (B) The gel image shows transcripts generated by RNA polymerase associated with either WT σ70 or the R451A mutant. Bands at 107/108 nt are RNAI transcripts derived from the pSR replication origin. The -9A-10T is constitutively active and functions independently of the AT-tract. (C) Effect of replacing AT-tracts with GC-rich sequences (the -35 and -10 elements were unchanged). The sequence immediately upstream of the -10 element was replaced with 5′-CTTCGCAGTCAGCTGAC-3′ (or 5′-CTTCGCAGTCAGCT-3′ for extended -10 elements).

Effect of the σ70 R451A mutation on spurious intragenic promoters. (A) Intragenic promoter DNA sequences. Transcription start sites and promoter elements are highlighted. The number of base matches to the consensus sequence for each element is also indicated. (B) The gel image shows transcripts generated by RNA polymerase associated with either WT σ70 or the R451A mutant. Bands at 107/108 nt are RNAI transcripts derived from the pSR replication origin. The -9A-10T is constitutively active and functions independently of the AT-tract. (C) Effect of replacing AT-tracts with GC-rich sequences (the -35 and -10 elements were unchanged). The sequence immediately upstream of the -10 element was replaced with 5′-CTTCGCAGTCAGCTGAC-3′ (or 5′-CTTCGCAGTCAGCT-3′ for extended -10 elements).

Many H-NS repressed intragenic promoters function with σ38 associated RNA polymerase

Our search for promoter motifs provided evidence that AT-tracts play a key role at happenstance promoters (Figure 1D). Hence, we have focused on understanding this DNA element and its interaction with RNA polymerase. However, during our initial promoter motif analysis, we identified a second DNA logo associated with horizontally acquired genes (Figure 6A, top). The logo depicts a 5′-TGn-3′ motif upstream of the sequence 5′-TATACT-3′. Previous work has shown that promoters used by the alternative σ38 factor, encoded by rpoS, often have 5′-TGn-3′ motifs (27). Furthermore, ChIP-seq analysis identified 5′-TATACT-3′ as the consensus -10 element for σ38 (Figure 6D, bottom) (23). Hence, σ38 might also serve promoters within horizontally acquired genes. To test this, we used ChIP-seq and compared chromosome-wide σ38 binding in E. coli RPB104 and the Δhns derivative. We identified 890 σ38 binding peaks in the starting strain and 905 peaks in the Δhns derivative (Supplementary Table S1). The proportion of σ38 binding peaks within genes increased in cells lacking H-NS (Figure 6B, blue charts). Similarly, there was increased binding of σ38 in H-NS targeted regions (labelled H-NS high) if H-NS was absent (Figure 6B, purple charts). We previously showed that deleting hns reorganised global RNA polymerase positioning; binding increased at H-NS silenced genes but was reduced at most other loci due to titration of the limited RNA polymerase pool (8). To understand if this was also the case for σ38 we divided the genome into 500 bp sections. We then calculated the σ38 binding signal for each DNA segment in wild type and Δhns cells. The values are plotted in Figure 6C. The diagonal line indicates where data points fall if the σ38 binding signal is the same in each strain. In the Δhns strain, σ38 binding signals increased at regions formerly bound by H-NS (H-NS high). Conversely, σ38 binding decreased at most other genomic regions (H-NS low). Figure 6D illustrates ChIP-seq peaks for σ38 binding within H-NS targeted genes. Multiple peaks for intragenic σ38 binding are evident in Δhns (red) but not wild type (blue) cells.
Figure 6.

The alternative σ38 factor recognises many intragenic promoters. (A) Sequence logo showing the imperfect extended -10 element conserved in randomly generated promoters (top). The σ38 promoter logo defined using ChIP-seq analysis is included for comparison (23). (B) The proportions of σ38 ChIP-seq peaks found inside/outside of genes (blue charts) or regions highly H-NS bound by H-NS (purple charts) in a wild-type (WT) or Δhns E. coli strain. (C) Examples of σ38 ChIP-seq peaks inside genes in Δhns (red) but not WT (blue) cells. H-NS binding signals are derived from the ChIP-seq data of Kahramanoglou et al. (32). (D) Scatter plot showing changes in σ38 binding when hns is deleted. Each data point represents the log10 of σ38 ChIP-seq signal in a 500 bp bin. Sequences usually bound by H-NS are shown in dark purple (high H-NS) and areas without H-NS are shown in pale purple (low H-NS).

The alternative σ38 factor recognises many intragenic promoters. (A) Sequence logo showing the imperfect extended -10 element conserved in randomly generated promoters (top). The σ38 promoter logo defined using ChIP-seq analysis is included for comparison (23). (B) The proportions of σ38 ChIP-seq peaks found inside/outside of genes (blue charts) or regions highly H-NS bound by H-NS (purple charts) in a wild-type (WT) or Δhns E. coli strain. (C) Examples of σ38 ChIP-seq peaks inside genes in Δhns (red) but not WT (blue) cells. H-NS binding signals are derived from the ChIP-seq data of Kahramanoglou et al. (32). (D) Scatter plot showing changes in σ38 binding when hns is deleted. Each data point represents the log10 of σ38 ChIP-seq signal in a 500 bp bin. Sequences usually bound by H-NS are shown in dark purple (high H-NS) and areas without H-NS are shown in pale purple (low H-NS).

DISCUSSION

We previously noted the widespread spurious transcription of AT-rich horizontally acquired genes (10). The phenomenon was attributed to increased occurrence of sequences resembling promoter -10 elements (8,10). However, promoter -10 elements alone are unable to drive transcription; the sequence cannot recruit RNA polymerase to the DNA (4). Structural analysis provides a rationale for this observation; base specific interactions between σ70 and the -10 hexamer only occur after DNA unwinding (5). In this work we show that spurious intragenic promoters frequently depend on an AT-tract located between -17 and -23 base pairs upstream of the transcription start site (Figures 2–5). Consistent with this, we and others have previously noted an A or T at positions -17 and -18 can be stimulatory (17,28–30). The AT-tract alters nucleic acid bending and facilitates a contact between σ70 side chain R451 and the double helix backbone. Hence, otherwise cryptic -10 elements are able to participate in the process of transcription initiation. Many intragenic promoters also function with σ38 bound RNA polymerase. This may explain why inactivation of rpoS is a pre-requisite for deletion of hns in Salmonella spp. (31). The simplest explanation for the abundance of promoters within AT-rich genes is chance occurrence (8–10). However, this hypothesis is difficult to test. In an effort to address the issue, we compared DNA sequence properties of spurious and randomly generated promoters (Figure 1). Whilst not proof of accidental origin, both types of promoter frequently depend on the presence of an AT-tract. Furthermore, the AT-tract is not enriched at canonical promoters. Presumably, this is to avoid constitutive promoter activity that short circuits regulation by transcription factors. We suggest that the AT-tract occurs frequently because of its low information content; any AT-rich DNA sequence is stimulatory (Figure 3). Conversely, promoter -35 elements have a precise consensus that is less likely to arise spontaneously. Intriguingly, the frequency of chance promoter occurrence increases only when the DNA AT-content exceeds that of the E. coli genome (i.e. ∼50%, Figure 1B). This may be indicative of adaptations that reduce the initiation of transcription at unwanted locations. For instance, the E. coli RNA polymerase could be hardwired to initiate transcription inefficiently at DNA sequences with an AT-content resembling coding DNA. We speculate that the AT-content threshold, above which promoters spontaneously arise, will differ depending on the genome AT-content of a given organism. Similarly, we predict that RNA polymerases isolated from bacteria with AT-rich genomes could be adapted to such templates and generate fewer spurious transcripts compared to the E. coli enzyme. In summary, we provide an explanation for the widespread occurrence of spurious promoters within horizontally acquired sections of the E. coli genome. Our data also have implications for our understanding of RNA polymerase specificity and promoter evolution.

DATA AVAILABILITY

ArrayExpress accession E-MTAB-8778. Click here for additional data file.
  32 in total

Review 1.  The regulation of bacterial transcription initiation.

Authors:  Douglas F Browning; Stephen J Busby
Journal:  Nat Rev Microbiol       Date:  2004-01       Impact factor: 60.633

Review 2.  The molecular basis of selective promoter activation by the sigmaS subunit of RNA polymerase.

Authors:  Athanasios Typas; Gisela Becker; Regine Hengge
Journal:  Mol Microbiol       Date:  2007-03       Impact factor: 3.501

Review 3.  Transcription regulation at the core: similarities among bacterial, archaeal, and eukaryotic RNA polymerases.

Authors:  Kimberly B Decker; Deborah M Hinton
Journal:  Annu Rev Microbiol       Date:  2013-06-13       Impact factor: 15.500

Review 4.  Xenogeneic Silencing and Its Impact on Bacterial Genomes.

Authors:  Kamna Singh; Joshua N Milstein; William Wiley Navarre
Journal:  Annu Rev Microbiol       Date:  2016-06-17       Impact factor: 15.500

Review 5.  Chromosome organization in bacteria: mechanistic insights into genome structure and function.

Authors:  Remus T Dame; Fatema-Zahra M Rashid; David C Grainger
Journal:  Nat Rev Genet       Date:  2019-11-25       Impact factor: 53.242

6.  Genome-Wide Transcriptional Response to Varying RpoS Levels in Escherichia coli K-12.

Authors:  Garrett T Wong; Richard P Bonocora; Alicia N Schep; Suzannah M Beeler; Anna J Lee Fong; Lauren M Shull; Lakshmi E Batachari; Moira Dillon; Ciaran Evans; Carla J Becker; Eliot C Bush; Johanna Hardin; Joseph T Wade; Daniel M Stoebel
Journal:  J Bacteriol       Date:  2017-03-14       Impact factor: 3.490

7.  A mutant spacer sequence between -35 and -10 elements makes the Plac promoter hyperactive and cAMP receptor protein-independent.

Authors:  Mofang Liu; Michael Tolstorukov; Victor Zhurkin; Susan Garges; Sankar Adhya
Journal:  Proc Natl Acad Sci U S A       Date:  2004-04-26       Impact factor: 11.205

8.  Direct and indirect effects of H-NS and Fis on global gene expression control in Escherichia coli.

Authors:  Christina Kahramanoglou; Aswin S N Seshasayee; Ana I Prieto; David Ibberson; Sabine Schmidt; Jurgen Zimmermann; Vladimir Benes; Gillian M Fraser; Nicholas M Luscombe
Journal:  Nucleic Acids Res       Date:  2010-11-21       Impact factor: 16.971

9.  Widespread suppression of intragenic transcription initiation by H-NS.

Authors:  Shivani S Singh; Navjot Singh; Richard P Bonocora; Devon M Fitzgerald; Joseph T Wade; David C Grainger
Journal:  Genes Dev       Date:  2014-01-21       Impact factor: 11.361

10.  H-NS can facilitate specific DNA-binding by RNA polymerase in AT-rich gene regulatory regions.

Authors:  Shivani S Singh; David C Grainger
Journal:  PLoS Genet       Date:  2013-06-20       Impact factor: 5.917

View more
  5 in total

1.  Widespread divergent transcription from bacterial and archaeal promoters is a consequence of DNA-sequence symmetry.

Authors:  Emily A Warman; David Forrest; Thomas Guest; James J R J Haycocks; Joseph T Wade; David C Grainger
Journal:  Nat Microbiol       Date:  2021-05-06       Impact factor: 17.745

2.  Xenogeneic silencing strategies in bacteria are dictated by RNA polymerase promiscuity.

Authors:  David Forrest; Emily A Warman; Amanda M Erkelens; Remus T Dame; David C Grainger
Journal:  Nat Commun       Date:  2022-03-03       Impact factor: 14.919

3.  Rhodobacter sphaeroides CarD Negatively Regulates Its Own Promoter.

Authors:  Kemardo K Henry; Wilma Ross; Richard L Gourse
Journal:  J Bacteriol       Date:  2021-08-09       Impact factor: 3.490

4.  H-NS-like proteins in Pseudomonas aeruginosa coordinately silence intragenic transcription.

Authors:  Andrew M Lippa; Michael J Gebhardt; Simon L Dove
Journal:  Mol Microbiol       Date:  2020-12-18       Impact factor: 3.979

5.  The zinc-finger bearing xenogeneic silencer MucR in α-proteobacteria balances adaptation and regulatory integrity.

Authors:  Jian Jiao; Biliang Zhang; Meng-Lin Li; Ziding Zhang; Chang-Fu Tian
Journal:  ISME J       Date:  2021-09-28       Impact factor: 10.302

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.