| Literature DB >> 32297955 |
Emily A Warman1, Shivani S Singh1, Alicia G Gubieda1, David C Grainger1.
Abstract
RNA polymerases initiate transcription at DNA sequences called promoters. In bacteria, the best conserved promoter feature is the AT-rich -10 element; a sequence essential for DNA unwinding. Further elements, and gene regulatory proteins, are needed to recruit RNA polymerase to the -10 sequence. Hence, -10 elements cannot function in isolation. Many horizontally acquired genes also have a high AT-content. Consequently, sequences that resemble the -10 element occur frequently. As a result, foreign genes are predisposed to spurious transcription. However, it is not clear how RNA polymerase initially recognizes such sequences. Here, we identify a non-canonical promoter element that plays a key role. The sequence, itself a short AT-tract, resides 5 base pairs upstream of otherwise cryptic -10 elements. The AT-tract alters DNA conformation and enhances contacts between the DNA backbone and RNA polymerase.Entities:
Year: 2020 PMID: 32297955 PMCID: PMC7229825 DOI: 10.1093/nar/gkaa244
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Strains, plasmids and oligonucleotides
| Name | Descriptiona,b,c | Source |
|---|---|---|
|
| ||
| JCB387 | Δ | ( |
| RPB104 | RPB104MG1655 with C-terminal SPA-tagged | ( |
| RPB104 | This work | |
|
| ||
| pRW50 | low copy number | ( |
| pSR | pBR322-derived vector with | ( |
| Encodes AmpR. | ||
| pSR Δ45-9A-10T | pSR carrying an optimised derivative of the | ( |
|
| ||
| Random F | GGCTGCGAATTCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN | This work |
| NNNNNAGGAGGATGCGGACTATG | ||
| Random R | CGCCCGAAGCTTcatagtccgcatcctcct | This work |
|
| ||
| -10 F | GGCTGCGAATTCgaccggcgagcttcgcagtcagctgac | This work |
| -10 R | CGCCCGAAGCTTcatagtccgcatcctcctgcgcgcggcaattatagtcagctgac | This work |
| -10/-35TT F | GGCTGCGAATTCgacc | This work |
| -10/ATi F | GGCTGCGAATTCgaccggcgagcttcgc | This work |
| -10/ATi R | CGCCCGAAGCTTcatagtccgcatcctcctgcgcgcggcaattatagtcaataaat | This work |
| -10/ATi/-35TT F | GGCTGCGAATTCgacc | This work |
| -10/ATii F | GGCTGCGAATTCgaccggcgagcttcgcag | This work |
| -10/ATii R | CGCCCGAAGCTTcatagtccgcatcctcctgcgcgcggcaattatagtcagctgac | This work |
| -10/ATii/-35TT F | GGCTGCGAATTCgacc | This work |
| -10 general R | GCCCGAAGCTTCatagtccgcatcctcctgcgcgcggcaattatagtc | This work |
|
| ||
| -10/ATR F | GGCTGCGAATTCgaccggcgagcttcgcwwwwwwwtgactataattgccgcgcgc | This work |
| -10/ATR/-35TT F | GGCTGCGAATTCgacc | This work |
|
| ||
|
| GGCTGCGAATTCacgttactttatctttactatctgc | This work |
|
| GCCCGAAGCTTCCTCCTttgtaagaacacttggtcctgaaaa | This work |
|
| GGCTGCGAATTCtactccattatctcgtcatcaacatg | This work |
|
| GCCCGAAGCTTCCTCCTcattgcctgaacaggcaaaatcttc | This work |
|
| GGCTGCGAATTCataagttacaccgaaagtataagag | This work |
|
| GCCCGAAGCTTCCTCCTgaatattttatgaatgttttctg | This work |
|
| GGCTGCGAATTCcggttacacaatactaacttatttaac | This work |
|
| GCCCGAAGCTTCCTCCTtgaaaaatcaatggcgcttaaatcatc | This work |
|
| GGCTGCGAATTCTcaaacagtttggtatcaaaacg | This work |
|
| GCCCGAAGCTTCATAGTCCGCATCCTCCTcccctgaaaacgatccgg | This work |
|
| GGCTGCGAATTCAccagtgccagattgcacataacg | This work |
|
| GCCCGAAGCTTCATAGTCCGCATCCTCCTtcaggctgcccgccataatgacg | This work |
|
| ||
|
| GGCTGCGAATTCtcaaacagtttggtatcaaacttcgcagtcag | This work |
|
| AGCCCGAAGCTTcctcctcccctgaaaacgatccggataatattatccctgcgagaat | This work |
| catagca | This work | |
|
| GGCTGCGAATTCaccagtgccagattgcac | This work |
|
| AGCCCGAAGCTTcctccttcaggctgcccgccataatgacgccaccggcaaccgccgt | This work |
| attgtcgtcagctgactgcgaaggtgcaa | This work | |
|
| ||
|
| CTTGAGTCCACGCTAGATCTGGCTGCGAATTCAcgttactttatctttactatctgctg | This work |
| ctttggcaatactctgagttgctgtgagattgaaa | ||
| cttggcaaaaaccgaatataccgaaattttcaggaccaagtgttcttacaaaggaggAAGCTTCGG | ||
| GCTTGTCAGTGCGCAAAAAGAT | ||
|
| CTTGAGTCCACGCTAGATCTGGCTGCGAATTctactccattatctcgtcatcaacatga | This work |
| attgccagcgactccgtgatagtggtttcatctatata | ||
| atctctatcatcacaatgatagccgaagattttgcctgttcaggcaatgaggaggAAGCTTCGGGCTT | ||
| GTCAGTGCGCAAAAAGAT | ||
|
| CTTGAGTCCACGCTAGATCTGGCTGCGAATTCataagttacaccgaaagtataagagtt | This work |
| ttgattataaaagtcttgacct | ||
| atgggcctggtagtgcattattacagaaaacattcataaaatattcaggaggAAGCTTCGGGCTTGT | ||
| CAGTGCGCAAAAAGAT | ||
|
| CTTGAGTCCACGCTAGATCTGGCTGCGAATTCcggttacacaatactaacttatttaac | This work |
| ccaaaatatcataaaaaagccgttatgaat | ||
| atgtcatcactgctttatgaaagagatgatttaagcgccattgatttttcaaggaggAAGCTTCGGGCTT | ||
| GTCAGTGCGCAAAAAGAT |
aN is either A, C, G or T incorporated into the oligonucleotide at random but supplied at a defined % of each nucleotide. Used to generate the DNA fragment library described in Figure 1.
bW is A or T, with an equal likelihood of either base being incorporated.
cSynthetic promoter -10 elements are underlined and key base changes introduced by oligonucleotides are in bold.
Figure 1.Promoters are more prevalent and active in random DNA sequences with a higher AT-content. (A) Experimental strategy for generation and selection of promoters from random DNA sequences of defined AT-content. Briefly, 43 bp DNA fragments of random sequence, but a defined AT-content between 40% and 75%, were fused to the lacZ gene in plasmid pRW50. The resulting plasmid libraries were used to transform the E. coli Δlac strain JCB387. Transformants expressing LacZ were identified as red or pink colonies on MacConkey agar. Such lac+ colonies were selected and their random 43 bp DNA insertion was sequenced. (B) The number of lac+ colonies increases as AT-content increases. A total of 10,735 colonies were examined and the percentage of lac+ colonies is plotted against the % AT-content of the corresponding insertion library. (C) Average promoter activity increases in AT-rich DNA sequences. The LacZ activity, quantified for lac+ colonies by β-galactosidase assay, is plotted against library % AT-content. (D) DNA sequence motifs associated with different classes of promoter
Figure 2.AT-tracts activate transcription from cryptic -10 elements and alter DNA bending. (A) AT-tracts increase promoter activity. Synthetic promoter sequences containing a consensus -10 element (yellow box), -35 element (blue box) and two different AT-tract sequences (ATi and ATii, red box) were cloned upstream of the lacZ gene. For each promoter, LacZ activity was measured in triplicate. Activity values are shown as a percentage relative to the ‘-10/ATi/-35’ promoter. The absolute activity of this promoter was 236 Miller units. Error bars show standard deviation. (B) An AT-tract is required for transcription initiation in vitro. Bands on the gel are RNA transcripts produced in vitro using the indicated promoter sequences. The RNAI transcript serves as an internal control. The control lane shows transcripts generated from empty pSR plasmid. (C) AT-tracts alter DNA bending. Bands on the gel correspond to DNA fragments with or without an AT-tract. All DNA fragments are the same length but have different electrophoretic mobility due to altered curvature.
Figure 3.AT-tracts of any sequence can activate transcription from a cryptic -10 element. Promoter sequences containing a -10 element and a randomised AT-tract, with or without a partial -35 element, were fused to the lacZ gene in plasmid pRW50. The resulting library was used to transform the ΔlacE. coli strain JCB387 and LacZ activity was determined for each transformant. Data are presented as a box plot with each point representing one colony. Measured LacZ activity is also shown for a promoter without an AT-tract (-10/-35) and the empty plasmid.
Figure 4.Sidechain R451 of σ70 is required for transcription at promoters dependent on an AT-tract. (A) The gel image shows transcripts generated by RNA polymerase associated with either WT σ70 or the R451A mutant. The RNAI transcript is derived from the pSR replication origin and serves as an internal control. The -9A-10T promoter is constitutively active and does not have an AT-tract.
Figure 5.Effect of the σ70 R451A mutation on spurious intragenic promoters. (A) Intragenic promoter DNA sequences. Transcription start sites and promoter elements are highlighted. The number of base matches to the consensus sequence for each element is also indicated. (B) The gel image shows transcripts generated by RNA polymerase associated with either WT σ70 or the R451A mutant. Bands at 107/108 nt are RNAI transcripts derived from the pSR replication origin. The -9A-10T is constitutively active and functions independently of the AT-tract. (C) Effect of replacing AT-tracts with GC-rich sequences (the -35 and -10 elements were unchanged). The sequence immediately upstream of the -10 element was replaced with 5′-CTTCGCAGTCAGCTGAC-3′ (or 5′-CTTCGCAGTCAGCT-3′ for extended -10 elements).
Figure 6.The alternative σ38 factor recognises many intragenic promoters. (A) Sequence logo showing the imperfect extended -10 element conserved in randomly generated promoters (top). The σ38 promoter logo defined using ChIP-seq analysis is included for comparison (23). (B) The proportions of σ38 ChIP-seq peaks found inside/outside of genes (blue charts) or regions highly H-NS bound by H-NS (purple charts) in a wild-type (WT) or Δhns E. coli strain. (C) Examples of σ38 ChIP-seq peaks inside genes in Δhns (red) but not WT (blue) cells. H-NS binding signals are derived from the ChIP-seq data of Kahramanoglou et al. (32). (D) Scatter plot showing changes in σ38 binding when hns is deleted. Each data point represents the log10 of σ38 ChIP-seq signal in a 500 bp bin. Sequences usually bound by H-NS are shown in dark purple (high H-NS) and areas without H-NS are shown in pale purple (low H-NS).