| Literature DB >> 16260475 |
Mitsuoki Kawano1, Gisela Storz, B Sridhar Rao, Judah L Rosner, Robert G Martin.
Abstract
The search for promoters has largely been confined to sequences upstream of open reading frames (ORFs) or stable RNA genes. Here we used a cloning approach to discover other potential promoters in Escherichia coli. Chromosomal fragments of approximately 160 bp were fused to a promoterless lacZ reporter gene on a multi-copy plasmid. Eight clones were deliberately selected for high activity and 105 clones were selected at random. All eight of the high-activity clones carried promoters that were located upstream of an ORF. Among the randomly-selected clones, 56 had significantly elevated activity. Of these, 7 had inserts which also mapped upstream of an ORF, while 49 mapped within or downstream of ORFs. Surprisingly, the eight promoters selected for high activity matched the canonical sigma70 -35 and -10 sequences no better than sequences from the randomly-selected clones. For six of the nine most active sequences with orientations opposite to that of the ORF, chromosomal expression was detected by RT-PCR, but defined transcripts were not detected by northern analysis. Our results indicate that the E.coli chromosome carries numerous -35 and -10 sequences with weak promoter activity but that most are not productively expressed because other features needed to enhance promoter activity and transcript stability are absent.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16260475 PMCID: PMC1275588 DOI: 10.1093/nar/gki928
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Activities and genomic locations of eight fragments selected for high activity
aRO refers to the relative orientation of fragment and the nearest gene. O = opposite, S = same.
bOrientation of potential promoter relative to the ORF.
cArrows indicate orientations of promoter or nearest gene. Forward arrows represent the clockwise or Watson strand and reverse arrows represent the counterclockwise or Crick strand. Numbers refer to chromosomal positions (21).
dUnderlined strain numbers indicate those with ‘immediately upstream’ promoters, arbitrarily defined as having the 3′ end of the sequence no farther than 200 bp upstream of the ORF and the 5′ end no closer than 50 bp from the ORF, assuming that a minimal promoter plus leader would be ∼50 bp long.
eThe sequence ∼80 bp upstream of the aroP promoter in our strain and in MG1655 (21) differs from the strain used by Pittard and coworkers (22,23). The sequence in our strain is TTGAGAGGGGTTGAGGCTGAGCTTTACA which matches (in bold) the optimal hexamers in 8 of 12 positions (separated by 16 bp), whereas the sequence in their strain is TACGGGGATCTGTTGAGGCTGAGCTTTACA which matches in only 5 positions.
fThe insert upstream of rpsA does not fit the definition of ‘immediately upstream’ but contains the known transcription start site of rpsA at 960 940 (24).
gOnly the P2 promoter of rsd is contained in the fragment.
hA possible alternative ORF start codon is GTG at 1 019 345.
Figure 1The distribution of promoter (β-galactosidase) activities of 105 random fragments of E.coli as assayed in the promoterless lacZ tester plasmid, pRS551. We defined a sequence as having potential promoter activity when the β-galactosidase activity of the strain was increased by 3-fold or more over that of the strain with the parental plasmid pRS551 (i.e. to >10 MU). We found that 56 (53%) of the inserts (tested in only one orientation) exhibited promoter activity by this criterion. However, some of the inserts may contain more than one promoter. Applying the Poisson distibution, P(0) = e−λ, where P(0) is the probability of finding no activity (i.e. the null class of plasmids expressing 0–10 MU = 47%), and solving for λ (the average number of potential promoters per insert), we calculate there are on average 0.76 promoters per fragment (in one of the two possible orientations). We therefore conclude that there are ∼1.52 (twice 0.76) promoters per 163 bp or one full promoter in either direction per ∼107 bp.
Activities and genomic locations of 20 most active fragments selected at random
aIn some cases, alternate possibilities are listed.
bRO refers to the relative orientation of fragment and the nearest gene. O = opposite, S = same. Alternate possibilities are indicated in brackets.
cOrientation of potential promoter relative to the ORF.
dArrows indicate orientations of promoter or nearest gene. Forward arrows represent the clockwise or Watson strand and reverse arrows represent the counterclockwise or Crick strand. Numbers refer to chromosomal positions (21).
eUnderlined strain numbers indicate those with ‘immediately upstream’ promoters, arbitrarily defined as having the 3′ end of the sequence no farther than 200 bp upstream of the ORF and the 5′ end no closer than 50 bp from the ORF, assuming that a minimal promoter plus leader would be ∼50 bp long.
fInsert is 302 bp upstream of an unannotated ORF of 237 codons.
gInsert is 213 bp upstream of an unannotated ORF of 119 codons.
Figure 2Detection of antisense transcripts by RT–PCR and northern analysis. (A) RT–PCR was performed with primer sets listed in Supplementary Table S1 and with (plus) and without (minus) RT. The products were analyzed by electrophoresis on 2% agarose gels. Lanes 1 and 2, ygeH antisense transcript (expected size, 124 bp); lanes 3 and 4, rhsE antisense transcript (expected size, 95 bp); lanes 5 and 6, yfjN antisense transcript (expected size, 92 bp); lanes 7 and 8, yiaO antisense transcript (expected size, 87 bp); lanes 9 and 10, yjcE antisense transcript (expected size, 103 bp); lanes 11 and 12, ydiM antisense transcript (expected size, 114 bp); lanes 13 and 14, ecpD antisense transcript (expected size, 119 bp); lanes 15 and 16, topA antisense transcript (expected size, 101 bp); lanes 17 and 18, yehI antisense transcript (expected size, 108 bp); lane M for 50 bp ladder size marker and lanes 19 and 20, RyjC RNA (expected size, 77 bp). The lack of correlation between the RT–PCR signal and β-galactosidase activity may be due to differences in the hybridization efficiencies for the primer pairs used. (B) Total RNA separated on 1% agarose gels and transferred to nylon membranes was probed with primers to the antisense strands of rhsE, yfjN, yiaO and yjcE as well as to the 77 nt antisense RNA RyjC. RNA molecular weight markers were run with each set of samples for direct estimation of RNA transcript length, but the RNA marker lane for only one of the panels is shown.
Figure 3Identification of the transcriptional start site of the antisense RNA from yfjN by primer extension analysis. (A) Lane 1, control host strain (DJ480); lane 2, DJ480 carrying single-copy λ prophage with the pRS551-derived anti-yfjN–lacZ fusion; lane 3, DJ480 carrying single-copy λ prophage with the pRS552-derived anti-yfjN–lacZ fusion; lane 4, DJ480 harboring the pRS551-anti-yfjN-lacZ plasmid; lane 5, DJ480 harboring the pRS552-anti-yfjN –lacZ plasmid. Reverse transcription reactions were performed by using 30 µg of total RNA from the parent and prophage-containing strains (lanes 1–3) and 5 µg of total RNA from plasmid-containing strains (lanes 4 and 5); The sequence ladder was generated using the primer used in the primer extension reactions. Although the levels of the primer extension products for the prophage strains are extremely low, the same relative levels were seen in all three repetitions of the experiment. (B) Top strand sequence from nt 2 763 939 to 2 764 368 of the E.coli chromosome (1–430 of yfjN). The yfjN ORF extends to 2 765 012. The ATG start codon of yfjN is boxed. The sequence cloned in pRS551 and pRS552 is surrounded by brackets (between 2 764 127 and 2 764 357). The +1 transcriptional start site is shown by an arrow. The −10 and −35 sequences of the yfjN antisense promoter are underlined.
Expression from multi-copy and single-copy fusions to potential antisense promoters and to known promoters
| Promoter | Plasmid | Prophage | Plasmid/prophage |
|---|---|---|---|
| Potential antisense promoters | |||
| | 2900 | 120 | 25 |
| | 2400 | 84 | 29 |
| | 1700 | 56 | 30 |
| | 670 | 33 | 20 |
| | 680 | 25 | 28 |
| | 200 | 6 | 35 |
| | 170 | 5 | 34 |
| | 170 | 6 | 28 |
| | 120 | 4 | 35 |
| | 150 | 4 | 39 |
| Average (SD) | 30 (5.3) | ||
| Known promoters | |||
| | 11 000 | 830 | 16 |
| | 9700 | 630 | 15 |
| | 9700 | 860 | 11 |
| | 7400 | 400 | 19 |
| | 7100 | 340 | 21 |
| | 6400 | 530 | 12 |
| | 5800 | 350 | 17 |
| | 4700 | 260 | 18 |
| Average (SD) | 16 (3.2) | ||
aThe ratios were calculated prior to rounding of the assay results.
Figure 4Correlation between match to optimal σ70 promoter (−35 and −10 sequences) measured by the pftools 2.2 program (homology score, X) and log10 of the promoter (β-galactosidase, Y) activity. The regression line for the 39 randomly isolated promoters that have homology scores of 45 or greater (open circles) has the formula Y = −5.5496 + 0.14772X and an R2 of 0.417. Two SDs are indicated by the dashed lines. The eight inserts selected on the basis of high activity are shown by filled circles.