| Literature DB >> 21569465 |
Eleftherios Pilalis1, Aristotelis A Chatziioannou, Asterios I Grigoroudis, Christos A Panagiotidis, Fragiskos N Kolisis, Dimitrios A Kyriakidis.
Abstract
BACKGROUND: Studies on bacterial signal transduction systems have revealed complex networks of functional interactions, where the response regulators play a pivotal role. The AtoSC system of E. coli activates the expression of atoDAEB operon genes, and the subsequent catabolism of short-chain fatty acids, upon acetoacetate induction. Transcriptome and phenotypic analyses suggested that atoSC is also involved in several other cellular activities, although we have recently reported a palindromic repeat within the atoDAEB promoter as the single, cis-regulatory binding site of the AtoC response regulator. In this work, we used a computational approach to explore the presence of yet unidentified AtoC binding sites within other parts of the E. coli genome.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21569465 PMCID: PMC3118216 DOI: 10.1186/1471-2164-12-238
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Workflow chart of the . A) The chart describes the algorithmic steps for the ab initio motif detection process. The thick arrows correspond to algorithmic flows after decision points, the dashed arrows represent initialization steps which are performed uniquely and the dotted arrows (Motif Searching part) concern independently implemented alternative data processing scenarios. The details of the procedure are described in Materials and Methods. B) Description of the workflow symbols.
The three executions of the motif detection procedure
| Executions | Specified positions upstream TSS | Initial promoter set | Qualified promoter set |
|---|---|---|---|
| 1 | from -184 to -165 | All | atoD, galU, yhjR-bcsE, narZ, puuP, mdlA, dmsA, crr |
| 2 | from -184 to -165 | sigma-54 factor-related promoters | atoD, rtcR, ibpB |
| 3 | from -184 to -165 | Two-component target promoters | atoD, dmsA, borD, csgD, pstS, glnH, dctA, yebE, nirB, torC, asr, tppB, acrD, |
The computational workflow was performed many times varying different parameters. Here are shown the three successful executions that corresponded to three different initial promoter sets.
The four different motifs matching the palindromic AtoC binding site
| Motif No. | Log-likelihood score | Consensus | Logo |
|---|---|---|---|
| Motif 1 | 117.6 | GCkATrCrrnnATTTrCwCA | |
| Motif 2 | 51.8 | GnGnAAnTTyCTGCAwAGCC | |
| Motif 3 | 168.6 | wnnnwrsAwAwmrnknmAnr | |
| Motif 4 | 66.3 | nCknATnmGrnnnTnACGCm | |
Motifs 1-3 are the top scoring motifs derived of each of the three aforementioned executions. Motif 4 is an intermediary motif of unexpectedly high conservation among different intergenic regions.
Number of hits per motif at 0.7 prior probability
| Motif | Hits (promoters) | Hits (ORFs |
|---|---|---|
| 1 | 55 | 234 |
| 2 | 153 | 694 |
| 3 | 331 | 688 |
| 4 | 346 | 266 |
While motifs 1-3 matched more ORFs than intergenic sites, motif 4 matched an uncommonly conserved intergenic region.
Results of the Gene Ontology Term Analysis
| GO id | Term | Corrected p-value | FDR (%) | Enrichment in hits | Percentage in hits | Enrichment in genome | Percentage in genome |
|---|---|---|---|---|---|---|---|
| GO:0016638 | oxidoreductase activity, acting on the CH-NH2 group of donors | 0.066 | 12.00 | 3/67 | 4.47 | 10/4339 | 0.23 |
| GO:0015749 | monosaccharide transport | 0.045 | 10.00 | 4/154 | 2.59 | 891101 | 0.23 |
| GO:0022891 | substrate-specific transmembrane transporter activity | 0.006 | 0.00 | 22/154 | 14.28 | 254/4339 | 5.85 |
| GO:0043169 | cation binding | 0.012 | 0.00 | 37/154 | 24.02 | 570/4339 | 13.13 |
| GO:0008324 | cation transmembrane transporter activity | 0.076 | 2.33 | 13/154 | 8.44 | 128/4339 | 2.94 |
| GO:0045449 | regulation of transcription | 0.001 | 0.00 | 53/324 | 16.35 | 372/4339 | 8.57 |
| GO:0007155 | cell adhesion | 0.027 | 0.00 | 10/324 | 3.08 | 34/4339 | 0.78 |
| GO:0001101 | response to acid | 0.042 | 0.12 | 4/324 | 1.23 | 5/4339 | 0.11 |
| GO:0009289 | fimbrium | 0.032 | 0.00 | 8/324 | 2.46 | 28/4339 | 0.64 |
| GO:0003700 | transcription factor activity | 0.01 | 0.00 | 32/324 | 9.87 | 208/4339 | 4.79 |
| GO:0051234 | establishment of localization | 7.82E-04 | 0.00 | 40/110 | 36 | 800/4339 | 18 |
| GO:0005215 | transporter activity | 3.91E-03 | 0.00 | 30/110 | 27 | 561/4339 | 13 |
Results of the GOT analysis of motifs 1, 2, 3 and 4, at prior probability set to 0.7 (corrected p-value cut-off set to 0.1).
Figure 2Motif 5. A) The sequence represents the highly conserved site of ~40 bp (based on 85 hits at 0.7 prior probability). B) The positions in the E. coli genome of the 85 hits of motif 5. All hits are situated in intergenic regions. The gene names correspond to the genes downstream of these intergenic regions. The arrow directions indicate the strand of each hit (to the right: strand +1; to the left: strand -1).
Figure 3Alignment of 20 top hits of motif 5. The alignment of 20 top hits of motif 5 highlights the high sequence conservation of a ~40 base pair-site located exclusively in intergenic regions (85 in total) and may have a putative global regulatory role in E. coli and other Enterobacteria.
Sites experimentally tested for in vivo AtoC binding
| Result | Site | Gene | Motif | Strand | Site distance from TSS (Refseq) | Gene product | |
|---|---|---|---|---|---|---|---|
| + | GCTATGCAGAAATTTGCACA | atoD | 1,2,3,4 | (+) | -184 | -165 | acetyl-CoA: acetoacetyl-CoA transferase, subunit |
| + | GCTATGCAGAAAATTGCGCA | atoD | 1,2,4 | (-) | -163 | -144 | acetyl-CoA: acetoacetyl-CoA transferase, subunit |
| + | GCTATAGAAATAATTACACA | dmsA | 1,2 | (-) | -69 | -50 | dimethyl sulfoxide reductase, anaerobic, subunit A [ |
| + | GCCATGCGGGGATTTAATCA | puuP | 1 | (+) | -148 | -129 | putrescine importer [ |
| + | GCCGTCCAGATGTTTACACA* | metR-metE | 1 | (-) | -87, -170 | -68, -151 | transcriptional activator [ |
| + | GCGATTGTAGGGATTGCTCA* | trmA-btuB | 1,(2) | (+) | -49, -340 | -30, -321 | tRNA (uracil-5-)-methyltransferase/ outer membrane transporter [ |
| + | ACTATACGGAAAATTCCACT* | ykgA-ykgQ | 1,2 | (-) | -235, -55 | -216, -36 | Putative transcriptional activator [ |
| + | GTGATGGGATTATTTGATCT* | aegA-narQ | 1 | (-) | -147, -79 | -128, -60 | predicted fused oxidoreductase: FeS binding subunit and sensory HK in TCS with NarP [ |
| + | GCCTCGGAGGTATTTAAACA* | cpxR-cpxP | 1 | (-) | -21, -149 | -2, -130 | response regulator in TCS with CpxA/ periplasmic protein; combats stress [ |
| + | GCGAGGCGGGTAATTAGACA | ymiA | 1 | (-) | -199 | -180 | small predicted membrane protein [ |
| + | GGCCCAATTTACTGCTTAGG | crr | 1,2 | (+) | -28 | -9 | glucose-specific PTS component [ |
| + | GCGGTGCAGGAGATTGCACA | 1 | (+) | predicted chaperone (flagellum) [ | |||
| + | CCGGTCCAGGAATTTACTCA | 1 | (-) | nitrate reductase 1, alpha subunit [ | |||
| + | GCGAAACGATTAATTACACA | 1 | (+) | glucosyltransferase predicted [ | |||
| + | ACTGTACAGAAAGTTGCTCA | 1 | (-) | transcription regulator (predicted) [ | |||
| + | GCGGGTCAGGTAATTGCACA | 1 | (-) | glutamate decarboxylase A, PLP-dependent [ | |||
| + | GACATGCAGTTGATTACACA | 1 | (+) | predicted transcription regulator [ | |||
| + | GCGATCCAAAAGTTTACTCA | narZ | 1 | (+) | 0 | + 19 | nitrate reductase 2 alpha subunit [ |
| + | CGGAGATTTCCCGCAAAGCC* | rtcB-rtcR | 2 | (+) | -145, -64 | -126, -45 | sigma-54-dependent transcriptional regulator of rtcBA [ |
| + | CGGCAAGTTTCGACATTGCC* | putA-putP | 2 | (+) | -398, -45 | -379, -26 | fused transcriptional regulator/proline dehydrogenase/ carboxproline:sodium symporter [ |
| - | GGGCATTTTCCTGCAAAACC* | rhaT-sodA | 3 | (+) | -167, -138 | -148, -119 | L-rhamnose:proton symporter/superoxide dismutase [ |
| - | GGATGATGTTCTGCATAGCA | adhP | 3 | (+) | -35 | -16 | ethanol-active dehydrogenase/acetaldehyde-active reductase [ |
| - | TTTATAGAAATAGATGCACG | nirB | 4 | (-) | -205 | -186 | nitrite reductase, large subunit, NAD(P)H-binding [ |
| + | TATGTGTAGAAAATTAAACA | borD | 4,3 | (+) | -59 | -40 | DLP12 prophage; predicted lipoprotein [ |
| + | GTTCAGTATAAAAGGGCATG | acrD | 2 | (+) | -128 | -109 | aminoglycoside/multidrug efflux system [ |
| - | GAAATAGAAATAGTTGAAAG | ykgE | 4 | (-) | -472 | -453 | predicted oxidoreductase [ |
| + | CCGGATAAGGCGTTTACGCC | yjcH | 4 | (+),(-) | -114, -122 | -95, -103 | conserved inner membrane protein/ acetate transport [ |
| - | CCGGATAAGACGTTTACGCC | yafJ | 4 | (+) | -193 | -174 | Unknown function, similarity to type II amidotransfease |
| + | CCGGATAAGGCGTTTACGCC | sseB | 4 | (+) | -125 | -106 | rhodanese-like enzyme [ |
Motif hits experimentally tested by ChIP analysis for in vivo AtoC binding. All annotations were provided by the Refseq database (Escherichia coli K12, Refseq id: NC_000913). Underlined: hits in gene-encoding regions. *These hits represent a unique site per pair, which could be attributed as cis-regulatory element to both flanking genes. As predicted by the GO Term analysis many of the targets are transcription factors, transporters and enzymes involved in oxidoreduction.
Figure 4. Results of a ChIP analysis by agarose gel electrophoresis (2% w/v) of PCR amplification products generated by primers corresponding to motif 1 hits and qualified promoters, included in Tables 1 and 5. All ChIP preparations were carried out following acetoacetate induction. Above each PCR product, generated by the specific designated primers, the name of each gene or promoter region comprising the motif sequence is denoted for each pair of lanes. All AtoC additional binding targets are PCR tested (25 cycles) in two ChIP preparations carried out without (-, left lanes) or with (+, right lanes) AcAc induction (AA).
Figure 5Monitoring the AcAc effect on AtoC . A) All predicted targets matching motifs 2, 3 and 4 (Table 5), that gave a positive signal in genomic DNA, are ChIP analysed. All AtoC additional binding targets are PCR tested (25 cycles) in two ChIP preparations carried out without (-, left lanes) or with (+, right lanes) AcAc induction (AA). Input Chromatin samples are also included representing positive genomic DNA signals for the sampled targets. B) Three selected targets representing each of the afore-mentioned groups (metR-metE, narG and acrD) were also PCR tested together with atoD (25 cycles) using as templates input chromatin controls (Input Chromatin) and immunoprecipitated preparations (ChIPs) from non-induced (-AA) or AcAc induced (+AA) cultures.
Figure 6Detection of putative AtoC binding targets in gene encoding regions. Selected motif 1 hits in gene encoding regions (Table 5) were ChIP analyzed for in vivo AtoC binding by electrophoresis of PCR products generated by primers specifically designed to comprise the target motif sequence. The name of each gene comprising the motif sequence is denoted above each lane. The AcAc effect on the binding to each target is also denoted, as stated in Figure 4.
Figure 7Verification of AtoC . A) Sample gel retardation experiments illustrating His10-AtoC protein binding to DNA fragments atoD, metR-metE, acrD (promoter) and narG (gene encoding) regions. As described in Materials and Methods, following electrophoresis, gels were first stained with Gel-Red dye and then with Coomassie blue, and side-by-side photographs are shown. The lines above the Gel-Red stained lanes indicate the different probes combined with the indicated His10-AtoC (AtoC) quantities. Arrows indicate bands that were stained with both Gel-Red and Coomassie blue, corresponding to band-shifting caused by the AtoC binding. Lines indicate free proteins, not bound to corresponding DNA particles. B) Band shift assay of His10-AtoC with a biotinylated fragment of the upstream region of the atoDAEB operon. EMSAs were performed as described in Materials and Methods with 10 ng of biotinylated atoD. The addition of His10-AtoC (0.35 μM) and certain amounts of competitive, non-biotinylated atoD, as well as metR-metE, acrD and narG fragments is indicated. 2 μg of sonicated calf thymus competitive, non-specific DNA were added in each reaction.
Figure 8Location of . The positions of the additional twenty-two experimentally verified AtoC binding sites, are shown on the circular E. coli genomic map, with the arrows indicating the direction of transcription. The atoSC and atoDAEB regions are expanded and the sequence of the inverted palindromic repeat, representing the AtoC original target, is shown in the boxes.