Literature DB >> 16717279

Genome prediction of PhoB regulated promoters in Sinorhizobium meliloti and twelve proteobacteria.

Ze-Chun Yuan1, Rahat Zaheer, Richard Morton, Turlough M Finan.   

Abstract

In proteobacteria, genes whose expression is modulated in response to the external concentration of inorganic phosphate are often regulated by the PhoB protein which binds to a conserved motif (Pho box) within their promoter regions. Using a position weight matrix algorithm derived from known Pho box sequences, we identified 96 putative Pho regulon members whose promoter regions contained one or more Pho boxs in the Sinorhizobium meliloti genome. Expression of these genes was examined through assays of reporter gene fusions and through comparison with published microarray data. Of 96 genes, 31 were induced and 3 were repressed by Pi starvation in a PhoB dependent manner. Novel Pho regulon members included several genes of unknown function. Comparative analysis across 12 proteobacterial genomes revealed highly conserved Pho regulon members including genes involved in Pi metabolism (pstS, phnC and ppdK). Genes with no obvious association with Pi metabolism were predicted to be Pho regulon members in S.meliloti and multiple organisms. These included smc01605 and smc04317 which are annotated as substrate binding proteins of iron transporters and katA encoding catalase. This data suggests that the Pho regulon overlaps and interacts with several other control circuits, such as the oxidative stress response and iron homeostasis.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16717279      PMCID: PMC1464414          DOI: 10.1093/nar/gkl365

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Dissection of regulatory networks that control gene transcription is among the primary goals of the post-genomic era of biology. Whether gene expression is measured from microarrays or reporter gene fusions or other methodologies, it is generally not possible to distinguish between the direct and indirect modulation of transcription. Bioinformatic approaches to identify the regulatory networks have included the design of algorithms for genome-wide prediction of conserved regulatory DNA binding motifs (1,2). A promising approach in the delineation of transcriptional networks lies in combining genomic scanning or in silico analysis with experimental transcription data obtained from cells grown under diverse experimental conditions (1,3–12). In this report, we combine in silico prediction with experimental data obtained from reporter gene fusions and through comparisons with published microarray data. We also explore cross-species comparative genomics as a tool to identify genes whose expression is controlled by a transcriptional regulator, PhoB, in response to the phosphate starvation. Inorganic phosphate (Pi) plays key roles in cells. In ATP, it is involved in energy metabolism, in protein phosphorylation it is responsible for regulation of transcription and many other cellular processes including chemotaxis and cell division, and perhaps most importantly, Pi is a major structural component of nucleic acids and membrane phospholipids. In many gram-negative bacteria, the transport and metabolism of Pi and phosphorous containing compounds is regulated at the transcriptional level by a two-component PhoR-PhoB signal transduction system. The Pho regulon consists of genes or operons regulated by PhoB and this has been well studied in Escherichia coli (13–15). Under Pi limiting conditions, the PhoR histidine kinase sensor undergoes autophosphorylation and subsequently donates its phosphate group to its cognate response regulator PhoB. Phosphorylated PhoB (PhoB-Pi) then modulates transcription of its targets by binding to a highly conserved 18 nt DNA sequence called the Pho box (or PhoB binding motif) which usually overlaps the −35 region of PhoB-regulated promoters (16,17). The majority of identified Pho boxes essentially comprised two 7 nt direct repeats of 5′-CTGTCAT-3′ separated by a conserved 4 nt spacer in the middle. It was postulated that the PhoB and Pho box binding complex interacts with the σ70 subunit of RNA polymerase to control transcription initiation (18–23). Over the past 30 years, about 30 Pho regulon members, which predominantly encompass an ensemble of genes involved in Pi uptake and metabolism, have been identified in E.coli as reviewed by Wanner (14). We are studying the gram negative α-proteobacterium Sinorhizobium meliloti. This organism forms N2-fixing root-nodules on alfalfa (Medicago sativa) and its genome is unusual as in addition to a 3.4 Mb chromosome it contains two megaplasmids 1.3 and 1.7 Mb in size. Previous studies have identified several Pho regulon members in S.meliloti including the pstSCAB and phoCDET operons which encode ABC-type high affinity transport systems for Pi and in the case of phoCDET likely phosphonates (24–27). The orfA-pit operon encodes a low affinity Pi transport system whose expression is negatively regulated by PhoB (28). Other Pho regulons include the exp, phn and pta-ackA operons (16,29,30). Using DNA microarray and promoter analysis, Krol and Becker (31) identified several novel putative Pho regulon members including afuA which is annotated as an iron transport binding protein. In ongoing studies to understand the response of S.meliloti to Pi limitation, we constructed a Pho box weight matrix based on known E.coli and S.meliloti PhoB binding sites and used this matrix to predict new PhoB binding sites in the S.meliloti genome. Expression of predicted Pho regulon members then was examined through the analysis of transcriptional reporter gene fusions and through the previously reported microarray data (31). The frequency weight matrix was also employed to predict PhoB binding motifs across 12 closely related proteobacterial genomes with a goal to identifying a common set of PhoB regulated genes as might be expected from a conserved biological response to Pi-limitation.

MATERIALS AND METHODS

Construction of the Pho box weight matrix for prediction of PhoB binding sites

A total of fifteen known Pho boxes from S.meliloti and E.coli were used for weight matrix construction. Five of those Pho box sequences were collected from previously identified PhoB binding sites from S.meliloti. Of those five, four were from S.meliloti strain 1021 including one PhoB binding site upstream orfA-pit; two sites from the phoC promoter; one from the phnG promoter (24,28,32) and one Pho box was taken from the orfA-pta-ackA promoter of S.meliloti strain 104A14 (16). Ten PhoB binding sites from E.coli were phoA, phoB, phoE, phoH, phnC, pstS1, pstS2, ugpB1, ugpB2 and ugpB3 (18,33–35) (see Table 1). Following their alignment a matrix was constructed from the relative frequencies of A, T, C or G at each position of the 18 nt Pho box sequence (Table 1). This matrix was used to determine an information-based measure of potential binding sites according to the method of Schneider et al. (36). An 18 bp window was moved over the entire genome on both strands and the score (Si) at each nucleotide position (having base i) was calculated according to S = (1/18) Σ [2 + log2(F)], where F is the frequency matrix for base i at position j. This score, which ranges from −2.62 (the score of the worse match) to 1.39 (the score of the consensus sequence), is a measure of the information content of a potential binding site measured against the example set. The lowest example score, that of orfA-pit, is 0.36 and a threshold of 0.35 was used to define a ‘hit’. A scan of the entire S.meliloti genome produced about 1500 hits on each strand. These were filtered to retain only those that were between −500 to +100 bp on the coding strand from an annotated translational start site.
Table 1

Pho-box matrix

Position#1#2#3#4#5#6#7#8#9#10#11#12#13#14#15#16#17#18
S.meliloti orfA-pta:TTGTCAAACCGCCGTAAC
S.meliloti phnG:ATGTCACAAGCCTGTCAT
S.meliloti phoC1:CTGACACTGCGCTTTCAT
S.meliloti phoC2:CTGTTACAGAACCTACAC
S.meliloti orfA-pit:CTGTGGGAAAGCCGTTTT
E.coli phoACTGTCATCACTCTGTCAT
E.coli phoBCTGTCATAAAGTTGTCAC
E.coli phoECTGTAATATATCTTTAAC
E.coli phoHCTGTCATCACTCTGTCAT
E.coli phnCCTGTTAGTCACTTTTAAT
E.coli pstS1CTGTCATAAAACTGTCAT
E.coli pstS2CTTACATATAACTGTCAC
E.coli ugpB1TTGTCATCTTTCTGACAC
E.coli ugpB2CTATCTTACAAATGTAAC
E.coli ugpB3AAGTTATTTTTCTGTAAT
Frequencies adjusted by adding 0.03 for zero count
    A0.130.060.070.130.070.840.070.580.40.530.270.070.030.030.130.320.880.03
    T0.130.880.070.810.20.070.60.190.270.130.330.130.750.250.810.070.060.03
    C0.710.030.070.030.670.030.20.190.20.270.130.770.190.030.030.580.030.44
    G0.030.030.840.030.070.070.130.030.130.070.270.030.030.690.030.030.030.03
Consensus sequenceCTGTCATAAATCTGTCAT

S.meliloti and E.coli PhoB binding sites were extracted from the following references: (16,18,24,28,32,33–35).

Pho-box matrix S.meliloti and E.coli PhoB binding sites were extracted from the following references: (16,18,24,28,32,33–35).

Generation of gusA transcriptional gene fusions to the PCR amplified Pho box containing promoters

To construct the gusA reporter gene fusions to the Pho box containing promoters, each promoter region was PCR amplified using the primers as listed in Supplementary Table 2, and the PCR amplified promoter fragments were digested with appropriate restriction enzymes and cloned into either pFUS1 vector which is a broad host replicable vector containing promoterless gusA (uidA) gene (37) or into a suicide plasmid pTH1360 [modified pVO155 (38) by replacement of gusA coding and upstream sequences with the ones in pFUS1]. The corresponding gene fusion plasmids were verified by sequencing and subsequently introduced into S.meliloti wild-type strains RCR2011 and its derivative RmP559 (RCR2011, PhoB::TnV) strains or RmP110 and RmH852 (Rm1021, phoB) by tri-parental mating using MT616 as the helper strain as described previously (39).

β-Glucuronidase and alkaline phosphatase assays

To determine the expression of the predicted Pho box containing genes or operons in response to Pi limitation, the S.meliloti wild-type strains and its PhoB mutant harbouring the plasmid borne promoter::gusA gene fusion were inoculated in 2 ml LBmc containing 2.5 µg/ml tetracycline and grown overnight aerobically at 30°C to OD600 of ∼1.0. Luria–Bertani (LB) broth was supplemented with 2.5 mM MgSO4 and 2.5 mM CaCl2 (LBmc) (40). A total of 0.5 m of cultures were spun down in a 1.5 ml microcentrifuge tube, washed twice in 1 ml phosphate free MOPS minimal medium (P0 medium) and resuspended in 250 µl of the P0 medium. MOPS-buffered minimal medium contains 40 mM morpholinopropane sulfonic acid/20 mM potassium hydroxide; 20 mM NH4Cl; 2 mM MgSO4; 2 mM CaCl2; 100 mM NaCl; 15 mm filter-sterilized glucose as carbon source and supplied with 0.3 µg/ml biotin and 10 ng/ml CoCl2 (24,41). Ten microliter aliquots of washed cells were subcultured into 5 ml of P0 medium, or MOPS minimal medium supplied with 2 mM KH2PO4 (P2 medium). After 32 h incubation at 30°C, 2 ml cultures were spun down at 10 000 r.p.m. for 1 min and resuspended in 1 M TrisHCl (pH 8.0) for alkaline phosphatase assays as described by Bardin et al. (24), and 3 ml cultures were left for β-glucuronidase assays according to the protocol described by Reeve et al. (37).

RESULTS AND DISCUSSION

Weight matrix prediction of potential PhoB regulated genes

A weight matrix to identify potential PhoB binding sites was generated from five S.meliloti and ten E.coli PhoB box example sequences of 18 nt length (Table 1). The nucleotide frequency matrix was used to calculate an information-based score for potential binding sites in a scan of the S.meliloti genome. Putative PhoB binding sites were defined by a score of greater than 0.35 and a location between +100 and −500 nt of the translational start codon on the transcribed strand of an annotated gene (see Materials and Methods). One hundred and three putative PhoB binding sites were found and are shown with their downstream annotated genes in Supplementary Table S1. Seven of these promoter regions contained two putative PhoB boxes, so that 96 distinct genes were found. Three out of four genes whose Pho boxes were used for matrix construction were also among those 96 genes. No orthologue of orfA-pta-ackA from S.meliloti strain 104A14 was found in Rm1021 strain. The threshold score (0.35) used to identify putative PhoB binding sites was derived from the lowest score (that of orfA-pit) among the example sequences. With this threshold, 18 of the top 20 scores were upstream of genes found to be induced by phosphate starvation, in a PhoB-dependent manner, by gusA fusion analysis in S.meliloti (see next section). However, most putative PhoB binding sites with scores above the cut-off level did not show phosphate-dependent regulation of transcription. Possible explanations in addition to false positives are that the matrix method did not include other important features of a PhoB binding site, such as appropriately positioned −10 and −35 promoter elements. It is also possible that some genes with PhoB binding sites require interaction with additional regulatory proteins before the gene can be regulated by phosphate limitation. Blanco et al. (23) showed that the C-terminal domain of PhoB interacts with a 22 bp region of dsDNA that consists of two direct repeats of 11 bp. Each 11 bp repeat has a conserved 7 bp region (consensus, CTGTCAT) followed by a less conserved 4 bp segment. Our weight matrix is comprised of two conserved 7 bp repeats separated by a single, less conserved 4 bp spacer, and omits the terminal 4 bp segment. However, this terminal segment is not well conserved (23) and will therefore contribute little to the weight matrix score. Furthermore, our weight matrix will reliably identify overlapping PhoB sites provided that they are separated by 4 bp ‘spacers’ and individually have component scores greater than 0.35.

Experimental validation of the predicted Pho regulon members by analysis of transcriptional gene fusions

To directly examine whether the S.meliloti genes identified by the frequency matrix were subject to phosphate-dependent regulation, we generated transcriptional reporter gene fusions to seventy-two of these candidate genes and examined their expression in defined MOPS-buffered minimal medium during growth under Pi-excess (2 mM Pi) and Pi-starvation (no Pi added) conditions (see Materials and Methods). Gene expression in a wild-type phoB+ background was compared with expression in an otherwise isogenic phoB− background (Table 2). Eighteen of the 72 promoter gene fusions were induced upon Pi-starvation in a PhoB-dependent fashion (Table 2). In addition, regardless of the media Pi concentration, gene fusions to smb20427 (putative amino acid ABC transport system), smc02886 and smc02675 (rrna) showed 3-, 2- and 10-fold more expression respectively in the wild-type background relative to the phoB− background.
Table 2

β-Glucuronidase activitiesa from predicted Pho-regulon gusA gene fusionsb

GenecWild-type 0 PiWild-type 2 mM PiphoB 0 PiphoB 2 mM PiDistanceScore
    SMa004187.4 ± 1.3110.3 ± 5.270.7 ± 4.6113.2 ± 4.5−101, −900.36, 0.48
    SMa030288 ± 4.6123.6 ± 8.2134.6 ± 7187.7 ± 1.2−210.45
    SMa0567139.7 ± 4.866 ± 2.6116.7 ± 5.465.9 ± 18.5−1300.37
    SMa0570220.7 ± 3.5155.9 ± 3.3169.3 ± 33.5128.4 ± 35.9−410.38
    SMa1355275.2 ± 44.2231.2 ± 40.2229.9 ± 21.8277.4 ± 24.3−1600.41
    SMa1456726.2 ± 14.2763 ± 41.2804.2 ± 56858.5 ± 34.6−300.45
    SMa1836139.1 ± 13.6162.9 ± 9.4122.1 ± 10.8146.4 ± 23.8−2850.39
    SMa2012101.1 ± 6100.1 ± 1.4112.2 ± 14102.8 ± 23.3−3810.39
    SMa2025132 ± 0.7206.5 ± 21.6121.8 ± 2.8207.8 ± 13.6−220.36
    SMa206337.7 ± 19.531.4 ± 17.155.2 ± 12.875.1 ± 7.9+3490.55
    SMb20106592.2 ± 8.7915 ± 52.5460.6 ± 23.3910.4 ± 84.1−600.43
    SMb2041042.6 ± 24.748.1 ± 16.270.2 ± 13.387.5 ± 12.8−60.44
    SMb204271895.2 ± 78.21943.1 ± 78.7671.8 ± 34.2606 ± 43−1290.37
SMb20483 (crp)95 ± 45.592.7 ± 24.592.1 ± 15.3102.2 ± 12.7−80, −620.36, 0.37
    SMb2049388 ± 14.8131.7 ± 38.6156.7 ± 16216.4 ± 10.6−410.63
SMb20759(phnG)*1622.6 ± 25.9136.6 ± 8.572.8 ± 4.1115.4 ± 5.9−660.40
    SMb208241062.7 ± 54.81974.06 ± 229.8996.45 ± 10.21709.57 ± 76.1−1540.38
    SMb20843*1112 ± 149239.7 ± 11.1179.5 ± 28.5200.5 ± 21.8−3400.38
    SMb20876*2092.4 ± 13444.5 ± 19.676.8 ± 13.769.9 ± 7.5−980.43
    SMb20935722.8 ± 29.51182.3 ± 67.9624.6 ± 66.31339.9 ± 105.2−1830.41
    SMb2098079.9 ± 24.141 ± 18.4101.3 ± 7.491.4 ± 12.9−2710.39
    SMb21144752 ± 50777 ± 6.7582 ± 35.1624.9 ± 41.9−4410.36
    SMb21154478.8 ± 51.4423.4 ± 15.4403 ± 17.3385.5 ± 35.5−1650.37
SMb21177 (phoC)*1777.4 ± 98.351.6 ± 5.465.4 ± 8.460.5 ± 5.3−85, −630.54, 0.48
    SMb211921231 ± 472424.6 ± 122.11165.6 ± 104.32752.7 ± 88.2−3560.42
    SMb21210258.8 ± 14.6278 ± 6205.2 ± 7.9278.3 ± 13.6−1200.36
    SMb2121653.7 ± 13.978.8 ± 28.785.7 ± 18.186 ± 15.7−3550.37
    SMb21222123.9 ± 16.4172.3 ± 15.4160 ± 14.2194.4 ± 16.1−3520.37
    SMb21270*1974.1 32861.1 ± 4.11253.1 ± 8.01343.7 ± 29.4−91, −800.39, 0.37
    SMb2130736.4 ± 21.748.6 ± 15.739.2 ± 7.163.5 ± 12.2−4240.42
    SMb21555296.8 ± 22.4343.7 ± 37.4303.9 ± 2.2463 ± 24.5+1650.39
    SMc000091829 ± 2.5.63616.6 ± 329.71631.8 ± 107.72613.7 ± 157.8−3020.37
    SMc0002761.7 ± 24.948.6 ± 15.957.8 ± 14.197.9 ± 8.6−3680.39
    SMc00042604.5 ± 3.51059.1 ± 76.1631 ± 38.41030.7 ± 40−620.43
    SMc00161619.6 ± 37.8909.3 ± 98.8347.6 ± 16522.1 ± 10.6−228, −2170.38, 0.59
    SMc00171*1530.9 ± 19.3124.1 ± 10.694 ± 1.4128.6 ± 3.4−520.47
    SMc004855771.6 ± 682.65146.4 ± 238.43506.4 ± 90.43809.5 ± 254.4−1960.39
SMc00618 (ppk1)*6034 ± 272366.8 ± 29.6266.7 ± 33.5255.2 ± 9.7−470.46
    SMc00801**673.2 ± 92.91824.7 ± 1631352.5 ± 1201719 ± 153−570.87
SMc00819 (katA)*1415.4 ± 91.2100 ± 12.3111.8 ± 16.5117.2 ± 14.6−500.60
    SMc00978161.6 ± 5.7117.1 ± 7.7158.5 ± 12.9106.9 ± 10.1−480.59
    SMc0098261.2 ± 2.344.6 ± 6.471.5 ± 25.365.8 ± 5.1−3590.39
    SMc01605*1242.6 ± 4.4270.2 ± 0.795.4 ± 3.688.6 ± 0.4−580.52
    SMc0163580.5 ± 63.665.1 ± 16.269.4 ± 7.878.6 ± 5.3−2070.36
    SMc01723*894.3 ± 3.1314.8 ± 1.2190.0 ± 3.6308.2 ± 7.3−640.41
    SMc01849188.5 ± 10.9171.9 ± 4.6175.1 ± 39.7188.2 ± 22.7−3210.42
SMc01852 (pfk)*734.3 ± 126288.9 ± 26.5204.6 ± 9217.6 ± 23.7−1050.49
    SMc01907*955.9 ± 24.3186.8 ± 17.585.9 ± 2.8125.2 ± 4.4−820.72
    SMc01934406.5 ± 22.4706.4 ± 44.4440.7 ± 21.7607.6 ± 26.9−2830.39
    SMc01952101.7 ± 10.2100.6 ± 13.4123.9 ± 10.9118.6 ± 12.8−2990.37
SMc02146 (pstS)*2658.6 ± 193100.7 ± 23.481.3 ± 19.878.7 ± 12.4−115, −1040.72, 0.38
    SMc0231554 ± 5.649.5 ± 2.676.4 ± 16.856.5 ± 8.9−1340.44
    SMc02601**282.4 ± 19818.9 ± 47.31036.6 ± 571426 ± 37.9−990.50
    SMc02634*2315.4 ± 146111.4 ± 9.890.4 ± 0.3114.9 ± 4.2−770.56
    SMc02675342.7 ± 9.4308.9 ± 30.530.7 ± 0.944.6 ± 1.9−4100.78
    SMc02689977.9 ± 54677.5 ± 116.71204.5 ± 86.4717.7 ± 4.3−1060.5
SMc02862 (orfA-pit)**46.2 ± 0.18111.3 ± 3.999.8 ± 6.6123.8 ± 6.4−830.37
SMc02863 (recF)42.3 ± 40.836.4 ± 1271.8 ± 573.6 ± 12.5−1950.45
    SMc02886618.4 ± 12.8731.9 ± 45.3290.6 ± 24.6347.7 ± 13.3−900.51
    SMc02976398.6 ± 19.2393 ± 47233.3 ± 19.3307.5 ± 15.3−730.36
    SMc03124*4081.4 ± 13.2130.5 ± 1.0126.5 ± 2.6109.7 ± 2.6−930.44
    SMc03174137.6 ± 9196.3 ± 14.7140.4 ± 15.1190.7 ± 29.1−4230.41
SMc03243(phoA1)*1761.4 ± 159108.4 ± 8.376 ± 5.3109.6 ± 5.1−64, −970.41, 0.37
    SMc03823153.3 ± 17.8587.6 ± 26.5218.9 ± 8.1728.8 ± 45.7−2060.46
    SMc03844260.57 ± 1.2438.1 ± 51.2397.9 ± 43.3557.4 ± 23.9−330.42
    SMc03975367.3 ± 25.1544.2 ± 67.3704.6 ± 10.81085 ± 29.5+220.56
    SMc04053*1048 ± 80.1149.2 ± 18125.6 ± 29.4164.6 ± 17.7−1420.39
    SMc04144235.3 ± 8247.6 ± 24.8173.5 ± 25220.7 ± 13.8−460.39
    SMc04213161.3 ± 6.7170 ± 1.4185.8 ± 11.2168.3 ± 20.1−1690.36
SMc04317 (afuA)*3284.3 ± 29950.6 ± 23.776.8 ± 9.359.7 ± 15.7−510.58
    SMc0445826.4 ± 9.126.7 ± 1.452.4 ± 6.754.3 ± 7.9−2300.37
    SMc044781498.2 ± 100.63333.8 ± 118.61632 ± 62.73952.4 ± 312.8−520.57

aGusA transcriptional gene fusion and expression demonstrating altered gene expression in response to Pi conditions. β-glucuronidase activities are expressed as Miller units and corresponding alkaline Phosphatase activities (data not shown) were determined as described in Materials and Methods. Each value corresponds to the mean of triplicate assays with (±) standard error. Background β-glucuronidase values from negative control (S.meliloti strains with no promoter fusions) were in the range of 42–55 Miller units. SMc02862 (orfA-pit) was a lacZ fusion.

bGene fusions integrated into the genome are shown in grey, others were generated in the plasmid pFUS1.

c*PhoB-dependent up-regulated genes, **PhoB-dependent down-regulated genes.

The Pho boxes from genes in ‘bold’ were also part of the weight matrix (Table 1).

β-Glucuronidase activitiesa from predicted Pho-regulon gusA gene fusionsb aGusA transcriptional gene fusion and expression demonstrating altered gene expression in response to Pi conditions. β-glucuronidase activities are expressed as Miller units and corresponding alkaline Phosphatase activities (data not shown) were determined as described in Materials and Methods. Each value corresponds to the mean of triplicate assays with (±) standard error. Background β-glucuronidase values from negative control (S.meliloti strains with no promoter fusions) were in the range of 42–55 Miller units. SMc02862 (orfA-pit) was a lacZ fusion. bGene fusions integrated into the genome are shown in grey, others were generated in the plasmid pFUS1. c*PhoB-dependent up-regulated genes, **PhoB-dependent down-regulated genes. The Pho boxes from genes in ‘bold’ were also part of the weight matrix (Table 1). Three reporter gene fusions were found to be repressed upon Pi-starvation in a PhoB-dependent manner. These were smc00801 (transmembrane protein of unknown function), smc02601 (nadABC) and smc02862 (orfA-pit). In the wild-type background these fusions were expressed at higher levels in media containing 2 mM Pi than in Pi-starved cells. Also, in the phoB mutant background the expression level was elevated and did not alter with media Pi. We have previously reported that expression of the low-affinity Pi-transport system encoded by the orfA-pit genes is repressed by PhoB (28). The repression of smc02601- smc02602- smc02603 (nadABC) expression suggests that the observed down regulation of NAD+ synthesis in S.meliloti possibly corresponds to a slight down regulation of smc00161 expression that appears to occur upon Pi-limited growth (Table 2). The smc00161 is annotated to encode an NH3-dependent NAD+ synthetase and the promoter region of this genes was predicted to carry a promoter Pho box (Supplementary Table S1).

Comparison of Pho box predictions with DNA microarray data

Employing DNA microarrays, Krol and Becker (31) identified 98 genes (some of which were in operons) that were more than 3-fold induced in a phoB-dependent manner upon Pi limitation. An additional 50 genes showed a strong increase in expression under phosphate limitation in a partially phoB-dependent or phoB-independent manner. Krol and Becker (31) also identified potential Pho-box sequences with ≤2 mismatches from the Pho-box consensus sequence TG(A/T)CA (C/A)-NNNN-C(C/T)(G/T)TCA(C/T) defined by Summers et al. (16). Of the 19 Pho-box promoters identified by Krol and Becker (31), 14 were also identified with our weight matrix (Table 3) and data from our gene fusion experiments (Table 2) revealed that 13 of these 14 genes were regulated by media Pi in a PhoB dependent manner (Table 2). A reporter fusion to the remaining gene, sma0612 has yet to be examined. Of the five Pho-boxes not identified by our weight matrix, four were unusual (sma1809, sma1822, smc00170 (sinR) and smc00429) as they contained 3 or 5 nt in the region flanked by the 7 nt direct repeats instead of the 4 (see Table 1). The Pho-box upstream of the remaining gene, sma0045, lies on the opposite strand to sma0045 and thus would not be included in our predictions. However reporter gene fusions to these genes should be analyzed as the microarray experiments suggested these genes were induced in a phoB-dependent manner (31).
Table 3

Pho regulon members in S.meliloti as identified by this study and the microarray analysis of Krol and Becker (31).

No.GeneStudyPho box motifdDistScoreDescription*Genomes
1SMb20759KnownATGTCACAAGCCTGTCAT−660.62phnG, carbon-phosphorus lyase component*4
2SMb21177KnownCTGTTACAGAACCTACAC−850.54phoC, phosphate ABC transporter ATP binding*6
KnownCTGACACTGCGCTTTCAT−630.48
3SMc02862KnownCTGTGGGAAAGCCGTTTT−830.37orfA-pit, inorganic phosphate transporter*4
4SMa0612OverlapaTTGTCAGGGGCCTTTCAT−180.36FixN3, cytochrome c oxidase subunit 1
5Smc00171Overlapa,bATGTCATCTTCCTGAAAC−520.47Conserved hypothetical protein*5
6Smc01605Overlapa,bCTGTCACATAGGCTTCAT−580.52Putative ABC transporter periplasmic binding*6
7Smc01907Overlapa,bCTGTCATCCAGCTGTTAC−820.72phoA2, putative alkaline phosphatase* 3
8Smc02146Overlapa,bGTTTCACCGAACTGACAT−1040.38pstS, phosphate-binding periplasmic protein*11
Overlapa,bCTGTCATAAATGTTTCAC−1150.72
9SMc02174OverlapaTTGTGAACGATCTGTCGC−760.36Hypothetical transmembrane protein*1
10Smc02634Overlapa,bTTGTCGCAGAGCTGTCAT−770.56phoA3, putative alkaline phosphatase*2
11Smc03124Overlapa,bCTGTCACGAAAGTTTCAC−930.44Putative ABC transporter periplasmic binding*5
12SMc03243Overlapa,bCTGTAAAGCCGCTGACAT−640.41phoA1, putative alkaline phosphatase*5
Overlapa,bCTGTCGCCAAACCGTCCG−970.37
13SMc03961OverlapaTTGTCACAATCCTATCCT−400.36sqdB, sulfolipid biosynthesis protein
14SMc04317Overlapa,bCTGTCACATCCCTGTAAA−510.58afuA, iron-binding periplasmic protein*3
15SMb20843Gene fusionb,cATTTTAAATCGCCGTCAC−3400.38algI, involved in acetylating surface saccharide
16SMb21270Gene fusionb,cCTGGCATCTTGATGTAAT−910.38Putative transcriptional regulator
17SMc00618Gene fusionbCTGTCACAGTTCCGTCGT−470.46ppk, putative polyphosphate kinase protein*10
18SMc00801Gene fusionbTTGTCATAAAACTGTCAC−570.87Hypothetical transmembrane protein
19SMc01723Gene fusionbCCGTCACTACCCTGACAT−640.41Hypothetical transmembrane global homology
20SMc01852Gene fusionbCTGTCATTTAATTTCCAT−1050.49pfk, pyrophosphate-fructose 6-Pi 1-phosphotransferase
21SMc02601Gene fusionbCTGTCACAAAACTTTCTA−990.5nadA, putative quinolinate synthetase A protein
22SMc04053Gene fusionbTTGACATAAAATAGTAAT−1420.39Hypothetical protein
23SMb20876Gene fusionb,cAATTCGTCAATCTGTCAC−980.43Hypothetical protein*1
24SMc00819Gene fusionb,cCTGTCGTTCAGCCGTCAC−500.6katA, catalase hydroperoxidase HPII*4
25SMa1961MicroarraycCTGACAAACCGCCTTAGC−1280.37Putative polyhydroxyalkanoate depolymerase
26SMb20037MicroarraycATGACGCTCACCCGTCAT−3890.39aroE2, putative shikimate 5-dehydrogenase
27SMb21317MicroarraycCTGTCATGCACCTGCATC−3850.39expG, regulator of exopolysaccharide II synthesis*1
28SMc00620MicroarraycCTTTCACCTCGCTGTAAG−700.41Hypothetical signal peptide protein* 2
29SMc00772MicroarraycAGGTCGTCAAGGCGTCAT−1030.36potH, probable putrescine transport permease*1
30SMc02490MicroarraycTTGTCATTGTTCGATCAT−410.38Hypothetical protein
31SMc03994MicroarraycCTGTCAGACAGGTTCAAT−470.37suhB, putative inositol monophosphatase
32SMc04239MicroarraycCTGTCGTGAAGGCTTCAT−980.4Hypothetical protein
33SMc04280MicroarraycCTTTTGTAAAGATTTCAT−810.37Hypothetical signal peptide protein
34SMc01848MicroarraycTCGTCATCAAAGTGTAGC−470.41Hypothetical protein (btaA-like)* 2

aPho-box sequences detected in this study and also by Krol and Becker (31).

bPho-box sequences detected in this study and gene fusions tested for PhoB-dependent regulation.

cPho-box sequences detected only by our matrix and PhoB-dependent regulation shown by microarray data [Krol and Becker (31)]. No gene fusions were tested for #25–34.

d‘Dist’ indicates the relative distance from the annotated translational start sites.

*Indicates the conserved Pho regulon homologs having predicted Pho box in other scanned genomes (see Tables 5–8).

Pho regulon members in S.meliloti as identified by this study and the microarray analysis of Krol and Becker (31). aPho-box sequences detected in this study and also by Krol and Becker (31). bPho-box sequences detected in this study and gene fusions tested for PhoB-dependent regulation. cPho-box sequences detected only by our matrix and PhoB-dependent regulation shown by microarray data [Krol and Becker (31)]. No gene fusions were tested for #25–34. d‘Dist’ indicates the relative distance from the annotated translational start sites. *Indicates the conserved Pho regulon homologs having predicted Pho box in other scanned genomes (see Tables 5–8). In addition to the 13 Pho regulon members predicted both here and by Krol and Becker (31), both the weight matrix data and data from reporter fusion assays identified an additional 10 genes whose expression was PhoB-regulated in response to Pi limitation (Tables 2 and 3). With the exceptions of smb20843 (algI), smc00618(ppk), smc02601(nadA) and smc00801(hypothetical, global homology), these genes also showed PhoB-dependent transcription in microarray studies. The failure to detect repression of smc02601(nadA) and smc00801expression in microarray experiments is not surprising as the microarray experiments also failed to detect orfA-pit repression and this operon is known to be repressed by PhoB (25). The failure to detect induction of smb20843 (algI), smc00618 (ppk), upon Pi-starvation is more surprising as these genes appear to be highly regulated in the gene fusion experiments. Moreover expression of ppk is known to be Pi-starvation induced in many organisms. The differences between the microarray and gene fusion data could result from several factors including differences in experimental growth conditions as in microarray experiments cells were grown in 100 µM Pi source as the Pi-limitation condition. Alternatively, it is possible that the particular probes employed for ppk and smb20843 yielded low signals. Through our weight matrix scan, Pho-box sequences were also found upstream of three more genes, sma2410 (rhbF), smc01296 (rpsN) and smc01820 (putative N-carbamyl-L-amino acid aminohydrolase) (Supplementary Table S1). Promoter fusions to these three genes have not been tested yet. These genes however are shown to be repressed in a PhoB-independent manner in microarray studies (31). Further studies are required to analyze the regulation of these genes and the nature of their associated Pho-box sequences. We note that in the case of orfA-pit, the Pho-box identified by Krol and Becker (31) lay on the opposite strand to the orfA-pit genes and is different from that identified by Bardin et al. (28). Since orfA-pit expression is negatively regulated by PhoB, it is of interest to determine the actual PhoB binding site as little is known regarding how PhoB represses transcription. In summary, of 96 genes with upstream Pho-boxes predicted by the frequency matrix genome analysis, 34 appear to be Pi and PhoB regulated as revealed from gene fusion and microarray analysis data (Table 3).

Analysis of predicted Pho regulon members across proteobacterial genomes

It is reasonable to assume that at least part of the physiological response to Pi-limitation will be conserved. As the Pho-box sequence identified by the PhoB proteins of different organisms appears to be conserved (14,16,24,28,42–44), we used the Pho-box frequency matrix described above (Table 1) to search the genomes of twelve gram negative bacteria (Table 4) for PhoB-binding sites using the same criteria as employed for S.meliloti. Genes that lay downstream of a predicted Pho-box with scores greater than 0.35 were further examined. We identified genes, such as pstSCAB, phoA, ugpA, phn and ppk that are known to be associated with phosphate metabolism (Tables 5–7). The pstS gene encodes the Pi-binding protein of the high affinity PstSCAB transport system (18,27) and expression of this system in E.coli, S.meliloti and Pseudomonas aeruginosa is known to be highly induced under Pi-limiting conditions and is PhoB dependent (27). In a number of organisms, such as Caulobacter crescentus, the pstS gene transcript is separate from the pstCAB-phoUB transcript and in these cases predicted Pho-boxes are also located upstream of the pstC gene (see Table 5).
Table 4

List of proteobacterial genomes scanned for the presence of Pho-boxes

No.Bacterial genomeAccession no.Number of predicted Pho regulons
1Acinetobacter sp. ADP1NC_00596656
2A.tumefaciens C58 Chromosome circularNC_00330451
A.tumefaciens C58 Chromosome linearNC_00330535
A.tumefaciens C58 Plasmid ATNC_0033067
A.tumefaciens C58 Plasmid TiNC_0033086
3B.japonicum USDA 110NC_00446387
4B.melitensis 16 M chromosome INC_00331725
B.melitensis 16 M chromosome IINC_00331833
5B.suis 1330 chromosome INC_00431027
B.suis 1330 chromosome IINC_00431132
6C.crescentus CB15NC_00269662
7E.coli K12NC_000913107
8E.coli O157:H7 ChromosomeNC_00269596
E.coli O157:H7 Plasmid pO157NC_0021285
E.coli O157:H7 Plasmid pOSAK1NC_0021272
9M.loti MAFF303099 for chromosomeNC_00267871
M.loti MAFF303099 plasmid pMLaNC_00267916
M.loti MAFF303099 plasmid pMLbNC_0026827
10P.aeruginosa PAO1NC_00251673
11P.putida KT2440NC_00294771
12S.meliloti 1021 chromosomeNC_00304756
S.meliloti 1021 plasmid pSymANC_00303716
S.meliloti 1021 plasmid pSymBNC_00307824
13Rhizobium sp. NGR234 plasmid pNGR234aNC_00091417
Rhizobium sp. NGR234 megaplasmid 2 contig 1AY3167474
Rhizobium sp. NGR234 megaplasmid 2 contig 2AY3167463
Table 5

Predicted Pho-regulon orthologue groups related to Pi metabolism

GroupGenePho box motifDistScoreGenome sourceDescription-predicted function
APhosphate transport (high affinity)
1SMc02146CTGTCATAAATGTTTCAC−1150.72S.melilotipstS, putative periplasmic phosphate-binding protein
GTTTCACCGAACTGACAT−1040.38
2ACIAD0279CTGTCATTAAAGTTTCAT−640.71Acinetobacter sp.ADP1phoU, transcriptional repressor for high affinity phosphate uptake
3ACIAD1212GCGTCATTAATTTGTCAC−980.38Acinetobactersp. ADP1pstS, putative ABC phosphate transporter (periplasmic-binding)
TTGTCACTGAATTGTCAT−870.54
4Atu0421TTGACATTTCCCATTCAT−1510.37A.tumefacienspstC, ABC transporter, membrane spanning protein
5Atu0420TTGTCACAAATCTTTCGT−1170.54A.tumefacienspstS, ABC transporter, substrate binding protein [phosphate]
CTTTCGTCAAAGTGACAT−1060.36
6blr109CTGTCATCCGACTGTCAC−880.71B.japonicumpstS, ABC transporter phosphate-binding protein
CTGTCACGAAACCTTCGT−770.48
7BMEI1989GTGTCATATAAGTGTAAT−760.56B.melitensispstS, putative periplasmic phosphate-binding protein
CTGTAATATTCGTGTCAT−870.53
TTGCAACAAACCTGTAAT−980.37
8BR2138TTGCAACAAACCTGTAAT−890.36B.suispstS, phosphate ABC transporter, phosphate-binding protein
CTGTAATATTAGTGTCAT−780.56
GTGTCATATGAGTGTAAT−670.41
9CC0290TTGTCGTCAAACTGTCAT−1040.68C.crescentuspstCAB-phoUB, phosphate ABC transporter, permease protein
CTGTCATGTAATTGTCGC−930.48
10CC1515CTGTCGTCAAACTGTGAC−620.57C.crescentuspstS, putative ABC transporter, periplasmic phosphate-binding
CTGCCTTAAAACTGTCGT−730.38
11b3728CTGTCACCTGTTTGTCCT−560.36E.coli K12pstS, high-affinity phosphate transport phosphate-binding protein
CTTACATATAACTGTCAC−670.62
CTGTCATATTCCTTACAT−780.64
CTGTCATAAAACTGTCAT−890.97
12ECs4664CTGTCACCTGTTTGTCCT−560.36E.coli O157:H7pstS, periplasmic phosphate-binding protein
CTTACATATAACTGTCAC−670.63
CTGTCACATTCCTTACAT−780.62
GTGTCATCAAACTGTCAC−890.67
CTTTCCTTTGGGTGTCAT−1000.37
13mll3723CTGTCATGCGACTGTAAT−790.58M.lotipstS, periplasmic phosphate-binding protein, (PBP)
CTGACACGAAACTGTCAT−900.65
14PP5328TTGTAATGTTTTTGTCAC−3020.38P.putidapstC, phosphate ABC transporter, permease protein
15PP5329TTGTCACAATGAAGTCAT−750.41P.putidapstS, ABC transporter, periplasmic phosphate-binding protein
CTTTCATCCAATTGTCAC−860.56
16PP2656CTGCCATTCAATAGTCAC−420.37P.putidapstS, phosphate ABC transporter, periplasmic phosphate-binding
TAGTCACAAAGTTGTAAC−310.49
TTGTAACACAGCTGATGC−200.36
17PA5368CTGTCATCTGTCTGTCAT−2770.74P.aeruginosapstC, membrane protein component of ABC
GCGTCATGTTGCTGTCAT−2880.39phosphate transporter
18PA5369CTTTCATAGAGTCTTCAT−600.53P.aeruginosapstS-like, hypothetical protein
CTGTCATATTCCTTTCAT−710.78
ATTTCATCCAACTGTCAT−820.56
BPhosphate transport (low affinity)
1SMc02862CTGTGGGAAAGCCGTTTT−830.37S.melilotiorfA-pit, phosphate transport transmembrane protein
2ACIAD1047TTGTCATATAATTGTCAT−510.73Acinetobacter sp. ADP1pit, phosphate transporter
3bll3022TTGTCATCCAGCCTTCAA−490.53B.japonicumpit-like, low-affinity inorganic phosphate transporter
4mll3637TCGTCATACAGGGTTCAT−560.36M.lotipit, phosphate transporter
5PP1373CTGTCATCTGCCTGTTAC−670.56P.putidaLow-affinity inorganic phosphate transporter
CPhosphonate/Phosphate metabolism
1SMb21177CTGACACTGCGCTTTCAT−630.54S.melilotiphoC, phosphate uptake ABC transporter ATP-binding protein
CTGTTACAGAACCTACAC−850.48
2SMb20759ATGTCACAAGCCTGTCAT−660.4S.melilotiphnG, putative C-P (carbon-phosphorus lyase component protein
3ACIAD0719TTGATATCAAGCTTCCAT−710.38Acinetobacter sp. ADP1phnA, putative alkylphosphonate uptake protein (PhnA)
4Atu0181ATGTCACACGCTTGTCAT−670.47A.tumefaciensphnG, hypothetical protein
5Atu0174CTGACACTCCCGCTTCAC−480.36A.tumefaciensphnC, ABC transporter, nucleotide binding/
TCTTCATCAAACTGACAC−590.37ATPase [phosphonate]
ATGTAATATTTTCTTCAT−700.36
6Atu6108CTGTCAGAACACCCTGAT−810.37A.tumefaciensphnA, alkylphosphonate uptake protein
7blr7947CTGTCATCGCTGCGTCAT−990.62B.japonicumphoC, phosphonate metabolism protein
8blr1221TTGTCACGAAACAGTCAT−120.47B.japonicumphnG, phosphonate metabolism protein
9bll7947ATGTCACACAAGTGTCAA−440.52B.japonicumphoC, phosphonates transport system nucleotide binding/ATPase
10CC0361CCGTCACAAATCCGTTGC−680.36C.crescentusPhosphonates ABC transporter, ATP-binding protein
11ECs5088CTGTTAGTCACTTTTAAT−600.43E.coli O157:H7phnC, ATP-binding component of phosphonate transport
12b4106CTGTTAGTCACTTTTAAT−600.42E.coli K12phnC, ATP-binding component of phosphonate transport
13mll9156TAGTCATCTCACTGTCAT−3150.56M.lotiphnM homolog, phosphonate metabolism protein
14mlr3342CTGTCATCCACCCGTCAT−800.73M.lotiphnG, similar to phnG gene product (phosphonate metabolism)
15PP2209CTGACTTATAGCTGGGAT−850.37P.putidaphnW, 2-aminoethylphosphonate:pyruvate aminotransferase
16PA3384CTGTCATCGTCACTTCAC−840.44P.aeruginosaphnC, ATP-binding component of ABC phosphonate transporter
TTGTCACTCGACTGTCAT−950.59
DPho response regulator
1ACIAD3557TTGCAATGATACTGTCAT−740.37Acinetobacter sp. ADP1phoB-phoR, positive response regulator for the pho regulon PhoR
TTGTAATAAGCCTGTCAT−470.49Acinetobacter sp. ADP1
2Atu0419ATGATGAATCTCTGTCAT−580.36A.tumefaciensphoR, two component sensor kinase
CTGTCATGAAGCTGGCCT−470.38
3blr1090CTGTAATCTTGCTGACGT−1790.37B.japonicumphoR, phosphate regulon, two-component sensor histidine kinase
4BMEI1624CTGTCTTTGAACTGTCAC−2190.68B.melitensisphoR, phosphate regulon sensor protein
5BR0298CTGTCTTTGAACTGTCAC−1370.68B.suisphoR, putative sensor histidine kinase
6b0399TTTTCATAAATCTGTCAT−640.41E.coli K12phoB, positive response regulator for pho regulon, sensor is PhoR
CTGTCATAAATCTGACGC−530.83
7ECs0449TTTTCATAAATCTGTCAT−640.72E.coli O157:H7phoB-phoR operon, positive response regulator for pho regulon
CTGTCATAAATCTGACGC−530.65
8PP5320CTGTCACACAGCTGCAAT−150.68P.putidaphoB-phoR, DNA-binding response regulator
CTGCAATAATTCCGTTAT−40.39
9PA5360GTGTCACATACCTGACAC−500.54P.aeruginosaphoB-phoR, two-component response regulator PhoB
CTGACACAATTTCGTTAT−390.42
EGlycerol 3-phosphate transport
1ACIAD1317CAGTCATTGAATCTTCAT−1680.42Acinetobacter sp. ADP1gpsA, glycerol-3-phosphate dehydrogenase, biosynthetic
2Atu5058CTGTCATCAAAACGTCGC−750.47A.tumefaciensugpB, ABC transporter, substrate binding protein
TTGCTAGACAGCCGTCAT−290.37
3Atu0305GTGTAATAAATCTGACAC−760.44A.tumefaciensugpA, ABC transporter, substrate binding protein
CTGACACGGAACTGTCAA−870.47
CTGTCAAAGAGGCGTCAT−980.63
4bll0733CAGTCATGTGAACGTCAT−550.36B.japonicumABC transporter glycerol-3-phosphate-binding protein
5blr2436CCTTCGCCTCTCTTTCAT−760.37B.japonicumglpD, glycerol-3-phosphate dehydrogenase
CTTTCATCTTGCTTTCGC−650.39
6b3453AAGTTATTTTTCTGTAAT−750.37E.coli K12ugpB, sn-glycerol 3-phosphate transport periplasmic binding protein
CTATCTTACAAATGTAAC−970.39
TTGTCATCTTTCTGACAC−1190.41
7ECs4299AAGTTATTTTTCTGTAAT−750.37E.coli O157:H7ugpB, sn-glycerol 3-phosphate transport periplasmic binding protein
CTATCTTACAAATGTAAC−970.39
CTGACACCTTACTATCTT−1080.41
CCGTCACCGCCTTGTCAT−1300.4
8mll3503CTGTCACATACCTTCTCT−610.37M.lotiugpB, sn-glycerol 3-phosphate transport system; periplasmic binding
FPhosphatases
1SMc03243CTGTAAAGCCGCTGACAT−640.41S.melilotiphoA, putative alkaline phosphatase
CTGTCGCCAAACCGTCCG−970.37
2Smc01907CTGTCATCCAGCTGTTAC−820.72S.melilotiHypothetical transmembrane protein
3Smc02634TTGTCGCAGAGCTGTCAT−770.56S.melilotiHypothetical protein, phosphatase
4Atu1263ATGTCATGCCACTGTCAC−510.59A.tumefaciensConserved hypothetical protein
5BMEII0655CCGTCATTCCTGTGTAAT−530.38B.melitensisphoA, alkaline phosphatase
6BR1200CTGTCACACGCCTGCAAT−800.44B.suisphoA, alkaline phosphatase
7BRA0616CTGTCATCGTTTCTTCGT−750.39B.suisHypothetical protein, phoA like gene
CCGTCATTCCTGTGTAAT−530.37
8b0383CTGTCATAAAGTTGTCAC−630.88E.coli K12phoA, alkaline phosphatase
9ECs0433CTTTTCAACAGCTGTCAT−50.39E.coli O157:H7phoA, alkaline phosphatase
CTGTCATAAAGTTGTCAC+60.86
10ECs4053TTGTCAGGTATCTGTATC−3610.37E.coli O157:H7phoA, putative alkaline phosphatase I
11mll2704GTGTCACATGGCCGTCAC−450.46M.lotiProbable acid phosphatase
12mll4115CTGTTATCAAAACTTCAT−730.52M.lotiphoA, secreted alkaline phosphatase
13PP1044CTGCCATCAAACTGTAAT−450.66P.putida(uxpA) lipoprotein UxpA
CTGTCGGATTTCTGCCAT−560.41
14PA3296TTGTCACAAGCCCGTCAT−820.53P.aeruginosaphoA, alkaline phosphatase
15PA2635CTGTCATCGTCCCGTCGC−530.39P.aeruginosaHypothetical protein
GMembrane lipids
1SMc01848TCGTCATCAAAGTGTAGC−470.41S.melilotiHypothetical protein (btaA-like)
2Atu2119CTGTCATCAAACTGTAGC−440.58A.tumefaciensHypothetical protein (btaA-like)
3mlr1574CTGTCACCGGCCTGTCAT+10.55M.lotiHypothetical protein (btaA-like)
HExopolysaccharide
1SMb21317CTGTCATGCACCTGCATC−3850.39S.melilotiexpG, activator of exopolysaccharide II synthesis
2SMc02851CTTTCAAAGAGCCGCCAC−1580.37S.melilotiPutative transcription regulator protein
3bll5036TTGTGGCACAGGCTTAAT−1450.36B.japonicummucS, transcriptional regulatory protein

‘Dist’ indicates the relative distance from the annotated translational start sites.

List of proteobacterial genomes scanned for the presence of Pho-boxes Predicted Pho-regulon orthologue groups related to Pi metabolism ‘Dist’ indicates the relative distance from the annotated translational start sites. It was striking that multiple18 bp Pho-box sequences were predicted upstream of the pst genes in all of the genomes examined (Table 5). Multiple Pho-boxes consisted of overlapping 7 bp direct repeats separated by 4 bp spacers. The frequency matrix detected consecutive 18 bp elements and adding a terminal 4 bp spacer formed consecutive 22 bp PhoB binding sites as defined by Blanco et al. (23). The two 11 bp direct repeat sequences bind the PhoB monomers head to tail (23). The pstS promoters from E.coli K12 and O157:H7 are predicted to contain five and six of these 11 bp direct repeats, respectively. The large number of Pho-boxes in all of the pstS promoter regions presumably reflects the importance of the PstSCAB high affinity transport system in the uptake of Pi under Pi-limiting conditions. Other genes associated with phosphate metabolism for which multiple Pho-boxes sequences were detected included alkaline phosphatase-like proteins (phoA), genes involved in phosphonate uptake and metabolism (phn), in glycerol-3-phosphate uptake (ugp and glp), the regulatory genes phoB and phoR (Table 5) and genes encoding polyphosphate kinase (Table 6).
Table 6

Conserved ppk Pho box motifs in various proteobacteria

No.GeneSource/Pho box motifGene annotation/Distance from ATGaScore
1ACIAD1782Acinetobacter sp. ADP1ppk polyphosphate kinase
TTGTAACTCGTTTGTAAC−1070.36
2Atu1144A.tumefaciensppk polyphosphate kinase
CTGTCACAGTTCCGTCGT−840.46
CCGTCGTCAAACTGTTAT−730.39
3bll2813B.japonicumppk2 hypothetical protein
CTGCCATCGCCGTGTCAA−3650.36
4bll4122B.japonicumppk polyphosphate kinase
ATGTCATCGAAACGTCAT−590.48
5BMEI1205B.melitensisppk polyphosphate kinase,
TTGTCATATGACAGCCAT+100.37
TTGCTATCAAATTGTCAT−10.39
6BR0748B.suisppk polyphosphate kinase
TTGTCATATGACAGCCAT−1010.37
TTGCTATCAAATTGTCAT−1120.4
7CC1710C.crescentusppk polyphosphate kinase,
CTGTCTTCACCGCGTCAT+1060.37
8mlr8161M.lotippk polyphosphate kinase
ATGTTAGAGCACCGCCAT+180.36
9mlr8387M.lotippk2 hypothetical protein
CTGCAATAAAACCGTCAC+740.51
10PA5243P.aeruginosappk delta-aminolevulinic acid dehydratase
TTGTTGCCAATCCGTCAT−560.39
11PA2428P.aeruginosappk2 hypothetical protein
CCGTCACCAAACCGTCAT−520.53
TTTTCATCTAACCGTCAC−630.51
12PP5216P.putidappk exopolyphosphatase
CTGTCATATGGCCGTCAT−1190.73
13PP5217P.putidappk polyphosphate kinase
GTGTAACACGGGCGTCAT−1220.36
14PP1752P.putidappk2 hypothetical protein
CTGTGGGAGCGCGTTCAT−700.37
15SMc00618S.melilotippk putative polyphosphate kinase
CTGTCACAGTTCCGTCGT−470.46

aIndicates the relative distance from the annotated translational start sites.

Conserved ppk Pho box motifs in various proteobacteria aIndicates the relative distance from the annotated translational start sites. In addition to the previously reported Pho-box in the orfA-pit promoter region of S.meliloti, Pho-box sequences were also detected in the promoter region of the orfA-pit orthologues in the α-proteobacteria, Bradyrhizobium japonicum and Mesorhizobium loti and the γ-proteobacteria Pseudomonas putida and Acinetobacter sp (Table 5). Bardin et al. (28) showed that the expression of orfA-pit in S.meliloti is repressed upon Pi-starvation, unlike in E.coli where the pit genes appear to be constitutively expressed (45) and for which no Pho-boxes were detected. The identification of putative Pho-boxes upstream of the orfA-pit genes in other bacteria suggests that these may also be repressed by Pi-starvation and that such repression may be a widespread phenomenon. A number of predicted Pho regulon members not normally associated with Pi metabolism were identified in several genomes (Table 7). One of the genes in this category was katA encoding catalase and was recently shown to be PhoB dependent in S.meliloti and P.aeruginosa in Pi-starvation conditions (46). The detection of Pho-box elements upstream of the katA genes of C.crescentus and P.putida suggests that katA expression in these organisms is also PhoB regulated. Pho-boxes upstream of several S.meliloti ABC-class transport systems were also detected upstream of homologous clusters in other bacteria. These were smc01605, smc04317 (afuA) and smc03124 (Table 7). Both the afuABC and smc01605 gene clusters in S.meliloti are annotated as putatively involved in Fe+3 transport, however definitive evidence is lacking. Choa et al., (47) did not find either of these ABC transport systems to be up-regulated in S.meliloti when grown in iron-limiting conditions. Therefore, it appears unlikely that they are actually involved in iron transport. A third ABC system in S.meliloti, smc03124, with conserved Pho-box sequences in other proteobacteria (Table 7), is annotated as a putative peptide binding protein. The actual substrate(s) transported by this system is unknown.
Table 7

Conserved Pho regulon members either of unknown and/or not clearly associated with Pi metabolism

GroupGenePho box motifDistScoreGenome sourceDescription:(predicted) function
A
1SMc01605CTGTCACATAGGCTTCAT−580.52S.melilotiABC transporter, periplasmic substrate-binding Fe+3
2Atu2147TTGCAATCCAACCGTCAC−630.39A.tumefaciensABC transporter, substrate binding protein
3BMEII1120CTGTCATCCAACCGTCAT−960.73B.melitensisIron(III)-binding periplasmic protein
4BRA0115CTGTCATCCAACCGTCAT−790.77B.suisABC transporter, periplasmic substrate-binding
5mll3069CTTTCATCTGGCTGTCAC−580.55M.lotiABC transporter, periplasmic binding protein
6PA3250CTGACATGAAACCGTCAT−1240.58P.aeruginosaHypothetical protein (smc01605 homolog)
7PP1726CCTACATCAAACTGTCAC−1190.39P.putidaABC transporter, periplasmic binding protein
B
1SMc04317CTGTCACATCCCTGTAAA−510.58S.melilotiafuA, iron-binding periplasmic protein
2Atu0202CTGGCATCCATCCGATAT−660.37A.tumefaciensABC transporter, substrate binding protein (iron)
3afuA2CTGTCACATACATGTTAT−460.54A.tumefaciensAtu2014 ABC transporter, substrate binding
4mll3626CTGTCATCCACCAGTCAT−790.62M.lotiABC transporter, periplasmic substrate-binding (iron)
5y4fPATGTAATTAAACTGACAT+240.53Rhizobium sp. NGR234Probable ABC transporter periplasmic binding
C
1SMb20876AATTCGTCAATCTGTCAC−980.43S.melilotiHypothetical protein
2Atu2329CGGTCACACGCCTGTCAT−1610.51A.tumefaciensConserved hypothetical protein
D
1Smc00171ATGTCATCTTCCTGAAAC−520.47S.melilotiConserved hypothetical protein
2Atu1649CTGTCATGTTGCTGAAAC−640.51A.tumefaciensConserved hypothetical protein
3bll5904CGGTCATCCCCCTGTCAC−240.52B.japonicumHypothetical protein
4CC3344CTGTCAGAAAGTCCGCAT−1520.38C.crescentusHypothetical protein
5mll0806CTGACATCGCGATTTCAC−270.44M.lotiHypothetical protein
7PP4510TTGCCATGGCGCTGTCAT−430.37P.putidaConserved hypothetical protein
E
1SMc00819CTGTCGTTCAGCCGTCAC−500.6S.melilotikatA, monofunctional catalase HPII
2Atu4642TAGTCATCTTCATGACAG−1330.37A.tumefacienskatA, bifunctional catalase/peroxidase HPI
3CC3043CGGTCGGTAAGGTGTCAC−600.36C.crescentusCatalase/peroxidase
4PA4236CTGTCATTCATCCTTAAC−1570.67P.aeruginosakatA, monofunctional catalase HPII
5PP3668CTGAAAGAACACTGGCAT−410.41P.putidaCatalase/peroxidase HPI
F
1SMc03124CTGTCACGAAAGTTTCAC−930.44S.melilotiPutative periplasmic binding ABC transporter
2Atu6138TTGCAACAAAACTGTCAC−770.43A.tumefaciensaccA, ABC transporter, substrate binding protein
CTGTCACCAAACTTTCAT−660.76
3blr0308CTGTCGGCTCACTGACAC−4450.45B.japonicumABC transporter peptide-binding protein
4BMEI1934ATGTCATATTTCTGAAAT−4180.56B.melitensisPutative periplasmic binding ABC transporter
5BMEII0505CTGACACATGGCTGGAAC−230.37B.melitensisOligopeptide transport system permease protein
6BRA0786CTGACACATGGCTGGAAC−230.37B.suisPeptide ABC transporter, substrate binding protein
7mlr9269CCCTCATAAAACCGTCAT−850.47M.lotiExtracellular solute-binding protein
8mll9149TTGTCATTCGACTGTCAT−890.62M.lotiOligopeptide ABC transporter oligopeptide-binding
G
SMc00620CTTTCACCTCGCTGTAAG−700.41S.melilotiHypothetical signal peptide protein
blr7070CTTTCACAAAACTGAAAC−440.57B.japonicumHypothetical protein
mlr1518CTGTCACGCAAGCATCAT−470.38M.lotiHypothetical protein
H
SMc00772AGGTCGTCAAGGCGTCAT−1030.36S.melilotipotH, putrescine transport system permease
BR1610TTGTCATCGCAGTGCCAT−510.39B.suisSpermidine/putrescine ABC transporter, permease
I
SMc02174TTGTGAACGATCTGTCGC−760.36S.melilotiHypothetical transmembrane protein
mll8170CTGTGAACAATGTGTCGC−720.36M.lotiHypothetical protein

‘Dist’ indicates the relative distance from the annotated translational start sites.

Conserved Pho regulon members either of unknown and/or not clearly associated with Pi metabolism ‘Dist’ indicates the relative distance from the annotated translational start sites. We identified a putative Pho-box upstream of smc00772 (potH)- gene clusters well as orthologues in M.loti and Brucella suis (Table 7). Although fusion data for smc00772 is unavailable, the potFGHI ABC-class, putative putrescine transporter cluster was identified as upregulated by Pi-limitation in the microarray analysis (31), although no Pho-box was identified by them. The putative Pho-box upstream of smc00772 (potH) lies within the coding region of potG (smc00771), instead of upstream of the regulator (potF). The fact that Pho-box-like sequences were identified upstream of genes similar to S.meliloti potH in M.loti and B.suis suggest that putrescine transport may be PhoB-regulated across a range of organisms and should be further investigated. In response to Pi-starvation, S.meliloti replaces phospholipids with other non-Pi-containing lipids sulphoquinovosyl diacylglycerols (SL), ornithine-containing lipids (OL) and diacylglyceryl-N,N,N-trimethylhomoserines (DGTS) (48,49). In Rhodobacter sphaeroides it was demonstrated that the smc01848 homolog btaA is directly involved in DGTS biosynthesis (50) and recently Lopez-Lara et al. (51) established that smc01848 and smc01849 (btaAB) are required for DGTS synthesis. A Pho-box is predicted 64 nt from the smc01848 start codon and orthologs of smc01848 in M.loti (mlr1574) and Agrobacterium tumefaciens (atu2119) also have predicted Pho boxes in the corresponding promoter regions (Table 5). These data strongly suggest that DGTS synthesis induced upon Pi limitation is mediated directly via PhoR-PhoB system.

Pi starvation and polyphosphate metabolism

Inorganic polyphosphates (polyPi) are linear polymers of orthophosphate residues linked by high-energy phosphoanhydride bonds. These polymers can vary in size from 3 to over 1000 phosphate residues. PolyPi is ubiquitous and the enzyme primarily responsible for polyPi synthesis in E.coli is polyP kinase (PPK), which uses the gamma phosphate of ATP to make the polymer. PolyPi can also be hydrolyzed to Pi either by exopolyhosphatases (PPX) or by endopolyphosphatases (PPN). The identification and assignment of Pho-boxes was sometimes complicated by differences in genome annotation, as in the case of genes encoding polyphosphate kinase (ppk) (Table 6). Here Pho boxes were predicted in the ppk promoter regions of 10 of the 12 genomes examined. However, the predicted Pho-box from both M.loti (52) and C.crescentus (53) were located within the annotated gene coding regions. Alignment of the Ppk amino acid sequence suggests that the actual start codons of the ppk genes in M.loti and C.crescentus are downstream of the annotated start codons (data not shown). Our reporter gene fusion data showed that the S.meliloti ppk gene was strongly induced member of the Pho regulon. However, most strikingly, the weight matrix did not detect a ppk Pho box either from the E.coli K12 genome or the E.coli O157 genome, even at very low cut-off (0.18). In E.coli there is genetic evidence demonstrated that polyphosphates accumulate upon Pi starvation and depend on PhoB, although the E.coli ppk promoter has never been mapped. Therefore, it is likely that E.coli PhoB regulates ppk indirectly as suggested elsewhere (54).

CONCLUSION

Several complementary approaches were integrated to investigate the cellular response to Pi starvation. As a first step, computational identification of PhoB binding motifs predicted 96 potential Pho regulon members from the entire S.meliloti genome. These were subsequently investigated by genetic screening of transcriptional reporter gene fusions and through comparisons with recently available microarray data (31). It was found that 34 out of the 96 in silico predicted Pho regulon members were regulated by Pi concentration in a PhoB dependent manner (Table 3). These 34 Pho regulon members were analyzed in silico for conservation or co-occurrence across 12 genomes scanned (Tables 5 and 7). Nineteen of these 34 candidates were also predicted as having upstream Pho-boxes in at least one of the other genomes scanned in this study. The in silico analysis provided evidence for the conservation of a core Pho regulon in bacteria and suggests that these organisms share a common response to Pi limitation. Such a conservation is not surprising as for example in both plants and yeast one of the major responses to Pi-limitation is the induction of a high affinity Pi transport system and the induction of scavenging enzymes, such as alkaline phosphatases. Extending the Pho-box analysis to many more genomes should define the core group of genes that respond to Pi-starvation. Further it will allow the identification of subgroups of genes, such as katA, whose expression is regulated by PhoB in some organisms but not in others. Analysis of the distribution of such data may lead to the recognition of associations between particular regulatory patterns and other phenotypic properties of the organisms.

SUPPLEMENTARY DATA

Supplemental Data are available at NAR online.
  53 in total

1.  Structural comparison of the PhoB and OmpR DNA-binding/transactivation domains and the arrangement of PhoB molecules on the phosphate box.

Authors:  H Okamura; S Hanaoka; A Nagadoi; K Makino; Y Nishimura
Journal:  J Mol Biol       Date:  2000-02-04       Impact factor: 5.469

Review 2.  DNA binding sites: representation and discovery.

Authors:  G D Stormo
Journal:  Bioinformatics       Date:  2000-01       Impact factor: 6.937

3.  Predicting regulons and their cis-regulatory motifs by comparative genomics.

Authors:  A Manson McGuire; G M Church
Journal:  Nucleic Acids Res       Date:  2000-11-15       Impact factor: 16.971

4.  Analysis of a 1600-kilobase Rhizobium meliloti megaplasmid using defined deletions generated in vivo.

Authors:  T C Charles; T M Finan
Journal:  Genetics       Date:  1991-01       Impact factor: 4.562

Review 5.  Inorganic polyphosphate: a molecule of many functions.

Authors:  A Kornberg; N N Rao; D Ault-Riché
Journal:  Annu Rev Biochem       Date:  1999       Impact factor: 23.643

6.  Identification and characterization of two chemotactic transducers for inorganic phosphate in Pseudomonas aeruginosa.

Authors:  H Wu; J Kato; A Kuroda; T Ikeda; N Takiguchi; H Ohtake
Journal:  J Bacteriol       Date:  2000-06       Impact factor: 3.490

7.  Complete genome sequence of Caulobacter crescentus.

Authors:  W C Nierman; T V Feldblyum; M T Laub; I T Paulsen; K E Nelson; J A Eisen; J F Heidelberg; M R Alley; N Ohta; J R Maddock; I Potocka; W C Nelson; A Newton; C Stephens; N D Phadke; B Ely; R T DeBoy; R J Dodson; A S Durkin; M L Gwinn; D H Haft; J F Kolonay; J Smit; M B Craven; H Khouri; J Shetty; K Berry; T Utterback; K Tran; A Wolf; J Vamathevan; M Ermolaeva; O White; S L Salzberg; J C Venter; L Shapiro; C M Fraser; J Eisen
Journal:  Proc Natl Acad Sci U S A       Date:  2001-03-20       Impact factor: 11.205

8.  Complete genome structure of the nitrogen-fixing symbiotic bacterium Mesorhizobium loti.

Authors:  T Kaneko; Y Nakamura; S Sato; E Asamizu; T Kato; S Sasamoto; A Watanabe; K Idesawa; A Ishikawa; K Kawashima; T Kimura; Y Kishida; C Kiyokawa; M Kohara; M Matsumoto; A Matsuno; Y Mochizuki; S Nakayama; N Nakazaki; S Shimpo; M Sugimoto; C Takeuchi; M Yamada; S Tabata
Journal:  DNA Res       Date:  2000-12-31       Impact factor: 4.458

9.  Two enzymes of diacylglyceryl-O-4'-(N,N,N,-trimethyl)homoserine biosynthesis are encoded by btaA and btaB in the purple bacterium Rhodobacter sphaeroides.

Authors:  R M Klug; C Benning
Journal:  Proc Natl Acad Sci U S A       Date:  2001-05-01       Impact factor: 11.205

10.  A comparative genomics approach to prediction of new members of regulons.

Authors:  K Tan; G Moreno-Hagelsieb; J Collado-Vides; G D Stormo
Journal:  Genome Res       Date:  2001-04       Impact factor: 9.043

View more
  56 in total

1.  Two-component PhoB-PhoR regulatory system and ferric uptake regulator sense phosphate and iron to control virulence genes in type III and VI secretion systems of Edwardsiella tarda.

Authors:  Smarajit Chakraborty; J Sivaraman; Ka Yin Leung; Yu-Keung Mok
Journal:  J Biol Chem       Date:  2011-09-27       Impact factor: 5.157

2.  Mutations in Escherichia coli Polyphosphate Kinase That Lead to Dramatically Increased In Vivo Polyphosphate Levels.

Authors:  Amanda K Rudat; Arya Pokhrel; Todd J Green; Michael J Gray
Journal:  J Bacteriol       Date:  2018-02-23       Impact factor: 3.490

Review 3.  Global regulation by the seven-component Pi signaling system.

Authors:  Yi-Ju Hsieh; Barry L Wanner
Journal:  Curr Opin Microbiol       Date:  2010-02-18       Impact factor: 7.934

Review 4.  Genomes of the symbiotic nitrogen-fixing bacteria of legumes.

Authors:  Allyson M MacLean; Turlough M Finan; Michael J Sadowsky
Journal:  Plant Physiol       Date:  2007-06       Impact factor: 8.340

Review 5.  Comparative genomic reconstruction of transcriptional regulatory networks in bacteria.

Authors:  Dmitry A Rodionov
Journal:  Chem Rev       Date:  2007-07-18       Impact factor: 60.622

6.  An Iterative, Synthetic Approach To Engineer a High-Performance PhoB-Specific Reporter.

Authors:  Julie L Stoudenmire; Tara Essock-Burns; Erena N Weathers; Sina Solaimanpour; Jan Mrázek; Eric V Stabb
Journal:  Appl Environ Microbiol       Date:  2018-07-02       Impact factor: 4.792

7.  Phosphate Limitation Induces Drastic Physiological Changes, Virulence-Related Gene Expression, and Secondary Metabolite Production in Pseudovibrio sp. Strain FO-BEG1.

Authors:  Stefano Romano; Heide N Schulz-Vogt; José M González; Vladimir Bondarev
Journal:  Appl Environ Microbiol       Date:  2015-03-13       Impact factor: 4.792

8.  A bifunctional glycosyltransferase from Agrobacterium tumefaciens synthesizes monoglucosyl and glucuronosyl diacylglycerol under phosphate deprivation.

Authors:  Adrian Semeniuk; Christian Sohlenkamp; Katarzyna Duda; Georg Hölzl
Journal:  J Biol Chem       Date:  2014-02-20       Impact factor: 5.157

9.  Sinorhizobium meliloti phospholipase C required for lipid remodeling during phosphorus limitation.

Authors:  Maritza Zavaleta-Pastor; Christian Sohlenkamp; Jun-Lian Gao; Ziqiang Guan; Rahat Zaheer; Turlough M Finan; Christian R H Raetz; Isabel M López-Lara; Otto Geiger
Journal:  Proc Natl Acad Sci U S A       Date:  2009-12-14       Impact factor: 11.205

10.  The PhoB regulatory system modulates biofilm formation and stress response in El Tor biotype Vibrio cholerae.

Authors:  Syed Zafar Sultan; Anisia J Silva; Jorge A Benitez
Journal:  FEMS Microbiol Lett       Date:  2009-10-28       Impact factor: 2.742

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.