| Literature DB >> 18187507 |
Alberto Sola-Landa1, Antonio Rodríguez-García, Alexander Kristian Apel, Juan F Martín.
Abstract
Expression of genes belonging to the pho regulon in Streptomyces coelicolor is positively regulated (as shown by comparing the wild-type and a DeltaphoP mutant) by binding of the response regulator PhoP to 11-nt direct repeats (DRus). These sequences have been found in over 100 genes of Streptomyces coelicolor; 20 of them were cloned and the binding of PhoP(DBD) to most of their promoters has been shown by electrophoretic mobility shift assays. Deletion experiments showed that at least two DRus are required for proper binding of PhoP(DBD). Deletion of 1 nt leaving a 10-nt direct repeat reduced drastically binding of PhoP(DBD). Three different types of operators have been identified. Complex operators (class III) contain up to six DRus, some of them with poor conservation of the 11-nt consensus sequence, which however were protected by PhoP(DBD) in footprinting analyses. A cooperative binding of PhoP(DBD) molecules initiated at conserved core DRus appears to be the mechanism involved in binding of several PhoP(DBD) monomers to those complex operators. The information theory-based model that incorporates the positive or negative contribution to the binding of PhoP(DBD) of adjacent sequences has been used to deduce the structure of PHO boxes and the relevance of each DRu.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18187507 PMCID: PMC2275107 DOI: 10.1093/nar/gkm1150
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Promoters recognized by PhoP
| Promoter region | Regulated gene and function | Operator structure | Number of EMSA complexes | Strand | Distance to ATG | Distance to +1 | Profile | PhoP regulation | |
|---|---|---|---|---|---|---|---|---|---|
| A. Class I: operators with 2 well-conserved DRu | |||||||||
| SCO2286 | 1 | Coding | 128 | 103 | AA | >0.25 | Activation | ||
| SCO3790-1 | SCO3790, putative phosphatase | 1 | Coding | 79 | N.D. | 0D | 0.165 | Activation | |
| SCO4142 | (EE) | 1 | Coding | 114 | 39 | A0 | <0.000 | Activation | |
| SCO6169-70 | SCO6169, possible regulatory protein | 1 | Coding | 195 | N.D. | 00 | >0.25 | N.D. | |
| SCO6169-70 | SCO6170, putative oxidoreductase | 1 | Non-coding | 101 | N.D. | 00 | >0.25 | N.D. | |
| B. Class II: operators with 3 conserved DRu | |||||||||
| SCO0033-4 | SCO0034, unknown | 1 | Non-coding | 127 | N.D. | 0D | 0.034 | Activation | |
| SCO1906 | SCO1906, putative phosphatase | 1 | Coding | 92 | N.D. | 00 | > 0.25 | N.D. | |
| SCO4261-3 | SCO4261, possible response regulator | 1 | Non-coding | 43 | N.D. | aA | 0.088 | Repression | |
| C. Class III: operators of complex structure | |||||||||
| SCO1393-4 | SCO1394, possible glycosyl hydrolase | (E) | 2 | Non-coding | 30 | N.D. | A0 | 0.049 | Activation |
| SCO2262-3 | SCO2262, possible oxidoreductase | E | 2 | Coding | 70 | N.D. | 0A | 0.013 | Repression |
| SCO2465-6 | E | 2 | Coding | 80 | N.D. | 0A | 0.052 | Repression | |
| SCO5447 | SCO5447, putative metalloprotease | E | 3 | Non-coding | 214 | N.D. | A0 | 0.26 | Activation |
| SCO7697 | SCO7697, possible secreted phytase | 3 | Coding | 122 | N.D. | A0 | 0.001 | Activation | |
| SCO4228-9 | 2 | Coding | 109 | 38 | A0 | 0.143 | Activation | ||
| SCO4228-9 | 2 | Non-coding | 63 | 65 | A0 | <0.000 | Activation | ||
| SCO2878 | SCO2878, unknown | CCCEUES | 2-3 | Coding | 42 | N.D. | AD | <0.000 | Activation |
| SCO1196 | SCO1196, putative secreted protein | 2 | Non-coding | 83 | N.D. | A0 | <0.000 | Activation | |
| SCO1845 | ES[ | 4 | Coding | 48 | 3 | 00 | >0.25 | Activation/ repression | |
| SCO2068 | 4 | Non-coding | 10 | 10 | 00 | >0.25 | Activation/ repression | ||
| ES[ | Coding | ||||||||
| D. Unclassified operators | |||||||||
| SCO4878-9 | SCO4878, putative glycosyltransferase | 4 DRus | 1 | Coding | 40 | N.D. | 00 | >0.25 | N.D. |
| SCO4878-9 | SCO4879, hypothetical protein | 5 DRus | 1 | Coding | 30 | N.D. | A0 | 0.001 | Activation |
aEach letter symbolizes a DRu and denotes the functional class. Class ‘C’ comprises DRus that form the core of the operator; ‘E’ comes from ‘extension’, ‘EU’ from ‘extension unstable’ and ‘ES’ from ‘extension support’; see text for a complete explanation. The DRus that originated the information matrix are represented by underlined letters. In two operators, brackets enclose possible DRus that were not protected in the footprinting assays, but help to explain the number of EMSA complexes. Numbers in square brackets are the separation in base pairs between contiguous DRus.
bTranscription profiles of the response to phosphate limitation. The first letter summarizes the transcription response of the S. coelicolor wild-type strain, and the second, the response of the ΔphoP mutant. ‘A’ means augmented expression, ‘D’ means decreased expression and ‘0’, no detected change (P > 0.1). Data obtained from Rodríguez-García et al. (15).
cThese P-values refer to the interaction contrast (or contrast 5; see Rodríguez-García et al. (15) for details). This comparison quantifies the differences between the response to phosphate limitation of the S. coelicolor wild type and that of the mutant ΔphoP strain. Low P-values support the regulatory role of PhoP.
dThe type of the transcriptional control was inferred from the transcription profile or from the reference provided. In the cases of bidirectional promoter regions, which genes are controlled was decided from the transcriptomic results; exceptionally, there is no data for SCO2263, because no probe was present in the microarrays used.
eAccording to Apel et al. (28).
fAccording to Sola-Landa et al. (23).
gAccording to Rodríguez-García et al. (15).
hThe ORF SCO4262 was annotated as doubtful (6). Since it lies overlapping the putative promoter region of SCO4261, we considered that this ORF is not real.
iAccording to Santos-Beneit, F. unpublished data.
jThe fragment that contains both operators of the SCO4878–SCO4879 intergenic region, showed a unique complex in EMSA experiments. Since it is not possible to dissect the contribution of each operator to the mobility shift, only the number of DRus is indicated.
Figure 1.Promoter activity in S. coelicolor M145 (dark bars) and ΔphoP mutant (open bars) of phoRP, phoU and pstS using the xylE gene (encoding a catechol dioxygenase) as reporter, growing in R5 liquid medium with 40 µM phosphate. The activity in ΔphoP mutant is also represented in a larger scale (insets) for easier visualization. Vertical lines on top of each bar indicate SD values. No activity was detected with pstS promoter in the ΔphoP mutant.
Figure 2.Analysis by EMSA of P-1 (native pstS promoter), P-0 and P-00 promoters (with deletions of one or two DRus, respectively) using increasing concentrations of GST-PhoPDBD protein. Lane C, control without protein; 1, 2, 3 and 4 reactions with 3.12, 6.25, 12.5 and 25 pmol of protein, respectively. The two shift bands of the pstS P-1 promoter are indicated by arrows. Note that there is no band in the gels with P-0 or P-00 promoters.
Figure 3.Analysis by EMSA of the binding of GST-PhoPDBD to several promoters studied in the present work. Lane C, control without protein; 1, 3.12 pmol of protein; 2, 6.25 pmol; 3, 12.5 pmol; 4, 25 pmol of protein. The phoRP promoter was used as control. Only promoters giving positive shifts are shown. Promoter names are indicated under the picture. In the right side of every shift, a scheme is represented explaining the probable DRus structure in the bands detected. C, E and EU correspond with the different types of DRu, Core, Extension and Unstable (see text for explanation). Boxes presumably bound to PhoP are in grey and free boxes (not bound to PhoP) are in white.
Figure 4.(A) Analysis of binding of PhoPDBD to the SCO1854 and SCO1993 promoters. The native promoters were mutated (top) and the unmatching bases were replaced by consensus ones. Changed bases are shown in boldface and underlined. The new promoters are named with the same SCO number and an asterisk. Lower panels, EMSA using GST-PhoPDBD protein. Lane C, control without protein; 1, 3.12 pmol of protein; 2, 6.25 pmol; 3, 12.5 pmol; 4, 25 pmol of protein. Note that at least a shift band (indicated by arrows) can be seen in mutated promoters, even with the lowest protein concentration (lane 1), whereas no band is detected in the wild-type promoters with the highest concentration (lane 4). (B) Analysis of the native pstS promoter and a mutant promoter containing a 10-nt DRu. The base deleted between the two DRus is shown in boldface and underlined (top). The new promoter is named pstS*. Note the drastic reduction in binding to the mutant promoter.
Figure 5.Sequence logo of the direct repeats of 11 nt that form the PhoP-binding-site unit. This logo corresponds to model I that comprises 37-DRus of Streptomyces coelicolor (see Materials and Methods section). The height of each letter is proportional to the frequency of the base, and the height of the letter stack is the conservation in bits at that position (25). Note the high conservation of the first seven positions of the DRu. Error bars are shown at the top of the stacks. The total information (Rsequence) for the DRu is 8.08 ± 0.61 bits. The sine wave represents the accessibility of a face of the DNA (B-form, 10.6 bases of helical pitch) with the major groove centred at position 4. C in position 1 or an A or a C at position 2 have a clear negative effect on PhoP binding.
Figure 6.(A) Individual information analysis of the PhoP-binding sites, identified in this and previous works, using the sequence walker method (29). Each binding site is formed with DRus of 11 nt, which are marked by boxes on its top. Each box contains the individual information (Ri) in bits, and is filled with tones of grey in a scale of 10% steps. Thus, the box of a DRu with a Ri value in the range of 90–100% of the highest possible Ri content (14.63 bits, the Ri of the consensus sequence), are coloured in black; boxes of Ri ≤ 1.46 bits, are shown in white. Hypothetical DRu, which were not comprised in the footprinting-protected area, are represented by boxes with dashed lines. The height of the letters is the information content in bits, and represents the contribution of each base to the conservation of the sequence. Letters extending downward represent unfavourable contacts. The walker limits are 2 bits, which is also the top of the sine wave, and −3 bits at the bottom; 0 bits is at the middle of the walker. If a base does not occur in the set of sequences that forms the model (model I), a black box is given. A grey box indicates that the upside-down letter extends beyond the lower limit. For simplicity, in this figure the sine wave has a periodicity of 11 bases. (B) Analysis of the promoter-region sequences that resembled PhoP-binding sites, but failed to bind PhoP. The mutagenized SCO1854* and SCO1993* sequences are shown below the wild-type sequences. Information scores (bits) of the sequences are indicated.