| Literature DB >> 14641908 |
David J Studholme1, Richard N Pau.
Abstract
BACKGROUND: The transition metal molybdenum is essential for life. Escherichia coli imports this metal into the cell in the form of molybdate ions, which are taken up via an ABC transport system. In E. coli and other Proteobacteria molybdenum metabolism and homeostasis are regulated by the molybdate-responsive transcription factor ModE.Entities:
Mesh:
Substances:
Year: 2003 PMID: 14641908 PMCID: PMC317290 DOI: 10.1186/1471-2180-3-24
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Figure 2Domain architectures of some example members of the HTH_9 family as defined by Pfam (Bateman et al., 2002). The domains are described in the text. A. Q9I631 (Pseudomonas aeruginosa), Q88AA0 (Ps. syringae), Q88QX4 (Ps. putida), Q9CMR6 (Pasteurella multocida), MODE_HAEIN (Haemophilus influenzae), Q8Z8A6 (Salmonella Typhi), Q8ZQR8 (Salmonella Typhimurium), MODE_ECOLI (E. coli), MODE_YERPE (Yersinia pestis), Q8EAN6 (Shewanella oneidensis), Q8XQR8 (Ralstonia solanacearum), Q8KC82 (Chlorobium tepidum), MODE_AZOVI (Azotobacter vinelandii), Q9ABA4 (Caulobater crescentus), MOPA_RHOCA (Rhodobacter capsulatus), MOPA_RHOCA (Rhodobacter capsulatus), Q9F4K4 (Herbaspirillum seropedicae). B. Q8PWN4 (Methanosarcina mazei), and Q8TTZ2 (M. acetivorans). C. Q9RBF7 (Alcaligenes eutrophus), Q8XXM1 (Ralstonia solanacearum), and Q8ZZY3 (Pyrobaculum aerophilum). D. Q9PMF6 (Campylobacter jejuni). E. Q97Z66 (Sulfolobus solfataricus), Q97ET9 (Sulfolobus tokodaii), Q8ZYE6 (P. aerophilum), O29240 (Archaeglobus fulgidus), Q8TVF9 (Methanopyrus kandleri), and Q98KI4 (Rhizobium loti).
Figure 1Alignment of the helix-turn-helix (HTH) domains homologous to the molybdenum-responsive transcriptional regulator ModE. Amino acid residues predicted to directly interact with the target DNA are marked with asterisks (*). The alignment was taken from the Pfam database .
Figure 3Alignment of known and strongly suspected ModE-binding sites used to generate the weight matrix.
Figure 4Sequence logo representation of the alignment of known and strongly suspected ModE-binding in Figure 3. A graphic representation of an aligned set of binding sites. The relative heights of the letters are proportional to the frequencies of bases at each position. The degree of sequence conservation is measured in bits of information and is indicated by the total height of a stack of letters. The vertical scale is in bits, with a maximum of 2 bits possible at each position. The logo was generated using WebLogo [27].
Figure 5Position-specific weight matrix derived from the alignment of known and strongly suspected ModE-binding sites used to generate the weight matrix in Figure 1.
Potential ModE-binding sites identified by scanning archaeal genome sequences against the weight matrix derived from alignment in Figure 3. Sites that fall immediately upstream of genes implicated in molybdenum metabolism are indicated in bold. Scores are Kullback-Leibler distances that have been normalised such that the maximum possible score is 100. Essentially, the higher the score is, the greater the magnitude of the theoretical binding energy [26]. All sites scoring more than 75 are listed. Distances are given (number of bases) between the downstream end of the putative ModE-binding site and the predicted translational start codon.
| Organism | Sequence | Score | Distance | Annotation of potential target gene |
| TAATGTTATATATCTTAATAAATAACTTT | 78 | 524 | MM1328 Two component response regulator | |
| TAACGATATATTAATAATTAGATTTAGAT | 78 | 299 | MM1783 hypothetical protein | |
| TATCCATATACTAATGATTATATATCCAT | 78 | 69 | MM1790 conserved protein | |
| AATATTTATATAGTACACAATATATCAAT | 78 | 39 | MM2248 conserved protein | |
| CACCCATATATTAGTTTATAAATAACGCT | 78 | 948 | MM2931 hydrolase | |
| GAAGCGTATATATAAGAGTATATAAAGAG | 78 | 53 | MM3131 Fructokinase | |
| TTTCTCTCTATACTAGTCTACAGATAGAG | 78 | 191 | MM3331 conserved protein | |
| GAACAATATATCGTTAACAATATATCCAT | 77 | 48 | MM0378 sugar phosphate nucleotydyl transferase | |
| AATCTTTATATCATGGAATTTATCGTGAA | 77 | 290 | MM0957 Ammonium transporter | |
| CATCTTTTTATACTTCCCTATATACTTAA | 77 | 31 | tlpC MM1658 methyl accepting chemotaxis protein | |
| AAAACATATATAGAAACATACATATCGCA | 77 | 153 | MM2159 hypothetical protein | |
| TCTCGTTATATTTCTATTTATATATATTT | 77 | 344 | MM2777 Acylphosphatase | |
| TTTCGCTGTATATCTTATTTCATAAATAG | 76 | 1436 | MM0016 translation initiation factor 1A | |
| TGACAGTATATAAATTAATAGTTAGAGAT | 76 | 76 | MM0211 Cysteine proteinase | |
| AGTCATTATCTATATCAATATAAATAGTT | 76 | 527 | MM0338 putative phosphomethylpyrimidine kinase | |
| AAACATTAAATATTTTAATACATAGTTAT | 76 | 39 | MM0659 GDP mannose 4,6 dehydratase | |
| TTTAGATTTATATAAGAATTTAAAACTAT | 76 | 276 | MM1157 conserved protein | |
| TATCGGGATATTCTATGGATTATATCGAA | 76 | 13 | MM1281 conserved protein | |
| AAACGTTATATACAAGCGAATATGAGTAT | 76 | 156 | MM1355 conserved protein | |
| CATCAATAAATTAATTTCTATATAAGGTA | 76 | 30 | MM1847 hypothetical protein | |
| GATTGATATATAATTATTTTCAAACTGAT | 76 | 25 | iorA MM2093 Indolepyruvate oxidoreductase, subunit | |
| AATTAATATATGATGGATTTTATATAGAT | 76 | 290 | MM2747 hypothetical protein | |
| TAAATATATATAAATAGAAATATAACGAG | 76 | 101 | MM2776 hypothetical protein | |
| AAGCGTTATTTATAAACTAATATATGGGT | 76 | 35 | MM2932 conserved protein | |
| AATCGCTCTATAGAAGTAAACTGAGCGGA | 76 | 377 | MM2945 Mannosyltransferase | |
| AAACTTTATATATTAAAATATAAATAAAA | 76 | 499 | MM3203 hypothetical protein | |
| TAAAGTTATTTATTAGTATACATAACTAT | 80 | 764 | cheY2 MA0016 chemotaxis response regulator | |
| AAATGAAATATATATATATAGATAACGAT | 80 | 185 | MA0724 predicted protein | |
| TTTCAATATATATTTCAATATATAAAATA | 80 | 378 | vhtG MA1146 F420 nonreducing hydrogenase | |
| AATCTGTATATAATTAACCAGATAGAGTT | 80 | 9 | MA2899 conserved hypothetical protein | |
| CACCGTTATGTTTAAATAAACATAACGAC | 79 | 1483 | MA0303 conserved hypothetical protein | |
| ATTAATTATATAAATGTGTATATAAATAT | 79 | 1375 | hypC MA1140 hydrogenase expression/formation protein | |
| GAACCTTATATATTTTTCTACAGAGAGCT | 79 | 114 | MA3892 hypothetical protein (multi domain) | |
| AGTGGCTATATTTAGCTATATATAACAAA | 79 | 710 | MA3957 ABC transporter, ATP binding protein | |
| ACTCGATATATTATTCAACGAATAGTGAT | 78 | 118 | MA1301 predicted protein | |
| ATCCGTTATGTATGAATGAACATAACGTT | 78 | 27 | MA1663 predicted protein | |
| TATCTTTATGTTTATCCGAACATATCGAT | 78 | 6 | MA4536 ABC transporter, solute binding protein | |
| ATTTACTTTATATCTGTATATATATTGAA | 77 | 529 | MA0519 conserved hypothetical protein | |
| TATCGTTATCTATATATATATATTTCATT | 77 | 114 | MA0725 conserved hypothetical protein | |
| CTATTTTATATATTGAAATATATATTGAA | 77 | 122 | MA1145 hypothetical protein (multi domain) | |
| GTTCCTTTTATATTGCAAATCATAACGTT | 77 | 43 | MA2861 response regulator receiver | |
| AAACGTTATATACAAGCGGATATGAGTAT | 76 | 147 | MA0056 conserved hypothetical protein | |
| AATTCTTTTATATAAATCCATATAACGGT | 76 | 130 | MA0459 conserved hypothetical protein | |
| TACCGTTATATGGATTTATATAAAAGAAT | 76 | 229 | MA0458 predicted protein | |
| CTAGGTTATATAACAGAAATCATAAAGAG | 76 | 17 | MA1630 sensory transduction histidine kinase | |
| TTACGATATATATAAATTTATCTAAAAAA | 76 | 87 | MA1757 conserved hypothetical protein | |
| ATTTTTTAGATAAATTTATATATATCGTA | 76 | 441 | MA1756 cell surface protein | |
| ATCCATTAGATACAAATATTTATATAGAA | 76 | 1467 | MA3192 conserved hypothetical protein | |
| TATATTTATATAAAAATCAACATATCTAT | 76 | 555 | cpa MA3604 carboxypeptidase A | |
| GTACAGTATATATTTTAAAATATAGTTAT | 76 | 507 | atpH MA4152 H(+) transporting ATP synthase, subunit H |
Frequencies of matches to the ModE-binding consensus in several complete genomes. The number of sites scoring greater than 75 for similarity to the ModE-binding consensus was counted for each of the complete genome sequences. Note that the consensus sequence is a nearly palindromic, so only sites on on one strand were counted. Matches were designated intragenic if the center of the 28 base site was located within a protein-coding region and intergenic otherwise.
| Organism | Number of intergenic matches (number implicated in molybdenum metabolism) | Number of intragenic matches | Genome Size (Mb) |
| 2 (1) | 0 | 2.84 | |
| 1 (1) | 1 | 9.11 | |
| 1 (0) | 13 | 1.64 | |
| 2 (1) | 4 | 2.15 | |
| 17 (5) | 9 | 4.64 | |
| 4 (0) | 10 | 1.70 | |
| 9 (3) | 13 | 1.83 | |
| 6 (2) | 14 | 1.80 | |
| 0 (0) | 10 | 1.67 | |
| 0 (0) | 6 | 1.64 | |
| 4 (2) | 2 | 7.04 | |
| 28 (4) | 3 | 5.75 | |
| 28 (2) | 6 | 4.10 | |
| 13 (3) | 11 | 2.26 | |
| 3 (1) | 0 | 6.26 | |
| 3 (1) | 1 | 6.18 | |
| 3 (1) | 0 | 6.4 | |
| 1 (1) | 0 | 5.80 | |
| 16 (6) | 6 | 4.81 | |
| 15 (4) | 7 | 4.86 | |
| 10 (2) | 12 | 4.97 | |
| 20 (5) | 5 | 4.61 | |
| 2 (0) | 1 | 3.65 | |
| 19 (5) | 4 | 4.65 |
Figure 6Formula used to calculate , the Kullback-Leibler distance [26], where is the position within the site, is the frequency of that base in the genome, and is the observed frequency of each base at that position (from the weight matrix). Values for were calculated from the percentage G+C content of the genome sequence.