| Literature DB >> 28348857 |
Semen A Leyn1, Inna A Suvorova1, Alexey E Kazakov2, Dmitry A Ravcheev3, Vita V Stepanova1, Pavel S Novichkov2, Dmitry A Rodionov4,1.
Abstract
Comparative genomics approaches are broadly used for analysis of transcriptional regulation in bacterial genomes. In this work, we identified binding sites and reconstructed regulons for 33 orthologous groups of transcription factors (TFs) in 196 reference genomes from 21 taxonomic groups of Proteobacteria. Overall, we predict over 10 600 TF binding sites and identified more than 15 600 target genes for 1896 TFs constituting the studied orthologous groups of regulators. These include a set of orthologues for 21 metabolism-associated TFs from Escherichia coli and/or Shewanella that are conserved in five or more taxonomic groups and several additional TFs that represent non-orthologous substitutions of the metabolic regulators in some lineages of Proteobacteria. By comparing gene contents of the reconstructed regulons, we identified the core, taxonomy-specific and genome-specific TF regulon members and classified them by their metabolic functions. Detailed analysis of ArgR, TyrR, TrpR, HutC, HypR and other amino-acid-specific regulons demonstrated remarkable differences in regulatory strategies used by various lineages of Proteobacteria. The obtained genomic collection of in silico reconstructed TF regulons contains a large number of new regulatory interactions that await future experimental validation. The collection provides a framework for future evolutionary studies of transcriptional regulatory networks in Bacteria. It can be also used for functional annotation of putative metabolic transporters and enzymes that are abundant in the reconstructed regulons.Entities:
Keywords: Proteobacteria; amino acid metabolism; comparative genomics; transcription factor
Mesh:
Substances:
Year: 2016 PMID: 28348857 PMCID: PMC5343134 DOI: 10.1099/mgen.0.000061
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Statistics for the studied TF regulons in Proteobacteria
Initially, we studied 21 TFs that are present in E. coli and/or Shewanella species and that are conserved in five or more taxonomic groups of Proteobacteria. Additionally, we studied TFs that represent non-orthologous replacements of the initial set of TFs in some taxonomic groups (marked by an asterisk).
| TF† | Protein | Metabolic pathways controlled by a TF | Genomes | TFBSs, | Genes, | Genes, |
|---|---|---|---|---|---|---|
| ArgR | ArgR | Arginine metabolism | 62 (6) | 1079 | 1223 | 19.7 |
| BioR* | GntR | Biotin biosynthesis | 13 (2) | 34 | 59 | 4.5 |
| BirA | BirA | Biotin biosynthesis | 94 (11) | 185 | 495 | 5.3 |
| FabR | TetR | Fatty acid biosynthesis | 74 (10) | 361 | 392 | 5.3 |
| FadP* | TetR | Fatty acid degradation | 25 (3) | 194 | 448 | 17.9 |
| FadR | GntR | Fatty acid degradation | 61 (6) | 374 | 423 | 6.9 |
| GlcC | GntR | Glycolate utilization | 23 (7) | 83 | 133 | 5.8 |
| HexR | RpiR | Central carbohydrate metabolism | 87 (11) | 897 | 1178 | 13.5 |
| HmgQ* | LysR | Tyrosine degradation | 17 (2) | 35 | 50 | 2.9 |
| HmgR* | IclR | Tyrosine degradation | 5 (1) | 12 | 24 | 4.8 |
| HmgS* | MarR | Tyrosine degradation | 3 (1) | 6 | 9 | 3.0 |
| HutC | GntR | Histidine utilization | 113 (18) | 386 | 857 | 7.6 |
| HypR | GntR | Proline/4-hydrohyproline utilization | 44 (12) | 190 | 265 | 6.0 |
| LiuQ* | TetR | Branched-chain amino acid utilization | 14 (3) | 54 | 73 | 5.2 |
| LiuR | MerR | Branched-chain amino acid utilization | 103 (15) | 671 | 1411 | 13.7 |
| LldR | GntR | Lactate utilization | 55 (13) | 146 | 241 | 4.4 |
| MetJ | MetJ | Methionine metabolism | 62 (6) | 1026 | 857 | 13.8 |
| MetR | LysR | Methionine metabolism | 117 (14) | 570 | 480 | 4.1 |
| NadR* | NadR | NAD biosynthesis | 11 (1) | 27 | 35 | 3.2 |
| NadQ* | NadQ | NAD biosynthesis | 30 (7) | 67 | 109 | 3.6 |
| NagC | ROK | 31 (5) | 304 | 431 | 13.9 | |
| NagQ* | GntR | 31 (10) | 81 | 225 | 7.3 | |
| NagR* | LacI | 25 (4) | 168 | 288 | 11.5 | |
| NrdR | NrdR | Deoxyribonucleotide biosynthesis | 186 (20) | 638 | 591 | 3.2 |
| NrtR | NrtR | NAD biosynthesis | 28 (11) | 75 | 96 | 3.4 |
| NtrC | Fis | Nitrogen assimilation | 169 (19) | 921 | 1804 | 10.7 |
| PdhR | GntR | Pyruvate metabolism | 55 (6) | 200 | 399 | 7.3 |
| PsrA | TetR | Fatty acid degradation | 76 (12) | 673 | 845 | 11.1 |
| RutR | TetR | Pyrimidine catabolism | 68 (13) | 273 | 743 | 10.9 |
| SahR* | ArsR | Methionine metabolism | 62 (9) | 156 | 266 | 4.3 |
| SamR* | ArsR | Methionine metabolism | 4 (1) | 17 | 30 | 7.5 |
| TrpR | TrpR | Aromatic amino acid metabolism | 53 (8) | 142 | 314 | 5.9 |
| TyrR | TyrR | Aromatic amino acid metabolism | 67 (7) | 618 | 896 | 13.4 |
‡Number of genomes and taxonomic groups (in parentheses) of Proteobacteria that contain the studied TF regulon. The detailed distribution of regulons and regulogs is provided in Fig. S2.
§Total number of candidate TFBSs and TF-regulated genes (target genes) in all studied genomes.
||Average number of candidate TFBSs per genome.
Fig. 1.Taxonomic distribution of 33 studied TF regulons in four major classes of Proteobacteria. Circles include the number of TFs that are either taxonomic class-specific or shared between several classes.
Fig. 2.Examples of motif changes in four orthologous groups of studied TFs. Conservative positions between motifs within groups are boxed with the same color.
Fig. 3.Arginine metabolism and its regulation by ArgR in Proteobacteria.
Fig. 4.Aromatic amino acid metabolism and its regulation by TyrR, TrpR and other transcription factors in Proteobacteria.
Fig. 5.Hydroxyproline and proline utilization pathway and its regulation by HypR in Proteobacteria.
Fig. 6.Histidine metabolism and its regulation by HutC in Proteobacteria.