| Literature DB >> 31435161 |
Abstract
MocR bacterial transcriptional regulators are a subfamily within the GntR family. The MocR proteins possess an N-terminal domain containing the winged Helix-Turn-Helix (wHTH) motif and a C-terminal domain whose architecture is homologous to the fold type-I pyridoxal 5'-phosphate (PLP) dependent enzymes and whose archetypical protein is aspartate aminotransferase (AAT). The ancestor of the fold type-I PLP dependent super-family is considered one of the earliest enzymes. The members of this super-family are the product of evolution which resulted in a diversified protein population able to catalyze a set of reactions on substrates often containing amino groups. The MocR regulators are activators or repressors of gene control within many metabolic pathways often involving PLP enzymes. This diversity implies that MocR specifically responds to different classes of effector molecules. Therefore, it is of interest to compare the AAT domains of MocR from six bacteria phyla. Multi dimensional scaling and cluster analyses suggested that at least three subgroups exist within the population that reflects functional specialization rather than taxonomic origin. The AAT-domains of the three clusters display variable degree of similarity to different fold type-I PLP enzyme families. The results support the hypothesis that independent fusion events generated at least three different MocR subgroups.Entities:
Keywords: Aspartate aminotransferase; Cluster analysis; MocR; Multidimensional scaling analysis; Pyridoxal 5'-phosphate; Structural bioinformatics
Year: 2019 PMID: 31435161 PMCID: PMC6677901 DOI: 10.6026/97320630015151
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Distribution of the taxonomic origin of AAT-like domain sequences at 50% pairwise sequence identity level
| Phylum | counts |
| Actinobacteria | 248 |
| Alpha proteobacteria | 260 |
| Bacteroidetes | 57 |
| Beta proteobacteria | 143 |
| Firmicutes | 369 |
| Gamma proteobacteria | 254 |
Reference MocRs
| Databank accession number | Labela | Functionb | Source organism |
| P49309 | MocR | Probable rhizopine catabolism regulatory protein | Rhizobium meliloti |
| D5AKX9 | TauR | Transcriptional activator, which is essential for taurine-dependent expression of the tpa-tauR-xsc operon | Rhodobacter capsulatus |
| P94426 | GabR | Activates the transcription of the gabTD operon. | Bacillus subtilis |
| Q8NS92 | PdxR | regulatory function in pyridoxine biosynthesis | Corynebacterium glutamicum |
| Q2YUS3 | NorG | Positively regulates the expression of the NorB efflux pump and negatively regulates the expression of the AbcA efflux pump | Staphylococcus aureus |
| P40193 | PtsJ | Transcriptional repressor of the pdxK gene | Salmonella typhimurium |
| C0ZDG2 | DdlR | Transcriptional regulator of D-alanyl-D-alanine ligase | Brevibacillus brevis |
| A6T5K2| | YczR | Putative transcriptional regulator of the expression of YczE genes | Klebsiella pneumoniae |
| WP_104279468 | VatR2 | Regulates expression of virulence factors, membrane and secreted proteins, and signal transducing proteins | Clavibacter michiganensis |
| YP_002824530 | EhuR | Negative regulation of Ectoine uptake and catabolism | Sinorhizobium meliloti |
| AMR55826 | EnuR | Transcriptional regulation of ectoine catabolism | Ruegeria pomeroyi |
Figure 1Three-dimensional representation of sequence space obtained with the multidimensional scale analysis implemented in the R package "bios2mds" of the aligned AAT-like domain. 3D space is defined by the first three components of the MDS (PC1, PC2, PC3). Distances were based on the pairwise BLOSUM30 alignment score. Colours indicate phylum: Actinobacteria green dot; Firmicutes red dot; Alphaproteobacteria cyan dot; Betaproteobacteria orange dot; Gammaproteobacteria blue dot; Bacteroidetes grey dot
Figure 2Bivariate plot visualizing the clustering of the AAT-domain sequences. Clusters are delimited by ellipses and the corresponding members denoted by different symbols. Labels denote the positions of the reference MocRs.
Taxonomic composition of the three subgroups as resulting from K-means clustering
| Cluster name | Sequence counts | Reference MocRsa | Actinobacteria | Alphab | Bacteroidetes | Betac | Gammad | Firmicutes |
| GabR | 535 | GabR | 106 | 99 | 22 | 45 | 76 | 187 |
| PdxR | ||||||||
| TauR | ||||||||
| MocR | ||||||||
| VatR2 | ||||||||
| PtsJ | 555 | PtsJ | 132 | 45 | 34 | 76 | 123 | 145 |
| NorG | ||||||||
| DdlR | ||||||||
| YczR | ||||||||
| EnuR | 242 | EnuR | 10 | 116 | 1 | 23 | 55 | 37 |
| EhuR |
Pfam family hits retrieved by the Profile HMM calculated for each cluster, after a HMMsearch over the Pfam-A databank
| Pfam hitsc | |||
| Pfam codesa | GabRb | PtsJb | EnuRb |
| PF00155.21 Aminotran_1_2 (Aminotransferase class I and II) | 98.47 | 78.96 | 82.81 |
| PF12897.7 Aminotran_MocR (Alanine-glyoxylate aminotransferase) | 1.43 | 0.98 | 1.07 |
| PF01053.20 Cys_Met_Meta_PP (Cys/Met metabolism PLP-dependent enzyme family) | 0 | 2.77 | 2.54 |
| PF00266.19 Aminotran_5 (Aminotranferase class V) | 0.09 | 5.64 | 2.9 |
| PF00202.21 Aminotran_3 (Aminotransferase class III) | 0 | 2.9 | 1.53 |
| PF01041.17 DegT_DnrJ_EryC1 (DegT/DnrJ/EryC1/StrS aminotransferase family) | 0.02 | 7.37 | 6.35 |
| PF01276.20 OKR_DC_1 (Group III pyridoxal-dependent decarboxylases) | 0 | 0.8 | 2.48 |
| PF01212.21 Beta_elim_lyase (Beta-eliminating lyase) | 0 | 0.59 | 0.31 |
Figure 3Logos comparison calculated through the Web site "Seq2Logo" for the block portions of GabR, PtsJ, and EnuR clusters (indicated by red boxes). Residues are represented by their one-letter code and coloured according to chemical properties. Labels indicate the sites discussed in the text according to the GabR 5x03 sequence numbering framework.
Figure 4: Three-dimensional structure of GabR from Bacillus subtilis in complex with PLP and γ-aminobutyric acid (PDB code 5x03). Transparent grey ribbon depicts the entire 5x03 monomer while the cyan segments denote the portions corresponding to the AAT-domain blocks. Lower panel displays a detail of the active site region where residues discussed in text are represented by stick models and labelled according to 5x03 numbering system. Cyan and grey stick residues mark the positions differing in the three MocR clusters and those discussed in the text, respectively. Yellow stick model represents the PLP bound to γ-aminobutyrate.
Sequence sites differing among clusters
| Positiona | GabRb | GabRc | PtsJc | EnuRc | Functiond |
| 205 | Tyr | Tyr/Phe | Tyr/Phe | Tyr/Phe | Stacking with PLP pyridine ring |
| 207 | Arg | Gly | Gly/Asn | Gly/Asn | Salt-bridge to GABA carboxylate |
| 248 | His | His/Arg | Hphobic | Hphobic | Interaction with His400 |
| 250 | Phe | Phe/Tyr | Asn | Asn | Interaction with PLP |
| 260 | Arg | Arg | Variable | Arg | First turn of helix 257-271 |
| 261 | Arg | Arg | Variable | Arg | ' |
| 281 | Tyr | Tyr | Val/Ile | Val/Ile | Stacking with PLP |
| 282 | Asp | Asp | Hphobic | Tyr | Interaction with position 248 and 400 |
| 284 | Glu | Asp/Glu | Asp/Glu | Hphobic | Interact with Tyr360 |
| 285 | Phe | Phe | Ile | Ile | Interact with Glu284 |
| 362 | Lys | Lys | Variable | Variable | Basic surface patch on the helix connecting the two domains |
| 363 | His | His | Variable | Hphobic | " |
| 365 | Lys | Lys/Arg | Variable | Variable | " |
| 366 | Lys | Lys/Arg | Variable | Variable | " |
| 368 | Lys | Lys/Arg | Arg/Variable | Arg/variable | " |
| 400 | His | His | Variable | His | Interaction with Asp282 |
| 446 | Ile | Hphobic | Arg | Arg | Interaction with His400 |