| Literature DB >> 15888209 |
Jian Song1, Carol A Bonner, Murray Wolinsky, Roy A Jensen.
Abstract
BACKGROUND: The TyrA protein family includes members that catalyze two dehydrogenase reactions in distinct pathways leading to L-tyrosine and a third reaction that is not part of tyrosine biosynthesis. Family members share a catalytic core region of about 30 kDa, where inhibitors operate competitively by acting as substrate mimics. This protein family typifies many that are challenging for bioinformatic analysis because of relatively modest sequence conservation and small size.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15888209 PMCID: PMC1173090 DOI: 10.1186/1741-7007-3-13
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Figure 1Composite of alternative biochemical routes from chorismate (CHA) to L-tyrosine (TYR) in nature. An antibiotic synthesis branch from CHA is also shown (dimmed). Here the intermediates shown to intervene between chorismate and pristinamycin or chloramphenicol are p-aminochorismate (ADC), p-aminoprephenate (ADP), p-aminophenylpyruvate (APP), and p-aminophenylalanine (APA). PPA may be transaminated by prephenate aminotransferase (PAT) to yield L-arogenate (AGN). The four TyrA homologs and the reactions they catalyze are colored differently. Arogenate dehydrogenase (TyrA) converts AGN to TYR. Alternatively, prephenate dehydrogenase (TyrA) converts PPA to 4-hydroxyphenylpyruvate (HPP) which is then transaminated to TYR via an homolog of TyrB, AspC, HisH, or Tat [49]. A broad-specificity cyclohexadienyl dehydrogenase (TyrA) is competent to catalyze either the TyrAor the TyrAreaction. PapC converts the 4-amino analog of PPA to the 4-amino analog of HPP. AroQ, AroH, and AroR are distinct homologs known to exist in nature for performance of the chorismate mutase reaction. Other abbreviations: AA, amino acid donor, KA, keto-acid accepter.
Abbreviations used to designate substrate specificities of tyrA/TyrA homologs
| Abbreviationa | ||
| Gene | Gene Product | Description of specificityb |
| TyrAx | Specificity for cyclohexadienyl substrate is unknown | |
| TyrAc | Broad-specificity cyclohexadienyl dehydrogenase (CDH) | |
| TyrAp | Narrow-specificity prephenate dehydrogenase (PDH) | |
| TyrAc_Δ | TyrAc_Δ | Broad-specificity cyclohexadienyl dehydrogenase having catalytic-core indels in correlation with an extra-core extension |
| TyrAa | Narrow-specificity arogenate dehydrogenase (ADH) | |
|
| NADTyrAa | TyrA homolog is AGN-specific and NAD+-specific |
|
| NADPTyrAa | TyrA homolog is AGN-specific and NADP+-specific |
| NAD(P)TyrAa | TyrA homolog is AGN-specific but utilizes either NAD+ or NADP+ | |
| x | xTyrAx | Specificity for both the cyclohexadienyl and pyridine nucleotide substrates is unknown |
aAbbreviations in the upper-table (upper 5 rows) indicate the specificities for the cyclohexadienyl substrate. Abbreviations in the lower-table (lower 4 rows) indicate specificities for both cyclohexadienyl (right subscripts) and pyridine nucleotide substrates (left subscripts). Combinations not shown can be deduced from the examples given, e.g., a TyrA homolog specific for prephenate and NAD+ would be designated NADTyrAp.
bThe abbreviations CDH, PDH, and ADH (shown parenthetically) have been used frequently in the literature.
Figure 2Phylogenetic tree for trimmed core domains of selected members of the TyrA Superfamily. Acronyms used for the various organisms are given in alphabetical order in Table 2. (A more extensive listing that includes organisms not shown in Fig. 2 and which also is hyperlinked to all of the individual GenBank records is given in Table S1. A similar table that also includes compilation of known and predicted substrate specificities is maintained at AroPath [73]. Lineages possessing experimentally established TyrA, TyrAp, TyrAc or TyrAc_Δ proteins are indicated by fill colors specified in the legend. Three specificity patterns for the pyridine nucleotide substrate are shown by line colors (see figure box). Although the cyanobacteria are depicted as having NADP+-specific TyrA proteins, some of them can also accept NAD, albeit to a lesser degree. All proteins having an aspartate residue homologous to D-32 of the E. coli NADTyrAc_Δ domain are presumed specific for NAD+. Fusion of TyrA domains with other catalytic domains is indicated within grey boxes (AroQ•TyrA, TyrA•AroF, HisHb•TyrA, and TyrA•AroQ•PheA•ACT) using the convention of a bullet to represent the interdomain area. The boxes overlap any relevant lineages. TyrA proteins having carboxy-terminal fusions with regulatory domains (TyrA•ACT and TyrA•REG) are also shown. The distance scale bar at the bottom left represents substitutions per site.
Curated TyrA amino-acid sequence files at AroPath [35]
| Complete TyrA sequences |
| Catalytic-core domainsa |
| Pyridine-nucleotide discriminator segmentsb |
| NAD+-specific |
| NADP+-specific |
| Broad specificity |
| Cyclohexadienyl-substrate core segments |
| Arogenate-specific (TyrAa) |
| Prephenate-specific (TyrAp) |
| Broad specificity |
| TyrAc |
| TyrAc_Δ |
| Pseudogene TyrA sequences |
aTrimmed free of N-terminal or C-terminal extensions, including any fusions with regulatory domains or other catalytic domains.
bHigh-glycine βαβ Rossmann fold at the N-terminus.
Figure 3Multiple alignment of the HMM consensus sequences obtained for different substrate-specificity groupings within cyclohexadienyl-substrate core segments (see Table 3). Invariant anchor residues are highlighted in yellow, conserved residues in grey. These consensus sequences will change continuously as corrections and refinements are made. The version shown was current as of April, 2005.
Figure 4Alignment of the N-terminal glycine-rich P-loop of TyrA•ACT proteins from the Class Actinobacteria. These are specific for L-arogenate as substrate, but fall into two groups with respect to the pyridine nucleotide co-substrate. The top NAD+-specific group possesses an aspartate (D) at position 32 (E. coli numbering), whereas the bottom NAD+/NADP+ group possesses an asparagine at the homologous position. Residue numbers are shown at the left. The species in the middle are color coded to match the hierarchical taxon positions obtained from NCBI. The variable loop of the Wierenga fingerprint [26], which in E. coli contains five residues (22–26), contains the minimal two residues in all of the Actinobacteria shown. The organisms on the right are color coded according to the taxonomic position indicated on the left (NCBI). The Rubrobacter xylanophilus TyrAa sequence is an orphan in the tree displayed in Fig. 2, as consistent with its outlying position in the taxonomy scheme.
Cyclohexadienyl substrates and inhibitors of TyrA proteins possess identical sidechains
| NADP+ | AGN | TYR | [25] | |
| NADP+ | AGN | TYR | [27, 28] | |
| NADP+ | AGN | None | [21] | |
| NAD(P)+ | AGN | None | [42, 43] | |
| NAD+ | PPAb | HPP | [21] | |
| NAD+ | PPA/AGN | HPP/TYR | [1] | |
| NAD+ | PPA/AGN | HPP/TYR | [17] | |
| NAD+ | PPA/AGN | None | [18] |
aAbbreviation: HPP, 4-hydroxyphenylpyruvate. bThis TyrAc enzyme has an overwhelming preference for PPA, but will use AGN poorly.
Figure 5Context of gene organization for tyrA, profiled against the 16S rRNA tree of the domain Bacteria. pheA, hisHb, tyrA, and aroF are color coded. Lineages typified by complete dispersal of aromatic-pathway genes are indicated by "GENE DISPERSAL". Gmet refers to Geobacter metallireducens; Dace refers to Desulfuromonas acetoxidans; and Ddes refers to Desulfovibrio desulfuricans. Consensus gene organizations are shown for the alpha and beta divisions of the Proteobacteria. The gamma division is subdivided to yield consensus gene organizations for the upper- and lower-gamma (enteric lineage) organisms. Genes that are adjacent and share a common transcriptional direction appear to reside in operons (or supraoperons). Any white spacing indicates substantial separation of the gene clusters shown in the genome. Genes of special interest are color coded, other genes of aromatic biosynthesis are shown in gray and all other genes are shown in white.
Figure 6Zoom-in from Fig. 5 showing tyrA synteny for the beta Proteobacteria and the upper-gamma Proteobacteria. The tree shown, based upon 16S rRNA sequences of the indicated organisms, indicates correct branching orders, but (to facilitate presentation) is not strictly correct in proportion. Circled numbers (in violet) indicate deduced evolutionary events for the beta Proteobacteria (see top of Table 5), whereas circled numbers (in pink; see bottom of Table 5) correspond to deduced evolutionary events for the upper-gamma Proteobacteria. Gene organizations of organisms indicated are shown on the right. The dotted outlining of some gene boxes in Coxiella burnetii and in P. syringae indicates pseudogene status.
Figure 7Zoom-in from Fig. 5 showing tyrA synteny for the lower-gamma Proteobacteria (enteric lineage). Deduced phylogenetic events numbered on the left are described in Table 6. The branching position for Buchnera is as suggested in ref. [7]. Dotted horizontal lines near the top of the tree indicate branch lengths that were shortened for convenience of presentation. Dotted outlining of boxes around some genes indicates their pseudogene status. It is unknown if the various open reading frame (ORF) insertions are functional.
Key to evolutionary events asserted in Figure 6
| Beta | 1 | Dispersal of |
| 2 | Complete dispersal of all nine genes originally in the | |
| 3 | Insertion of | |
| 4 | Expulsion of | |
| 5 | Fusion of | |
| 6 | Loss of | |
| Upper-Gamma | 1 | Insertion of |
| 2 | Translocation of | |
| 3 | Fusion of | |
| 4 | Loss of | |
| 5 | N-terminal deletion of • | |
| 6 | Separation of | |
| 7 | Insertion of 4 unknown genes between | |
| 8 | Loss of | |
aSince both Nitrosomonas (beta Proteobacteria) and Acidothiobacillus (upper-gamma Proteobacteria) emerge at deep positions in the tree of Fig. 5, an almost equally parsimonius possibility is that the ancestral serA was retained in this syntenic position in these two genera, but was transposed elsewhere shortly after early divergence.
Key to evolutionary events asserted in Figure 7
| 1 | Escape of |
| 2 | Fusion of |
| 3 | Dissociation of |
| 4 | Removal of all genes intervening between |
| 5 | Dissociation of |
| 6 | Dissociation of |
| 7 | Loss of |
| 8 | |
| 9 | |
| 10 | |
| 11 | |
| 12 | |
| 13 | |
| 14 | |
| 15 | All intervening genes between |
| 16 | |
| 17 | Insertion of |
| 18 | Insertion of ORF between |
| 19 | Insertion of ORF between |
Figure 8Distribution of modules of aromatic catabolism mapped on a 16S rRNA tree. In this figure, only presence or absence (not gene order) is indicated. The Phh module (orange) consists of phenylalanine hydroxylase (PhhA), carbinolamine dehydratase (PhhB), and tyrosine aminotransferase (not shown, see Text), and accomplishes the overall conversion of PHE to 4-hydroxyphenylpyruvate. The Hpd module (yellow) is 4-hydroxyphenylpyruvate dioxygenase, which converts 4-hydroxyphenylpyruvate to homogentisate. The Hmg module (blue) catalyzes the 3-step conversion of homogentisate to acetoacetate and fumarate. The distribution of PhhR and TyrR is shown in boxes. In some cases the HmgC member is shaded light blue to indicate that the gene encoding this isomerase could not be found and is probably encoded by an as yet unknown analog. Some long branches are drawn with gaps that represent 25% of the length of the scale bar.
Figure 9Protein tree of TyrR homologs. Nodes supported by bootstrap values of 998 or more are marked with solid circles, and the bootstrap values for nodes internal to these are shown. Generic names relevant to the organism abbreviations can be viewed in Fig. 8. A conserved region containing the σ54 contact motif GAFTGA is highlighted as an orange band. "Imperfect" residues in this region are shown in lower-case fonts. Residue numbers are shown at the right. TyrR and PhhR are regulators of σ70 and σ54 promoters, respectively. Four σ54 proteins of unknown function have very long branches, and to facilitate the visual presentation, the gaps in branch continuity shown represent a scale-bar distance of 0.1. Clades possessing σ54 regulators are indicated with blue stripes, and σ70 regulators are indicated with green stripes.
Putative attenuatorsa associated with tyrA
| ¬ | 4d | |
| ¬ | (11) | |
| ¬ | (11) | |
| ¬ | (11) | |
| ¬ | 4 | |
| ¬ | 5 | |
| ¬ | 4 | |
| ¬ | 4 | |
| ¬ | 4 | |
| ¬ORF > | ||
| ¬ | 4,(11) | |
| ¬ | 4 | |
| ¬ | ||
| ¬ | ||
| ¬ | 6 | |
| ¬ | 6 |
aAttenuators were extracted from the website of Merino [66]. Links are provided for viewing the complete data, including a visualization of the putative attenuator structures.
bThe symbol ¬ is used for attenuators. Genes encoding the alternative biochemical steps that were recently reported for formation of dehydroquinate from aspartate semialdehyde and ketohexose 1-phosphate [52] are designated aroA' and aroB'.
cRefers to figures within this manuscript or, if enclosed within parentheses, to the figure in ref. [7].
dSee the consensus gene organization for α Proteobacteria.
Key to organism acronyms
| Aferr | ||
| ACIN | ||
| Aact | ||
| Anae | ||
| Atei | ||
| Atum | ||
| Abal | ||
| Aori | ||
| ANAB | ||
| Atha | ||
| Aful | Aful_1 | |
| Avin | Avin_1 | |
| Bant | ||
| Bcer | ||
| Bhal | Bhal_2 | |
| Bste | ||
| Bsub | ||
| Bthu | ||
| Blon | Blon_1 | |
| Bflo | ||
| Bbro | ||
| Bcep | ||
| Bfun | ||
| Bmal | ||
| Bpse | Bpse_6 | |
| Cjej | ||
| Cvio | ||
| Cdip | ||
| Ceff | ||
| Cglu | Cglu_1 | |
| Ddes | ||
| Dvul | ||
| Dace | Dace_5 | |
| Efae_2 | ||
| Efae_1 | Efae_1 | |
| Ecar | ||
| Ecol | ||
| Gmet | ||
| Gsul | ||
| Gvio | ||
| Hinf | ||
| Hhep | ||
| Hpyl | ||
| Kpne | ||
| Lxyl | ||
| Linn | ||
| Lmon | ||
| Lcor | Lcor_3 | |
| Lesc | ||
| Mjan | ||
| Mkan | Mkan_1 | |
| Mbar | ||
| Mthe | Mthe_7 | |
| Mdeg | ||
| Mavi | ||
| Mbov | Mbov_2 | |
| Mlep | ||
| Mtub | ||
| Mxan | ||
| Ngon | ||
| Neur | ||
| Nfar | ||
| NONO | ||
| Npun | Npun_1 | |
| Naro | ||
| Oihe | ||
| Osat | ||
| Pagg | ||
| Pmul | ||
| Plum | ||
| Pmar_1 | Pmar_3 | |
| Pmar_2 | Pmar_10 | |
| Pacn | ||
| Paer | Paer_1 | |
| Pflu | ||
| Pput | ||
| Pstu | ||
| Reut | ||
| Rsol | ||
| Rcap | ||
| Rsph | ||
| Rpal | ||
| Rrub | Rrub_1 | |
| Rxyl | ||
| Scer | ||
| Styp | Styp_1 | |
| Spom | ||
| Sone | ||
| Spu | ||
| Saur | Saur_2 | |
| Sgor | ||
| Spne | ||
| Save | ||
| Scae | Scae_2 | |
| Scoe | Scoe_1 | |
| Slav | ||
| Spri | ||
| Sros | Sros_1 | |
| Stoy | ||
| Ssol | ||
| Stok | ||
| SYNE_1 | SYNE_1 | |
| SYNE_2 | ||
| SYNE_3 | SYNE_3 | |
| Tfus | ||
| Telo | ||
| Tery | ||
| Twhi | ||
| Vcho | ||
| Vpar | ||
| Wsu | ||
| Xcam | ||
| Xfas | ||
| Yent | ||
| Zmob |
aThe system of acronym usage is: the first letter (capital) is the first letter of the genus followed by the first three letters (lower-case) of the species. If there is no species designation the first four letters of the genus are used (all in capitals). Redundant 4-letter acronyms are distinguished by unique following numbers. See [74] for a comprehensive listing with hyperlinks to the Taxonomy database records and the GenBank records at NCBI.