| Literature DB >> 24148833 |
Teresa Milano, Alessandro Paiardini, Ingeborg Grgurina, Stefano Pascarella1.
Abstract
BACKGROUND: Pyridoxal 5'-phosphate (PLP)-dependent enzymes of fold type I, the most studied structural class of the PLP-dependent enzyme superfamily, are known to exist as stand-alone homodimers or homotetramers. These enzymes have been found also embedded in multimodular and multidomain assembly lines involved in the biosynthesis of polyketides (PKS) and nonribosomal peptides (NRPS). The aim of this work is to provide a proteome-wide view of the distribution and characteristics of type I domains covalently integrated in these assemblies in prokaryotes.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24148833 PMCID: PMC3870968 DOI: 10.1186/1472-6807-13-26
Source DB: PubMed Journal: BMC Struct Biol ISSN: 1472-6807
List of structures of PLP-dependent type-I domains utilized for the calculation of the PLP_domain profile
| 1. 1AX4 | Tryptophanase | 2.10 | |
| 2. 1B9H | 3-amino-5-hydroxybenzoate synthase | 2.00 | |
| 3. 1BJ4 | Serine hydroxymethyltransferase | 2.65 | |
| 4. 1BJN | Phosphoserine aminotransferase | 2.29 | |
| 5. 1BJW | Aspartate aminotransferase | 1.80 | |
| 6. 1BS0 | 8-amino-7-oxononanoate synthase | 1.65 | |
| 7. 1C7N | Cystalysin | 1.90 | |
| 8. 1CL1 | Cystathionine beta-lyase | 1.83 | |
| 9. 1D2F | MalY protein | 2.50 | |
| 10. 1DGD | Dialkylglycine decarboxylase | 2.80 | |
| 11. 1DTY | Adenosylmethionine aminotransferase | 2.14 | |
| 12. 1ECX | NifS-like protein | 2.70 | |
| 13. 1ELQ | L-cysteine/L-cystine C-S lyase | 1.80 | |
| 14. 1FG3 | Histidinol-phosphate aminotransferase | 2.20 | |
| Alanine-glyoxylate aminotransferase | 2.50 | ||
| 1-aminocyclopropane-1-carboxylate synthase | 2.80 | ||
| 17. 1JS6 | Dopa decarboxylase | 2.60 | |
| 18. 1LK9 | Alliin lyase | 1.53 | |
| 19. 1MDX | ArnB aminotransferase | 1.96 | |
| Ornithine aminotransferase | 2.50 | ||
| 21. 1QGN | Cystathionine gamma-synthase | 2.90 | |
| 4-aminobutyrate-aminotransferase | 2.40 | ||
| Acetylornithine aminotransferase | 2.25 | ||
| 5-aminolevulinate synthase | 2.70 | ||
| Threonine Aldolase | 2.25 | ||
| 26. 2GSA | Glutamate-1-semialdehyde 2,1-aminomutase | 2.40 | |
| Serine Palmitoyltransferase | 1.30 | ||
| Cystathionine gamma lyase | 2.60 | ||
| L-tryptophan aminotransferase | 2.40 | ||
| LL-diaminopimelate aminotransferase | 1.99 | ||
| Glutamate decarboxylase beta | 3.00 |
a)Boldfaced codes denote the new structures added to the original set.
Figure 1Structurally Conserved Regions. Alignment of the Structurally Conserved Regions (SCR) of the 31 fold type I structures considered. Colors indicate conservation of residue physico-chemical properties. Each structure is labeled by its PDB code flanked by the sequence positions encompassing the reported SCRs. “SCR line” numbers the 13 conserved regions; below is the conservation histogram and the consensus sequence. The identically conserved residues in position 69 and 85 are the Asp interacting with the cofactor pyridine nitrogen and the Lys forming the Schiff base, respectively. Indels are not shown for easing the interpretation of the figure. Indel positions are denoted by the all-gap columns separating the different SCRs.
Figure 2Topology of the unrooted consensus tree calculated from the multiple alignment of the non-redundant set of type I domains. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates) is shown next to the branches whenever the value was greater than 50. Sequences are labeled by their UniProt code and the following information, in order: specie name, phylum, specificity, cluster type and product (definitions refers to those reported in Table 2). Question mark denotes unknown information. Red circles indicate reference structures identified by their PDB id codes: [PDB:2E7U] is glutamate-1-semialdehyde 2,1-aminomutase from Thermus termophilus; 1DGE, dialkylglycine decarboxylase from Burkholderia cepacia; [PDB:1VEF], acetylornithine aminotransferase from Thermus termophilus; [PDB:3A2B], serine palmitoyltransferase from Sphyngobacterium multivorum; [PDB:1BS0], 8-amino-7-oxononanoate synthase from Escherichia coli; [PDB:3TQX], 2-amino-3-ketobutyrate coenzyme A ligase from Coxiella burnetii; [PDB:1C7G], tyrosine phenol-lyase from Erwinia herbicola; [PDB:2JIS], cysteine sulfinic acid decarboxylase from Homo sapiens. Subtrees defining the four families are drawn with different colours. The tree is unrooted.
List of non redundant NRPS/PKS proteins containing type I PLP-dependent domains of the four families
| ? | CP–KS–CP–CP–ABHy6– | ? | T1pks | Mmal-Mmal-Mal-Mmal-Mal-Mmal-Mal-Mal-Pk | ||
| GSA | KS–CP– | ? | Nrps-transatpks | Asn-Mal-Gly-Nrp-Nrp-Nrp-Thr-Gly | ||
| TPA | CP–ABHy– | ? | ? | ? | ||
| GSA | A–CP–KS–AT–CP– | Nrp | Nrps-t1pks | |||
| DAT | KS–AT–ADHN–ADHznN–KR–CP– | Mal | Terpene-t1pks | Mal | ||
| GSA | CP– | Nrp | T1pks-nrps | Nrp-Mal- | ||
| GSA | CP– | ? | Nrps | Nrp-Mal-Nrp-Orn-Asp-Nrp-Nrp-Thr | ||
| DAT | CP– | ? | ? | ? | ||
| DOT | A–CP– | ? | ? | ? | ||
| OAT | CP–ABHy6– | ? | ? | ? | ||
| GSA | C–A–CP–KS–AT–CP– | Gly,- | Nrps-t1pks | |||
| GSA | KS–AT–CP– | Mal | Nrps-t1pks | Nrp-Nrp- | ||
| GSA | T1pks-nrps | Mal-Thr | ||||
| GSA | A–CP–KS–AT–CP– | Mal, Thr | Nrps-t1pks | Nrp-Tyr-Pk- | ||
| GSA | KS–AT–CP– | Asp | Nrps-t1pks-hserlactone | |||
| AOS | A–CP– | Nrp | ? | ? | ||
| AOS | A–CP– | ? | ? | ? | ||
| AOS | CP– | ? | ? | ? | ||
| AOS | A–CP–KS–AT– | Mal | T1pks | Nrp-Pk- | ||
| AOS | CP–CP– | ? | T1-pks | Nrp-Pk-Mal | ||
| AOS | KR–CP– | ? | T1pks | Gly-Mal-Mal | ||
| AOS | A–CP–Luc– | ? | Terpene | ? | ||
| TPL | CP–KR–KS–ECH–CP–KS–KR–MT–CP–KS–KR–CP–KS–CP–CP–DUF2156– | ? | T2pks-nrps-transatpks | Nrp-Mal-Cys-Mal-Thr | ||
| TPL | CP–KR–KS–ECH–CP–KS–KR–MT–CP–KS–KR–CP–KS–CP–DUF2156– | ? | Nrps-transatpks-t1pks-t2pks | Nrp-Mal-Thr-Cys-Mal | ||
| TPL | CP–KR–KS–CP–KS–KR–CP–MT–CP–KS–KR–CP–KS–CP–CP–DUF2156– | ? | Nrps-transatpks | Ser-Nrp-Thr-Pk-Ser-Mal-Cys-Ala-Nrp | ||
| TPL | CP–DUF2156– | ? | transatpks-t2pks | Pk-Mal | ||
| TPL | CP–KS–KR–CP– | ? | T2pks-transatpks-nrps | Mal-Gly-Nrp | ||
| TPL | CP–DUF2156– | ? | Lantipeptide-nrps-transatpks | Leu-Leu-Phe-Leu-Leu-Nrp-Ile-Thr-Gly-Asn-Ser | ||
| DDC | KS–AT–ADH–KR–CP–C–HxxPF–A– | Mal, Nrp | T1pks-t2pks-nrps | Pk-Pk-Pk-Mal-Mal- | ||
| GDC | A–CP–C– | Gly | Nrp | Gly | ||
a)Family assignment with B6db. Abbreviations mean: GSA = Glutamate-1-semialdehyde 2,1-aminomutase; TPA = Taurine--pyruvate aminotransferase; DAT = Diamine aminotransferase; DOT = Diaminobutyrate-2-oxoglutarate transaminase; OAT = Ornithine--oxo-acid aminotransferase; AOS = 8-amino-7-oxononanoate synthase, TPL = Tyrosine phenol-lyase; DDC = Diaminobutyrate decarboxylase; GDC = Glutamate decarboxylase. Question mark means “not assigned”.
b)Domain codes with Pfam id code in parentheses: A = AMP–binding enzyme (PF00501); CP = Phosphopantetheine attachment site (PF00550); KS = beta–ketoacyl synthase (PF00109); AT = Acyltransferase domain (PF00698); MT = Methyltransferase domain (PF13489); C = Condensation domain (PF00668); HxxPF = HxxPF–repeated domain (PF13745); Luc = Luciferase (PF00296); ABHyX = Alpha/beta hydrolase fold where X is the subgroup number; FMO = Flavin–containing monooxygenase (PF00743); ADHN = Alcohol dehydrogenase GroES–like domain (PF08240); ADHZnN = Zinc–binding dehydrogenase (PF00107); KR = Ketoreductase domain (PF08659); DUF4009 = Domain of unknown function (PF13193); SDH = Shikimate/quinate 5–dehydrogenase (PF01488); ECH = Enoyl–CoA hydratase/isomerase family (PF00378). Type-I domains are: PLP1 = Aminotran_3 (PF00202); PLP2 = Aminotran_1_2 (PF00155); PLP3 = Beta_elim_lyase (PF01212); PLP4 = Pyridoxal_deC (PF00282).
c)Predicted specificity of the AT or A domains of the chain containing the type-I domain. Nrp means generic amino acid (no consensus among the predictive methods). Question mark denotes no prediction. Mal and Pk represent malonyl and a generic polyketide, respectively.
d)Gene cluster type as predicted by antiSMASH. Pks and Nrps stand for Polyketide synthase and Non-ribosomal polypeptide synthetase, respectively. T1pks, T2pks, transatpks refer to Type-I, Type-II and trans-AT pks, respectively. Question mark denotes no prediction.
e) Predicted cluster product. Monomers incorporated by the module containing the type-I domain are boldfaced. Question mark denotes no prediction.
Figure 3Sequence alignment between a representative sequence of each family of type I domains and the most similar structural template. Sequences are labeled by their databank code. Aminotran_3 (a): [UniProt: Q2T5Z2] indicates polyketide synthase from Burkholderia thailandensis; [PDB:2E7U] is the glutamate-1-semialdehyde 2,1-aminomutase from Thermus thermophilus HB8. Aminotran_1_2 (b): [UniProt:B1XHP8] indicates AMP-binding enzyme from Synechococcus sp. (strain ATCC 27264 / PCC 7002 / PR-6); [PDB:3A2B] denotes serine palmitoyltransferase from Sphingobacterium multivorum. Beta_elim_lyase (c): [UniProt:F8TUA6] corresponds to keto-hydroxyglutarate-aldolase/polyketide synthase from Lysobacter sp.; [PDB:1C7G] labels the tyrosine phenol-lyase from Erwinia herbicola. Pyridoxal_deC (d): [UniProt:B6IZA3] is the non-ribosomal peptide synthetase module from Coxiella burnetii; [PDB:2JIS] stands for the cysteine sulfinic acid decarboxylase from Homo sapiens. Secondary structures are charted below the template sequence. Helices (alpha and 310 helices are designated by α or η respectively) are displayed as squiggles and beta strands (β) are rendered as arrows. Beta turns are denoted as “TT” and strict α turns as “TTT”. Dots indicate gaps. Identically conserved residues are displayed on a red background; red letters indicate conservative substitutions. Triangles mark residues known to be functionally important in the template enzyme. Black circles tag important residues from the other subunit. Stars label the Asp and the Lys residue involved in interaction with pyridine nitrogen and in Schiff-base forming, respectively. The black square in the panel (c) indicates the Arg381 of the template missing in the homologous PLP domain.
Figure 4Structural superposition of the model-template pairs. Structural superposition of the model-template pairs for families Aminotran_3 (a), Aminotran_1_2 (b), Beta_elim_lyase (c) and Pyridoxal_deC (d) reported in Figure 3. Ribbon representation is used. Structural templates are colored in grey. Green and cyan indicate the model subunits. Cofactor is represented by transparent yellow spheres. Arrows point to insertion or deletion regions that are distinguished by magenta or yellow colors. The conformation of inserted magenta loop in the model of Aminotran_3 group (a) has no structural meaning: it has been modeled only with the purpose to indicate its approximate location on the protein surface.
Figure 5Comparisons of the models of the active site of the domains representative of each type I subfamily. Grey drawing indicates the reference structural template, while orange and cyan depict the two subunits of the models. Cofactor is drawn as transparent yellow spheres encapsulating stick models. Relevant side chains are rendered as sticks. Numbering refers to Figure 3. (a) Type I domain from polyketide synthase from Burkholderia thailandensis [UniProt:Q2T5Z2], and glutamate-1-semialdehyde 2,1-aminomutase from Thermus thermophilus HB8 (internal aldimine) [PDB:2E7U]. (b) AMP-binding enzyme [UniProt:B1XHP8] from Synechococcus sp. (strain ATCC 27264/PCC 7002/PR-6), and serine palmitoyltransferase from Sphingobacterium multivorum (external aldimine with serine) [PDB-3A2B]. (c) keto-hydroxyglutarate-aldolase/polyketide synthase from Lysobacter sp. [UniProt:F8TUA6] and tyrosine phenol-lyase from Erwinia herbicola (internal aldimine) [PDB:1C7G]. (d) Nonribosomal peptide synthetase module from Coxiella burnetii [UniProt: B6IZA3] and cysteine sulfinic acid decarboxylase from Homo sapiens (internal aldimine) [PDB:2JIS].