| Literature DB >> 29998051 |
Mária Martinovičová1, Štefan Janeček1,2.
Abstract
Glycoside hydrolases (GHs) have been classified in the CAZy database into 153 GH families. Currently, there might be four α-amylase families: the main family GH13, the family GH57 with related GH119 and, eventually, also GH126. The family GH57 was established in 1996 as the second and smaller α-amylase family. In addition to α-amylase, it contains 4-α-glucanotransferase, α-glucan branching enzyme, amylopullulanase, dual-specificity amylopullulanase-cyclomaltodextrinase, non-specified amylase, maltogenic amylase and α-galactosidase. The family GH57 enzymes employ the retaining reaction mechanism, share five typical conserved sequence regions and possess catalytic (β/α)7-barrel succeeded by a four-helix bundle with the catalytic machinery consisting of catalytic nucleophile and proton donor (glutamic acid and aspartic acid at strands β4 and β7, respectively). The present bioinformatics study delivers a detailed sequence comparison of 1602 family GH57 sequences with the aim to highlight the uniqueness of each enzyme's specificity and all eventual protein groups. This was achieved by creating the evolutionary tree focused on both the enzyme specificities and taxonomical origin. The substantial increase of numbers of sequences from recent comparisons done more than 5 years ago has allowed to refine the details of the sequence logos for the individual enzyme specificities. The study identifies a new evolutionary distinct group of α-galactosidase-related enzymes with until-now-undefined enzyme specificity but positioned on the evolutionary tree on a branch adjacent to α-galactosidases. The specificity of α-galactosidase is, moreover, the only one of the entire family GH57 for which there is no structural support for the proposal of the proton donor based on sequence analysis. The analysis also suggests a few so-called "like" protein groups related to some family GH57 enzyme specificities but lacking one or both catalytic residues.Entities:
Keywords: Bioinformatics analysis; Conserved sequence regions; Evolutionary relatedness; Unique sequence/structural features; α-Amylase family GH57
Year: 2018 PMID: 29998051 PMCID: PMC6037648 DOI: 10.1007/s13205-018-1325-9
Source DB: PubMed Journal: 3 Biotech ISSN: 2190-5738 Impact factor: 2.406
Enzymes and proteins from the family GH57 used in the present study
| Enzyme | Number | Archaea | Bacteria | Characterizeda | Lengthb |
|---|---|---|---|---|---|
| α-Amylases | 154 | 99 | 55 | 1 | 414 |
| AAMY-like proteins | 126 | 60 | 66 | 443 | |
| 4-α-Glucanotransferases | 107 | 38 | 69 | 5 | 670 |
| 4AGT-like proteins | 63 | 63 | 623 | ||
| Amylopullulanases | 268 | 74 | 194 | 8 | 814 |
| Amylopullulanases–cyclomaltodextrinases | 40 | 20 | 20 | 4 | 529 |
| APU-CMD-like proteins | 5 | 5 | 728 | ||
| Maltogenic amylases | 34 | 34 | – | 3 | 590 |
| AGAL-related enzymes | 15 | 15 | 660 | ||
| α-Galactosidases | 14 | 14 | – | 1 | 362 |
| Maltogenic amylase-like proteins | 1 | 1 | 479 | ||
| Non-specified amylases | 170 | 18 | 152 | 1 | 785 |
| GBE like | 60 | 60 | 520 | ||
| α-Glucan branching enzymes | 545 | 26 | 519 | 4 | 546 |
| Total | 1602 | 383 | 1219 | 27 |
The set was created based on sequences classified in the CAZy family GH57 completed by maltogenic amylases (currently kept in CAZy as “non-classified” sequences) using also the BLAST search
AAMY α-Amylase, 4AGT 4-α-glucanotransferase, APU-CMD amylopullulanase–cyclomaltodextrinase, AGAL α-galactosidase, GBE α-glucan branching enzyme
aCharacterized GH57 enzymes: α-amylase (1)—Methanocaldococcus jannaschii (Kim et al. 2001); 4-α-glucanotransferases (5)—Dictyoglomus thermophilum (Fukusumi et al. 1988; Nakajima et al. 2004), Pyrococcus furiosus (Laderman et al. 1993a, b), Thermococcus kodakaraensis (Tachibana et al. 1997), Thermococcus litoralis (Jeon et al. 1997; Imamura et al. 2003), Archaeoglobus fulgidus (Labes and Schonheit 2007; Paul et al. 2015); amylopullulanases (8)—Pyrococcus furiosus (Dong et al. 1997), Thermococcus hydrothermalis (Erra-Pujada et al. 1999), Thermococcus litoralis (Imamura et al. 2004), Spirochaeta thermophila (Angelov et al. 2010), Dictyoglomus turgidum (Brumm et al. 2011), Thermococcus siculi (Jiao et al. 2011), Thermococcus kodakaraensis (Guan et al. 2013), Sulfolobus acidocaldarius (Choi and Cha 2015); amylopullulanases–cyclomaltodextrinases (4)—Staphylothermus marinus (Li et al. 2013), Caldivirga maquilingensis (Li and Li 2015), Desulfurococcus amylolyticus (Park et al. 2018), Thermophilum pendens (Li et al. 2018); maltogenic amylases (3)—Pyrococcus furiosus (Comfort et al. 2008), Pyrococcus sp. ST04 (Jung et al. 2014; Park et al. 2014), Thermococcus cleftensis (Jeon et al. 2014); α-galactosidases (1)—Pyrococcus furiosus (van Lieshout et al. 2003); non-specified amylases (1)—uncultured bacterium (Wang et al. 2011); and α-glucan branching enzymes (4)—Thermococcus kodakarensis (Murakami et al. 2006; Santos et al. 2011), Thermotoga maritima (Ballschmiter et al. 2006; Dickmanns et al. 2006), Thermus thermophilus (Palomo et al. 2011), Pyrococcus horikoshii (Na et al. 2017)
bThe “Length” indicates the average length
Fig. 1Evolutionary tree of the α-amylase family GH57. The analyzed set contains 1602 sequences (for details, see Table 1). The tree is based on the alignment of conserved sequence regions (cf. Fig. 2) and calculated using the neighbor-joining method. The picture shows the same tree emphasizing a its division according to individual enzyme specificities and protein groups and b the division reflecting the taxonomic origin—bacteria (blue) and archaea (red). The 27 experimentally characterized GH57 enzymes (cf. Table 1) are signified by black dots in both versions of the tree. GBE α-glucan branching enzyme, AGAL α-galactosidase, APU-CMD amylopullulanase–cyclomaltodextrinase, 4AGT 4-α-glucanotransferase, AAMY α-amylase
Fig. 2Sequence fingerprints of GH57 enzyme specificities and their “like” protein homologues. Sequence logos of enzyme specificities (on the left) are based on 154 α-amylases, 268 amylopullulanases, 40 amylopullulanase–cyclomaltodextrinases, 107 4-α-glucanotransferases, 545 α-glucan branching enzymes, 34 maltogenic amylases, 170 non-specified amylases, 14 α-galactosidases and 15 α-galactosidase-related enzymes. Sequence logos of their “like” protein homologues (on the right) are based on 126 sequences of α-amylase-like proteins, five sequences of amylopullulanase–cyclomaltodextrinase-like proteins, 63 sequences of 4-α-glucanotransferase-like proteins, 60 sequences of α-glucan branching enzyme-like proteins and one sequence of maltogenic amylase-like protein. CSR-1 residues 1–5, CSR-2 residues 6–11, CSR-3 residues 12–17, CSR-4 residues 18–27, CSR-5 residues 28–36. The catalytic nucleophile (No. 15, Glu) with the proton donor (No. 20, Asp) and their corresponding positions in the “like” protein counterparts are indicated by asterisks
Fig. 3Structure comparison of 4-α-glucanotransferase and α-galactosidase-related enzymes. a Real structure of Thermococcus litoralis 4-α-glucanotransferase (Imamura et al. 2003; PDB: 1K1Y); b and c modelled structures of α-galactosidase-related enzymes from Shewanella baltica and Clostridium kluyveri. The individual domains are coloured as follows: catalytic incomplete TIM-barrel—red, succeeding helical bundle—blue, and C-terminal β-sheet domain—green. Detailed view illustrates the active site with glutamic acid and aspartic acid as catalytic nucleophile and proton donor, respectively, with bound acarbose (from the 4-α-glucanotransferase structure)