| Literature DB >> 22992189 |
Henrik Aspeborg1, Pedro M Coutinho, Yang Wang, Harry Brumer, Bernard Henrissat.
Abstract
BACKGROUND: The large Glycoside Hydrolase family 5 (GH5) groups together a wide range of enzymes acting on β-linked oligo- and polysaccharides, and glycoconjugates from a large spectrum of organisms. The long and complex evolution of this family of enzymes and its broad sequence diversity limits functional prediction. With the objective of improving the differentiation of enzyme specificities in a knowledge-based context, and to obtain new evolutionary insights, we present here a new, robust subfamily classification of family GH5.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22992189 PMCID: PMC3526467 DOI: 10.1186/1471-2148-12-186
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Phylogenetic tree of family GH5. In this circular phylogram, the branches corresponding to subfamilies 1–53 are shown in color and the subfamily numbers are indicated next to the exterior color circle. The branches corresponding to sequences not included into subfamilies are in black. A detailed version of this tree is found in Additional file 1: Figure S1.
Newly defined subfamilies within glycoside hydrolase family GH5
| GH5_1 | A1b | 133 | 3.2.1.4 | 2ZUN | |
| 3.2.1.73 | |||||
| 3.2.1.91 | |||||
| GH5_2 | A2b | 245 | 3.2.1.4 | 2A3H | |
| 3.2.1.132 | |||||
| GH5_4 | A3 + A4b | 160 | 3.2.1.4 | 2JEQ | |
| 3.2.1.151 | |||||
| 3.2.1.73 | |||||
| 3.2.1.8 | |||||
| GH5_5 | A5 + A6b,c,e | 123 | 3.2.1.4 | IGZJ | |
| GH5_7 | A7d | 133 | 3.2.1.25 | IRH9 | |
| 3.2.1. 78 | |||||
| 2.4.1.- | |||||
| GH5_8 | A8d | 71 | 3.2.1.78 | 2WHL | |
| GH5_9 | A9e | 107 | 3.2.1.58 | 3N9K | |
| 3.2.1.75 | |||||
| 3.2.1.21 | |||||
| GH5_10 | A10e,f | 19 | 3.2.1.78 | 2C0H | |
| GH5_11 | | 19 | ND | | |
| GH5_12 | | 42 | 3.2.1.21 | | |
| 3.2.1.45 | |||||
| GH5_13 | | 59 | ND | | |
| GH5_14 | | 15 | 3.2.1.58 | | |
| GH5_15 | | 10 | 3.2.1.75 | | |
| GH5_16 | | 10 | 3.2.1.164 | | |
| GH5_17 | | 5 | 3.2.1.78 | | |
| GH5_18 | | 24 | ND | | |
| GH5_19 | | 23 | ND | | |
| GH5_20 | | 17 | ND | | |
| GH5_21 | | 10 | 3.2.1.8 | | |
| GH5_22 | | 12 | 3.2.1.4 | | |
| GH5_23 | | 5 | 3.2.1.149 | | |
| 3.2.1.168 | |||||
| GH5_24 | | 5 | ND | | |
| GH5_25 | | 16 | 3.2.1.4 | 3MMW | |
| 3.2.1.78 | |||||
| GH5_26 | | 17 | 3.2.1.4 | | |
| 3.2.1.73 | |||||
| GH5_27 | | 5 | 3.2.1.123 | | |
| GH5_28 | | 8 | 3.2.1.123 | 2OSX | |
| GH5_29 | | 5 | 3.2.1.123 | | |
| GH5_30 | | 5 | ND | | |
| GH5_31 | | 5 | 3.2.1. | | |
| GH5_32 | | 5 | Eukaryota (Plants; Stramenopiles) | ND | |
| GH5_33 | | 9 | Eukaryota (Stramenopiles) | ND | |
| GH5_34 | | 5 | 3.2.1. | 2Y8K | |
| GH5_35 | | 5 | ND | | |
| GH5_36 | | 23 | 3.2.1.73 | IVJZ | |
| 3.2.1.78 | |||||
| GH5_37 | | 19 | 3.2.1.4 | ICEN | |
| 3.2.1.73 | |||||
| 3.2.1.74 | |||||
| GH5_38 | | 10 | 3.2.1. | | |
| GH5_39 | | 7 | 3.2.1.4 | | |
| GH5_40 | | 8 | ND | | |
| GH5_41 | | 14 | ND | | |
| GH5_42 | | 9 | ND | | |
| GH5_43 | | 11 | ND | | |
| GH5_44 | | 24 | ND | | |
| GH5_45 | | 5 | ND | | |
| GH5_46 | | 15 | 3.2.1. | | |
| GH5_47 | | 6 | ND | | |
| GH5_48 | | 18 | 3.2.1. | | |
| GH5_49 | | 20 | ND | | |
| GH5_50 | | 7 | ND | | |
| GH5_51 | | 5 | ND | | |
| GH5_52 | | 6 | 3.2.1.74 | | |
| GH5_53 | 6 | 3.2.1.74 |
ND Activity not determined yet.
aExperimentally determined.
bsee [16].
csee [17].
dsee [18].
esee [19].
fsee [20].
gEC numbers not yet defined.
hActive enzyme(s) present but with unclear EC number(s).
* See http://www.cazy.org for most recent information.
Historical names for some subfamilies are provided, along with the taxonomical range, the characterization level and structural information from representative structures. Known enzyme activities in family GH5 are provided using the following Enzyme Classification (EC) numbers and corresponding activities: 3.2.1.4 – endo-β-1,4-glucanase or cellulase; 3.2.1.8 – endo-β-1,4-xylanase; 3.2.1.21 – β-glucosidase; 3.2.1.25 – β-mannosidase; 3.2.1.45 – β-glucocerebrosidase; 3.2.1.58 – glucan β-1,3-glucosidase; 3.2.1.73 – licheninase; 3.2.1.74 – cellodextrinase; 3.2.1.75 – glucan endo-β-1,6-glucosidase; 3.2.1.78 – mannan endo-β-1,4-mannosidase or endo-β-1,4-mannanase; 3.2.1.91 – cellulose β-1,4-cellobiosidase or cellobiohydrolase; 3.2.1.123 – endoglycoceramidase; 3.2.1.132 – chitosanase; 3.2.1.149 – β-primeverosidase; 3.2.1.151 – xyloglucan-specific endo-β-1,4-glucanase; 3.2.1.164 – endo-β-1,6-galactanase; 3.2.1.168 – hesperidin 6-O-α-L-rhamnosyl-β-glucosidase; 3.2.1.- – undefined EC numbers for β-1,3-mannanase/β-1,3-glucomannanase or for arabinoxylan-specific β-xylanase or still unclear EC numbers depending on the subfamily (see notes and text); 2.4.1.- – β-mannan transglycosidase.
Figure 2Examples of modular GH5 proteins. (a) Diverse modular arrangements of putative monofunctional modular enzymes from subfamily GH5_8. (b) Same for putative bifunctional GH5 enzymes containing a subfamily GH5_8 module. (c) Other putative bifunctional enzymes containing at least a single GH5 module. (d) Selected examples of proteins containing GH5 modules having lost one or more catalytic residues. For a given protein, each GH5 module is identified by a number of fields separated by “|” indicating: (i) the organism, with 3 letters for the genre and either 5 letters for the species or full strain code; (ii) the GenBank protein accession; (iii) if attributed, the subfamily number or other information; (iv) EC numbers if available. These individual tags are analogous to what is found in Additional file 1: Figure S1. The module types and other protein segments present are: GHx_y – glycoside hydrolase family x subfamily y (pink); CEx – carbohydrate esterase module of family x (light brown); Cip21 – chitin-binding protein type 21 module with putative carbohydrate oxidative cleaving activity, formerly CBM33 (dark gray); CBMx – carbohydrate binding modules of family x (light green); FN3 – fibronectin type III modules (dark green); DOC – cellulosomal dockerin modules (light violet); EXPN – expansin modules (dark purple); signal peptides (purple); transmembrane segments (yellow); linkers (light blue); other regions (light grey).
Characterized carbohydrate-active enzymes of family GH5 not yet classified into subfamilies
| β-mannanase A (ManA;CelA) | 3.2.1.78 | | AAD09354 | GH5 CBM16 CBM16 | B-Firmicutes_Clostridia | |
| endo-β-1,4-glucanase D (CelD; CelCCD; EGCCD; Ccel_0840) | 3.2.1.4 | | BAA14354 ACL75216 | GH5 CBM11 DOC | B-Firmicutes_Clostridia | |
| endo-β-1,4-glucanase/b-1,3:1,4-glucanase H (CelH) | 3.2.1.4 3.2.1.73 | | AAA23225 | GH26 GH5 CBM11 DOC | B-Firmicutes_Clostridia | |
| cellulase (EBI-244) | 3.2.1.4 | | AEB53062 | GH5 | A-Crenarchaeota | |
| endo-β-1,4-glucanase 3 (Cel3;Cel-3; Eg3; Fisuc_2230; FSU_2772) | 3.2.1.4 | | AAA24893 ACX75816 ADL25000 | GH5 | B-Fibrobacteres_Acidobacteria group | |
| Fisuc_2933/FSU_0196 | | AGM | ACX76513 ADL26912 | GH5 CBM4 | B-Fibrobacteres_Acidobacteria group | |
| Fisuc_1523/FSU_2005 | | MUC | ACX75120 ADL26743 | GH5 | B-Fibrobacteres_Acidobacteria group | |
| endoglucanase (CelA; lpg1918) | 3.2.1.4 | AHEC | AAU27988 | GH5 | B-Gammaproteobacteria | |
| endo-β-1,4-glucanase 5B (Sde_2490) | 3.2.1.4 | | ABD81750 | CBM6 GH5 | B-Gammaproteobacteria | |
| endo-β-1,4-glucanase 5E (Sde_2929) | 3.2.1.4 | | ABD82186 | CBM6 CBM6 GH5 | B-Gammaproteobacteria | |
| endo-β-1,6-glucanase (Exg3; SPBC2D10.05) | 3.2.1.75 | | CAA21163 NP_596224 | GH5 | E-Fungi | |
| endo-β-1,4-glucanase (Cel5G) | 3.2.1.4 | | ADD71777 | GH5 | B-environmental samples | |
| SARM_0034/694713_55880/TW39 | | LIC CMC | ADX05705 | GH5 | U-unclassified sequences | |
| SARM_0047/1057205_158590/TW-15 | | CMC | ADX05718 | GH5 | U-unclassified sequences | |
| SARM_0086/0_06533/TW-18 | | PCW | ADX05761 | GH5 | U-unclassified sequences | |
| β-glucanase (RR.06; RR.06-1; BglC) | | BBG Cel5 LIC | CAJ19140 | GH5 | U-unclassified sequences | |
| β-glucanase (RR.10; RR.10-1) | BBG | unidentified microorganism | CAJ19146 | U-unclassified sequences |
The active enzymes are tagged by their Enzyme Classification (EC) number (see Table 1) or by the significant positively assayed substrates that are: AEHC - AZCL-HE cellulose; AGM - AZCL-galactomannan; BBG – barley β-glucan; Cel5 – cellopentose; CMC - carboxymethyl cellulose; LIC –lichenan; MUC - 4-methylumbelliferyl-β-D-cellobioside; PCW – plant cell wall.