| Literature DB >> 21622642 |
Caitlin Murphy1, Justin Powlowski, Min Wu, Greg Butler, Adrian Tsang.
Abstract
Fungi produce a wide range of extracellular enzymes to break down plant cell walls, which are composed mainly of cellulose, lignin and hemicellulose. Among them are the glycoside hydrolases (GH), the largest and most diverse family of enzymes active on these substrates. To facilitate research and development of enzymes for the conversion of cell-wall polysaccharides into fermentable sugars, we have manually curated a comprehensive set of characterized fungal glycoside hydrolases. Characterized glycoside hydrolases were retrieved from protein and enzyme databases, as well as literature repositories. A total of 453 characterized glycoside hydrolases have been cataloged. They come from 131 different fungal species, most of which belong to the phylum Ascomycota. These enzymes represent 46 different GH activities and cover 44 of the 115 CAZy GH families. In addition to enzyme source and enzyme family, available biochemical properties such as temperature and pH optima, specific activity, kinetic parameters and substrate specificities were recorded. To simplify comparative studies, enzyme and species abbreviations have been standardized, Gene Ontology terms assigned and reference to supporting evidence provided. The annotated genes have been organized in a searchable, online database called mycoCLAP (Characterized Lignocellulose-Active Proteins of fungal origin). It is anticipated that this manually curated collection of biochemically characterized fungal proteins will be used to enhance functional annotation of novel GH genes. Database URL: http://mycoCLAP.fungalgenomics.ca/.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21622642 PMCID: PMC3263737 DOI: 10.1093/database/bar020
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
The types of information extracted from the characterization literature
| Column Number | Title | Type of information |
|---|---|---|
| 1 | Entry name | The unique identifier representing the enzyme. It incorporates the enzyme activity, the GH family it belongs to, and the phylogenetic origin of the enzyme. |
| 2 | Gene name | The assigned gene name based on the standardized naming convention adopted for this study (see ‘Methods’ section). |
| 3 | Species | The genus and species of the enzyme’s natural host. |
| 4 | Strain | The strain of fungus used to obtain the gene and/or enzyme. |
| 5 | Gene name | The assigned gene name based on the standardized naming convention adopted for this study (see ‘Methods’ section). |
| 6 | Gene alias | Any other names the gene is referred to in the literature or sequence databases. |
| 7 | Enzyme name | The name most commonly used to identify an enzyme of a specific activity type. |
| 8 | Enzyme alias | Any other names the gene product (enzyme) is referred to in literature or public databases. |
| 9 | Systematic name | The systematic enzyme name according to the EC. < |
| 10 | The EC number | A numerical classification of enzymes based on the reactions they catalyze. < |
| 11 | Gene ID (GenBank) | The nucleotide sequence ID issued by the GenBank database. < |
| 12 | UniProt ID | The ID issued to each protein in the UniProt database. < |
| 13 | Protein ID (GenBank) | The protein ID issued by the GenBank database. < |
| 14 | Characterization literature | The ID of the literature describing the enzyme’s characterization and properties recorded as PMID (PubMed ID) or CSFGID (Centre for Structural and Functional Genomics). |
| 15 | Structure literature | The PMID or CSFGID of any literature describing the structure of the enzyme if available. |
| 16 | GH family | The GH family the enzyme belongs to. < |
| 17 | Assay | The experiment used to determine the function and/or properties of the enzyme. |
| 18 | Activity assay conditions | The buffer, pH and temperature used in the assay to determine the enzyme activity |
| 19 | Kinetic assay conditions | The buffer, pH and temperature in which the |
| 20 | Substrates | The chemical substrates used to assay that the enzyme was assayed on. |
| 21 | Host (recombinant expression) | The organism used to produce the recombinant enzyme for the experimental assay. |
| 22 | Specific activity | The activity of the purified enzyme on the given substrate. Recorded in U/mg where 1 U (unit) = 1 μmol/min/mg = 16.67 nkat/mg. |
| 23 | Substrate specificity | The activity of the enzyme on a given substrate compared to other substrates tested. Expressed as a percentage with the highest activity usually equal to 100%. |
| 24 | The Michaelis–Menten constant ( | |
| 25 | The maximum number of reactions the enzyme catalyzes in one second (s−1). | |
| 26 | The maximum velocity measured in U/mg at which an enzyme catalyzes a reaction. Reported in different ways, often as U/mg. | |
| 27 | pH optimum | The pH at which enzyme activity is maximal highest. |
| 28 | pH stability | The pH range over which the enzyme is able to remain active. retains maximal activity (usually ≥80%) under the conditions defined in the paper |
| 29 | Temperature optimum | The temperature (°C) at which enzyme activity is maximal. |
| 30 | Temperature stability | The temperature (°C) beyond which the enzyme activity (usually ≥20%) is lost under the conditions defined in the study. |
| 31 | Isoelectric point (theoretical) | The pI of the enzyme calculated from its amino acid composition. |
| 32 | Isoelectric point (experimental) | The pI of the enzyme determined by isoelectric focusing. |
| 33 | Molecular weight (theoretical) | The molecular weight (kDa) of the enzyme calculated from its amino acid composition |
| 34 | Molecular weight (experimental) | The molecular weight (kDa) of the enzyme estimated using SDS–PAGE, gel filtration, etc. |
| 35 | Protein length | The number of amino acids in the enzyme before cleavage of the signal peptide (unless stated otherwise). |
| 36 | Signal peptide | The number of amino acids comprising the signal peptide, which targets the enzyme for secretion. |
| 37 | CBD | Carbohydrate binding domain if present as part of the enzyme. |
| 38 | Glycosylation | Type of glycosylation (only if experimentally determined) |
| 39 | Other features | Any other information regarding the enzyme’s activity. |
| 40 | GO (molecular) | The GO term defining the molecular function of the enzyme |
| 41 | Evidence (molecular) | The type of information supporting the annotation of the molecular function of the enzyme. |
| 42 | GO (process) | The GO term defining the biological process the enzyme participates in. |
| 43 | Evidence (process) | The type of information supporting the annotation of the biological process. The biological process is only assigned the evidence code ‘Inferred by Direct Assay (IDA)’ when assayed on its natural substrate |
| 44 | GO (component) | The GO term defining the cellular compartment in which the enzyme acts. |
| 45 | Evidence (component) | The type of information supporting the enzyme’s component annotation. |
This table lists the names of the columns used to organize the collected data in a spreadsheet. The type of data each heading encompasses is explained on the right.
Activities of the characterized GH
| Enzyme name | Code | Activity |
|---|---|---|
| α-1,2-mannosidase (2-α-mannosyl-oligosaccharide α- | MSD | Catalyzes the hydrolysis of terminal, non-reducing-end glucose in mannosyl-oligosaccharides |
| α-amylase (4-α- | AMY | Catalyzes the hydrolysis of internal α-1,4-glucosidic linkages in polysaccharides and releases products in α-configuration |
| α-arabinofuranosidase (α- | ABF | Catalyzes terminal, non-reducing-end hydrolysis of α- |
| α-galactosidase (α- | MEL | Catalyzes the hydrolysis of non-reducing-end α- |
| α-glucosidase (4-α- | AGL | Releases glucose by catalyzing the hydrolysis of non-reducing-end α- |
| α-glucuronidase (α- | AGU | Catalyzes the hydrolysis of glucuronic acid branches from hemicellulose |
| α- | RHA | Catalyzes the hydrolysis of non-reducing-end α- |
| α-xylosidase | AGD | Catalyzes the hydrolysis of terminal α-linked xylosides |
| Arabinogalactanase (arabinogalactan 4-β- | GAN | Catalyzes the hydrolysis of internal β-1,4-linked galactosidic linkages |
| Arabinoxylan–arabinofuranosidase | AXH | Catalyzes the removal of arabinosides from xylan main chains |
| β-galactosidase (β- | LAC | Catalyzes the hydrolysis of terminal, non-reducing-end β- |
| β-glucosidase (β- | BGL | Releases glucose by acting on terminal, non-reducing-end β- |
| Beta-mannanase (4-β- | MAN | Catalyzes the hydrolysis of β-1,4-mannosidic linkages in mannans, galactomannans and glucomannans |
| β-mannosidase (β- | MND | Catalyzes the hydrolysis of terminal, non-reducing-end β- |
| β-xylosidase (4-β- | XYL | Catalyzes the hydrolysis of the bond joinholding xylose sugars together in xylobiose |
| Cellobiohydrolase (4-β- | CBH | Acts on non-reducing-end 1,4-β- |
| Cellulase-enhancing protein | CEP | Exact function unknown but enhances hydrolysis of cellulose by cellulases |
| Chitinase ((1-4)-2-acetamido-2-deoxy-β- | CHI | Catalyzes the random hydrolysis of |
| Chitosanase (chitosan | CSN | Catalyzes the hydrolysis of β-1,4 linkages in acetylated chitosans |
| Dextranase (6-α- | DEX | Acts on 1,6-α-glucosidic linkages in dextrins |
| Endo-arabinanase (5-α- | ABN | Catalyzes the hydrolysis of internal α-1,5-arabinofuranosidic linkages in arabinans |
| Endo-β-1,6-glucanase (6-β- | BGN | Catalyzes the random hydrolysis of β-1,6 linkages in β-1,6-linked glucans |
| Endo-β- | END | Catalyzes the removal of acetylated glycoprotein branches forming mannosyl-oligosaccharides |
| Endo-inulinase (1-β- | INU | Catalyzes the hydrolysis of internal fructosidic linkages in inulin |
| Endo-polygalacturonase (1,4-α- | PGA | Catalyzes the random hydrolysis of 1,4-α-galactosiduronic linkages in pectate and galacturonans |
| Endo-rhamnogalacturonase | RHG | Catalyzes the hydrolysis of links between galacturonic acid and rhamnopyranosyl residues in pectins |
| Endoglucanase (4-β- | EGL | Catalyzes the hydrolysis of β-1,4-glucosidic linkages in cellulose |
| Exo-1,3-β-glucanase (3-β- | EXG | Catalyzes the hydrolysis of glucose from the non-reducing-ends of β-1,3-glucans |
| Exo-arabinanase | ARB | Catalyzes the hydrolysis of α-1,5-arabinofuranosidic linkages from the ends of arabinans |
| Exo-glucosaminidase (Chitosan exo-1,4-β- | GLS | Catalyzes the hydrolysis of glucosamine residues from the non-reducing ends of chitosans |
| Exo-inulinase (β- | INX | Catalyzes the hydrolysis of terminal, non-reducing 2,1- and 2,6-linked fructofuranose in fructans |
| Exo-polygalacturonase (poly{1,4-α- | PGX | Catalyzes the hydrolysis of |
| Exo-rhamnogalacturonase | RGX | Catalyzes the hydrolysis of rhamnoside residues from the ends of pectin |
| Galactanase (galactan endo-1,6-β-galactosidase) | GAL | Catalyzes the hydrolysis of internal β-1,6-galactosidic linkages in arabinopgalactans and the hydrolysis of β-1,3- and β-1,6-galactosidic linkages in mixed galactans |
| Hexosaminidase (β-N-acetyl- | HEX | Catalyzes the hydrolysis of terminal, non-reducing-end |
| Invertase (β- | SUC | Catalyzes the hydrolysis of β- |
| Isopullulanase (pullulan 4-glucanohydrolase) | IPU | Catalyzes the hydrolysis of pullulan to isopanose |
| Laminarinase (3-β- | LAM | Catalyzes the hydrolysis of β-1,3-glucosidic linkages in β-1,3-glucans |
| Licheninase (1,3-, 1,4-β- | LIC | Catalyzes the hydrolysis of β-1,4-glucosidic linkages in mixed-link glucans |
| Mixed-link glucanase (3(or 4)-β- | MLG | Catalyzes the hydrolysis of β-1,3 or β-1,4 linkages in mixed glucans when the glucose involved in the linkage is substituted at the 1,3 position |
| Mutanase (3-α- | MUT | Catalyzes the internal hydrolysis of α-1,3-glycosidic linkages |
| Oligo-1,6-glucosidase (oligosaccharide 6-α-glucohydrolase) | OGL | Catalyzes the hydrolysis of 1,6-glycosidic linkages in oligosaccharides |
| Oligoxyloglucan cellobiohydrolase (oligoxyloglucan reducing-end cellobiohydrolase) | XBH | Catalyzes the hydrolysis of cellobiose from the reducing ends of xyloglucans with |
| Trehalase (α, α-trehalose glucohydrolase) | TRE | Catalyzes the hydrolysis of trehalose to release two |
| Xylanase (4-β- | XYN | Acts on 1,4-β-xylosidic linkages in xylan |
| Xylogalacturonase | XGH | Catalyzes the hydrolysis of xylosyl substitutions on pectins |
| Xyloglucanase ([(1-6)-α- | XEG | Catalyzes the hydrolysis of bonds involved in xyloglucan chains |
This table lists the different enzyme activities collected in the literature survey. A combination of BRENDA
Fungal species having characterized glycoside hydrolases
| Code | Number of enzymes characterized | Alternate names | |
|---|---|---|---|
| Ascomycota species | |||
| | ACRBL | 1 | |
| | ACRNA | 1 | |
| | APHAL | 1 | |
| | ARXAD | 2 | |
| | ASPAC | 9 | |
| | ASPAW | 11 | |
| | ASPFL | 2 | |
| | ASPFU | 5 | |
| | ASPKA | 11 | |
| | ASPNG | 47 | |
| | ASPOR | 16 | |
| | ASPPH | 1 | |
| | ASPSH | 2 | |
| | ASPSO | 1 | |
| | ASPSP | 1 | |
| | ASPSU | 2 | |
| | ASPTE | 2 | |
| | ASPTU | 6 | |
| | AURPU | 4 | |
| | BIOOC | 4 | |
| | BISSP | 1 | |
| | BOTFU | 7 | |
| | CANAL | 6 | |
| | CANOL | 1 | |
| | CANTS | 1 | |
| | CANWI | 1 | |
| | CHABR | 1 | |
| | CHAGR | 2 | |
| | CHATH | 1 | |
| | CLAPU | 2 | |
| | COCIM | 2 | |
| | COCCA | 8 | |
| | COCSA | 1 | |
| | CRYPA | 1 | |
| | DALES | 1 | |
| | DEBOC | 3 | |
| | EMEDE | 1 | |
| | EMENI | 34 | |
| | FUSEQ | 1 | |
| | FUSOX | 2 | |
| | FUSSO | 3 | |
| | GEOSP | 2 | |
| | GIBSP | 1 | |
| | GIBZE | 3 | |
| | HANAN | 1 | |
| | HORRE | 1 | |
| | HUMGT | 4 | |
| | HUMIN | 6 | |
| | HYPSC | 1 | |
| | ISAJA | 1 | |
| | KLULA | 3 | |
| | KLUMA | 2 | |
| | KURMO | 1 | |
| | LIPKO | 2 | |
| | LIPST | 1 | |
| | MAGGR | 5 | |
| | MELAO | 3 | |
| | METAN | 1 | |
| | NEOSP | 1 | |
| | NEUCR | 1 | |
| | PAETH | 1 | |
| | PENBR | 1 | |
| | PENCA | 1 | |
| | PENCH | 3 | |
| | PENCI | 3 | |
| | PENEN | 1 | |
| | PENFN | 6 | |
| | PENJA | 2 | |
| | PENMI | 2 | |
| | PENOL | 2 | |
| | PENPU | 6 | |
| | PENSI | 1 | |
| | PENSQ | 8 | |
| | PERSP | 1 | |
| | PICAN | 2 | |
| | PICJA | 1 | |
| | PICPA | 1 | |
| | ROBSP | 1 | |
| | YEAST | 9 | |
| | SACFI | 3 | |
| | SCHPO | 5 | |
| | STAEC | 1 | |
| | TALEM | 2 | |
| | THEAU | 5 | |
| | THELA | 2 | |
| | THIHE | 1 | |
| | THITE | 1 | |
| | TRIAS | 3 | |
| | TRIHA | 11 | |
| | TRIKO | 2 | |
| | TRILO | 1 | |
| | TRIRE | 23 | |
| | TRISP | 1 | |
| | TRIVI | 2 | |
| | VERDA | 1 | |
| | YARLI | 1 | |
| Basidiomycota species | |||
| | AGABI | 4 | |
| | ATHRO | 2 | |
| | CHOPU | 1 | |
| | COPCI | 3 | |
| | CRYAL | 1 | |
| | CRYFL | 1 | |
| | CRYSP | 2 | |
| | FOMPA | 2 | |
| | FOMPI | 1 | |
| | IRPLA | 6 | |
| | MERGI | 1 | |
| | PHARA | 2 | |
| | PHACH | 11 | |
| | SCHCO | 1 | |
| | SPOSI | 1 | |
| | TRAHI | 1 | |
| | UROFA | 1 | |
| Mucoromycotinia species | |||
| | GONSP | 1 | |
| | MORAL | 1 | |
| | MUCCI | 2 | |
| | MUCHI | 1 | |
| | MUCJA | 1 | |
| | MYCCO | 1 | |
| | PHYNI | 1 | |
| | RHIOL | 1 | |
| | RHIOR | 17 | |
| | RHISP | 1 | |
| | SYNRA | 1 | |
| Neocallimastigomycota species | |||
| | NEOFR | 1 | |
| | NEOPA | 4 | |
| | ORPJO | 1 | |
| | ORPSP | 3 | |
| | PIREQ | 2 | |
| | PIRSP | 7 |
This table lists the species of fungi and the number of characterized glycoside hydrolases collected from each. They are listed according to phylum. The species name used in this research is listed on the left, while any other names used for the same species are listed under ‘Alternative Names’. The five-letter codes used for the standardized naming of genes are also listed here. The codes follow a naming system used by UniProt. The first three letters represent the genus and the last two letters represent the species of the fungus.
GH families having characterized enzymes of fungal origin
| Family | Number of Enzymes Characterized | Activity |
|---|---|---|
| GH1 | 7 | β-glucosidase (7) |
| GH2 | 5 | β-mannosidase (2), chitosanase (1), exo-glucosaminidase (1), β-galactosidase (1) |
| GH3 | 30 | β-glucosidase (22), β-xylosidase (8) |
| GH5 | 45 | Endoglucanase (22), exo-1,3-β-glucanase (12), β-mannanase (8), galactanase (2), endo-1,6-β-glucanase (1) |
| GH6 | 12 | Cellobiohydrolase (11), endoglucanase (1) |
| GH7 | 29 | Cellobiohydrolase (18), endoglucanase (10), xylanase (1) |
| GH9 | 1 | Endoglucanase (1) |
| GH10 | 19 | Xylanase (19) |
| GH11 | 44 | Xylanase (44) |
| GH12 | 24 | Endoglucanase (20), xyloglucanase (3), licheninase (1) |
| GH13 | 10 | α-glucosidase (3), |
| GH15 | 14 | Glucoamylase (14) |
| GH16 | 5 | Mixed-link glucanase (3), laminarinase (1), licheninase (1) |
| GH17 | 2 | Laminarinase (1), exo-1,3-β-glucanase (1) |
| GH18 | 13 | Chitinase (13) |
| GH20 | 6 | Hexosaminidase (6) |
| GH26 | 3 | β-mannanase (3) |
| GH27 | 6 | α-galactosidase (6) |
| GH28 | 54 | Endo-polygalacturonase (40), exo-polygalacturonase (9), endo-rhamnogalacturonase (3), exo-rhamnogalacturonase (1), xylogalacturonase (1) |
| GH30 | 3 | Endo-1,6-β-glucanase (3) |
| GH31 | 10 | α-glucosidase (8), |
| GH32 | 22 | Invertase (10), exo-inulinase (7), endo-inulinase (4) |
| GH35 | 1 | β-galactosidase (1) |
| GH36 | 7 | |
| GH37 | 4 | Trehalase (4) |
| GH43 | 6 | Endo-1,5- |
| GH45 | 8 | Endoglucanase (8) |
| GH47 | 5 | |
| GH49 | 4 | Dextranase (3), isopullulanase (1) |
| GH51 | 5 | |
| GH53 | 6 | Arabinogalactanase (6) |
| GH54 | 9 | |
| GH55 | 4 | Exo-1,3-β-glucanase (3), laminarinase (1) |
| GH61 | 3 | Cellulase-enhancing protein (3) |
| GH62 | 2 | Arabinoxylan arabinofuranosidase (2) |
| GH65 | 2 | Trehalase (2) |
| GH67 | 4 | |
| GH71 | 3 | Mutanase (3) |
| GH74 | 6 | Xyloglucanase (3), oligoxyloglucan cellobiohydrolase (2), endoglucanase (1) |
| GH75 | 2 | Chitosanase (2) |
| GH78 | 3 | |
| GH81 | 2 | Laminarinase (2) |
| GH85 | 1 | |
| GH93 | 2 | Exo-arabinanase (2) |
The GH families and the total number of biochemically characterized enzymes collected for each are listed here. The GH families that did not have any characterized fungal enzymes were not included. The column titled ‘Activity’ shows the different types of activities the collected enzymes from each GH have. The numbers in brackets show the distribution of activity types in each family.
Figure 1.mycoCLAP search page. The main search page from mycoCLAP is shown here. Keywords such as enzyme activity, glycoside hydrolase family or a substrate name can be entered as search terms. Leaving the field blank and clicking on ‘search’ allows a user to browse the database. The information recorded during the curation process has been divided into six categories shown here; Enzyme Name, Biochemical Properties, Annotation, External Resources, Protein Features and Sequence. By checking boxes under these categories a user can determine which types of information will be displayed on the results page. The default settings are shown here. The tabs along the top allow quick and easy navigation through all of mycoCLAP’s features.
Figure 2.Gene page example. This screen shot illustrates the set-up and types of data available on mycoCLAP. This is part of the gene page for the glycoside hydrolase XYN10A_ASPNG (a family 10 xylanase from Aspergillus niger). The ‘Names and Origin’ section includes any names or abbreviations used to identify the enzyme in the literature or on other databases. A search by any of those names will deliver this enzyme as a hit. The next section contains biochemical properties extracted from the literature. This entry has the enzyme’s specific activity, pH optima and temperature optima on birch wood xylan when expressed from two different hosts. Other information recorded on gene pages includes nucleotide and amino acid sequences, protein domains, assay conditions, enzyme family, literature citations and other features recorded in the literature that make the entry unique.
Some properties of characterized fungal cellulases
| Activity | GH family | Total | Optimal pH | Optimal temperature (°C) | Temperature stability (°C) | Mass (kDa) |
|---|---|---|---|---|---|---|
| β | GH1 | 7 | 3.5–6.3 | 40–55 | 40–55 | 52–94 |
| GH3 | 22 | 3.5–8.0 | 37–72 | 30–70 | 74–145 | |
| GH6 | 11 | 4.8–9.0 | 40–50 | 30–60 | 40–60 | |
| GH7 | 18 | 5.0–6.0 | 35–65 | 50–55 | 47–90 | |
| GH5 | 22 | 3.5–8.5 | 40–75 | 37–80 | 35–56 | |
| GH6 | 1 | 5.5 | NA | NA | 38 | |
| GH7 | 10 | 4.0–5.5 | 452–57 | 40–50 | 46–56 | |
| GH9 | 1 | NA | NA | NA | 90 | |
| GH12 | 20 | 2.0–5.0 | 55–70 | 40–55 | 25–32 | |
| GH45 | 8 | 5.0–7.0 | 30–65 | 60 | 20–47 | |
| GH74 | 1 | 4.5 | 55 | 30 | 90 | |
| GH61 | 3 | 4.0–6.0 | NA | NA | 36–56 |
The three different types of cellulase activities; β-glucosidase, endoglucanase and cellobiohydrolase are listed here. Cellulose-enhancing proteins have been included as well. A range of the biochemical properties for each activity type is presented according to GH family. ‘NA’ indicates that the information was not available. The summarized data presented in the table were compiled from this study.