| Literature DB >> 25461894 |
Peter K Busk1, Mette Lange1, Bo Pilgaard1, Lene Lange1.
Abstract
The cellulose-degrading fungal enzymes are glycoside hydrolases of the GH families and lytic polysaccharide monooxygenases. The entanglement of glycoside hydrolase families and functions makes it difficult to predict the enzymatic activity of glycoside hydrolases based on their sequence. In the present study we further developed the method Peptide Pattern Recognition to an automatic approach not only to find all genes encoding glycoside hydrolases and lytic polysaccharide monooxygenases in fungal genomes but also to predict the function of the genes. The functional annotation is an important feature as it provides a direct route to predict function from primary sequence. Furthermore, we used Peptide Pattern Recognition to compare the cellulose-degrading enzyme activities encoded by 39 fungal genomes. The results indicated that cellobiohydrolases and AA9 lytic polysaccharide monooxygenases are hallmarks of cellulose-degrading fungi except brown rot fungi. Furthermore, a high number of AA9, endocellulase and β-glucosidase genes were identified, not in what are known to be the strongest, specialized lignocellulose degraders but in saprophytic fungi that can use a wide variety of substrates whereas only few of these genes were found in fungi that have a limited number of natural, lignocellulotic substrates. This correlation suggests that enzymes with different properties are necessary for degradation of cellulose in different complex substrates. Interestingly, clustering of the fungi based on their predicted enzymes indicated that Ascomycota and Basidiomycota use the same enzymatic activities to degrade plant cell walls.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25461894 PMCID: PMC4252092 DOI: 10.1371/journal.pone.0114138
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Flow scheme for the Hotpep analysis.
Blue lines represent DNA sequences. Red arrows represent protein sequences. Short purple lines represent short peptides. See “Materials and Methods” for details.
CAZymes in fungal genomes found by PPR and annotated in CAZy.
| Species | PPR genes | CAZy genes | Ratio (PPR/CAZy) |
|
| 44 | 42 | 1.05 |
|
| 254 | 260 | 0.98 |
|
| 203 | 216 | 0.94 |
|
| 214 | 227 | 0.94 |
|
| 241 | 251 | 0.96 |
|
| 187 | 186 | 1.01 |
| Total | 1143 | 1182 | 0.97 |
Comparison of GH43 genes found by PPR and annotated in CAZy.
| CAZy annotated genes found by: |
|
|
|
|
| Total |
| PPR in genomes | 11 | 11 | 8 | 17 | 5 | 52 |
| PPR in full-length protein sequences | 14 | 12 | 9 | 18 | 7 | 60 |
| Not found by PPR | 1 | 1 | 0 | 1 | 0 | 3 |
| Found by PPR but not annotated in CAZy | 2 | 0 | 0 | 0 | 0 | 2 |
| All domains found by PPR | 15 | 12 | 9 | 17 | 7 | 60 |
| All domains annotated in CAZy | 14 | 13 | 9 | 18 | 7 | 61 |
Two CAZy hits to one PPR hit.
List of fungi whose genomes were mined for GH- and LPMO-encoding genes.
| Fungus | Division | Description | Cellulose degrader |
|
| Asco | Soil and skin | No |
|
| Asco | Soil and skin | No |
|
| Asco | Environmental saprophyte | No |
|
| Asco | Soil-borne pathogen | No |
|
| Asco | Baker's yeast | No |
|
| Asco | Plants, animals, waste water | No |
|
| Asco | Gut flora, pathogen | No |
|
| Asco | Plant pathogen | No |
|
| Basidio | Ectomycorrhizal and saprobic | No |
|
| Basidio | Plant pathogen | No |
|
| Basidio | Soil-borne pathogen | No |
|
| Chytridio | Soil | No |
|
| Chytridio | Amphibians | No |
|
| Chytridio | Leaf litter | No |
|
| Chytridio | Humid soil | No |
|
| Zygo | dead organic matter/pathogen | No |
|
| Zygo | Municipal compost | No |
|
| Asco | Saprophyte | Yes |
|
| Asco | Saprophyte | Yes |
|
| Asco | Saprophyte | Yes |
|
| Asco | Saprophyte | Yes |
|
| Asco | Cell-wall degrading plant pathogen | Yes |
|
| Asco | Saprobic wood-degrading | Yes |
|
| Asco | Soft rot | Yes |
|
| Asco | Soft rot | Yes |
|
| Asco | Soft rot | Yes |
|
| Asco | Soft rot | Yes |
|
| Asco | Soft rot, soil | Yes |
|
| Asco | Soft rot, soil | Yes |
|
| Asco | Wood-degrading | Yes |
|
| Basidio | Brown rot | Yes |
|
| Basidio | Brown rot | Yes |
|
| Basidio | Endophyte | Yes |
|
| Basidio | Saprophyte | Yes |
|
| Basidio | White rot | Yes |
|
| Basidio | White rot | Yes |
|
| Basidio | White rot | Yes |
|
| Basidio | White rot | Yes |
|
|
|
|
|
Figure 2Number of GH- and LPMO-encoding genes and functionally predicted genes in the 39 fungi.
All GH and LPMOs (CAZy auxiliary activity families AA9, AA10 and AA11) were annotated in the genome sequences of the 39 fungi. Next, as many as possible of these genes were assigned an EC number as described in “Materials and Methods”.
Figure 3Cluster analysis based on GH- and LPMO-encoding genes of the cellulose-degrading and non-cellulose-degrading fungi.
A) Cluster analysis based on the number and types of GH-families and AA families 9–11. B) Cluster analysis based on EC numbers for the GH genes an on the AA families 9 and 11. Cluster analysis was performed as described in “Materials and Methods”. Light blue is used as background color for non-cellulose-degraders, brown for brown rot and light red for other cellulose-degraders.
Figure 4Average number of genes encoding each cellulose-degrading enzyme in the fungi.
The average number of genes predicted to encode each of the activities endoglucanase (EC 3.2.1.4), β-glucosidase (EC 3.2.1.21) and cellobiohydrolases (EC 3.2.1.91, non-reducing end and EC 3.2.1.176, reducing end) and the LPMOs (AA9 and AA11) in the cellulose-degraders and non-cellulose degraders.
Figure 5Distribution of endoglucanases and β-glucosidases in many different GH families.
A) Average number of endoglucanase-genes in each GH families per fungal genome. B) Average number of β-glucosidase-genes in GH families per fungal genome. The GH families of each functionally annotated endoglucanase (EC 3.2.1.4) and β-glucosidase (EC 3.2.1.21) was counted.
Figure 6Average number of genes encoding glycoside hydrolase and LPMO enzymes in the 39 fungi.
The average number of genes predicted to encode cellulose-degrading activities (EC 3.2.1.4, 3.2.1.21, EC 3.2.1.91 and EC 3.2.1.176 and LPMOs), hemicellulose-degrading activities (EC 3.2.1.37, 3.2.1.151, 3.2.1.8, 3.2.1.55, 3.2.1.23, 3.2.1.59, 3.2.1.131, 3.2.1.177, 3.2.1.78, 3.2.1.67, 2.4.1.183, 3.2.1.25, 3.2.1.31, 3.2.1.15, 3.2.1.39, 3.2.1.58, 3.2.1.63, 3.2.1.22, 3.2.1.99, 3.2.1.75, 3.2.1.52, 3.2.1.6, 3.2.1.106, 3.2.1.51, 3.2.1.113, 2.4.1.18, 2.4.1.25, 3.2.1.28, 3.2.1.45, 3.2.1.18, 3.2.1.3, 3.2.1.26, 3.2.1.24, 2.4.1.20, 3.2.1.40 and 3.2.1.171) or other activities (EC 3.2.1.1, 3.2.1.20, 3.2.1.10, 3.2.1.165, 3.2.1.132, 3.2.1.14, 3.2.1.145, 3.2.1.89 and 3.2.1.164) in the cellulose-degraders and non-cellulose degraders. The division of enzyme activities (EC numbers) on cellulose-degradation, hemicellulose-degradation or other was described previously [2].