| Literature DB >> 33799907 |
Lene Lange1, Kristian Barrett2, Anne S Meyer2.
Abstract
Fungal genome sequencing data represent an enormous pool of information for enzyme discovery. Here, we report a new approach to identify and quantitatively compare biomass-degrading capacity and diversity of fungal genomes via integrated function-family annotation of carbohydrate-active enzymes (CAZymes) encoded by the genomes. Based on analyses of 1932 fungal genomes the most potent hotspots of fungal biomass processing CAZymes are identified and ranked according to substrate degradation capacity. The analysis is achieved by a new bioinformatics approach, Conserved Unique Peptide Patterns (CUPP), providing for CAZyme-family annotation and robust prediction of molecular function followed by conversion of the CUPP output to lists of integrated "Function;Family" (e.g., EC 3.2.1.4;GH5) enzyme observations. An EC-function found in several protein families counts as different observations. Summing up such observations allows for ranking of all analyzed genome sequenced fungal species according to richness in CAZyme function diversity and degrading capacity. Identifying fungal CAZyme hotspots provides for identification of fungal species richest in cellulolytic, xylanolytic, pectinolytic, and lignin modifying enzymes. The fungal enzyme hotspots are found in fungi having very different lifestyle, ecology, physiology and substrate/host affinity. Surprisingly, most CAZyme hotspots are found in enzymatically understudied and unexploited species. In contrast, the most well-known fungal enzyme producers, from where many industrially exploited enzymes are derived, are ranking unexpectedly low. The results contribute to elucidating the evolution of fungal substrate-digestive CAZyme profiles, ecophysiology, and habitat adaptations, and expand the knowledge base for novel and improved biomass resource utilization.Entities:
Keywords: CUPP; eco-physiological enzyme hotspot; enzyme function specificity diversity; enzyme profiles; peptide-based functional annotation; taxonomic enzyme hotspots
Year: 2021 PMID: 33799907 PMCID: PMC8000046 DOI: 10.3390/jof7030207
Source DB: PubMed Journal: J Fungi (Basel) ISSN: 2309-608X
Figure 1Flow diagram describing the steps from downloading the full fungal genomes from NCBI and prediction of genomic proteins using AUGUSTUS. The carbohydrate-active enzymes were assigned a CAZy family and molecular function prediction via CUPP (see text). The predicted Function;Family observations were then assigned to their respective target substrates. The * indication added as fourht number in some EC numbers (molecular function numbers) indicate that no full EC number is available in the CAZy database. A similar molecular function (same EC number) from two different CAZy families are considered as two different observations by combining the EC number and the CAZy family into a combined string.
Listing of the ten top-ranked species with regard to biomass degrading capacity, (all observations included), specifying the number of “Function;Family” observations. The table includes ranking according to “Total” (A), followed by ranking according to enzyme observations with affinity for each of the target substrates, Cellulose (B), Xylan (C), Pectin (D) and Lignin (E). The total list of observations for the analyzed 1932 genomes is given in Table S1.
| A Ranking: Total | Taxonomy | Target Substrate of Encoded CAZymes | |||||
|---|---|---|---|---|---|---|---|
| Species | Class | Phylum | Cellulose | Pectin | Xylan | Lignin | Total |
|
| Neocallimastigomycetes | Chytridiomycota | 248 | 85 | 208 | 0 | 541 |
|
| Neocallimastigomycetes | Chytridiomycota | 232 | 122 | 172 | 0 | 526 |
|
| Agaricomycetes | Basidiomycota | 91 | 204 | 50 | 149 | 494 |
|
| Sordariomycetes | Ascomycota | 139 | 176 | 74 | 95 | 484 |
| Sordariomycetes | Ascomycota | 117 | 102 | 108 | 98 | 425 | |
|
| Sordariomycetes | Ascomycota | 106 | 163 | 63 | 79 | 411 |
| Leotiomycetes | Ascomycota | 105 | 138 | 75 | 91 | 409 | |
|
| Sordariomycetes | Ascomycota | 116 | 129 | 58 | 97 | 400 |
|
| Sordariomycetes | Ascomycota | 111 | 128 | 56 | 90 | 385 |
| Sordariomycetes | Ascomycota | 111 | 118 | 58 | 94 | 381 | |
|
|
|
| |||||
|
|
|
|
|
|
|
|
|
|
| Neocallimastigomycetes | Chytridiomycota | 248 | 85 | 208 | 0 | 541 |
|
| Neocallimastigomycetes | Chytridiomycota | 232 | 122 | 172 | 0 | 526 |
|
| Sordariomycetes | Ascomycota | 139 | 176 | 74 | 95 | 484 |
| Neocallimastigomycetes | Chytridiomycota | 128 | 47 | 117 | 0 | 292 | |
| Sordariomycetes | Ascomycota | 117 | 102 | 108 | 98 | 425 | |
|
| Sordariomycetes | Ascomycota | 116 | 129 | 58 | 97 | 400 |
|
| Sordariomycetes | Ascomycota | 111 | 128 | 56 | 90 | 385 |
| Sordariomycetes | Ascomycota | 111 | 118 | 58 | 94 | 381 | |
|
| Sordariomycetes | Ascomycota | 107 | 118 | 58 | 93 | 376 |
|
| Sordariomycetes | Ascomycota | 106 | 163 | 63 | 79 | 411 |
|
|
|
| |||||
|
|
|
|
|
|
|
|
|
|
| Neocallimastigomycetes | Chytridiomycota | 248 | 85 | 208 | 0 | 541 |
|
| Neocallimastigomycetes | Chytridiomycota | 232 | 122 | 172 | 0 | 526 |
| Neocallimastigomycetes | Chytridiomycota | 128 | 47 | 117 | 0 | 292 | |
| Sordariomycetes | Ascomycota | 117 | 102 | 108 | 98 | 425 | |
| Leotiomycetes | Ascomycota | 105 | 138 | 75 | 91 | 409 | |
|
| Sordariomycetes | Ascomycota | 139 | 176 | 74 | 95 | 484 |
|
| Dothideomycetes | Ascomycota | 68 | 32 | 72 | 56 | 228 |
|
| Neocallimastigo | Chytridiomycota | 91 | 30 | 71 | 0 | 192 |
|
| Agaricomycetes | Basidiomycota | 82 | 61 | 70 | 114 | 327 |
|
| Leotiomycetes | Ascomycota | 94 | 82 | 69 | 79 | 324 |
|
|
|
| |||||
|
|
|
|
|
|
|
|
|
|
| Agaricomycetes | Basidiomycota | 91 | 204 | 50 | 149 | 494 |
|
| Sordariomycetes | Ascomycota | 139 | 176 | 74 | 95 | 484 |
|
| Sordariomycetes | Ascomycota | 106 | 163 | 63 | 79 | 411 |
|
| Sordariomycetes | Ascomycota | 90 | 150 | 59 | 72 | 371 |
|
| Sordariomycetes | Ascomycota | 90 | 139 | 65 | 77 | 371 |
| Sordariomycetes | Ascomycota | 90 | 139 | 63 | 76 | 368 | |
|
| Sordariomycetes | Ascomycota | 90 | 139 | 57 | 71 | 357 |
|
| Sordariomycetes | Ascomycota | 89 | 139 | 63 | 77 | 368 |
| Leotiomycetes | Ascomycota | 105 | 138 | 75 | 91 | 409 | |
|
| Eurotiomycetes | Ascomycota | 95 | 137 | 53 | 58 | 343 |
|
|
|
| |||||
|
|
|
|
|
|
|
|
|
|
| Agaricomycetes | Basidiomycota | 91 | 204 | 50 | 149 | 494 |
|
| Agaricomycetes | Basidiomycota | 82 | 61 | 70 | 114 | 327 |
|
| Agaricomycetes | Basidiomycota | 77 | 70 | 44 | 104 | 295 |
|
| Agaricomycetes | Basidiomycota | 77 | 36 | 44 | 104 | 261 |
|
| Agaricomycetes | Basidiomycota | 58 | 18 | 33 | 102 | 211 |
| Sordariomycetes | Ascomycota | 117 | 102 | 108 | 98 | 425 | |
| Agaricomycetes | Basidiomycota | 62 | 31 | 34 | 98 | 225 | |
|
| Sordariomycetes | Ascomycota | 116 | 129 | 58 | 97 | 400 |
|
| Agaricomycetes | Basidiomycota | 58 | 55 | 34 | 96 | 243 |
|
| Sordariomycetes | Ascomycota | 139 | 176 | 74 | 95 | 484 |
Figure 2Attack of American Coffee Spot, by Mycena citricolor. (a) on coffee leaves, leading to total degradation of attacked areas, forming conspicuous holes in the leaves. (b) Minute fruiting bodies (caps) of M. citricolor invading coffee berries. The berry is strongly affected by the fungal infection, causing brown necrotic tissue. The entire infected area is sunken in, flattening the shape of the infected coffee fruit. In short, the figure hints the effect of a plant pathogen with a powerful, pectinolytic as well as ligninolytic secretome of biomass degrading enzymes. Photos: Andrew Dominick. University of Wisconsin-La Crosse and Direcction General de Sanidad Vegetal, Sagarpa (Secretaria de Agricultura Ganaderia Desarrollo Rural Pesca y Alimentacion), Mexico.
Listing of the ten top-ranked species with regard to enzyme function specificity diversity (including only unique observations), specifying the number of “Function;Family” observations. The table includes ranking according to “Total” (A), followed by ranking according to enzyme observations with affinity for each of the target substrates, Cellulose (B), Xylan (C), Pectin (D) and lignin (E). (*) indicates more species with same number of observations. The total list of observations for the analyzed 1932 genomes is given in Table S1.
| A Ranked: Total | Taxonomy | Target Substrate of Encoded CAZymes | |||||
|---|---|---|---|---|---|---|---|
| Species | Class | Phylum | Cellulose | Pectin | Xylan | Lignin | Total |
| Sordariomycetes | Ascomycota | 17 | 53 | 24 | 18 | 112 | |
|
| Sordariomycetes | Ascomycota | 16 | 53 | 25 | 18 | 112 |
|
| Sordariomycetes | Ascomycota | 16 | 53 | 25 | 18 | 112 |
|
| Sordariomycetes | Ascomycota | 16 | 53 | 24 | 18 | 111 |
| Sordariomycetes | Ascomycota | 16 | 53 | 24 | 18 | 111 | |
|
| Sordariomycetes | Ascomycota | 16 | 53 | 23 | 18 | 110 |
|
| Sordariomycetes | Ascomycota | 18 | 49 | 25 | 17 | 109 * |
|
| Sordariomycetes | Ascomycota | 16 | 51 | 23 | 19 | 109 * |
|
| Sordariomycetes | Ascomycota | 15 | 52 | 24 | 18 | 109 * |
|
| Sordariomycetes | Ascomycota | 15 | 52 | 24 | 18 | 109 * |
|
|
|
| |||||
|
|
|
|
|
|
|
|
|
|
| Sordariomycetes | Ascomycota | 18 | 49 | 25 | 17 | 109 |
|
| Sordariomycetes | Ascomycota | 18 | 40 | 24 | 15 | 97 |
|
| Leotiomycetes | Ascomycota | 18 | 43 | 22 | 19 | 102 |
|
| Dothideomycetes | Ascomycota | 18 | 41 | 22 | 16 | 97 |
|
| Sordariomycetes | Ascomycota | 18 | 41 | 21 | 17 | 97 |
|
| Sordariomycetes | Ascomycota | 18 | 35 | 19 | 18 | 90 |
|
| Sordariomycetes | Ascomycota | 17 * | 43 | 25 | 17 | 102 |
| Sordariomycetes | Ascomycota | 17 * | 53 | 24 | 18 | 112 | |
|
| Dothideomycetes | Ascomycota | 17 * | 41 | 24 | 17 | 99 |
|
| Dothideomycetes | Ascomycota | 17 * | 46 | 23 | 18 | 104 |
|
|
|
| |||||
|
|
|
|
|
|
|
|
|
| Leotiomycetes | Ascomycota | 16 | 45 | 26 | 16 | 103 | |
|
| Sordariomycetes | Ascomycota | 18 | 49 | 25 | 17 | 109 |
|
| Sordariomycetes | Ascomycota | 17 | 43 | 25 | 17 | 102 |
|
| Sordariomycetes | Ascomycota | 16 | 53 | 25 | 18 | 112 |
|
| Sordariomycetes | Ascomycota | 16 | 53 | 25 | 18 | 112 |
|
| Sordariomycetes | Ascomycota | 16 | 45 | 25 | 15 | 101 |
| Dothideomycetes | Ascomycota | 16 | 39 | 25 | 17 | 97 | |
|
| Sordariomycetes | Ascomycota | 18 | 40 | 24 * | 15 | 97 |
| Sordariomycetes | Ascomycota | 17 | 53 | 24 * | 18 | 112 | |
|
| Dothideomycetes | Ascomycota | 17 | 41 | 24 * | 17 | 99 |
|
|
|
| |||||
|
|
|
|
|
|
|
|
|
|
| Sordariomycetes | Ascomycota | 16 | 53 | 25 | 18 | 112 |
|
| Sordariomycetes | Ascomycota | 16 | 53 | 25 | 18 | 112 |
| Sordariomycetes | Ascomycota | 17 | 53 | 24 | 18 | 112 | |
|
| Sordariomycetes | Ascomycota | 16 | 53 | 24 | 18 | 111 |
| Sordariomycetes | Ascomycota | 16 | 53 | 24 | 18 | 111 | |
|
| Sordariomycetes | Ascomycota | 16 | 53 | 23 | 18 | 110 |
|
| Sordariomycetes | Ascomycota | 15 | 52 | 24 | 18 | 109 |
|
| Sordariomycetes | Ascomycota | 15 | 52 | 24 | 18 | 109 |
|
| Sordariomycetes | Ascomycota | 16 | 51 * | 23 | 19 | 109 |
|
| Sordariomycetes | Ascomycota | 16 | 51 * | 23 | 18 | 108 |
|
|
|
| |||||
|
|
|
|
|
|
|
|
|
|
| Sordariomycetes | Ascomycota | 16 | 51 | 23 | 19 * | 109 |
|
| Sordariomycetes | Ascomycota | 16 | 47 | 22 | 19 * | 104 |
|
| Sordariomycetes | Ascomycota | 16 | 44 | 22 | 19 * | 101 |
|
| Leotiomycetes | Ascomycota | 18 | 43 | 22 | 19 * | 102 |
|
| Sordariomycetes | Ascomycota | 15 | 42 | 21 | 19 * | 97 |
|
| Sordariomycetes | Ascomycota | 16 | 39 | 21 | 19 * | 95 |
|
| Agaricomycetes | Basidiomycota | 16 | 23 | 21 | 19 * | 79 |
|
| Leotiomycetes | Ascomycota | 16 | 42 | 20 | 19 * | 97 |
|
| Leotiomycetes | Ascomycota | 16 | 40 | 20 | 19 * | 95 |
|
| Agaricomycetes | Basidiomycota | 17 | 25 | 20 | 19 * | 81 |
Enzyme profile of the 10 species with highest “Pectin+Lignin” biomass degrading capacity. Number of Pectin plus lignin observations is listed in right hand column. One basidiomycete (M. citricolor) and one Ascomycete (V. longisporum) stands out to be unique among all 1932 species analyzed. The heatmap shows the highly varying enzyme profile found in the species with highest total degrading capacity on pectin and lignin.
| Species | Class | Phylum | Cellulose | Pectin | Xylan | Lignin | Total | Pectin + Lignin |
|---|---|---|---|---|---|---|---|---|
|
| Agaricomycetes | Basidiomycota | 91 | 204 | 50 | 149 | 494 | 353 |
|
| Sordariomycetes | Ascomycota | 139 | 176 | 74 | 95 | 484 | 271 |
|
| Sordariomycetes | Ascomycota | 106 | 163 | 63 | 79 | 411 | 242 |
| Leotiomycetes | Ascomycota | 105 | 138 | 75 | 91 | 409 | 229 | |
|
| Sordariomycetes | Ascomycota | 116 | 129 | 58 | 97 | 400 | 226 |
|
| Sordariomycetes | Ascomycota | 90 | 150 | 59 | 72 | 371 | 222 |
|
| Leotiomycetes | Ascomycota | 92 | 124 | 60 | 95 | 371 | 219 |
|
| Sordariomycetes | Ascomycota | 111 | 128 | 56 | 90 | 385 | 218 |
|
| Sordariomycetes | Ascomycota | 90 | 139 | 65 | 77 | 371 | 217 |
|
| Sordariomycetes | Ascomycota | 89 | 139 | 63 | 77 | 368 | 217 |
Left column is names of top scoring sets of species with total-ranked observations (including redundant observations); the scores are given in second column. The third column from the left is the number of unique “Function;Family” observations of the same sets of degrading capacity Top ranking species. The right-hand column is the ratio, Redundant over Unique observations, named as Redundancy Multiplication Score. * marks the species/substrate with the strongest (>8) Redundancy Multiplication factor; ¤ marks the species with the lowest (<3) Redundancy Multiplication Score.
| All Observations | Only Unique Observations | Redundancy Multiplication Score | |||
|---|---|---|---|---|---|
|
| |||||
|
| 208 | 16 | 13.0 * | ||
|
| 172 | 15 | 11.5 * | ||
|
| 117 | 15 | 7.8 | ||
|
| 108 | 22 | 4.9 | ||
| 75 | 26 | 2.9 | |||
|
| 74 | 18 | 4.1 | ||
| 72 | 18 | 4.0 | |||
|
| 71 | 10 | 7.1 | ||
|
| 70 | 20 | 3.5 | ||
| 69 | 23 | 3.0 | |||
|
| |||||
|
| 204 | 24 | 8.5 * | ||
|
| 176 | 37 | 4.8 | ||
|
| 163 | 49 | 3.3 | ||
| 150 | 45 | 3.3 | |||
| 139 | 51 | 2.7¤ | |||
|
| 139 | 53 | 2.6¤ | ||
|
| 139 | 49 | 2.8¤ | ||
| 139 | 53 | 2.6¤ | |||
|
| 138 | 45 | 3.1 | ||
|
| 137 | 39 | 3.5 | ||
|
| |||||
|
| 149 | 18 | 8.3 * | ||
|
| 114 | 16 | 7.1 | ||
| 104 | 17 | 6.1 | |||
| 104 | 15 | 6.9 | |||
| 102 | 16 | 6.4 | |||
|
| 98 | 13 | 7.5 | ||
|
| 98 | 14 | 7.0 | ||
|
| 97 | 18 | 5.4 | ||
|
| 96 | 19 | 5.1 | ||
|
| 95 | 18 | 5.3 | ||
|
| |||||
|
| 204 | 24 | 8.5 * | ||
|
| 176 | 37 | 4.8 | ||
|
| 163 | 49 | 3.3 | ||
|
| 150 | 45 | 3.3 | ||
|
| 139 | 51 | 2.7¤ | ||
| 139 | 53 | 2.6¤ | |||
|
| 139 | 49 | 2.8¤ | ||
|
| 139 | 53 | 2.6¤ | ||
| 138 | 45 | 3.1 | |||
|
| 137 | 39 | 3.5 | ||
|
| |||||
|
| 149 | 18 | 8.3 * | ||
|
| 114 | 16 | 7.1 | ||
|
| 104 | 17 | 6.1 | ||
|
| 104 | 15 | 6.9 | ||
|
| 102 | 16 | 6.4 | ||
| 98 | 13 | 7.5 | |||
| 98 | 14 | 7.0 | |||
|
| 97 | 18 | 5.4 | ||
|
| 96 | 19 | 5.1 | ||
|
| 95 | 18 | 5.3 | ||
Figure 3Pie chart representation of representation (in number of species) of fungal phyla in the upper most part of the list of Total biomass-degrading capacity: (a): 1–500, (b): 500–1000 and (c): 1000–1932. Ascomycota species dominates from 1–1000; Basidiomycota grow from 22 to 86 to 335 from upper part to lower part of the ranking list. All 8 fungal phyla are represented in the second half of the ranked listing of in all 1932 species, although some are in such low numbers that values are not visible in the charts, so numbers have been added with the phylum name. Sequencing projects are still biased towards over-representation of Ascomycota and under-representation of especially Chytridiomycota, and Blastocladiomycota.
Overview of the fungal species, which on two or more substrates rank among Top10 of all 1932 fungal genomes analyzed; Total carbohydrate biomass degrading capacity: According to total degrading capacity, including all observations;Function specificity diversity: according to richness in function specificity diversity of CAZymes. Two species stands out, V. longisporum in total degrading capacity on all four substrates; and Colletotrichum COLG25 in enzyme function specificity diversity.
| Total Carbohydrate Biomass Degrading Capacity | |||||
|---|---|---|---|---|---|
| Target Substrate of Encoded CAZymes | |||||
|
| Ascomycota | Cellulose | Xylan | Lignin | Pectin |
| Ascomycota | Cellulose | Xylan | Lignin | ||
|
| Neocallimastigo | Cellulose | Xylan | ||
|
| Neocallimastigo | Cellulose | Xylan | ||
|
| Basidiomycota | Pectin | Lignin | ||
|
| Ascomycota | Cellulose | Xylan | ||
| Ascomycota | Xylan | Pectin | |||
|
| Ascomycota | Cellulose | Lignin | ||
|
| Basidiomycota | Xylan | Lignin | ||
|
| |||||
|
| |||||
|
| Ascomycota | Cellulose | Xylan | Pectin | |
|
| Ascomycota | Cellulose | Xylan | ||
|
| Ascomycota | Cellulose | Xylan | ||
|
| Ascomycota | Pectin | Lignin | ||
|
| Ascomycota | Xylan | Lignin | ||
|
| Ascomycota | Xylan | Pectin | ||
|
| Ascomycota | Cellulose | Lignin | ||
|
| Ascomycota | Cellulose | Xylan | ||
|
| Ascomycota | Cellulose | Xylan | ||