| Literature DB >> 28198423 |
Salvatore Montella1, Valeria Ventorino2, Vincent Lombard3,4, Bernard Henrissat3,4,5, Olimpia Pepe2, Vincenza Faraco1.
Abstract
In this study, a high-throughput sequencing approach was applied to discover novel biocatalysts for lignocellulose hydrolysis from three dedicated energy crops, Arundo donax, Eucalyptus camaldulensis and Populus nigra, after natural biodegradation. The microbiomes of the three lignocellulosic biomasses were dominated by bacterial species (approximately 90%) with the highest representation by the Streptomyces genus both in the total microbial community composition and in the microbial diversity related to GH families of predicted ORFs. Moreover, the functional clustering of the predicted ORFs showed a prevalence of poorly characterized genes, suggesting these lignocellulosic biomasses are potential sources of as yet unknown genes. 1.2%, 0.6% and 3.4% of the total ORFs detected in A. donax, E. camaldulensis and P. nigra, respectively, were putative Carbohydrate-Active Enzymes (CAZymes). Interestingly, the glycoside hydrolases abundance in P. nigra (1.8%) was higher than that detected in the other biomasses investigated in this study. Moreover, a high percentage of (hemi)cellulases with different activities and accessory enzymes (mannanases, polygalacturonases and feruloyl esterases) was detected, confirming that the three analyzed samples were a reservoir of diversified biocatalysts required for an effective lignocellulose saccharification.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28198423 PMCID: PMC5309792 DOI: 10.1038/srep42623
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Quality and statistical summary of sequencing and assembling.
| Parameter | Biomass sample | ||
|---|---|---|---|
| Total reads (bp) | 11,208,388,400 | 11,274,127,600 | 2,392,000 |
| High quality reads (bp) | 10,010,000,000 (89%) | 10,010,000,000 (88%) | 2,100,000 (88%) |
| Number of contigs | 95,292 | 159,184 | 33,805 |
| Length of contigs (bp) | 111,530,551 | 143,424,210 | 40,937,098 |
| N50 contig length (bp) | 1,326 | 914 | 1,452 |
| N90 contig length (bp) | 574 | 546 | 583 |
| Largest contig (bp) | 49,245 | 650,642 | 85,030 |
| Shortest contig (bp) | 500 | 500 | 500 |
| Mapping | |||
| PE | 6,497,524 | 7,683,451 | 2,852,867 |
| SE | 1,790,597 | 2,097,376 | 1,253,515 |
| Total (%) | 14.77 | 17.45 | 33.14 |
Relative abundance of dominant taxa at the phylum and class rank mapping the high quality reads to the NT database (NCBI).
| Arundo donax | Eucalyptus camaldulensis | Populus nigra | ||||
|---|---|---|---|---|---|---|
| Phylum (%) | Class (%) | Phylum (%) | Class (%) | Phylum (%) | Class (%) | |
| Bacteria | Proteobacteria (47.4%) | γ-Proteobacteria (27.1%) | Proteobacteria (29.6%) | γ-Proteobacteria (17.4%) | Proteobacteria (29.6%) | γ-Proteobacteria (25.2%) |
| α-Proteobacteria (13.5%) | α-Proteobacteria (9.0%) | α-Proteobacteria (3.2%) | ||||
| β-Proteobacteria (4.0%) | β-Proteobacteria (2.9%) | β-Proteobacteria (1.0%) | ||||
| δ-Proteobacteria (2.4%) | ||||||
| Actinobacteria (40.1%) | Actinobacteria (40.1%) | Actinobacteria (57.6%) | Actinobacteria (57.6%) | Actinobacteria (59.8%) | Actinobacteria (59.8%) | |
| Bacteroidetes (1.0%) | Firmicutes (10.4%) | Bacilli (10.3%) | ||||
| Fungi | Ascomycota (2.7%) | Sordariomycetes (1.6%) | Ascomycota (5.3%) | Eurotiomycetes (3.4%) | ||
| Sordariomycetes (1.1%) | ||||||
Only taxa with an incidence ≥1% in each sample are shown.
Figure 1Abundance of bacterial and fungal genera in A. donax, E. camaldulensis and P. nigra lignocellulosic biomass.
Only taxa with an incidence ≥1% in each sample are shown. Other bacteria and other fungi represent the aggregate of other bacterial and fungal genera, respectively; not assigned means that these reads cannot be annotated at the genus level.
Figure 2Relative abundance of eggNOG categories related to the predicted ORFs from T3ADSB, T3ESB and T3PSB sample.
Figure 3KEGG pathway classification of the predicted ORFs from T3ADSB, T3ESB and T3PSB samples.
CAZYmes classification of predicted ORFs from T3ADSB, T3ESB and T3PSB sample.
| CAZymes classification | T3ADSB | T3ESB | T3PSB | |||
|---|---|---|---|---|---|---|
| # ORFs | % | # ORFs | % | # ORFs | % | |
| Auxiliary Activities enzymes (AAs) | 76 | 4.2% | 23 | 1.8% | 45 | 2.1% |
| Carbohydrate-binding modules (CBMs) | 94 | 5.2% | 148 | 11.6% | 285 | 13.5% |
| Carbohydrate Esterases (CEs) | 110 | 6.1% | 68 | 5.3% | 159 | 7.5% |
| Glycoside Hydrolases (GHs) | 1059 | 59.1% | 750 | 58.6% | 1136 | 53.8% |
| Glycosiltransferases (GTs) | 460 | 25.7% | 320 | 25.0% | 555 | 26.3% |
| Polysaccharide Lyases (PLs) | 24 | 1.3% | 37 | 2.9% | 48 | 2.3% |
| Total CAZYmes | 1792 | 1279 | 2113 | |||
*The total numbers of CAZYmes is less than the sum (AAs + CBMs + CEs + GHs + GTs + PLs) due to the fact that some multimodular predicted proteins were detected.
Figure 4GH family percentage of predicted ORFs from T3ADSB (A), T3ESB (B) and T3PSB (C) samples. The GHs with more than 1% of abundance are reported.
Comparison of GH family percentage of predicted ORFs from T3ADSB, T3ESB and T3PSB sample.
| CAZy Family | Main Known activities | T3ADSB | T3ESB | T3PSB |
|---|---|---|---|---|
| GH1 | β-glucosidases, β-galactosidases, 6-phospho-β-glucosidase and 6-phospho-β-galactosidase, β-mannosidase, β-D-fucosidase and β-glucuronidase | 2.50% | 2.92% | 4.10% |
| GH2 | β-galactosidases, β-glucuronidases, β-mannosidases, exo-β-glucosaminidases | 5.68% | 3.06% | 3.17% |
| GH3 | exo-acting β-D-glucosidases, α-L-arabinofuranosidases, β-D-xylopyranosidases and N-acetyl-β-D-glucosaminidases | 6.64% | 5.43% | 6.24% |
| GH16 | Xyloglucosyltransferase, keratan-sulfate endo-1,4-β-galactosidase, endo-1,3-β-glucanase, endo-1,3(4)-β-glucanase, licheninase, β-agarase, κ-carrageenase, xyloglucanase, endo-β-1,3-galactanase, β-porphyranase, hyaluronidase, endo-β-1,4-galactosidase, chitin β-1,6-glucanosyltransferase, endo-β-1,4-galactosidase | 3.66% | 2.65% | 1.21% |
| GH18 | chitinases and endo-β-N-acetylglucosaminidases | 3.85% | 4.04% | 2.98% |
| GH23 | muramidase, peptidoglycan N-acetylmuramoylhydrolase, 1,4-β-N-acetylmuramidase and N-acetylmuramoylhydrolase | 2.02% | 3.34% | 4.10% |
| GH43 | α-L-arabinofuranosidases, endo-α-L-arabinanases (or endo-processive arabinanases) and β-D-xylosidases | 5.29% | 3.90% | 4.29% |
| GH92 | exo-acting α-mannosidases | 5.20% | 6.41% | 4.57% |
The GHs with an incidence >1% in each sample are shown are reported.
Figure 5Percentage composition of bacterial (A) and fungal (B) genera related to GH families of predicted ORFs in A. donax, E. camaldulensis and P. nigra biomass. Only taxa with an incidence ≥1% in each sample are shown. Others represent the aggregate of other bacterial (A) and fungal (B) genera.
Figure 6KEGG pathway classification for the putative genes coding for Enzyme Commission (EC) number activities related to the hydrolysis of glycosidic bonds from T3ADSB, T3ESB and T3PSB samples.
Comparison of plant polysaccharides hydrolyzing enzymes in our samples T3ADSB, T3ESB and T3PSB and in samples with the highest lignocellulose-degrading potentiality.
| CAZy family | Main known activity | Pfam domain | Metagenomic samples | |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| T3ADSB | T3ESB | T3PSB | Six years old elephant feces (Ilmberger | Yak rumen (Dai | Snail crop (Cardoso | Cow rumen (Hess | Bovine rumen(Brulc | Macropod gut (Pope | Termite hindgut (Warnecke | |||||||||||||
| Endo-Celluases | ||||||||||||||||||||||
| GH5 | cellulase | PF00150 | 40 | 3.8% | 15 | 2.0% | 23 | 2.0% | 517 | 4.7% | 1302 | 3.5% | 36 | 1.4% | 1451 | 5.2% | 7 | 0.7% | 10 | 1.8% | 56 | 8.0% |
| GH6 | endoglucanase | PF01341 | 11 | 1.0% | 5 | 0.7% | 12 | 1.1% | 0 | 0.0% | 0 | 0.0% | 4 | 0.2% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% |
| GH7 | endoglucanase | PF00840 | 11 | 1.0% | 1 | 0.1% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 1 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% |
| GH9 | endoglucanase | PF00759 | 2 | 0.2% | 3 | 0.4% | 5 | 0.4% | 119 | 1.1% | 767 | 2.0% | 15 | 0.6% | 795 | 2.9% | 6 | 0.6% | 0 | 0.0% | 9 | 1.3% |
| GH44 | endoglucanase | NA | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 7 | 0.1% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 6 | 0.9% |
| GH45 | endoglucanase | PF02015 | 1 | 0.1% | 0 | 0.0% | 0 | 0.0% | 7 | 0.1% | 13 | 0.0% | 0 | 0.0% | 115 | 0.4% | 0 | 0.0% | 0 | 0.0% | 4 | 0.6% |
| GH48 | endo-processive cellulase | PF02011 | 1 | 0.1% | 1 | 0.1% | 3 | 0.3% | 0 | 0.0% | 32 | 0.1% | 2 | 0.1% | 3 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% |
| 66 | 6.2% | 25 | 3.3% | 43 | 3.8% | 650 | 5.9% | 2114 | 5.6% | 57 | 2.2% | 2365 | 8.5% | 13 | 1.4% | 10 | 1.8% | 75 | 10.7% | |||
| Endo-hemicellulases | ||||||||||||||||||||||
| GH8 | endo-xylanases | PF02011 | 1 | 0.1% | 1 | 0.1% | 7 | 0.6% | 85 | 0.8% | 174 | 0.5% | 46 | 1.8% | 329 | 1.2% | 4 | 0.4% | 1 | 0.2% | 5 | 0.7% |
| GH10 | endo-1,4-β xylanase | PF00331 | 15 | 1.4% | 5 | 0.7% | 8 | 0.7% | 258 | 2.3% | 2664 | 7.1% | 25 | 1.0% | 1025 | 3.7% | 7 | 0.7% | 11 | 2.0% | 46 | 6.5% |
| GH11 | xylanase | PF00457 | 4 | 0.4% | 2 | 0.3% | 4 | 0.4% | 20 | 0.2% | 244 | 0.6% | 1 | 0.0% | 165 | 0.6% | 1 | 0.1% | 0 | 0.0% | 14 | 2.0% |
| GH12 | endoglucanase & xyloglucan hydrolases | PF01670 | 3 | 0.3% | 1 | 0.1% | 3 | 0.3% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% |
| GH26 | β- mannanase & xylanase | PF02156 | 5 | 0.5% | 6 | 0.8% | 10 | 0.9% | 103 | 0.9% | 537 | 1.4% | 11 | 0.4% | 369 | 1.3% | 5 | 0.5% | 5 | 0.9% | 15 | 2.1% |
| GH28 | galacturonases | PF00295 | 11 | 1.0% | 8 | 1.1% | 20 | 1.8% | 242 | 2.2% | 244 | 0.6% | 69 | 2.7% | 472 | 1.7% | 5 | 0.5% | 2 | 0.4% | 6 | 0.9% |
| GH53 | endo-1,4-β-galactanase | PF07745 | 4 | 0.4% | 1 | 0.1% | 5 | 0.4% | 88 | 0.8% | 1066 | 2.8% | 9 | 0.3% | 0 | 0.0% | 17 | 1.8% | 9 | 1.6% | 12 | 1.7% |
| 43 | 4.1% | 24 | 3.2% | 57 | 5.0% | 796 | 7.2% | 4929 | 13.1% | 161 | 6.2% | 2360 | 8.5% | 39 | 4.1% | 28 | 5.0% | 98 | 13.9% | |||
| Debranching enzymes | ||||||||||||||||||||||
| GH51 | α-L-arabinofuranosidase | NA | 14 | 1.3% | 7 | 0.9% | 13 | 1.1% | 239 | 2.2% | 0 | 0.0% | 22 | 0.8% | 0 | 0.0% | 64 | 6.7% | 12 | 2.2% | 18 | 2.6% |
| GH54 | α-L-arabinofuranosidase | PF09206 | 0 | 0.0% | 3 | 0.4% | 0 | 0.0% | 13 | 0.1% | 111 | 0.3% | 0 | 0.0% | 0 | 0.0% | 1 | 0.1% | 0 | 0.0% | 0 | 0.0% |
| GH62 | α-L-arabinofuranosidase | PF03664 | 3 | 0.3% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 2 | 0.1% | 1 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% |
| GH67 | α-glucuronidase | PF07477 PF07488 | 0 | 0.0% | 3 | 0.4% | 8 | 0.7% | 0 | 0.0% | 1090 | 2.9% | 5 | 0.2% | 120 | 0.4% | 0 | 0.0% | 5 | 0.9% | 10 | 1.4% |
| GH78 | α-L-rhamnosidase | PF05592 | 0 | 0.0% | 19 | 2.5% | 7 | 0.6% | 413 | 3.7% | 426 | 1.1% | 73 | 2.8% | 1260 | 4.5% | 34 | 3.6% | 25 | 4.5% | 0 | 0.0% |
| 17 | 1.6% | 32 | 4.3% | 28 | 2.5% | 665 | 6.0% | 1627 | 4.3% | 102 | 3.9% | 1381 | 5.0% | 99 | 10.3% | 42 | 7.5% | 28 | 4.0% | |||
| Oligosaccharide-degrading enzymes | ||||||||||||||||||||||
| GH1 | β-glucosidase & other β-linked dimers | PF00232 | 26 | 2.5% | 23 | 3.1% | 46 | 4.0% | 103 | 0.9% | 331 | 0.9% | 294 | 11.4% | 253 | 0.9% | 10 | 1.0% | 61 | 11.0% | 22 | 3.1% |
| GH2 | β-galactosidases & other β-linked dimers | PF02836 PF00703 PF02837 | 59 | 5.6% | 22 | 2.9% | 37 | 3.3% | 917 | 8.3% | 942 | 2.5% | 66 | 2.5% | 1436 | 5.2% | 186 | 19.4% | 24 | 4.3% | 23 | 3.3% |
| GH3 | mainly βglucosidases | PF00933 | 69 | 6.5% | 39 | 5.2% | 70 | 6.2% | 804 | 7.3% | 5448 | 14.5% | 219 | 8.5% | 2844 | 10.2% | 176 | 18.4% | 72 | 12.9% | 69 | 9.8% |
| GH29 | α-L-fucosidase | PF01120 | 15 | 1.4% | 11 | 1.5% | 9 | 0.8% | 376 | 3.4% | 899 | 2.4% | 70 | 2.7% | 939 | 3.4% | 74 | 7.7% | 2 | 0.4% | 0 | 0.0% |
| GH35 | β-galactosidase | PF01301 | 12 | 1.1% | 7 | 0.9% | 9 | 0.8% | 123 | 1.1% | 468 | 1.2% | 32 | 1.2% | 158 | 0.6% | 12 | 1.3% | 3 | 0.5% | 3 | 0.4% |
| GH38 | α-mannosidase | PF01074 PF07748 | 15 | 1.4% | 12 | 1.6% | 22 | 1.9% | 81 | 0.7% | 90 | 0.2% | 18 | 0.7% | 272 | 1.0% | 17 | 1.8% | 3 | 0.5% | 11 | 1.6% |
| GH39 | β-xylosidase | PF01229 | 7 | 0.7% | 6 | 0.8% | 4 | 0.4% | 89 | 0.8% | 159 | 0.4% | 6 | 0.2% | 315 | 1.1% | 2 | 0.2% | 1 | 0.2% | 3 | 0.4% |
| GH42 | β-galactosidase | PF02449 PF08533 PF08532 | 16 | 1.5% | 14 | 1.9% | 24 | 2.1% | 37 | 0.3% | 207 | 0.6% | 54 | 2.1% | 374 | 1.3% | 11 | 1.1% | 8 | 1.4% | 24 | 3.4% |
| GH43 | arabinases & xylosidases | PF04616 | 56 | 5.3% | 28 | 3.7% | 52 | 4.6% | 894 | 8.1% | 2313 | 6.2% | 185 | 7.1% | 0 | 0.0% | 61 | 6.4% | 10 | 1.8% | 16 | 2.3% |
| GH52 | β-xylosidase | PF03512 | 0 | 0.0% | 0 | 0.0% | 2 | 0.2% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | 3 | 0.4% |
| 275 | 26.0% | 162 | 21.6% | 275 | 24.2% | 3424 | 31.0% | 10857 | 28.9% | 944 | 36.4% | 6591 | 23.7% | 549 | 57.4% | 184 | 33.0% | 174 | 24.7% | |||
| 401 | 37.9% | 243 | 32.4% | 403 | 35.5% | 5535 | 50.1% | 19527 | 52.0% | 1264 | 48.8% | 12697 | 45.7% | 700 | 73.1% | 264 | 47.4% | 375 | 53.3% | |||
| 1059 | 750 | 1136 | 11038 | 37563 | 2590 | 27755 | 957 | 557 | 704 | |||||||||||||
GHs are grouped according to the major functional roles as classified in Allgaier et al.62. For each GH family, the total number and the percentage on total GHs detected in the respective sample were shown.
aSearches for glycoside hydrolases were performed by using HMMER hmmsearch with Pfam_Is HMMs (full-length models) to identify complete matches to the family, which were named in accordance with the CAZy nomenclature scheme.
bThe detection and assignment of glycoside hydrolases were performed by BLAST-based procedures against the CAZy database.