| Literature DB >> 25709713 |
Wuttichai Mhuantong1, Varodom Charoensawan2, Pattanop Kanokratana1, Sithichoke Tangphatsornruang3, Verawat Champreda1.
Abstract
BACKGROUND: As one of the most abundant agricultural wastes, sugarcane bagasse is largely under-exploited, but it possesses a great potential for the biofuel, fermentation, and cellulosic biorefinery industries. It also provides a unique ecological niche, as the microbes in this lignocellulose-rich environment thrive in relatively high temperatures (50°C) with varying microenvironments of aerobic surface to anoxic interior. The microbial community in bagasse thus presents a good resource for the discovery and characterization of new biomass-degrading enzymes; however, it remains largely unexplored.Entities:
Keywords: Biofuels; Biorefinery; Comparative genomics; Lignocellulose degradation; Metagenomics
Year: 2015 PMID: 25709713 PMCID: PMC4337096 DOI: 10.1186/s13068-015-0200-8
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Summary of bagasse fosmid pyrosequencing data
|
| ||||||
|---|---|---|---|---|---|---|
|
|
|
|
| |||
|
|
|
|
| |||
| 1. Raw reads | 1,038,205 | 591,656,071 | 569.9 | 173.3 | 40 | 1,595 |
| 2. Read screen repeats | 982,383 | 569,556,388 | 579.8 | 164.7 | 40 | 1,595 |
| 3. Read screen repeats and trim vector | 726,980 | 421,491,438 | 579.8 | 166.0 | 40 | 1,595 |
|
| ||||||
|
|
|
|
| |||
|
|
|
|
| |||
| 1. Contigs | 17,829 | 32,867,905 | 1,843.5 | 2,394.6 | 100 | 46,577 |
| 2. Singletons (non-redundant) | 185,543 | 109,290,202 | 589.0 | 163.5 | 40 | 1,595 |
The bagasse fosmid library was sequenced on one full lane of the 454 GS-FLX Titanium, resulting in approximately one million raw reads. The reads with contaminating sequences of vector or host genome were removed before contig assembling and redundant sequence cleaning.
Figure 1Taxonomic distribution of a bagasse fosmid library. A large majority of sequences in the library were classified as of bacterial origin (about 94%), followed by eukaryotes (about 4%) and archaea (about 0.5%). The taxonomic terms were obtained using BLASTN against non-redundant NT database using E-value cutoff at 1e-3. The pie chart represents percentages of reads that have the best hits (lowest E-value) to particular taxa.
Figure 2Lignocellulosic degradation pathway and its related enzymes found in our bagasse metagenome. Simplified biomass degradation process and enzymes involved. The enzyme families present in the bagasse metagenomic library are highlighted in red text. Colored pie charts show the amount of reads mapped to different GH families involving different steps of biomass degradation that belong to major bacterial phyla.
Summary of the number of reads from the bagasse metagenome mapped to lignocellulose-degrading genes
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cellulases | GH5 | 6 | 18 | 1 | 0 | 3 | 4 | 0 | 4 | 0 | 0 | Cellulase; endoglucanase; beta-glucosidase |
| Cellulases | GH6 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Endoglucanase; cellobiohydrolase |
| Cellulases | GH9 | 1 | 9 | 0 | 0 | 0 | 0 | 0 | 2 | 3 | 1 | Endoglucanase; beta-glucosidase |
| Cell wall elongation | GH16 | 8 | 12 | 0 | 1 | 0 | 0 | 0 | 5 | 0 | 0 | Xyloglucan; licheninase |
| Cell wall elongation | GH17 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28 | 0 | 0 | Exo-beta-1,3-glucanase; licheninase |
| Cell wall elongation | GH74 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | Endoglucanase; xyloglucanase |
| Oligosaccharide-degrading enzymes | GH1 | 9 | 3 | 0 | 2 | 0 | 0 | 0 | 9 | 2 | 1 | Beta-glucosidase; beta-galactosidase |
| Oligosaccharide-degrading enzymes | GH2 | 6 | 64 | 5 | 9 | 5 | 10 | 0 | 9 | 4 | 0 | Beta-mannosidase; beta-galactosidase |
| Oligosaccharide-degrading enzymes | GH3 | 8 | 57 | 1 | 4 | 8 | 2 | 0 | 68 | 19 | 2 | Beta-glucosidase; beta-glucosylceramidase |
| Oligosaccharide-degrading enzymes | GH29 | 0 | 14 | 1 | 0 | 7 | 4 | 1 | 0 | 0 | 0 | Alpha-L-fucosidase |
| Oligosaccharide-degrading enzymes | GH35 | 1 | 0 | 3 | 1 | 0 | 0 | 0 | 3 | 0 | 1 | Beta-galactosidase |
| Oligosaccharide-degrading enzymes | GH38 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | Alpha-mannosidase |
| Oligosaccharide-degrading enzymes | GH39 | 7 | 0 | 0 | 2 | 0 | 1 | 0 | 4 | 0 | 0 | Beta-xylosidase |
| Oligosaccharide-degrading enzymes | GH42 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | Beta-galactosidase |
| Oligosaccharide-degrading enzymes | GH43 | 1 | 12 | 1 | 2 | 1 | 0 | 0 | 20 | 0 | 0 | Beta-xylosidase |
| Oligosaccharide-degrading enzymes | GH52 | 0 | 0 | 0 | 0 | 0 | 5 | 0 | 1 | 0 | 0 | Beta-xylosidase |
| Endohemicellulases | GH8 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | Endo-1,4-D-glucanase; chitosanase |
| Endohemicellulases | GH10 | 11 | 14 | 2 | 0 | 5 | 6 | 0 | 8 | 0 | 0 | Xylanase; beta-1,4-xylanase; endo-1,4-beta-xylanase |
| Endohemicellulases | GH11 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | Endo-1,4-beta-xylanase; xylanase |
| Endohemicellulases | GH12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 1 | Endoglucanase; xyloglucan hydrolase |
| Endohemicellulases | GH26 | 0 | 3 | 0 | 0 | 0 | 4 | 0 | 0 | 0 | 0 | Beta-mannanase; endo-1,4-beta-mannosidase |
| Endohemicellulases | GH28 | 0 | 15 | 0 | 0 | 4 | 9 | 0 | 2 | 4 | 0 | Polygalacturonase; pectate lyase; endopolygalacturonase |
| Endohemicellulases | GH53 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Endo-1,4-beta-galactosidase |
| Debranching enzymes | GH51 | 4 | 7 | 2 | 2 | 13 | 1 | 0 | 3 | 0 | 0 | Alpha-L-arabinofuranosidase; endoglucanase |
| Debranching enzymes | GH62 | 2 | 0 | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | Alpha-L-arabinofuranosidase |
| Debranching enzymes | GH67 | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | Alpha-glucuronidase |
| Debranching enzymes | GH78 | 0 | 11 | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | Alpha-L-rhamnosidase |
Summary of the number of reads in the bagasse metagenome mapped to genes encoding lignocellulose-degrading enzyme homologs annotated by the CAZy database.
Figure 3Taxonomic profiles of metagenomes of lignocellulosic (blue)- and non-lignocellulosic (red)-degrading sources. A) The relative taxonomic distributions of bacterial phyla in different metagenomic datasets. Each bar represents the percentage of total reads. The numbers within the bars indicate numbers of unique genes that reads from metagenomic libraries mapped to. Proteobacteria dominate almost all metagenomic communities, except for human gut, mouse gut, and wallaby gut, which are dominated by Firmicutes, and termite gut by Spirochaetes. B) Principal component analysis (PCA) of bacterial diversity profiles (left) and metagenome profiles (right). The bacteria of many phyla are found in highly overlapping environments, except for Firmicutes and Spirochaetes, which are predominantly present in mammal and termite guts; this explains why their profiles are not clustered with other metagenomes in the PCA plot.
Figure 4Comparison of Clusters of Orthologous Groups (COGs) in lignocellulolytic and non-lignocellulolytic metagenomes. The bar plots represent the percentage of reads mapped to different COGs using BLAST, while the numbers within the bars indicate the number of unique genes. Metagenomic profiles are clustered using hierarchical clustering (complete linkage method), based on the divergence of COG profiles.
Figure 5Comparison of reads mapped to different KEGG metabolic pathways and CAZy enzyme families. Comparative genomic analysis of selected glycoside hydrolase (GH), carbohydrate-binding module (CBM), and auxiliary activities (AA) families. Color shades indicate the numbers of unique genes in the families to which metagenomic reads were mapped.
Figure 6Lignocellulolytic metagenomes are taxonomically diverse, but their carbohydrate-active enzymes are conserved. Spearman correlations were computed for metagenomic libraries within the lignocellulose-degrading environment group (C), non-lignocellulose-degrading group (N), or metagenomes from different groups (Out group), based on the taxonomy, COG, KEGG, and CAZy profiles. Error bars represent standard errors of means.