| Literature DB >> 25648696 |
Christopher W Simmons1, Amitha P Reddy2, Patrik D'haeseleer3, Jane Khudyakov3, Konstantinos Billis4, Amrita Pati4, Blake A Simmons5, Steven W Singer6, Michael P Thelen3, Jean S VanderGheynst2.
Abstract
BACKGROUND: New lignocellulolytic enzymes are needed that maintain optimal activity under the harsh conditions present during industrial enzymatic deconstruction of biomass, including high temperatures, the absence of free water, and the presence of inhibitors from the biomass. Enriching lignocellulolytic microbial communities under these conditions provides a source of microorganisms that may yield robust lignocellulolytic enzymes tolerant to the extreme conditions needed to improve the throughput and efficiency of biomass enzymatic deconstruction. Identification of promising enzymes from these systems is challenging due to complex substrate-enzyme interactions and requirements to assay for activity. In this study, metatranscriptomes from compost-derived microbial communities enriched on rice straw under thermophilic and mesophilic conditions were sequenced and analyzed to identify lignocellulolytic enzymes overexpressed under thermophilic conditions. To determine differential gene expression across mesophilic and thermophilic treatments, a method was developed which pooled gene expression by functional category, as indicated by Pfam annotations, since microbial communities performing similar tasks are likely to have overlapping functions even if they share no specific genes.Entities:
Keywords: Biofuels; Carbohydrate binding module family 2; Carbohydrate binding module family 33; Cellulase; Glycoside hydrolase family 48; Lignocellulose deconstruction; Microbial communities; Solid-state culture
Year: 2014 PMID: 25648696 PMCID: PMC4296540 DOI: 10.1186/s13068-014-0180-0
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Metatranscriptome sequencing and processing metrics
|
|
| |
|---|---|---|
| Total reads generated | 68,754,440 | 50,014,968 |
| mRNA reads | 43,825,869 | 24,348,655 |
| Mapped mRNA reads | 3,916,829 | 2,153,529 |
| Lignocellulolytic transcriptome reads | 6,167 | 3,481 |
Protein families containing glycoside hydrolase (GH) or carbohydrate binding module (CBM) domains relevant to lignocellulose deconstruction that were targeted during metatranscriptome analysis
|
|
|
|
|
|---|---|---|---|
| Pfam00150 | GH | 5 | β-mannosidase, endo-β-1,4-glucanase, endo-β-1,4-mannosidase, endo-β-1,4-xylanase, β-1,4-cellobiosidase, β-1,3-mannanase, xyloglucan-specific endo-β-1,4-glucanase, exo-β-1,4-glucanase |
| Pfam00232 | GH | 1 | β-glucosidase |
| Pfam00331 | GH | 10 | Endo-β-1,4-xylanase, endo-β-1,3-xylanase |
| Pfam00457 | GH | 11 | Xylanase |
| Pfam00722 | GH | 16 | Endo-β-1,3-glucanase, endo-β-1,3(4)-glucanase, xyloglucanase |
| Pfam00759 | GH | 9 | Endoglucanase, cellobiohydrolase, β-glucosidase |
| Pfam00840 | GH | 7 | Endo-1,4-β-glucanase, cellobiohydrolase |
| Pfam00933 | GH | 3 | β-glucosidase, 1,4-β-xylosidase, exo-1,3-1,4-glucanase, α-L-arabinofuranosidase |
| Pfam01270 | GH | 8 | Cellulase, endo-β-1,4-xylanase, reducing-end xylose-releasing exo-oligoxylanase |
| Pfam01341 | GH | 6 | Endoglucanase, cellobiohydrolase |
| Pfam01670 | GH | 12 | Endoglucanase, xyloglucan hydrolase, β-1,3-1,4-glucanase |
| Pfam01915 | GH | 3C | β-glucosidase, β-1,4-xylosidase, exo-1,3-1,4-glucanase, α-L-arabinofuranosidase |
| Pfam02011 | GH | 48 | Reducing end-acting cellobiohydrolase, endo-β-1,4-glucanase |
| Pfam02015 | GH | 45 | Endoglucanase |
| Pfam02156 | GH | 26 | β-mannanase, β-1,3-xylanase |
| Pfam03443 | GH | 61 | Lytic polysaccharide monooxygenase |
| Pfam03648 | GH | 67 N | α-glucuronidase xylan, α-1,2-glucuronidase |
| Pfam03664 | GH | 62 | α-L-arabinofuranosidase |
| Pfam04616 | GH | 43 | β-xylosidase, α-L-arabinofuranosidase, arabinanase, xylanase |
| Pfam07477 | GH | 67C | α-glucuronidase, xylan, α-1,2-glucuronidase |
| Pfam07488 | GH | 67 M | α-glucuronidase, xylan, α-1,2-glucuronidase |
| Pfam00553 | CBM | 2 | n/a |
| Pfam00734 | CBM | 1 | n/a |
| Pfam00942 | CBM | 3 | n/a |
| Pfam02013 | CBM | 10 | n/a |
| Pfam02018 | CBM | 4, 9 | n/a |
| Pfam03067 | CBM | 33 | n/a |
| Pfam03422 | CBM | 6 | n/a |
| Pfam03424 | CBM | 17, 28 | n/a |
| Pfam03425 | CBM | 11 | n/a |
| Pfam03426 | CBM | 15 | n/a |
| Pfam09212 | CBM | 27 | n/a |
| Pfam09478 | CBM | 49 | n/a |
Figure 1Rarefaction curves describing number of expressed genes detected versus number of reads sampled for the mesophilic community (dashed line) and thermophilic community (solid line).
Expression of lignocellulolytic glycoside hydrolase families in enriched mesophilic and thermophilic microbial communities
|
| ||
|---|---|---|
|
|
|
|
| 1 | 10.9 | 3.9 |
| 3 | 12.3 | 28.8 |
| 5 | 1.8 | 9.6 |
| 6 | 16.1 | 1.4 |
| 7 | 1.4 | 0.0 |
| 8 | 0.7 | 1.4 |
| 9 | 4.4 | 3.0 |
| 10 | 12.3 | 4.6 |
| 11 | 13.8 | 4.4 |
| 12 | 1.3 | 0.2 |
| 16 | 2.2 | 9.4 |
| 26 | 0.1 | 2.1 |
| 43 | 8.4 | 24.4 |
| 45 | 0.0 | 0.1 |
| 48 | 9.6 | 0.5 |
| 61 | 2.8 | 0.1 |
| 62 | 0.9 | 0.3 |
| 67 | 1.2 | 5.8 |
*Values are given as percentages of the total number of normalized read counts mapped to lignocellulolytic glycoside hydrolase Pfams in the corresponding metagenome.
Figure 2Total gene expression and expression of glycoside hydrolase families relevant to lignocellulose deconstruction by phylum in thermophilic and mesophilic enriched communities. TM - total metatranscriptome, all reads in metatranscriptome considered; DM - deconstructive metatranscriptome, only reads for deconstructive glycoside hydrolase Pfams considered.
Genera that express >50% of lignocellulolytic glycoside hydrolases in enriched thermophilic and mesophilic communities
|
|
| |||
|---|---|---|---|---|
|
|
|
|
|
|
| 1 |
| 83.4 |
| 23.7 |
|
| 12.1 | |||
|
| 11.4 | |||
|
| 8.9 | |||
| 3 |
| 36.1 |
| 35.2 |
|
| 21.0 |
| 23.9 | |
| 5 |
| 57.7 |
| 16.8 |
|
| 16.5 | |||
|
| 15.8 | |||
|
| 10.9 | |||
| 6 |
| 93.8 |
| 35.3 |
|
| 33.9 | |||
| 7 |
| 70.2 | n/a | |
| 8 |
| 44.7 |
| 69.3 |
|
| 40.0 | |||
| 9 |
| 79.8 |
| 45.8 |
|
| 20.8 | |||
| 10 |
| 65.5 |
| 25.1 |
|
| 20.1 | |||
|
| 17.4 | |||
| 11 |
| 94.7 |
| 47.7 |
|
| 25.1 | |||
| 12 |
| 86.7 |
| 73.6 |
| 16 |
| 27.1 |
| 29.2 |
|
| 18.5 |
| 18.7 | |
|
| 17.8 |
| 17.0 | |
| 26 |
| 65.2 |
| 24.9 |
|
| 21.1 | |||
|
| 15.3 | |||
| 43 |
| 38.9 |
| 34.3 |
|
| 23.9 |
| 18.7 | |
| 45 | n/a |
| 100 | |
| 48 |
| 100 |
| 43.1 |
|
| 22.1 | |||
| 61 |
| 49.0 |
| 100 |
| 43.2 | ||||
|
| ||||
| 62 |
| 46.0 |
| 100 |
|
| 40.5 | |||
| 67 |
| 39.5 |
| 50.7 |
|
| 26.9 | |||
Phyla are indicated as A, Actinobacteria; Ac, Acidobacteria; As, Ascomycota; B, Bacteroidetes; F, Firmicutes; P, Proteobacteria; Pl, Planctomycetes.
Differential expression of protein families associated with lignocellulolytic activity in enriched thermophilic and mesophilic communities
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Pfam00150 | GH5 | M | 2.73 | 0.960 | 1.000 |
| Pfam00232 | GH1 | T | 9.33 | 0.333 | 0.331 |
| Pfam00331 | GH10 | T | 6.90 | 0.540 | 0.369 |
| Pfam00457 | GH11 | T | 7.53 | 0.109 | 0.369 |
| Pfam00722 | GH16 | M | 2.57 | 0.769 | 1.000 |
| Pfam00759 | GH9 | T | 4.79 | 0.542 | 0.569 |
| Pfam00840 | GH7 | T | n/a2 | 0.291 | 0.256 |
| Pfam00933 | GH3 | M | 1.26 | 0.970 | 1.000 |
| Pfam01270 | GH8 | T | 1.28 | 0.975 | 1.000 |
| Pfam01341 | GH6 | T | 59.68 | 0.152 |
|
| Pfam01670 | GH12 | T | 20.49 | 0.623 | 0.466 |
| Pfam01915 | GH3C | M | 1.39 | 0.864 | 1.000 |
| Pfam02011 | GH48 | T | 129.95 |
|
|
| Pfam02015 | GH45 | M | n/a2 | 0.751 | 1.000 |
| Pfam02156 | GH26 | M | 7.94 | 0.890 | 0.878 |
| Pfam03443 | GH61 | T | 113.00 | 0.364 | 0.323 |
| Pfam03648 | GH67N | M | 1.10 | 0.859 | 1.000 |
| Pfam03664 | GH62 | T | 42.87 | 0.282 |
|
| Pfam04616 | GH43 | M | 1.16 | 0.946 | 1.000 |
| Pfam07477 | GH67C | M | 1.06 | 0.840 | 1.000 |
| Pfam07488 | GH67M | M | 1.15 | 0.667 | 1.000 |
| Pfam00553 | CBM2 | T | 30.07 |
|
|
| Pfam00734 | CBM1 | n/a3 | n/a3 | n/a3 | n/a3 |
| Pfam00942 | CBM3 | T | 29.40 | 0.179 | 0.221 |
| Pfam02013 | CBM10 | M | n/a2 | 0.485 |
|
| Pfam02018 | CBM4,9 | M | 1.47 | 0.247 | 1.000 |
| Pfam03067 | CBM33 | T | 49.61 |
|
|
| Pfam03422 | CBM6 | M | 2.31 | 0.930 | 1.000 |
| Pfam03424 | CBM17,28 | M | n/a2 | 0.956 | 1.000 |
| Pfam03425 | CBM11 | M | 1.27 | 0.988 | 1.000 |
| Pfam03426 | CBM15 | M | n/a2 | 0.656 | 1.000 |
| Pfam09212 | CBM27 | n/a3 | n/a3 | n/a3 | n/a3 |
| Pfam09478 | CBM49 | n/a3 | n/a3 | n/a3 | n/a3 |
1T, thermophilic community; M, mesophilic community.
2Fold change cannot be calculated because read count is 0 in one metatranscriptome.
3Pfam expression not detected in either metatranscriptome.
4 P-value is the mean from running the algorithm three times.
5Bold values indicate P-values ≤ to the critical value of 0.1.
Figure 3Expression levels in the enriched thermophilic community for individual genes annotated as (A) Pfam02011 (glycoside hydrolase family 48), (B) Pfam03067 (carbohydrate binding module family 33), and (C) Pfam00553 (carbohydrate binding module family 2). Genes listed account for ≥90% of all expression within a given Pfam.