| Literature DB >> 23894306 |
Patrik D'haeseleer1, John M Gladden, Martin Allgaier, Patrik S G Chain, Susannah G Tringe, Stephanie A Malfatti, Joshua T Aldrich, Carrie D Nicora, Errol W Robinson, Ljiljana Paša-Tolić, Philip Hugenholtz, Blake A Simmons, Steven W Singer.
Abstract
Thermophilic bacteria are a potential source of enzymes for the deconstruction of lignocellulosic biomass. However, the complement of proteins used to deconstruct biomass and the specific roles of different microbial groups in thermophilic biomass deconstruction are not well-explored. Here we report on the metagenomic and proteogenomic analyses of a compost-derived bacterial consortium adapted to switchgrass at elevated temperature with high levels of glycoside hydrolase activities. Near-complete genomes were reconstructed for the most abundant populations, which included composite genomes for populations closely related to sequenced strains of Thermus thermophilus and Rhodothermus marinus, and for novel populations that are related to thermophilic Paenibacilli and an uncultivated subdivision of the little-studied Gemmatimonadetes phylum. Partial genomes were also reconstructed for a number of lower abundance thermophilic Chloroflexi populations. Identification of genes for lignocellulose processing and metabolic reconstructions suggested Rhodothermus, Paenibacillus and Gemmatimonadetes as key groups for deconstructing biomass, and Thermus as a group that may primarily metabolize low molecular weight compounds. Mass spectrometry-based proteomic analysis of the consortium was used to identify >3000 proteins in fractionated samples from the cultures, and confirmed the importance of Paenibacillus and Gemmatimonadetes to biomass deconstruction. These studies also indicate that there are unexplored proteins with important roles in bacterial lignocellulose deconstruction.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23894306 PMCID: PMC3716776 DOI: 10.1371/journal.pone.0068465
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Each contig was plotted against its average read coverage per base (X axis), and its GC% content (Y axis).
The surface area of each circle is proportional to the length of the contigs in bp, giving an intuitive visualization of how much metagenomic sequence is covered by each cluster. Phylogenetic bins are represented by different colors, while grey circles represent (typically smaller) contigs that were not assigned to a bin.
Phylogenetic bins and statistics.
| Bin | covg | GC% | Size bp | contigs | N50 | Best reference genome | AAI | Completenessc |
|
| 74.7 | 68% | 2,126,834 | 457 | 5,482 |
| 91% | 72% |
|
| 71.2 | 65% | 2,899,900 | 732 | 4,581 |
| 96% | 96% |
|
| 34.5 | 71% | 3,785,002 | 87 | 64,448 |
| 48% | 100% |
|
| 29.2 | 34% | 168,710 | 8 | 123,990 |
| 84% | 99% |
|
| 26.1 | 63% | 2,985,204 | 48 | 89,045 |
| 56% | 96% |
|
| 10.1 | 65% | 1,689,164 | 393 | 5,280 |
| 95% | 60% |
|
| 8.7 | 48% | 2,237,622 | 71 | 59,820 |
| 98% | 88% |
|
| 6.3 | 67% | 5,207,803 | 662 | 10,556 |
| 85% | 80% |
|
| 6.2 | 65% | 2,635,886 | 397 | 8,832 |
| 76% | 87% |
|
| 5.7 | 60% | 4,323,820 | 546 | 11,083 |
| 56% | 90% |
: Based on blastn coverage of reference genome, rather than phylogenetic markers. b: Including additional cluster at 12.0 read coverage that likely corresponds to a Sphaerobacter megaplasmid. c: See also File S3.
Carbohydrate active enzymes and ligninases in metagenome (see also File S4).
| Bin | GH | CBM | selected GH's | selected enzymes |
|
| 18 | 2 | 1 GH1, 1 GH9, 2 GH42 | 1 β-Glucosidase, 2 β-Galactosidase, 2 Laccase, |
|
| 63 | 18 | 2 GH2, 6 GH3, 2 GH5, 1 GH16, 2 GH35, 1 GH43, 2 GH67 | 2 Endoglucanase, 8 β-Glucosidase, 1 Endoxylanase, 3 β-Xylosidase, 4 α-L-Arabinofuranosidase, 2 β-Galactosidase, 2 Laccase |
|
| 37 | 13 | 1 GH1, 1 GH2, 3 GH3, 2 GH9, 1 GH74 | 1 Endoglucanase, 2 β-Glucosidase, 5 Laccase |
|
| 57 | 23 | 1 GH1, 4 GH2, 4 GH3, 1 GH5, 1 GH8, 1 GH9, 1 GH11, 1 GH39, 1 GH42, 3 GH43, 1 GH67 | 3 Endoglucanase, 2 CBH2, 4 β-Glucosidase, 6 Endoxylanase, 5 β-Xylosidase, 5 α-L-Arabinofuranosidase, 2 β-Galactosidase, 1 Feruloyl esterase, 3 Laccase |
|
| 38 | 18 | 1 GH2, 5 GH3, 1 GH5, 1 GH12, 1 GH16, 1 GH67 | 1 Endoglucanase, 5 β-Glucosidase, 1 β-Xylosidase, 1 α-L-Arabinofuranosidase, 2 Laccase |
|
| 44 | 6 | 1 GH2, 3 GH3, 2 GH4, 1 GH6 | 2 Endoglucanase, 2 β-Glucosidase, 1 Endoxylanase, 1 β-Xylosidase, 1 α-L-Arabinofuranosidase, 2 Laccase |
|
| 100 | 17 | 3 GH1, 3 GH2, 1 GH3, 2 GH4, 5 GH5, 3 GH29, 9 GH39, 2 GH42, 4 GH43, 23 GH109 | 2 Endoglucanase, 4 β-Glucosidase, 1 α-L-Arabinofuranosidase, 1 β-Galactosidase, 1 Feruloyl esterase, 9 Laccase |
|
| 26 | 8 | 2 GH1, 1 GH4, 1 GH5 | 3 Endoglucanase, 2 β-Glucosidase, 1 Endoxylanase, 3 Laccase |
|
| 31 | 5 | 1 GH9, 1 GH11, 1 GH39 | 1 Endoglucanase, 1 Laccase, 11 glutathione S-transerases (potential beta-aryl etherases) |
Figure 2Normalized protein abundance within each of the three proteome fractions, by phylogenetic bins.
Top 10 most abundant overrepresented proteins in the supernatant (see also File S5).
| Locus tag | Protein Description | Bin | Supernatant | Fold |
| sg4i_00021160 | carbohydrate ABC transporter substrate-binding protein, CUT1 family (TC 3.A.1.1.-) | Paenibacillus | 2.48E+09 | 84 |
| sg4i_00251940 | hypothetical protein | Gemmatimonadetes | 1.49E+09 | 27 |
| sg4i_00325360 | S-layer homology domain. | Paenibacillus | 5.34E+08 | 6.8 |
| sg4i_00645210 | carbohydrate ABC transporter substrate-binding protein, CUT1 family (TC 3.A.1.1.-) | Paenibacillus | 3.92E+08 | 42 |
| sg4i_00646120 | Enterochelin esterase and related enzymes | Paenibacillus | 3.2E+08 | 31 |
| sg4i_00319610 | carbohydrate ABC transporter substrate-binding protein, CUT1 family (TC 3.A.1.1.-) | Paenibacillus | 2.78E+08 | 19 |
| sg4i_00608910 | hypothetical protein | Rhodothermus | 2.65E+08 | 14 |
| sg4i_00027040 | Xaa-Pro aminopeptidase | Paenibacillus | 2.57E+08 | 9 |
| sg4i_00336790 | Predicted transcriptional regulator | Hyphomicrobium | 2.49E+08 | 49 |
| sg4i_00622770 | hypothetical protein | Thermus | 2.42E+08 | 163 |
: Not binned; assigned based on best blastp hit.
Overrepresented lignocellulolytic enzymes in the supernatant (see also File S5).
| Locus tag | Protein Description | Family | EC | Bin | Supernatant | Fold |
| sg4i_00646120 | Enterochelin esterase and related enzymes | CBM9 | 3.1.1.73 | Paenibacillus | 3.20E+08 | 31 |
| sg4i_00255110 | Putative multi-copper oxidases | Laccase | 1.10.3.2 | Gemmatimonadetes | 1.27E+08 | 150 |
| sg4i_00193360 | BNR/Asp-box repeat. | GH74 | Sphaerobacter | 6.59E+07 | 6591 | |
| sg4i_00004420 | Alpha-L-arabino-furanosidase | GH51 | 3.2.1.55 | Paenibacillus | 6.27E+07 | 8.1 |
| sg4i_00024910 | Beta-1,4-xylanase | GH10, CBM4/9 | 3.2.1.8 | Paenibacillus | 2.85E+07 | 4.3 |
| sg4i_00608930 | Beta-1,4-xylanase | GH10, CBM4/9 | 3.2.1.8 | Hyphomicrobium | 1.59E+07 | 2.7 |
| sg4i_00588670 | Carbohydrate binding domain. | CBM16 | Paenibacillus | 1.12E+07 | 3.1 | |
| sg4i_00324270 | Endoglucanase | GH5 | 3.2.1.4 | Paenibacillus | 9.65E+06 | 29 |
| sg4i_00432730 | maltooligosyl trehalose hydrolase (EC 3.2.1.141) | GH13, CBM48 | 3.2.1.141 | Sphaerobacter | 5.39E+06 | 43 |
| sg4i_00141500 | Alpha-L-fucosidase | GH29 | 3.2.1.51 | Sphaerobacter | 4.72E+06 | 472 |
| sg4i_00592760 | Alpha-glucosidases, family 31 of glycosyl hydrolases | GH31 | 3.2.1.- | Paenibacillus | 2.93E+06 | 15 |
| sg4i_00102640 | Alpha-glucosidases, family 31 of glycosyl hydrolases | GH31 | 3.2.1.- | Rhodothermus | 2.56E+06 | 3.4 |
: Not binned; assigned based on best blastp hit.