| Literature DB >> 29588665 |
Suresh Poudel1,2,3, Richard J Giannone4,2, Mirko Basen5,6, Intawat Nookaew1,2,7, Farris L Poole2,5, Robert M Kelly2,8, Michael W W Adams2,5, Robert L Hettich4,2.
Abstract
BACKGROUND: Caldicellulosiruptor bescii is a thermophilic cellulolytic bacterium that efficiently deconstructs lignocellulosic biomass into sugars, which subsequently can be fermented into alcohols, such as ethanol, and other products. Deconstruction of complex substrates by C. bescii involves a myriad of highly abundant, substrate-specific extracellular solute binding proteins (ESBPs) and carbohydrate-active enzymes (CAZymes) containing carbohydrate-binding modules (CBMs). Mass spectrometry-based proteomics was employed to investigate how these substrate recognition proteins and enzymes vary as a function of lignocellulosic substrates.Entities:
Keywords: Avicel; C5 substrates; C6 substrates; Caldicellulosiruptor bescii; Carbohydrate-active enzymes (CAZymes); Extracellular; Extracellular solute binding proteins (ESBPs); Glycosyl hydrolases (GH); Lignocellulosic; Mass spectrometry; Protein of unknown function (PUF); Switchgrass; Xylan
Year: 2018 PMID: 29588665 PMCID: PMC5865380 DOI: 10.1186/s13068-018-1076-1
Source DB: PubMed Journal: Biotechnol Biofuels ISSN: 1754-6834 Impact factor: 6.040
Fig. 1Growth of C. bescii on complex medium with different a soluble or b insoluble components of plant biomass. Cultivation was performed in 1-L closed bottles filled with 400 mL medium, under a N2/CO2 (80/20) atmosphere, with glucose (open squares), cellobiose (open circles), xylose (open triangles), crystalline cellulose (closed circles), birchwood xylan (closed triangles), and on unpretreated switchgrass (crosses), at substrate concentrations of 5 g L−1. After cultures reached the mid-to-late exponential growth phase (7.0–1.5 × 108 mL−1; indicated by arrows), supernatants containing the extracellular proteins were separated from the cells by centrifugation. Representative growth curve shown for each substrate
Fig. 2Genome atlas overlaid with abundance profiles of proteome obtained from the mass spectrometry experiments. The innermost circle with bases position is the circular genome. The outer 18 lanes are the biological samples (6 × 3 = 18). Each lane is color coded (darker color represents highly abundant proteins). Lanes A–M represent the regions of genome that correspond to highly abundant proteins
Most abundant extracellular proteome
aCore extracellular proteins
Extracellular proteome summary
| All vs. all | Simple vs. complex | C5 vs. C6 | Union | |
|---|---|---|---|---|
| SignalP | 33 | 13 | 25 | 43 |
| CAZymes | 8 | 6 | 3 | 11a |
| ESBPs | 12 | 9 | 5 | 14 |
| PUFs | 25 | 13 | 21 | 37 |
aOut of 11 CAZymes, 7 consist of CBMs
Differentially abundant CAZYmes in different growth conditions
| LocusTag_ProteinDescription | Comparison (subA_subB)a | subA–subB (fold changea) | CBM | |
|---|---|---|---|---|
| Athe_0610_glycoside hydrolase starch-binding | CB_SWG | 0.00 | − 4.66 | Yes |
| Athe_0610_glycoside hydrolase starch-binding | GLU_SWG | 0.00 | − 3.98 | Yes |
| Athe_0610_glycoside hydrolase starch-binding | GLU_XYLN | 0.00 | − 4.37 | Yes |
| Athe_0610_glycoside hydrolase starch-binding | Simple_Complex | 0.01 | − 2.81 | Yes |
| Athe_0610_glycoside hydrolase starch-binding | CB_XYLN | 0.00 | − 5.04 | Yes |
| Athe_0610_glycoside hydrolase starch-binding | C6_C5 | 0.01 | − 2.69 | Yes |
| Athe_0460_glycoside hydrolase 94 | GLU_CB | 0.02 | − 1.55 | No |
| Athe_0460_glycoside hydrolase 94 | XYLN_XYLO | 0.00 | − 2.68 | No |
| Athe_0460_glycoside hydrolase 94 | GLU_XYLN | 0.00 | 1.41 | No |
| Athe_0460_glycoside hydrolase 94 | GLU_XYLO | 0.01 | − 1.27 | No |
| Athe_0460_glycoside hydrolase 94 | AVI_SWG | 0.02 | 4.8 | No |
| Athe_0460_glycoside hydrolase 94 | AVI_XYLN | 0.00 | 4.65 | No |
| Athe_0460_glycoside hydrolase 94 | AVI_XYLO | 0.01 | 1.97 | No |
| Athe_0460_glycoside hydrolase 94 | AVI_CB | 0.03 | 1.69 | No |
| Athe_0460_glycoside hydrolase 94 | CB_XYLN | 0.00 | 2.96 | No |
| Athe_0460_glycoside hydrolase 94 | GLU_AVI | 0.00 | − 3.24 | No |
| Athe_0459_glycoside hydrolase 94 | AVI_SWG | 0.04 | 4.7 | No |
| Athe_0459_glycoside hydrolase 94 | AVI_XYLN | 0.04 | 5.96 | No |
| Athe_0459_glycoside hydrolase 94 | AVI_CB | 0.01 | 1.54 | No |
| Athe_0459_glycoside hydrolase 94 | GLU_AVI | 0.03 | − 2.91 | No |
| Athe_2028_glycoside hydrolase family 4 | Simple_Complex | 0.04 | 1.16 | No |
| Athe_1857_glycoside hydrolase family 48 | AVI_XYLN | 0.03 | − 1.63 | Yes |
| Athe_1857_glycoside hydrolase family 48 | AVI_XYLO | 0.02 | − 2.13 | Yes |
| Athe_1857_glycoside hydrolase family 48 | AVI_CB | 0.03 | − 2.96 | Yes |
| Athe_1857_glycoside hydrolase family 48 | Simple_Complex | 0.01 | 1.21 | Yes |
| Athe_1857_glycoside hydrolase family 48 | GLU_AVI | 0.04 | 1.9 | Yes |
| Athe_0089_Endo-1,4-beta-xylanase | XYLN_XYLO | 0.00 | 5.01 | Yes |
| Athe_0089_Endo-1,4-beta-xylanase | GLU_XYLN | 0.00 | − 5.05 | Yes |
| Athe_0089_Endo-1,4-beta-xylanase | XYLN_SWG | 0.00 | 5.22 | Yes |
| Athe_0089_Endo-1,4-beta-xylanase | AVI_XYLN | 0.00 | − 5.44 | Yes |
| Athe_0089_Endo-1,4-beta-xylanase | CB_XYLN | 0.01 | − 5.02 | Yes |
| Athe_0089_Endo-1,4-beta-xylanase | C6_C5 | 0.02 | − 2.66 | Yes |
| Athe_1860_glycoside hydrolase family 48 | XYLN_XYLO | 0.03 | − 0.53 | Yes |
| Athe_1860_glycoside hydrolase family 48 | AVI_SWG | 0.05 | − 2.14 | Yes |
| Athe_1860_glycoside hydrolase family 48 | XYLN_SWG | 0.05 | − 0.72 | Yes |
| Athe_1860_glycoside hydrolase family 48 | AVI_XYLO | 0.05 | − 1.95 | Yes |
| Athe_1860_glycoside hydrolase family 48 | AVI_CB | 0.05 | − 2.89 | Yes |
| Athe_1860_glycoside hydrolase family 48 | Simple_Complex | 0.03 | 1.02 | Yes |
| Athe_0609_pullulanase, type I | Simple_Complex | 0.00 | − 2.29 | Yes |
| Athe_0609_pullulanase, type I | C6_C5 | 0.03 | − 1.99 | Yes |
| Athe_0152_Acetyl xylan esterase | GLU_CB | 0.03 | 4.11 | No |
| Athe_0152_Acetyl xylan esterase | GLU_XYLN | 0.03 | 4.03 | No |
| Athe_0594_Cellulase | Simple_Complex | 0.01 | − 2.05 | Yes |
| Athe_1865_glycoside hydrolase family 9 | AVI_SWG | 0.04 | − 1.75 | Yes |
| Athe_1865_glycoside hydrolase family 9 | AVI_XYLN | 0.01 | − 1.8 | Yes |
| Athe_1865_glycoside hydrolase family 9 | AVI_XYLO | 0.01 | − 1.81 | Yes |
| Athe_1865_glycoside hydrolase family 9 | AVI_CB | 0.04 | − 1.89 | Yes |
| Athe_1865_glycoside hydrolase family 9 | C6_C5 | 0.04 | − 0.91 | Yes |
aThe fold changes are in log scale. If the fold change is positive (+), the protein is significantly more abundant in substrate (left side) when compared substrate (right side) in the comparison column. Similarly, if the fold change is negative (−), the protein is significantly more abundant in substrate (right side) when compared substrate (left side) in the comparison column
Fig. 3Scatterplot of absolute fold change (in log scale) and −log10 (p value) obtained by pairwise comparisons of ESBPs and CAZymes across all growth conditions. The most significant proteins (p value < 0.01 and fold change > ×4) are colored. Each point represents the p value and fold change obtained by the pairwise comparison. Each rectangular box represents the locus tag of protein along with the most significant comparisons. The red colored substrate means the high abundance of proteins and green color means low abundance of protein
Fig. 4Clustering of proteins in a heat-map based on fold-change difference when one substrate is compared versus all five other substrates. a Glucose; b cellobiose; c xylose; d Avicel; e xylan; and f switchgrass. The darker yellow color in the heat-map refers to a group of proteins having maximum fold change and a darker yellow color is progressively decreasing in fold-change difference when a substrate is compared with another substrate. The black rectangular box refers to the protein clusters that have highest fold-change difference versus another substrate. The functional description of the proteins in these clusters are mentioned underneath each heat-map
Fig. 5Volcano plot showing differentially abundant proteins based on fold change versus t test probability. The plot is obtained with the proteomic approach when comparing extracellular proteome metrics obtained by growing C. bescii in C5 substrates versus C6 substrates. Green dots represent the proteins that have a p value < 0.05 and > 2 fold change. The green dots on the left side of the plot are the proteins that are differentially more abundant and specific in C5 substrates. Similarly, green dots on the right side of the plot are the proteins that are differentially more abundant and specific in C6
Fig. 6Hierarchical clustering based on individual protein z-score of the significant proteins (p value < 0.05) obtained from pairwise comparison of simple substrates versus complex substrates comparison. The red rectangular box shows the proteins that were most differentially abundant and specific to complex substrates. The reported abundances are z-scores of log2 transformed intensity
Fig. 7Volcano plot showing differentially abundant proteins of unknown functions. The plot is a merged volcano plot obtained with the proteomic approach when comparing extracellular proteins of unknown functions obtained by growing C. bescii in C5 substrates versus C6 substrates and complex substrates versus simple substrates. Green dots represent the proteins that have a p value < 0.01 and > ×2 fold change. The green dots on the left side of the plot are the proteins that are differentially more abundant in C5/Complex substrates (as mentioned in the parentheses). Similarly, green dots on the right side of the plot are the proteins that are differentially more abundant in C6/Simple substrates (as mentioned in the parentheses)