| Literature DB >> 24984213 |
Richard A Heins1, Xiaoliang Cheng, Sangeeta Nath, Kai Deng, Benjamin P Bowen, Dylan C Chivian, Supratim Datta, Gregory D Friedland, Patrik D'Haeseleer, Dongying Wu, Mary Tran-Gyamfi, Chessa S Scullin, Seema Singh, Weibing Shi, Matthew G Hamilton, Matthew L Bendall, Alexander Sczyrba, John Thompson, Taya Feldman, Joel M Guenther, John M Gladden, Jan-Fang Cheng, Paul D Adams, Edward M Rubin, Blake A Simmons, Kenneth L Sale, Trent R Northen, Samuel Deutsch.
Abstract
Harnessing the biotechnological potential of the large number of proteins available in sequence databases requires scalable methods for functional characterization. Here we propose a workflow to address this challenge by combining phylogenomic guided DNA synthesis with high-throughput mass spectrometry and apply it to the systematic characterization of GH1 β-glucosidases, a family of enzymes necessary for biomass hydrolysis, an important step in the conversion of lignocellulosic feedstocks to fuels and chemicals. We synthesized and expressed 175 GH1s, selected from over 2000 candidate sequences to cover maximum sequence diversity. These enzymes were functionally characterized over a range of temperatures and pHs using nanostructure-initiator mass spectrometry (NIMS), generating over 10,000 data points. When combined with HPLC-based sugar profiling, we observed GH1 enzymes active over a broad temperature range and toward many different β-linked disaccharides. For some GH1s we also observed activity toward laminarin, a more complex oligosaccharide present as a major component of macroalgae. An area of particular interest was the identification of GH1 enzymes compatible with the ionic liquid 1-ethyl-3-methylimidazolium acetate ([C2mim][OAc]), a next-generation biomass pretreatment technology. We thus searched for GH1 enzymes active at 70 °C and 20% (v/v) [C2mim][OAc] over the course of a 24-h saccharification reaction. Using our unbiased approach, we identified multiple enzymes of different phylogentic origin with such activities. Our approach of characterizing sequence diversity through targeted gene synthesis coupled to high-throughput screening technologies is a broadly applicable paradigm for a wide range of biological problems.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24984213 PMCID: PMC4168791 DOI: 10.1021/cb500244v
Source DB: PubMed Journal: ACS Chem Biol ISSN: 1554-8929 Impact factor: 5.100
Figure 1Phylogenetic tree of GH1 enzymes (n = 967) retrieved from sequence databases and metagenomic data. Branch and background color of the leaf labels indicate the phylogenetic origin of each GH1 sequence representative by kingdom (blue = bacteria, light green = archaea, red = eukaryota, purple = metagenome). Black rectangles in first outer circle show the GH1 representatives selected for synthesis on the basis of our criteria for maximizing phylogenetic space covered. Orange rectangles on second outer circle show examples of metagenome-derived representatives that define new clades.
Figure 2Overview of acoustic deposition process and NIMS data. (a) Samples are acoustically transferred from the assay plate to the NIMS chip. Individual reactions on the NIMS chip are ionized by a laser and detected by a time-of-flight mass spectrometer. (b) A representative mass spectrum from a single sample derived from the mean of 12 laser shots. The text above the peaks denotes the multiple substrates and products that can be resolved on the basis of mass tags that encode the identity of the substrate (CB = cellobiose, Lac = lactose, XB = xylobiose, Mal = maltose). (c) Enzyme assay results (n = 10,080) for all synthesized GH1 representatives (n = 175) displayed on a phylogenetic tree. Branches are colored according to kingdom. Four concentric outer circles indicate soluble expression and maximum temperature at which enzyme activity was observed for the cellobiose, lactose, and xylobiose NIMS substrates, respectively.
HPLC-Based Sugar Profiling of Selected GH1s toward a Panel of Relevant Carbohydrate Substratesa
| specific
activity (U mg–1) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| enzyme | reaction temp (°C) | sophorose (β1,2)-Glc-Glc | laminaribiose (β1,3)-Glc-Glc | cellobiose (β1,4)-Glc-Glc | gentiobiose (β,6)-Glc-Glc | lactose (β1,4)-Gal-Glc | glucomannan (β1,4)-Glc-Man | galactobiose (β1,4)-Gal-Gal | mannobiose (β1,4)-Man-Man | laminarin (β1,3) and (β1,6) |
| Mesophillic | ||||||||||
| CR_14_Cow_Rumen | 30 | 35.1 | 54.6 | 8.6 | 0.2 | 0.4 | 1.8 | 0.1 | nd | 0.1 |
| CAA82733.1_ | 40 | 8.3 | 8.5 | 6.9 | 1.0 | 4.9 | 2.6 | 0.1 | nd | 0.1 |
| CR_19_Cow_Rumen | 40 | 25.1 | 20.0 | 3.1 | 0.7 | nd | 2.5 | nd | nd | 0.4 |
| AAF37730.1_ | 50 | 21.4 | 42.6 | 34.8 | 0.6 | 7.9 | 22.2 | 0.1 | 0.4 | 0.1 |
| AAZ81839.1_ | 55 | 86.6 | 95.1 | 70.8 | 6.9 | 16.9 | 34.0 | 0.9 | nd | 1.1 |
| ACJ34717.1_ | 60 | 52.0 | 71.6 | 4.2 | 0.6 | 0.6 | 3.3 | 0.1 | nd | 0.7 |
| ACL70277.1_ | 60 | 74.9 | 62.1 | 29.4 | 7.1 | 7.9 | 18.2 | 1.9 | nd | 1.0 |
| ADD27066.1_ | 60 | 5.1 | 4.6 | 2.8 | 1.8 | 0.4 | 0.5 | nd | nd | 3.7 |
| CAA42814.1_ | 65 | 78.8 | 127.7 | 3.2 | 1.0 | 7.3 | 5.2 | 0.9 | 0.2 | 0.4 |
| Thermophillic | ||||||||||
| ABW01253.1_ | 70 | 53.9 | 53.2 | 28.0 | 13.3 | 23.9 | 17.6 | 2.0 | nd | 0.3 |
| ACJ75238.1_ | 70 | 95.6 | 94.0 | 59.1 | 2.7 | 7.3 | 37.2 | 0.4 | 0.3 | 0.2 |
| ACJ76349.1_ | 70 | 98.5 | 100.6 | 38.2 | 13.7 | 58.7 | 27.7 | 2.4 | nd | 0.8 |
| ACZ42845.1_ | 75 | 48.8 | 75.9 | 49.8 | 15.9 | 61.7 | 46.1 | 10.2 | nd | 9.2 |
| AAA72843.1_ | 80 | 19.7 | 19.2 | 16.9 | 30.9 | 16.6 | 11.9 | 5.1 | nd | 8.5 |
| AAY81155.1_ | 80 | 38.1 | 48.4 | 8.4 | 3.6 | 1.9 | 4.5 | 1.9 | nd | 0.1 |
| AAF36392.1_ | 85 | 21.8 | 32.6 | 25.9 | 14.4 | 21.9 | 15.3 | 1.5 | 0.3 | 10.7 |
| ABS61401.1_ | 85 | 43.6 | 44.3 | 22.2 | 1.3 | 9.2 | 14.9 | 0.1 | 1.9 | 0.1 |
| AAL81332.1_ | 95 | 75.9 | 45.3 | 28.6 | 1.6 | 9.5 | 20.9 | 1.1 | 15.9 | 0.2 |
| ACM22958.1_ | 95 | 147.9 | 163.8 | 26.8 | 4.9 | 19.1 | 23.2 | 5.6 | nd | 5.1 |
A unit (U) is defined as 1 μmol of total sugar monomers produced in 1 min. Values represent the average of three measurements with a coefficient of variation less than 10%. nd = not detectable. β1,2 refers to a β1–2 glycosidic bond. Glc = glucose, Gal = galactose, Man = mannose.
Figure 3Activity of subgroup of GH1s toward phosphorylated substrates. (a) Multiple sequence alignment highlighting the SKY/SNY motif found in a subgroup of GH1s that showed no activity toward the NIMS substrates, along with the sequence from a 6-phospho-β-glucosidase GH1 from L. lactis with a known crystal structure (top). Four representative GH1s from this study that showed activity toward the NIMS and real substrates are shown for comparison (bottom). (b) A phylogenetic tree highlighting the subgroup of GH1s with the SKY/SNY motif; the three outer concentric rings show (i) expression, (ii) SKY/SNY motif identity, and (iii) their enzyme activity toward the phosphorylated chromogenic substrate, pNPβG6P.
Figure 4Activity of thermophilic GH1s in [C2mim][OAc]. (a) Specific activities of thermophilic GH1s toward cellobiose after a 24-h preincubation at 70 °C in the presence (orange) or absence (black) of 20% (v/v) [C2mim][OAc]. Text labels refer to the accession numbers for the tested enzymes. (b) Glucose and cellobiose release from [C2mim][OAc]-pretreated switchgrass after 24 h saccharification at 70 °C and 20% (v/v) [C2mim][OAc] in the presence of accessory glycosyl hydrolases. Numbers to the right denote % conversion of biomass into glucose for each enzyme.