| Literature DB >> 28598995 |
Ditte Hededam Welner1,2, David Shin1,2, Giovani P Tomaleri1,2, Andy M DeGiovanni1,2, Alex Yi-Lin Tsai1,3, Huu M Tran1,4, Sara Fasmer Hansen1,3, Derek T Green1, Henrik V Scheller1,3, Paul D Adams1,2.
Abstract
Molecular characterization of plant cell wall glycosyltransferases is a critical step towards understanding the biosynthesis of the complex plant cell wall, and ultimately for efficient engineering of biofuel and agricultural crops. The majority of these enzymes have proven very difficult to obtain in the needed amount and purity for such molecular studies, and recombinant cell wall glycosyltransferase production efforts have largely failed. A daunting number of strategies can be employed to overcome this challenge, including optimization of DNA and protein sequences, choice of expression organism, expression conditions, co-expression partners, purification methods, and optimization of protein solubility and stability. Hence researchers are presented with thousands of potential conditions to test. Ultimately, the subset of conditions that will be sampled depends on practical considerations and prior knowledge of the enzyme(s) being studied. We have developed a rational approach to this process. We devise a pipeline comprising in silico selection of targets and construct design, and high-throughput expression screening, target enrichment, and hit identification. We have applied this pipeline to a test set of Arabidopsis thaliana cell wall glycosyltransferases known to be challenging to obtain in soluble form, as well as to a library of cell wall glycosyltransferases from other plants including agricultural and biofuel crops. The screening results suggest that recombinant cell wall glycosyltransferases in general have a very low soluble:insoluble ratio in lysates from heterologous expression cultures, and that co-expression of chaperones as well as lysis buffer optimization can increase this ratio. We have applied the identified preferred conditions to Reversibly Glycosylated Polypeptide 1 from Arabidopsis thaliana, and processed this enzyme to near-purity in unprecedented milligram amounts. The obtained preparation of Reversibly Glycosylated Polypeptide 1 has the expected arabinopyranose mutase and autoglycosylation activities.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28598995 PMCID: PMC5466300 DOI: 10.1371/journal.pone.0177591
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Alphabetical list of the CWGTs that were screened.
| Protein | Gene identifier | CAZy/DUF | Source organism |
|---|---|---|---|
| ARAD1 | At2g35100 | GT47 | |
| ARAD2 | At5g44930 | GT47 | |
| EMB2756 | 100797815 | DUF616 | |
| FUT6 | At1g14080 | GT37 | |
| GALS1 | At2g33570 | GT92 | |
| GALS1 | 100787815 | GT92 | |
| Galt31A | At1g32930 | GT31 | |
| GAUT1 | At3g61130 | GT8 | |
| GAUT7 | At2g38650 | GT8 | |
| GlcAT14A | 101210575 | GT14 | |
| GUT1/IRX10L | At5g61840 | GT47 | |
| GUT1/IRX10L | 100783737 | GT47 | |
| GUT1/IRX10L | 100794632 | GT47 | |
| GUX1 | At3g18660 | GT8 | |
| IRX7 | At2g28110 | GT47 | |
| IRX8 | At5g54690 | GT8 | |
| IRX9 | At2g37090 | GT43 | |
| IRX9L | At1g27600 | GT43 | |
| IRX9L | 606306 | GT43 | |
| IRX14 | At4g36890 | GT43 | |
| IRX14 | 100777505 | GT43 | |
| MGD2 | At3g48820 | GT29 | |
| MUCI10 | 101291758 | GT34 | |
| MUCI10 | 4350696 | GT34 | |
| MUR3 | At2g20370 | GT47 | |
| NN | At1g53290 | GT31 | |
| DUF246 | 101299994 | DUF246 | |
| DUF246 | 101311123 | DUF246 | |
| DUF246 | 101776477 | DUF246 | |
| PARVUS | At1g19300 | GT8 | |
| PARVUS | 101228909 | GT8 | |
| PARVUS | 4336486 | GT8 | |
| RGP1 | At3g02230 | GT75 | |
| RGP1 | 100282614 | GT75 | |
| RGP1 | 100836426 | GT75 | |
| RGXT2 | At4g01750 | GT77 | |
| RRA | 100255856 | GT77 | |
| RRA2 | At1g75110 | GT77 | |
| DUF288 | 101253530 | DUF288 | |
| DUF288 | 101309981 | DUF288 | |
| DUF288 | 101510562 | DUF288 | |
| TBL13 | 100802467 | DUF231 | |
| TBL29 | 100825801 | DUF231 | |
| XEG113 | 100819032 | GT77 | |
| XXT1 | At3g62720 | GT34 | |
| XXT1 | 101262652 | GT34 |
46 CWGT sequences comprising 22 A. thaliana genes and 24 genes from other plants. These sequences were screened for optimal E. coli expression using the high-throughput screening pipeline established in this study.
* for uncharacterized gene sequences, protein name is inferred from sequence similarity to characterized proteins.
** proteins from four Domain of Unknown Function (DUF) protein families were included in this study since they likely represent hitherto un-recognized CWGTs [5] or cell wall polysaccharide O-acetyltransferases.
*** NN = no name annotated in TAIR (http://www.arabidopsis.org, [6]). Will be referred to as At1g53290 in this text.
Fig 1Flow diagram of the high-throughput expression screening pipeline.
Hits from the test library screening.
| Protein | Construct | Vector | Chaperone co-expression | Yield (μg/mL) |
|---|---|---|---|---|
| At1g53290 | Δ1–50 | pET55dest | DnaK, DnaJ, GrpE | 5.3 |
| At1g53290 | full-length | pET55dest | DnaK, DnaJ, GrpE | 18.0 |
| At1g53290 | Δ1–50 | pET32dest | DnaK, DnaJ, GrpE | 1.2 |
| At1g53290 | full-length | pET32dest | Trigger Factor | 0.4 |
| FUT6 | Δ1–12 and Δ507–519 | pET22dest | Trigger Factor | 1.8 |
| Galt31A | full-length | pET32dest | Trigger Factor | 0.6 |
| GUT1/IRX10L | Δ1–46 | pET55dest | DnaK, DnaJ, GrpE | 0.6 |
| GUT1/IRX10L | Δ1–46 | pET32dest | Trigger Factor | 1.0 |
| IRX9 | full-length | pET32dest | DnaK, DnaJ, GrpE | 4.6 |
| IRX9 | full-length | pET32dest | Trigger Factor | 0.8 |
| IRX9 | Δ1–72 | pET55dest | Trigger Factor | 7.0 |
| MUR3 | Δ1–100 | pET55dest | DnaK, DnaJ, GrpE | 6.8 |
| PARVUS | Δ1–39 | pET55dest | Trigger Factor | 1.6 |
| RGP1 | full-length | pET55dest | none | 111.0 |
| RGP1 | full-length | pET55dest | GroEL, GroES | 271.8 |
| RGP1 | full-length | pET32dest | Trigger Factor | 1.3 |
| RGP1 | full-length | pET32dest | DnaK, DnaJ, GrpE | 1.6 |
| XXT1 | full-length | pET32dest | Trigger Factor | 9.6 |
| XXT1 | full-length | pET32dest | DnaK, DnaJ, GrpE | 10.9 |
Fig 2The effect of salts, pH and additives in the lysis buffer on the solubility of RGP1.
Western blots against a cloning scar shows the varying amount of RGP1 in the soluble lysis fraction after lysing in different buffers. The arrow indicates the band corresponding to full-length RGP1 (42 kDa). Buffers 13 and 19–21, containing 1-2M salt, as well as the low-pH (4.3–5.5) buffers 11,12,15, 22 and 30 decrease the amount of soluble RGP1, while lower salt concentrations as well as several additives increase the amount of soluble RGP1 relative to the standard lysis buffer (std: 50 mM HEPES pH 8.0, 250 mM NaCl, 5 mM MgCl2). MSG = monosodium glutamate, Dex = dextran. *Condition 24: 100 mM triethanolamine, 80 mM sodium glutamate, 0.02% n-octyl-β-D-glucoside, 10% glycerol, 5mM MgSO4, pH 8.5. **Condition 29: 100 mM Tris, 100 mM KCl, 0.1% deoxycholate, 25% glycerol, pH 7.6. ***Condition 30: 100 mM potassium acetate, 50 mM NaCl, 0.05% dextran sulfate, 0.1% CHAPS, pH 5.5.
Fig 3SDS-PAGE of selected samples from the non-Arabidopsis library.
Coomassie-stained gels showing nickel affinity eluates of samples selected from the automated capillary electrophoresis procedure as described. Each lane is marked with the organism and protein name, as well as with the well number for reference to S6 Data. Bands that were excised for mass spectrometry are marked with a green dot (confirmed by mass spectrometry) or a red cross (not confirmed). The lower right gel has samples expressed in the periplasm (pET22DEST vector, no excisions), while the other three gels have samples expressed in the cytoplasm (pET55DEST vector). Sample b8 (upper left gel) has no detectable background on SDS-PAGE, but produces a normal band pattern in the automated capillary electrophoresis, indicating that the absence of bands on SDS-PAGE is likely due to faulty well loading. Ca = Cicer arietinum, Cs = Cucumis sativa, Fv = Fragaria vesca, Gm = Glycine max, Si = Setaria italica, Sl = Solanum lycopersicum, Vv = Vitis vinifera, Zm = Zea mays.
Hits from the non-Arabidopsis library screening.
| ID | Protein | Construct | Species | Yield (μg/mL) |
|---|---|---|---|---|
| D02_Clone01 | ZmRGP1 (a4) | Δ135–361 | 30.3 | |
| D04_Clone01 | FvMUCI10 (a6) | Δ1–91 | 35.9 | |
| D18_Clone03 | CsGlcAT14A (a9) | Δ1–29 and Δ343–396 | 86.3 | |
| D02_Clone02 | ZmRGP1 (b4) | Δ237–361 | 23.5 | |
| D08_Clone01 | SiDUF246 (b5) | Δ1–63 | 24.3 | |
| D13_Clone03 | GmIRX14 (b7) | Δ1–78 and Δ436–502 | 119.8 | |
| D04_Clone03 | FvMUCI10 (c6) | Δ1–91 and Δ393–457 | 120.6 | |
| D22_Clone01 | FvDUF288 (c7) | Δ1–95 | 110.8 | |
| D03_Clone01 | SlXXT1 (c9) | Δ1–46 | 40.2 | |
| D04_Clone04 | FvMUCI10 (d6) | Δ1–57 and Δ393–457 | 223.5 | |
| D22_Clone03 | FvDUF288 (e7) | Δ1–95 and Δ311–761 | 45.9 | |
| D06_Clone02 | GmGALS1 (e8) | Δ1–49 | 21.3 | |
| D21_Clone01 | SlDUF288 (e9) | Δ1–104 | 56.2 | |
| D13_Clone04 | GmIRX14 (f5) | Δ1–156 and Δ436–502 | 94.8 | |
| D23_Clone01 | VvRRA (g5) | Δ1–53 | 63.3 | |
| D20_Clone01 | GmEMB2756 (g6) | Δ1–104 | 20.0 | |
| D24_Clone01 | GmXEG113 (h6) | Δ1–52 | 41.2 |
The hits are ordered like in Fig 3 and marked with the well number for easy referral. The ID refers to S4 Table.
Fig 4Arabidopsis thaliana RGP1 scale-up, purification and activity determinations.
(A) Coomassie stained SDS-PAGE of the final chromatographic step yielding almost pure RGP1 (42 kDa). (B) Phosphate-release assay showing autoglycosylating or hydrolytic activity of RGP1 on UDP-glucose. (C) UDP-arabinose mutase activity of RGP1. High-pressure liquid chromatograms of authentic UDP-arabinopyranose (UDP-Arap) and UDP-arabinofuranose (UDP-Araf) standards (grey) overlaid with the chromatogram of the reaction mixture of UDP-Araf with recombinant, purified RGP1 (black).