Literature DB >> 25005081

A new crystal form of a hyperthermophilic endocellulase.

Misumi Kataoka1, Kazuhiko Ishikawa1.   

Abstract

The hyperthermophilic glycoside hydrolase family endocellulase 12 from the archaeon Pyrococcus furiosus (EGPf; Gene ID PF0854; EC 3.2.1.4) catalyzes the hydrolytic cleavage of the β-1,4-glucosidic linkage in β-glucan in lignocellulose biomass. A crystal of EGPf was previously prepared at pH 9.0 and its structure was determined at an atomic resolution of 1.07 Å. This article reports the crystallization of EGPf at the more physiologically relevant pH of 5.5. Structure determination showed that this new crystal form has the symmetry of space group C2. Two molecules of the enzyme are observed in the asymmetric unit. Crystal packing is weak at pH 5.5 owing to two flexible interfaces between symmetry-related molecules. Comparison of the EGPf structures obtained at pH 9.0 and pH 5.5 reveals a significant conformational difference at the active centre and in the surface loops. The interfaces in the vicinity of the flexible surface loops impact the quality of the EGPf crystal.

Entities:  

Keywords:  Pyrococcus furiosus; endocellulase; hyperthermophile

Mesh:

Substances:

Year:  2014        PMID: 25005081      PMCID: PMC4089524          DOI: 10.1107/S2053230X14010930

Source DB:  PubMed          Journal:  Acta Crystallogr F Struct Biol Commun        ISSN: 2053-230X            Impact factor:   1.056


Introduction

Cellulases are the most important industrial enzymes for biomass utilization, since the enzyme plays a key role in the degradation of β-glucan cellulose. Recent research into biofuel production from lignocellulose biomass has accelerated the development of cellulases optimized for efficient biomass breakdown to monosaccharides, known as saccharification. A hyperthermophilic cellulase would be very useful in industrial applications because enzymatic reactions occurring at high temperature have many merits, such as a reduced risk of microbial contamination, increased solubility of the substrate and improved transfer rate. Therefore, much research has focused on developing thermophilic cellulases with high activity. Hyperthermophilic β-1,4-endocellulases (endo-type cellulases) have been found in the genome databases of several hyperthermophilic archaea. The hyperthermophilic archaea Pyrococcus horikoshii and P. furiosus have glycoside hydrolase (GH) family 5 (EGPh) and family 12 endocellulases (EGPf), respectively. Each family of enzymes shows different substrate specificities and exhibits hydrolytic activity at high temperatures. The optimal and denaturing temperatures of EGPh are 100 and 103°C, respectively (Kim & Ishikawa, 2013 ▶), and those of EGPf are 100 and 112°C, respectively (Bauer et al., 1999 ▶). The crystal structures of two hyperthermophilic endocellulases have been determined (Kim & Ishikawa, 2010 ▶; Kim et al., 2012 ▶). The structure of EGPf was determined at an atomic resolution of 1.07 Å (Kim et al., 2012 ▶; PDB entry 3vgi). Under the crystallization conditions used by Kim and coworkers, the EGPf crystal form has symmetry consistent with space group P21212 and one EGPf molecule is present per asymmetric unit. The optimum pH of EGPf is reported to be approximately pH 6.0 (Bauer et al., 1999 ▶), but the crystal was prepared at pH 9.0 (Kataoka et al., 2012 ▶). Here, we describe a new crystal form of EGPf prepared at pH 5.5 and compare it with the previously solved EGPf structure.

Materials and methods

Protein preparation

We prepared recombinant EGPf using a method similar to that described previously (Kim et al., 2012 ▶). The plasmid containing the full-length EGPf used the vector pET-11a (Novagen, Madison, Wisconsin, USA) and was introduced into Escherichia coli strain BL21 (DE3) pLysS. The truncated protein gene (EGPfΔN30), with a deletion of 30 amino-acid residues (signal sequence and the proline and hydroxyl residue-rich regions) at the N-terminal region of EGPf, was constructed by PCR and inserted into the pET-11a vector. pET-11a was introduced into E. coli BL21 (DE3) cells for recombinant protein expression. The cells were cultured in LB containing 100 mg l−1 sodium ampicillin at 37°C to an OD600 of 0.6, and isopropyl β-d-1-thiogalactopyranoside (IPTG) was then added. After 16 h culture at 30°C, the cells were harvested by centrifugation (5000g, 15 min, 4°C). The cells were suspended in 20 mM TrisHCl pH 8.0 containing 0.7 M ammonium sulfate and then homogenized by ultrasonication (40 W, 20 kHz) for 15 min on ice. The homogenates were heated at 70°C for 15 min. After removing the cell debris by centrifugation (15 000g, 20 min, 4°C), the enzyme solution was filtered (0.2 µm) and applied to hydrophobic interaction column chromatography (using a HiTrap Phenyl column). The flow rate for the column chromatography was 2.0 ml min−1 using 20 mM TrisHCl pH 8.0 buffer with a gradient of 0.7–0 M ammonium sulfate. Gel-filtration chromatography (using a HiLoad 26/60 Superdex 200 pg column) was carried out using 20 mM TrisHCl pH 8.0 buffer with a flow rate of 2.0 ml min−1. The purity and molecular weight of the protein were analyzed by SDS–PAGE. The concentration of EGPfΔN30 (molecular weight 30 540.48 Da) was determined from the UV absorbance at 280 nm, using 81 790 as the molar extinction coefficient, which was calculated from the protein sequence (UniProt ID E7FHY8; Gill & von Hippel, 1989 ▶).

Crystallization

The purified EGPfΔN30 was concentrated to 17 mg ml−1 and then dialyzed against 20 mM TrisHCl pH 8.0 using an Amicon Centricon YM-10 (Millipore, Billerica, Massachusetts, USA) by centrifugation (5000g, 4°C). The crystals of EGPfΔN30 were grown at 22°C using a reservoir solution composed of 100 mM CHC (2:3:4 citric acid:HEPES:CHES) buffer pH 5.5, 200 mM lithium sulfate, 5%(v/v) ethanol by the hanging-drop vapour-diffusion method. Typically, drops consisting of 1 µl protein solution and 1 µl reservoir solution were equilibrated against 450 µl reservoir solution.

Data collection and processing

The selected crystals were harvested and immersed in cryoprotectant solution consisting of 30%(v/v) glycerol in mother liquor. The soaked crystal was collected using a Cryo-Loop (Hampton Research, Aliso Viejo, California, USA) and immediately flash-cooled under a stream of nitrogen gas at −173°C. X-ray diffraction data for a single crystal measurement were collected using an MX-300HE CCD detector (Rayonix, Evanston, Illinois, USA) on the SPring-8 BL44XU beamline (Hyogo, Japan). The diffraction data set extended to 1.68 Å resolution and was collected at a wavelength of 0.9 Å. The crystal-to-detector distance was 220 mm. The crystal was rotated 180° with an oscillation angle of 0.5° per frame. The data collected from diffraction measurements were merged, indexed, integrated and scaled using the programs in the HKL-2000 software package (Otwinowski & Minor, 1997 ▶). Data-collection statistics are presented in Table 1 ▶.
Table 1

Data-collection and refinement statistics for the structure of EGPfN30 at pH 5.5

Data collection
Wavelength ()0.9
Space group C2
Unit-cell parameters (, ) a = 134.7, b = 62.6, c = 86.3, = 95.1
Molecules per asymmetric unit2
Matthews coefficient (3Da1) (Matthews, 1968)2.5
Solvent content (%)51
Resolution range ()50.01.68 (1.711.68)
Total No. of observed reflections311544 (15000)
No. of unique reflections81623 (4029)
Average I/(I)14.9 (3.7)
R merge 0.068 (0.367)
Multiplicity3.8 (3.8)
Completeness (%)99.9 (99.9)
Refinement
No. of atoms
Protein4390
Glycerol102
Ca2+ 4
Water468
Resolution used in refinement ()43.01.68
R work /R free § 0.181/0.217
Wilson B factor (2)18
R.m.s.d., bond distances ()0.03
R.m.s.d., bond angles ()2.5
Mean overall B factor (2)27
Ramachandran plot
Most favoured regions (%)96.6
Disallowed regions (%)0.0
PDB code 3wq7

R merge = , where I(hkl) is the intensity of the ith measurement of reflection hkl, including symmetry-related reflections, and I(hkl) is their average.

R work = .

R free is R work for approximately 5% of the reflections that were excluded from the refinement.

R.m.s.d. bond distances and angles are r.m.s.d.s from ideal values (Engh Huber, 1991 ▶).

Structure determination and refinement

The EGPfΔN30 structure was determined by molecular replacement with MOLREP (Vagin & Teplyakov, 2010 ▶) in the CCP4 package (Winn et al., 2011 ▶), using the structure of EGPfΔN30 in the P21212 form as a search model (Kim et al., 2012 ▶; PDB entry 3vgi). Structure model building was performed with Coot (Emsley et al., 2010 ▶). The structure was refined using REFMAC5 (Murshudov et al., 2011 ▶).Water molecules were introduced at peaks over 3 r.m.s.d. in the F o − F c difference Fourier map fulfilling reasonable interactions with the protein model. A Ramachandran plot of the final structure was validated using PROCHECK (Laskowski et al., 1993 ▶). The values of the r.m.s.d. for comparisons of structures were calculated using SUPERPOSE (Krissinel & Henrick, 2004 ▶). Figures were prepared using PyMOL (DeLano, 2002 ▶). The refinement statistics are presented in Table 1 ▶. The interface between EGPfΔN30 and symmetric molecules was calculated using PISA (Protein Interfaces, Surfaces and Assemblies; Krissinel & Henrick, 2007 ▶).

Results and discussion

Structure of EGPf at pH 5.5

EGPf appears to be a secretory enzyme because of its signal sequence at the N-terminus. Recombinant EGPf without the signal sequence was expressed using the pET system. No recombinant enzyme crystals were obtained using the Crystal Screen (Hampton Research, Aliso Viejo, Califonia, USA) or Wizard I and II (Emerald Bio, Bainbridge Island, Washington, USA) crystallization screening kits. However, the recombinant product of a truncated protein gene (EGPfΔN30), in which the N-terminal 30 amino-acid residues (signal sequence and the proline and hydroxyl residue-rich regions) were deleted from EGPf, was crystallized at pH 9.0, and the structure was determined at 1.07 Å resolution (Kim et al., 2012 ▶; PDB entry 3vgi). Under these crystallization conditions, crystals grew with symmetry consistent with space group P21212, and one EGPf molecule is present per asymmetric unit. However, the enzymatic optimum pH of EGPf was reported to be approximately pH 6.0 (Bauer et al., 1999 ▶). Therefore, we attempted to prepare crystals at the more physiologically relevant pH of 5.0–6.0 in order to obtain the structure of the active site in EGPf at these conditions. Based on initial screening results, high-quality crystals of EGPfΔN30 were obtained using a reservoir solution consisting of 100 mM CHC (2:3:4 citric acid:HEPES:CHES) buffer pH 5.5, 200 mM lithium sulfate, 5%(v/v) ethanol at 22°C. The average size of each crystal was about 0.7 × 0.5 × 0.3 mm after one week (Fig. 1 ▶). This was a new crystal form with symmetry consistent with space group C2 that diffracted to 1.68 Å resolution. The data-collection statistics are summarized in Table 1 ▶. Determination of the structure of EGPfΔN30 was carried out by the molecular-replacement method using the previous structural data (Kim et al., 2012 ▶; PDB entry 3vgi). Two molecules of EGPfΔN30, labelled A pH5.5 and B pH5.5, were identified in the crystallographic asymmetric unit. The final model contains two monomer molecules with 270 amino-acid residues each. After refinement, the R factors were estimated to be R work = 0.181 and R free = 0.217. The structure of the enzyme consists of a β-jelly-roll fold (Fig. 2 ▶ a).
Figure 1

A photograph of the EGPfΔN30 crystals prepared at pH 5.5. The scale bar corresponds to 0.5 mm.

Figure 2

Comparison of the EGPfΔN30 structure at pH 5.5 and at pH 9.0 identified by colour: red, pH 5.5; blue, pH 9.0. (a) Wall-eyed stereoview of the overall crystal structure of EGPfΔN30 drawn as a ribbon model viewed from the front. The two EGPfΔN30 structures are superimposed on each other. (b) The structure of the entrance to the active-site cleft is changed between A pH5.5 and A pH9.0, as are the structures of the catalytic residues. (c) The r.m.s.d. values of the Cα atoms of A pH5.5/B pH5.5 and A pH5.5/A pH9.0. (d) B factors of the amino-acid residues of EGPfΔN30 at pH 5.5 (A pH5.5 and B pH5.5) and pH 9.0 (A pH9.0). Hydrogen bonds between the two molecules are indicated by a dotted red line. (e) Catalytic mechanism of EGPf in the first half-reaction. Here, the typical of a retaining enzyme is depicted in a schematic diagram. The two glucose residues correspond to the productive binding mode. pH 9.0: the acidic side chain of Glu178 adjacent to the nucleophile Glu197 maintains a negative charge. pH 5.5: the nucleophile Glu197 maintains a negative charge without the side chain of Glu178.

Comparison to the previously determined EGPf structure

The r.m.s.d. of the Cα atoms between A pH5.5 and B pH5.5 was 0.3 Å (Fig. 2 ▶ b). In both molecules, the DxDxDG calcium-binding motif (Asp68Glu76 and Asp142) was present in the loop region between the B1 and B2 strands and exhibited high B-factor values (Fig. 2 ▶ c). In B pH5.5, poor electron density was observed for the loop regions of B3–A5 (Gly131Asp156), B5–B6 (Thr182Asp194) and α-helix–B4 (Ser272Glu283), but the regions were interpretable. On the other hand, EGPf crystallized at pH 9.0 has symmetry consistent with P21212, with unit-cell parameters a = 58.0, b = 118.7, c = 46.8 Å (Kim et al., 2012 ▶; PDB entry 3vgi). Under the previous crystallization conditions, one molecule exists in the asymmetric unit (labelled A pH9.0). The r.m.s.d. of the Cα atoms between structures A pH5.5 and A pH9.0 was 0.2 Å (Fig. 2 ▶ b). Comparison of the structures at pH 5.5 and 9.0 showed that the structures of both main chains are the same. However, conformational changes were observed in the side chains of Trp62 and Glu178 (Fig. 2 ▶ a) located at the active-site cleft. Trp62 seems to contribute to the substrate binding (Kim et al., 2012 ▶; PDB entry 3vgi). The torsion angle (χ2) of the indole ring of Trp62 differs by approximately 20° between A pH5.5 and A pH9.0. Trp62 is not particularly mobile as the B factor of Trp62 is ∼20 Å2. Trp62 is located at subsite −4, at the entrance to the nonreducing side of the active-site cleft. On the other hand, the dihedral angle of the carboxyl group of Glu178 between A pH5.5 and A pH9.0 is approximately 80°. Because of this, the distance between Glu178 OE1 and Glu197 OE2 in A pH5.5 (3.2 Å) is larger than that in A pH9.0 (2.5 Å) by 0.7 Å. Glu178 is located to the back of two catalytic residues (Glu197, nucleophile; and Glu290, proton donor) and it is thought to be the proton donor to Glu197 (cellulase 12A from Thermotoga maritima; Cheng et al., 2011 ▶). Although the B-factor values for Glu197 of A pH5.5 and B pH5.5 are 16 and 35 Å2, respectively, the conformation of Glu178 did not change between the apo forms (A pH5.5 and B pH5.5). No other significant differences at the active centre were observed among the A pH9.0, A pH5.5 and B pH5.5 molecules. This result suggests that the conformational change of Glu178 is due to protonation of the carboxyl group of Glu178 and/or Glu197 at acidic pH. The position or state of nucleophile Glu197 is stabilized by Glu178 at pH 9.0. Glu178 seems to play a role in controlling the optimum pH of the enzymatic activity. From the catalytic mechanism of cellulase 12A from T. maritima (Cheng et al., 2011 ▶), Glu197 is identified as the catalytic nucleophile of EGPf (Fig. 2 ▶ e). In the first half-reaction, the acidic side chain of Glu178 adjacent to the nucleophile Glu197 is believed to maintain a negative charge (Fig. 2 ▶ e), as suggested by Cheng et al. (2011 ▶). This catalytic mechanism was supported by the structural data at pH 9.0 (Fig. 2 ▶ b). However, the structural data at pH 5.5 (Fig. 2 ▶ b) suggest another catalytic mechanism (Fig. 2 ▶ e). It is speculated that the side chain of Glu178 located at a distance of 3.2–3.7 Å from the nucleophile Glu197 is protonated and the nucleophile Glu197 maintains a negative charge in the first half-reaction.

Interfaces with symmetry-related molecules

The EGPfΔN30 structures at pH 5.5 and at pH 9.0 each interact with seven symmetry-related molecules (Fig. 3 ▶ a, Table 2 ▶): single primes (′) for the interacting molecules refer to molecules that are in the first layer relative to a central molecule (A pH5.5 or A pH9.0) and double primes (′′) refer to molecules that are in a second layer relative to the central molecule (B pH5.5). That is, seven interfaces (1–3, 1′–4′) at pH 5.5 and four interfaces (1′′–4′′) at pH 9.0 were formed (Table 2 ▶). The interfaces between monomers of the central molecule and the symmetry-related molecules are summarized in Table 2 ▶. The B-factor values of the amino-acid residues of the two determined structures are shown in Fig. 2 ▶(c). The average values for A pH5.5, B pH5.5 and A pH9.0 are 18, 33 and 12 Å2, respectively. In B pH5.5, the B factors of the overall and loop region of the surface are higher than for A pH5.5 and A pH9.0. In particular, the B factors of the DxDxDG calcium-binding motif and the B5–B6 loop region in A pH5.5/B pH5.5 exhibit higher values (44/54 Å2 and 19/51 Å2) than in A pH9.0 (14 and 11 Å2) (Figs. 2 ▶ c and 3 ▶ b). In B pH5.5, two interactions, 1′ (B pH5.5–B′′1) and 4′ (B–A′′2), in these regions have higher B-factor values than the other five interactions (1, 2, 3, 2′ and 3′) (Table 2 ▶). These interactions are likely to weaken the molecular packing because of the high fluctuation and flexibility of these regions. In contrast to the EGPfΔN30 structures at pH 5.5, A pH9.0 has stronger packing because of the lower flexibility of the interfaces. In conclusion, crystal packing is weaker and the quality of the EGPfΔN30 crystal at pH 5.5 is lower than at pH 9.0 because of the flexible interfaces 1′ and 4′.
Figure 3

Symmetry-related molecules and interfaces of A pH5.5 and A pH9.0. The single primes (′) refer to molecules that are in the first layer relative to A pH5.5 or A pH9.0 and the double primes (′′) refer to molecules that are in a second layer relative to B pH5.5. (a) Seven symmetry-related molecules, drawn as cartoon models, are viewed from the front and upper side. Characters in the molecules correspond to the interaction molecules in Table 2 ▶. (b) Interfaces 1′ and 4′ are drawn as tube models. Rainbow colours are used to show the high (red) and low (blue) B factors of the amino-acid residues.

Table 2

Interfaces between monomers of the determined molecule and the symmetric molecules

In the interface, single primes () refer to interactions with central molecule (A pH5.5 or A pH9.0) and double primes () refer to interactions with central molecule (B pH5.5). In the interaction molecule, single primes () of the interaction molecules refer to molecules that are in the first layer relative to a central molecule (A pH5.5 or A pH9.0) and double primes () refer to molecules that are in a second layer relative to that central molecule (B pH5.5).

InterfaceInteracting moleculeSymmetry operationInterface area (2)Solvation free energy gain (i G) (kcalmol1)Hydrogen bondsSalt bridges B factor (2)
A pH5.5
1 A1 4100.56016/16
2 A2 3606.14017/17
3 B1 2001.91027/25
B pH5.5
1 B1 4102.16038/52
2 A pH5.5 3802.06017/15
3 A1 3101.83319/18
4 A2 2200.83042/27
A pH9.0
1 A1 5002.36012/13
2 A2 4801.314212/12
3 A3 2200.51014/13
4 A4 2003.50013/16

The sum values of the gain on complex formation for the two surfaces.

Value of B factor at the interface belonging to each monomer.

PDB reference: hyperthermophilic endocellulase, 3wq7
  14 in total

1.  Crystallization and preliminary X-ray analysis of a hyperthermophilic endoglucanase from Pyrococcus furiosus.

Authors:  Misumi Kataoka; Han-Woo Kim; Kazuhiko Ishikawa
Journal:  Acta Crystallogr Sect F Struct Biol Cryst Commun       Date:  2012-02-23

2.  Inference of macromolecular assemblies from crystalline state.

Authors:  Evgeny Krissinel; Kim Henrick
Journal:  J Mol Biol       Date:  2007-05-13       Impact factor: 5.469

3.  Molecular replacement with MOLREP.

Authors:  Alexei Vagin; Alexei Teplyakov
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2009-12-21

4.  Structure of hyperthermophilic endocellulase from Pyrococcus horikoshii.

Authors:  Han-Woo Kim; Kazuhiko Ishikawa
Journal:  Proteins       Date:  2010-02-01

5.  An endoglucanase, EglA, from the hyperthermophilic archaeon Pyrococcus furiosus hydrolyzes beta-1,4 bonds in mixed-linkage (1-->3),(1-->4)-beta-D-glucans and cellulose.

Authors:  M W Bauer; L E Driskill; W Callen; M A Snead; E J Mathur; R M Kelly
Journal:  J Bacteriol       Date:  1999-01       Impact factor: 3.490

6.  Calculation of protein extinction coefficients from amino acid sequence data.

Authors:  S C Gill; P H von Hippel
Journal:  Anal Biochem       Date:  1989-11-01       Impact factor: 3.365

7.  Solvent content of protein crystals.

Authors:  B W Matthews
Journal:  J Mol Biol       Date:  1968-04-28       Impact factor: 5.469

8.  Atomic resolution of the crystal structure of the hyperthermophilic family 12 endocellulase and stabilizing role of the DxDxDG calcium-binding motif in Pyrococcus furiosus.

Authors:  Han-Woo Kim; Misumi Kataoka; Kazuhiko Ishikawa
Journal:  FEBS Lett       Date:  2012-03-06       Impact factor: 4.124

9.  The role of disulfide bond in hyperthermophilic endocellulase.

Authors:  Han-Woo Kim; Kazuhiko Ishikawa
Journal:  Extremophiles       Date:  2013-04-27       Impact factor: 2.395

10.  REFMAC5 for the refinement of macromolecular crystal structures.

Authors:  Garib N Murshudov; Pavol Skubák; Andrey A Lebedev; Navraj S Pannu; Roberto A Steiner; Robert A Nicholls; Martyn D Winn; Fei Long; Alexei A Vagin
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2011-03-18
View more
  1 in total

Review 1.  Lignocellulose degradation mechanisms across the Tree of Life.

Authors:  Simon M Cragg; Gregg T Beckham; Neil C Bruce; Timothy D H Bugg; Daniel L Distel; Paul Dupree; Amaia Green Etxabe; Barry S Goodell; Jody Jellison; John E McGeehan; Simon J McQueen-Mason; Kirk Schnorr; Paul H Walton; Joy E M Watts; Martin Zimmer
Journal:  Curr Opin Chem Biol       Date:  2015-11-14       Impact factor: 8.822

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.