Literature DB >> 25426413

Structure and stability of metagenome-derived glycoside hydrolase family 12 cellulase (LC-CelA) a homolog of Cel12A from Rhodothermus marinus.

Hiroyuki Okano1, Masashi Ozaki1, Eiko Kanaya1, Joong-Jae Kim1, Clement Angkawidjaja2, Yuichi Koga1, Shigenori Kanaya1.   

Abstract

Ten genes encoding novel cellulases with putative signal peptides at the N-terminus, termed pre-LC-CelA-J, were isolated from a fosmid library of a leaf-branch compost metagenome by functional screening using agar plates containing carboxymethyl cellulose and trypan blue. All the cellulases except pre-LC-CelG have a 14-29 residue long flexible linker (FL) between the signal peptide and the catalytic domain. LC-CelA without a signal peptide (residues 20-261), which shows 76% amino acid sequence identity to Cel12A from Rhodothermus marinus (RmCel12A), was overproduced in Escherichia coli, purified and characterized. LC-CelA exhibited its highest activity across a broad pH range (pH 5-9) and at 90 °C, indicating that LC-CelA is a highly thermostable cellulase, like RmCel12A. The crystal structure of LC-CelA was determined at 1.85 Å resolution and is nearly identical to that of RmCel12A determined in a form without the FL. Both proteins contain two disulfide bonds. LC-CelA has a 16-residue FL (residues 20-35), most of which is not visible in the electron density map, probably due to structural disorder. However, Glu34 and Pro35 form hydrogen bonds with the central region of the protein. ΔFL-LC-CelA (residues 36-261) and E34A-LC-CelA with a single Glu34 → Ala mutation were therefore constructed and characterized. ΔFL-LC-CelA and E34A-LC-CelA had lower melting temperatures (T m) than LC-CelA by 14.7 and 12.0 °C respectively. The T m of LC-CelA was also decreased by 28.0 °C in the presence of dithiothreitol. These results suggest that Glu34-mediated hydrogen bonds and the two disulfide bonds contribute to the stabilization of LC-CelA.

Entities:  

Keywords:  CD, circular dichroism; CM-cellulose, carboxymethyl cellulose; Crystal structure; DTT, dithiothreitol; FL, flexible linker; Flexible linker; GH family, glycoside hydrolase family; GdnHCl, guanidine hydrochloride; Glycoside hydrolase family 12 cellulase; LC-CelA, GH family 12 cellulase from leaf–branch compost; Leaf–branch compost; Metagenome; SP, signal peptide; Stability

Year:  2014        PMID: 25426413      PMCID: PMC4239480          DOI: 10.1016/j.fob.2014.10.013

Source DB:  PubMed          Journal:  FEBS Open Bio        ISSN: 2211-5463            Impact factor:   2.693


Introduction

A metagenomic approach is an efficient method to isolate novel enzymes useful for industrial purposes as well as for academic research [1-4]. In EXPO Park, Japan, leaves and branches cut from the trees are collected periodically, mixed with urea, and agitated for composting. The temperature increases up to ∼70 °C inside this compost [leaf–branch compost (LC)] and then decreases to ∼50 °C roughly one year later upon completion of composting. This compost is expected to be rich in various thermophilic microorganisms and is therefore a promising source of genes encoding novel thermostable enzymes. We have isolated a novel cutinase, LC-cutinase, with polyethylene terephthalate (PET) degrading activity [5], and 12 novel RNases H1, LC1 ∼ LC12-RNases H1 [6], from this compost using a metagenomic approach. Structural and functional studies of these enzymes have shown that LC-cutinase is a kinetically robust protein with a slow unfolding rate [7]; LC9-RNase H1 is a bacterial RNase H1 with an atypical DEDN active site motif, evolutionarily distinct from those with a typical DEDD active site motif [8]; and LC11-RNase H1 is a prokaryotic RNase H1 with a unique substrate recognition mechanism, which interacts with four non-consecutive, instead of consecutive, 2′-OH groups of the RNA strand of RNA/DNA substrates [9,10]. However, no cellulases have been isolated from this compost. Cellulases hydrolyze β-1,4-glycoside bonds in cellulose and have received much attention because of their potential application in the production of bioethanol from cellulosic biomass [11-15]. Cellulases consist of three types of enzymes, endo-1,4-β-d-glucanase (endoglucanase; EC 3.2.1.4), exo-1,4-β-d-glucanase (cellobiohydrolase; EC 3.2.1.91 and 3.2.1.176), and β-glucosidase (EC 3.2.1.21). It has recently be shown that lytic polysaccharide monooxygenase enzyme AA9, formerly known as glycoside hydrolase (GH) family 61, represent another class of cellulose degrading enzymes [16]. These enzymes synergistically hydrolyze cellulose to produce glucose [16,17]. According to the CAZy database (http://www.cazy.org/), endoglucanase, cellobiohydrolase and β-glucosidase are classified into twelve, five and six families respectively. GH family 12 is one of the endoglucanase families. A GH family 12 cellulase (Cel12A) from Rhodothermus marinus, termed RmCel12A, is one of the most thermostable endoglucanases identified to date [18]. RmCel12A has a putative signal peptide (SP, residues 1–17) and a flexible linker (FL, residues 18–37) at the N-terminus. The crystal structure of RmCel12A without these SP and FL regions (residues 38–261) has been determined in substrate-free [19] and substrate-bound [20] forms. It has been proposed that dimerization, and a cluster of aromatic residues in the active site cleft, contribute to the adaptation of RmCel12A to a high temperature environment [20]. Thermostable enzymes are generally more useful than thermolabile ones in industry because of their higher temperature, chemical, and pH stability, lower production cost, and longer shelf life [21-23]. In addition, as the reaction temperature increases, the reaction rate increases, the solubility of the substrate and product increases, viscosity of the reaction mixture decreases, and the risk of microbial contamination decreases. Therefore, it would be informative to examine whether thermostable enzymes with cellulose-degrading activity can be isolated from a leaf–branch compost by a metagenomic approach. In this report, we showed that ten novel cellulases were isolated from leaf–branch compost by a metagenomic approach. We also showed that one of them, LC-CelA, which has high amino acid sequence identity to RmCel12A, was highly thermostable. X-ray crystallographic studies and structure-based mutational studies indicate that hydrogen bonds formed between the C-terminal region of the FL and the central region of the protein, and the two disulfide bonds contribute to the stabilization of LC-CelA.

Materials and methods

Cells, plasmids, and enzymes

Escherichia coli BL21-CodonPlus(DE3)-RP was from Stratagene (La Jolla, CA, USA). Plasmid pET25b was from Novagen (Madison, WI, USA). E. coli BL21-CodonPlus(DE3)-RP transformants were grown in lysogeny broth (LB) medium (10 g Tryptone; 5 g Yeast extract; 10 g NaCl in 1 L H2O) supplemented with 50 mg L−1 ampicillin.

Construction of DNA library and screening

The 4-month-old leaf–branch compost made in EXPO Park, Japan was used to construct a metagenomic DNA library. The temperature and pH of this leaf–branch compost are 67 °C and pH 7.5. Extraction of DNA from this compost and construction of a DNA library for metagenomic study using CopyControl™ Fosmid Library Production kit (EPICENTRE Biotechnologies, Madison, WI, USA) were performed as described previously [5]. This DNA library was spread on LB-agar plates containing 12.5 μg mL−1 chloramphenicol, 0.01% l-arabinose, 0.5% CM-cellulose, and 0.1 mg mL−1 trypan blue. CM-cellulose and trypan blue have been used as a cellulose substrate and a chromogenic dye respectively for detection of cellulolytic activity [24]. The resultant plates were incubated at 37 °C for several days. Plasmids were extracted from colonies, which form halos around them due to hydrolysis of CM-cellulose. Genes encoding CM-cellulose degrading enzymes were identified by transposon mutagenesis using EZ-Tn5TM Promoter Insertion kit (EPICENTRE Biotechnologies), according to the procedures recommended by the supplier. Nucleotide sequence of the gene was determined by an ABI Prism 3100 DNA sequencer (Applied Biosystems, Tokyo, Japan). Oligonucleotides for sequencing were synthesized by Hokkaido System Science (Sapporo, Hokkaido, Japan).

Construction of plasmids

For construction of plasmid pET-LC-CelA used to overproduce LC-CelA (residues 20–261 of pre-LC-CelA) in a form with Met-Asp at the N-terminus, the gene encoding LC-CelA was amplified by PCR using the fosmid vector harboring the pre-LC-CelA gene as a template. The sequences of the PCR primers were 5′-GATCCATGGATCTGTTCCCAGAGAAAAATG-3′ for 5′-primer and 5′-CAAGAATTCACTTTAGCGTGCGGTC-3′ for 3′-primer, where underlines represent the NcoI site for 5′-primer and EcoRI site for 3′-primer. The resultant DNA fragment was digested with NcoI and EcoRI, and ligated into the NcoI- EcoRI sites of pET25b to generate plasmid pET-pelB-LC-CelA, which harbors the gene encoding the pelB-LC-CelA fusion protein. The small NdeI and NcoI fragment encompassing the pelB leader sequence was then deleted from this plasmid by digestion with NdeI and NcoI, the 5′-overhang filled in by Klenow DNA polymerase, and re-ligation to generate pET-LC-CelA. For constructions of plasmids pET-LC-CelA-His and pET-ΔFL-LC-CelA-His used to overproduce LC-CelA-His and ΔFL-LC-CelA-His (residues 36–261) in a form with Met at the N-terminus and a His-tag at the C-terminus, the genes encoding LC-CelA and ΔFL-LC-CelA were amplified by PCR using the fosmid vector harboring the pre-LC-CelA gene as a template. The sequences of the 5′-primers were 5′-GGATGCCATATGCTGTTCCCAGAG-3′ for LC-CelA and 5′-GAAGATCATATGACGGCTACAGTATGCG-3′ for ΔFL-LC-CelA. The sequence of the 3′-primer was 5′-TTCGTCGACGCGTGCGGTCAAAGTAAC-3′ for both LC-CelA and ΔFL-LC-CelA. In these sequences, underlines represent the NdeI site for 5′-primer and SalI site for 3′-primer. The resultant DNA fragments were digested with NdeI and SalI, and ligated into the NdeI- SalI sites of pET25b. Plasmid pET-E34A-LC-CelA-His used to overproduce E34A-LC-CelA-His in a form with Met at the N-terminus and a His-tag at the C-terminus was constructed by PCR with the KOD-plus mutagenesis kit (Toyobo Co., Ltd., Osaka, Japan), according to the procedures recommended by the supplier. Plasmid pET-LC-CelA-His was used as a template. The mutagenic primers were designed in such a way that the GAG codon for Glu34 was changed to GCT for Ala. PCR was performed in 25 cycles with Gene Amp PCR system 2400 (Applied Biosystems) using KOD DNApolymerase (Toyobo). PCR primers and mutagenic primers were synthesized by Hokkaido System Science. The nucleotide sequence was confirmed by an ABI Prism 3100 DNA sequencer (Applied Biosystems).

Overproduction and purification

E. coli BL21-CodonPlus(DE3) transformants with the pET25b derivatives were cultivated at 37 °C. When the absorbance of the culture at 600 nm reached around 0.5, isopropyl-β-d-thio galactopyranoside (IPTG) was added to the culture medium and cultivation was continued for an additional 4 h. Cells were then harvested by centrifugation at 6000g for 10 min, suspended in 10 mM TrisHCl (pH 8.0) containing 1 mM EDTA (TE buffer), disrupted by sonication lysis, and centrifuged at 30,000g for 30 min. The supernatant was collected, dialyzed against TE buffer, incubated at 70 °C for 30 min for heat treatment, and centrifuged at 30,000g for 30 min. The subsequent purification procedures were carried out at 4 °C. For purification of LC-CelA-His and its derivatives with a C-terminal His-tag, the supernatant obtained after heat treatment was dialyzed against 20 mM TrisHCl (pH 7.0) containing 10 mM imidazole and 0.3 M NaCl, and applied to a Ni Sepharose 6 Fast Flow column (GE Healthcare) equilibrated with the same buffer. The protein was eluted from the column by linearly increasing the imidazole concentration from 10 to 300 mM. The fractions containing the protein were collected and dialyzed against 10 mM TrisHCl (pH 7.0). For purification of LC-CelA without a His-tag, the supernatant obtained after heat treatment was loaded onto a HiTrap Q HP column (GE Healthcare, Tokyo, Japan) equilibrated with TE buffer containing 1 mM DTT. The protein was eluted from the column by linearly increasing the NaCl concentration from 0 to 1 M. The fractions containing the protein were collected and applied to a Hi-Load 16/60 Superdex 200 pg column (GE Healthcare) equilibrated with TE buffer containing 50 mM NaCl for gel filtration chromatography. The fractions containing the protein were pooled. The production level of the protein in E. coli cells and the purity of the protein were analyzed by SDSpolyacrylamide gel electrophoresis [25] using a 12% polyacrylamide gel, followed by staining with Coomassie brilliant blue (CBB). The amount of the protein was estimated from the intensity of the band visualized by CBB staining using the Scion Image program. The N-terminal amino acid sequence of the protein was determined by a Procise automated sequencer model 491 (Applied Biosystems). The protein concentration was determined from the UV absorption on the basis that the absorbance of a 0.1% (1.0 mg mL−1) solution at 280 nm is 3.37 for LC-CelA, 2.96 for LC-CelA-His, 3.15 for ΔFL-LC-CelA-His and 2.97 for E34A-LC-CelA-His. These values were calculated by using ε = 1,526 M−1 cm−1 for tyrosine and 5225 M−1 cm−1 for tryptophan at 280 nm [26].

Sequence analysis

Blast searches of the amino acid sequences of metagenome-derived cellulases deduced from their nucleotide sequences were performed using the DDBJ blastp search tool (http://blast.ddbj.nig.ac.jp/blastn?lang = en). The flexible and hydrophilic regions of each amino acid sequence were predicted by using the PROTSCALE tool (http://web.expasy.org/protscale/). Calculation of pI for the corresponding amino acid region is performed by using the Compute pI/Mw tool (http://web.expasy.org/compute_pi/). Domain search was performed by using the SMART tool (http://smart.embl.de/) [34].

Determination of enzymatic activity

The enzymatic activity was determined at the temperatures indicated by the dinitrosalicylic acid (DNS) stopped method [27] using CM-cellulose as a substrate. The reaction mixture (100 μL) contained 100 mM sodium phosphate (pH 7.0) and 1% (w/v) CM-cellulose (low viscosity grade, Sigma–Aldrich Co., St Louis, MO, USA). The enzymatic reaction was initiated by adding an appropriate amount of the enzyme and terminated by adding 10 μL of 10% SDS and boiling for 5 min. The reaction time was 10 min. The resultant solution was mixed with 300 μL of the DNS solution prepared as described previously [27], boiled for 5 min, and cooled on ice. After centrifugation at 17,000g for 5 min, an aliquot of the supernatant (100 μL) was withdrawn, diluted twice by distilled water, and measured for absorption at 500 nm (A500). The amount of reducing sugars released from the substrate was estimated from the A500 value by using glucose as standard. One unit of enzymatic activity was defined as the amount of enzyme that produced 1 μmol of reducing sugars per min. For analysis of the temperature dependence of activity, the activity was determined at pH 7.0 and various temperatures ranging from 40 to 100 °C. For the analysis of pH dependence of activity, the activity was determined at 90 °C and various pHs ranging from pH 4.0 to 10.5. The buffers used for this analysis were 100 mM sodium citrate (pH 4.0–6.0), 100 mM sodium phosphate (pH 6.0–8.0), and 100 mM glycineNaOH (pH 8.0–10.5).

Measurement of circular dichroism (CD) spectra

The far-UV (200–260 nm) CD spectrum of the protein was measured at 25 °C on a J-725 spectropolarimeter (Japan Spectroscopic, Tokyo, Japan). The protein was dissolved in 10 mM TrisHCl (pH 7.0). The protein concentration was 0.1 mg mL−1 and a cell with an optical path length of 2 mm was used. The mean residual ellipticity (θ, deg cm2 dmol−1) was calculated using an average amino acid molecular mass of 110 Da.

Thermal denaturation

The thermal denaturation curve of the protein was obtained by monitoring the change in CD values at 222 nm as the temperature was increased. The protein was dissolved in 10 mM TrisHCl (pH 7.0) containing 3.0 M GdnHCl. The protein concentration and optical path length were 0.1 mg mL−1 and 2 mm, respectively. The rate of temperature increase was 3.0 °C min−1. The temperature of the midpoint of the transition, Tm, was calculated by curve fitting of the resultant CD values versus temperature data on the basis of a least-square analysis.

Crystallization

For crystallization, LC-CelA was dialyzed against 10 mM TrisHCl (pH 8.0) and concentrated to 10 mg mL−1 using an ultrafiltration system Centricon (Millipore, Billerica, MA, USA). The crystallization conditions were initially screened using crystallization kits from Hampton Research (Aliso Viejo, CA, USA) (Crystal Screens I and II) and Emerald BioStructures, Inc. & Emerald BioSystems (Bainbridge Island, WA, USA) (Wizard I, II, III and IV). The conditions were surveyed using sitting-drop vapor-diffusion method at 4 °C and 20 °C. Drops were prepared by mixing 1 μL each of the protein and reservoir solutions, and were vapor-equilibrated against a 100 μL reservoir solution. Crystals of LC-CelA appeared after a few weeks in Wizard I No. 28 [0.1 M 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES) (pH 7.5), 0.2 M NaCl and 20% (w/v) polyethylene glycol (PEG) 3000]. To improve the crystal quality, the crystallization conditions were further optimized. Diffraction-quality crystals were obtained when the protein concentration was increased to 11.7 mg mL−1 and the reservoir solution was changed to 0.1 M HEPES (pH 7.5) containing 0.2 M NaCl and 23% (w/v) PEG 3350.

X-ray diffraction data collection and structure determination

X-ray diffraction data set of LC-CelA was collected at −173 °C without cryoprotectant at a wavelength of 0.9 Å with the beam line BL44XU at SPring-8 (Hyogo, Japan). Diffraction data set was indexed, integrated and scaled using the HKL2000 program suite [28]. The structure was solved by the molecular replacement method using MOLREP [29] in the CCP4 program suite [30]. The 1.85 Å structure of RmCel12A (PDB: 1H0B) was used as a starting model. Automated model building was done by using ArpWarp [31]. Structural refinement was carried out by using REFMAC [32] of the CCP4 program and the model was corrected using COOT [33]. The statistics for data collection and refinement are summarized in Table 3. The figures were prepared using PyMol (http://www. pymol.org).
Table 3

Data collection and refinement statistics of LC-CelA.

CrystalLC-CelA
Wavelength (Å)0.900
Space groupC121



Cell parameters
a, b, c (Å)130.78, 59.58, 74.95
α = γ, β (°)90.00, 122.89
Molecules/asymmetric unit2
Resolution range (Å)50.00–1.85 (1.88–1.85)a
Reflections measured297,834
Unique reflections41,281
Redundancy7.2 (7.4)
Completeness (%)99.4 (99.6)a
Rmerge (%)b12.0 (37.1)a
Average I/σ (I)21.8 (5.6)a



Refinement statistics
Resolution limits (Å)62.9–1.9
No. of atoms
Protein/water1798/507
Rwork (%)/Rfree (%)c16.2/20.5



Rms deviations from ideal values
Bond lengths (Å)0.012
Bond angles (°)1.200



Average B factors (Å2)
Protein15.2
Water27.2



Ramachandran plot statistics
Preferred regions (%)96.9
Allowed regions (%)3.1
Molprobity scored1.35

Values in parentheses are for the highest-resolution shell.

Rmerge = ∑|I — |/∑ I, where I is an intensity measurement for reflection with indices hkl and is the mean intensity for multiply recorded reflections.

Free R-value was calculated using 5% of the total reflections chosen randomly and omitted from refinement.

MolProbity score combines the clashscore, rotamer, and Ramachandran evaluations into a single score, normalized to be on the same scale as X-ray resolution [52].

Protein data bank accession number

The coordinates and structure factors for LC-CelA have been deposited in the Protein Data Bank under code PDB: 3WX5.

Results and discussion

Cloning of cellulase genes from metagenomic DNA library

The size of the metagenomic DNA library prepared from the leaf–branch compost was approximately 2.1 × 104 colony forming units (CFU). The restriction fragment length polymorphisms of ten randomly selected clones using restriction enzyme BamHI showed non-redundant patterns and an average insert size of 35 kb. Screening of the library for genes encoding cellulose degrading enzymes was performed using CT-agar plates (agar plates containing CM-cellulose and trypan blue). Of approximately 6000 clones screened, 24 clones gave a halo on CT-agar plates when they were incubated at 37 °C for 3 days. Ten of them, which gave a halo on CT-gellan gum plates (gellan gum plates containing CM-cellulose and trypan blue) at 50 °C, were chosen to determine the nucleotide sequences of the genes responsible for halo formation. Determination of the nucleotide sequences of these genes by transposon mutagenesis indicated that all ten clones harbored genes encoding cellulases, that were different from one another. Because these cellulases have a putative signal peptide (SP) at their N-terminus (as described below), they are termed pre-LC-CelA∼J. Clones harboring the genes encoding pre-LC-CelA, pre-LC-CelD, pre-LC-CelF, pre-LC-CelG, and pre-LC-CelI produced a halo on CT-gellan gum plates even at 80 °C, suggesting that these cellulases are highly thermostable.

Amino acid sequences of pre-LC-CelA∼J

Blast searches of the amino acid sequences of pre-LC-CelA∼J deduced from their nucleotide sequences indicated that all the cellulases are novel, as summarized in Table 1. The number of constituent amino acid residues in these cellulases varies from 261 to 782. The highest identity between the amino acid sequence of either one of these cellulases and that available from the database varies from 42% to 76%. This result indicates that a metagenomic approach is useful to increase our knowledge on sequence space of cellulase. The nucleotide sequences of the genes encoding pre-LC-CelA∼J are deposited in GenBank under accession numbers KF626648–KF626657.
Table 1

List of cellulases isolated from leaf–branch compost and proteins with the highest amino acid sequence identities.

CellulasesNo. of residuesProtein with the highest sequence identity
ProteinSource organismaAccession No.Identity (%)
Pre-LC-CelA261Glycoside hydrolase family 12Rhodothermus marinus (65 °C)G2SJ2976
Pre-LC-CelB596CellulasePlesiocystis pacifica SIR-1 (18 °C)A6G3Z750
Pre-LC-CelC515α-L-Arabinofuranosidase-like proteinOpitutus terrae (30 °C)B1ZZJ545
Pre-LC-CelD400Glycosyl hydrolase family 12Thermobispora bispora (50 °C)D6Y3Q345
Pre-LC-CelE286Cellulase 12AStreptomyces sp. 11AG8 (35 °C)Q9KIH143
Pre-LC-CelF782Uncharacterized proteinStreptosporangium roseum (26 °C)D2B80862
Pre-LC-CelG577Glycosyl hydrolase family 9Microcoleus sp. PCC 7113 (25 °C)K9WM6642
Pre-LC-CelH742CellulaseCandidatus methylomirabilis oxyfera (25 °C)D5MF1364
Pre-LC-CelI592Glucosidase-like glycosyl hydrolaseXenococcus sp. PCC 7305 (25 °C)L8M23257
Pre-LC-CelJ326Glycoside hydrolase family 6Frankia sp. EUN1f (30 °C)D3D8M153

The optimum growth temperature of each organism is shown in parenthesis.

The amino acid sequences of pre-LC-CelA∼J are schematically shown in Fig. 1. Domain searches using the SMART tool [34] allow us to predict a SP, a catalytic module of cellulase, and a cellulose binding domain/module. All ten cellulases have a 16–36 residue long SP at their N-terminus, suggesting that they are secretory proteins. They also have a catalytic module of cellulase. Three (pre-LC-CelA, pre-LC-CelD and pre-LC-CelE) have a catalytic module of GH family 12 cellulase, two (pre-LC-CelF and pre-LC-CelG) have a catalytic module of GH family 9 cellulase, and two (pre-LC-CelB and pre-LC-CelJ) have a catalytic module of GH family 6 cellulase. Pre-LC-CelI, pre-LC-CelH, and pre-LC-CelC have catalytic modules of GH family 3, 44, and 51 cellulases respectively. In addition, pre-LC-CelD and pre-LC-CelF have a cellulose binding domain II (CBDII) and pre-LC-CelF has a cellulose binding module 3 (CBM3). Analyses of the hydrophilic and flexible regions using the PROTSCALE tool allow us to predict a hydrophilic flexible linker (FL). All the cellulases, except for pre-LC-CelG, have a 14–29 residue long FL between the putative SP and the catalytic domain. Pre-LC-CelG has a 28 residue long FL within a catalytic module of cellulase. These FLs are acidic, because their pI values range from 3.28 to 5.52.
Fig. 1

Schematic representation of the primary structures of metagenome-derived cellulases from leaf–branch compost. Putative signal peptides (N-terminal open boxes), putative flexible linkers (black boxes), glycoside hydrolase family (GHF) domains (gray boxes), and cellulose binding domains/modules (C-terminal open boxes) are shown. The numbers above the sequence represent the positions of the N- and C-terminal residues of each protein. The numbers below the sequence represent the positions of the N- and C-terminal residues of each region or domain.

Pre-LC-CelA∼J are probably secreted in microorganisms as LC-CelA∼J, because SPs are removed by signal peptidase upon secretion. Of them, LC-CelA is expected to be the most stable cellulase, because it shows high amino acid sequence identity to GH family 12 cellulase from R. marinus (RmCel12A) (Table 1). RmCel12A has a 20 residue long FL (residues 18–37) [35]. Removal of this FL decreases the Tm value of RmCel12A by 8.4 °C and its half-life at 90 °C from 5 h to 2 h [35], suggesting that the FL contributes to the stabilization of RmCel12A. However, this stabilization mechanism remains to be understood, because the crystal structure of RmCel12A has been determined only in a form without the FL [19,20]. Therefore, in order to examine whether LC-CelA is, like RmCel12A, a highly thermostable enzyme, and to analyze the role of the FL, we decided to overproduce LC-CelA in E. coli, purify and characterize it, and determine its crystal structure.

Comparison of amino acid sequences of pre-LC-CelA and pre-RmCel12A

The amino acid sequence of pre-LC-CelA is compared with that of pre-RmCel12A in Fig. 2. Both proteins consist of 261 residues. However, the regions predicted as SP (residues 1–19) and FL (residues 20–35) in pre-LC-CelA are slightly different from those predicted as SP (residues 1–17) and FL (residues 18–37) in pre-RmCel12A. According to the crystal structure of ΔFL-RmCel12A (RmCel12A without FL, residues 38–261) in complex with the substrate [20], Trp9, Trp26, Trp69, Trp159, Trp161, Tyr163, and Trp209 (Trp45, Trp62, Trp104, Trp195, Trp197, Tyr199, and Trp245 respectively in pre-RmCel12A) form an extensive aromatic network in the active site cleft, to which the substrate binds. Asn24, His67, Arg100, Met136, Pro137, and Gly138 (Asn60, His103, Arg136, Met172, Pro173, and Gly174 respectively in pre-RmCel12A) also form this cleft. Two catalytic residues, Glu124 and Glu207 (Glu160 and Glu243 respectively in pre-RmCel12A) are located at this cleft. All of these residues are fully conserved in pre-LC-CelA.
Fig. 2

Alignment of the amino acid sequences of pre-LC-CelA and pre-Cel12A from Rhodothermus marinus (pre-RmCel12A). The accession numbers are KF626648 for pre-LC-CelA and G2SJ29 for pre-RmCel12A. For the pre-RmCel12A sequence, the identical residues with those in the pre-LC-CelA sequence are indicated by asterisks (∗). Gaps are shown by dashes. A putative signal peptide (SP) is underlined. The region predicted as a flexible linker (FL) between a putative SP and a catalytic domain is boxed. The amino acid residues responsible for dimerization and catalytic activity of RmCel12A are denoted above the sequences by open and closed circles respectively. The aromatic and non-aromatic amino acid residues that form the active site cleft of RmCel12A and corresponding residues of LC-CelA are shown in boldface. Numbers represent the positions of the amino acid residues that start from the initiator methionine residue for each protein. Numbers in parentheses represent the positions of the amino acid residues that start from the initiator methionine for the RmCel12A derivative without SP and FL used to determine the crystal structure [Met-(Thr38-Gln261)]. The ranges of the secondary structures of LC-CelA, which are identical to those of RmCel12A, are shown above the sequences.

Overproduction and purification of LC-CelA and LC-CelA-His

Because the SP of pre-RmCel12A is cytotoxic to E. coli cells, and the production level of pre-RmCel12A greatly increases on removal of the SP [35], LC-CelA (residues 20–261) was overproduced in E. coli without the SP, either in a non-His-tagged or a His-tagged form. LC-CelA in a non-His-tagged form with Met-Asp at its N-terminus is simply designated as LC-CelA, whereas LC-CelA in a His-tagged form with Met at the N-terminus and a His-tag at the C-terminus is designated as LC-CelA-His. LC-CelA and LC-CelA-His were used for structural analysis and biochemical characterization respectively. Upon induction for overproduction, LC-CelA and LC-CelA-His equally accumulated in E. coli cells, in both soluble and insoluble forms. The fraction of the protein in soluble form was 20–30%. This protein was purified to give a single band on SDS–PAGE by heat treatment, followed by nickel affinity chromatography (LC-CelA-His) or ion-exchange and gel filtration chromatographies (LC-CelA) (data not shown). The amount of protein purified from 1 L culture was typically 3 mg for both LC-CelA and LC-CelA-His. The N-terminal amino acid sequence of LC-CelA was determined to be Met-Asp-Leu-Phe-, indicating that LC-CelA contains the entire FL region. The molecular mass of LC-CelA was estimated to be 29 kDa by gel filtration chromatography. This value is comparable to the calculated mass (26.9 kDa), suggesting that LC-CelA exists as a monomer. ΔFL-RmCel12A has been reported to exist as a dimer [20]. The interface of the two monomers is stabilized by two salt bridges formed between Glu4 (Glu40 in pre-RmCel12A) of one monomer and Arg47 (Arg83 in pre-RmCel12A) of the other. Arg47 is conserved as Arg81 in LC-CelA, whereas Glu4 is replaced by Thr38 in LC-CelA. LC-CelA does not form a dimer, probably because the salt bridges that stabilize the interface of two monomers are not formed.

Activity of LC-CelA-His

The temperature dependence of the enzymatic activity of LC-CelA-His was analyzed by measuring the activity at various temperatures ranging from 40 to 100 °C using CM-cellulose as a substrate. As shown in Fig. 3A, LC-CelA-His exhibited the highest activity at 90 °C. This temperature was slightly lower than, but comparable to, that of RmCel12A in a form without the SP, which has been reported to be ⩾100 °C [35]. However, the specific activity of LC-CelA-His determined at 60 °C, 4.2 units mg−1, was slightly higher than that of RmCel12A without the SP determined at 65 °C (3.1 units mg−1) [35]. These results indicate that LC-CelA-His is a highly thermostable enzyme, like RmCel12A, and is slightly more active than RmCel12A.
Fig. 3

Optimum temperature and pH for activity of LC-CelA-His. The temperature (A) and pH (B) dependencies of the enzymatic activity of LC-CelA-His are shown. The activity was determined at pH 7.0 and the temperatures indicated (A) or at 90 °C and the pHs indicated (B) using 1% (w/v) carboxymethyl-cellulose (CM-cellulose) as a substrate, as described in Experimental procedures. The buffers used to analyze the pH dependence of the activity were 100 mM sodium citrate (pH 4.0–6.0), 100 mM sodium phosphate (pH 6.0–8.0) and 100 mM Glycine–NaOH (pH 8.0–10.5). The experiment was carried out at least twice, and errors from the average values are indicated by vertical lines.

The pH dependence of the enzymatic activity was analyzed by measuring the activity at various pHs ranging from pH 4.0 to 10.5. As shown in Fig. 3B, LC-CelA-His exhibited nearly maximal activity across a broad pH range (pH 5.0–9.0). RmCel12A also exhibits nearly maximal activity over a broad pH range (pH 5.0–8.0), but shows only ∼50% of the maximal activity at pH 9.0 [18]. This result indicates that the pH range suitable for activity of LC-CelA-His is larger than that of RmCel12A.

Stability of LC-CelA-His against heat inactivation

To analyze the stability of LC-CelA-His against irreversible heat inactivation, the enzyme (0.05 mg mL−1) was incubated in 100 mM sodium phosphate (pH 7.0) at various temperatures ranging from 60 to 100 °C for 30 min, and the residual activity was determined at pH 7.0 and 60 °C. As shown in Fig. 4, LC-CelA-His almost fully retained activity after incubation at 90 °C for 30 min, whereas it almost totally lost activity after incubation at 95 °C for 30 min. This result is consistent with the observation that the optimum temperature for LC-CelA-His activity is 90 °C (Fig. 3A). LC-CelA-His almost completely lost activity after incubation at 100 °C for 30 min (Fig. 4), but it exhibited approximately 80% of the maximal activity in activity assays at 100 °C (Fig. 3A), probably because the enzyme hydrolyzed the substrate before it was fully denatured in those assay conditions.
Fig. 4

Stability of LC-CelA-His and its derivatives against heat inactivation. LC-CelA-His was incubated at pH 7.0 and the temperatures indicated for 30 min in the absence of DTT (filled circle) or in the presence of 1 mM DTT (open circle). ΔFL-LC-CelA-His (filled square) and E34A-LC-CelA-His (filled triangle) were also incubated at the same condition in the absence of DTT. The residual activities were determined at pH 7.0 and 60 °C using CM-cellulose as a substrate, as described in Experimental procedures. The experiment was carried out at least twice, and errors from the average values are indicated by vertical lines.

Thermal denaturation of LC-CelA-His

To analyze the stability of LC-CelA-His more quantitatively, thermal denaturation of this protein was analyzed at pH 7.0 in the presence of 3 M guanidine hydrochloride (GdnHCl) by monitoring the change in CD values at 222 nm. Thermal denaturation of the protein was reversible in this condition. It was necessary to use 3 M GdnHCl because the protein was not fully denatured even at 100 °C in the presence of ⩽2 M GdnHCl. Fig. 5 shows the thermal denaturation curve of LC-CelA-His measured in the presence of 3 M GdnHCl. The midpoint of the transition of this curve, Tm, is 86.8 °C (Table 2). The Tm value of RmCel12A has been reported to be 102.9 °C in the absence of denaturant [35]. Because LC-CelA-His is not fully denatured even at 100 °C in the absence of denaturant, LC-CelA-His may be nearly as stable as RmCel12A.
Fig. 5

Thermal denaturation curves of LC-CelA-His and its derivatives. Thermal denaturation curves of LC-CelA-His (thin dashed line), ΔFL-LC-CelA-His (thick solid line) and E34A-LC-CelA-His (thick dashed line) obtained in the absence of DTT and that of LC-CelA-His obtained in the presence of 1 mM DTT (thin solid line) are shown. These curves were obtained at pH 7.0 in the presence of 3M GdnHCl by monitoring the change in CD values at 222 nm as described in Experimental procedures.

Table 2

Activities and stabilities of LC-CelA-His and its derivatives.

Protein[DTT](mM)Specific activitya(U/mg)Relative activity(%)Tmb(°C)ΔTmc(°C)
LC-CelA-His04.210086.8
14.210058.8−28.0
ΔFL-LC-CelA-His03.99372.1−14.7
E34A-LC-CelA-His04.09674.8−12.0

The specific activity was determined at 60 °C either in the presence or absence of DTT using CM-cellulose as a substrate, as described in Experimental procedures. Each experiment was carried out at least twice and the average value is shown. Errors are within 10% of the values reported.

The melting temperature (Tm), which is the temperature of the midpoint of the transition, was determined from the thermal denaturation curves shown in Fig. 5.

ΔTm = Tm determined − 86.8 °C.

Activity and stability of LC-CelA

To examine the effects of attachment of a C-terminal His-tag, the activity and stability of LC-CelA were determined and compared with those of LC-CelA-His. The activity of LC-CelA was determined at 60 °C using CM-cellulose as a substrate. The specific activity of LC-CelA determined at this temperature was 4.3 units mg−1, which was comparable to that of LC-CelA-His (4.2 units mg−1). The stability of LC-CelA was analyzed at pH 7.0 in the presence of 3 M GdnHCl by CD spectroscopy. Thermal denaturation of LC-CelA was reversible in this condition. The Tm value of this protein was determined to be 86.0 °C, which was comparable to that of LC-CelA-His (86.8 °C). These results indicate that attachment of a C-terminal His-tag does not significantly affect the activity or stability of LC-CelA.

Crystal structure of LC-CelA

To examine whether the FL region of LC-CelA is folded into a defined structure, the crystal structure of LC-CelA was determined at 1.85 Å resolution. The asymmetric unit of the crystal structure consists of two LC-CelA molecules (A and B). Both molecules consist of residues 33–261. The N-terminal Met-Asp and most of the FL region (residues 20–32) are not visible in the electron density map probably due to structural disorder. The structures of these two molecules are nearly identical with a root-mean-square deviation (RMSD) value of 0.08 Å for 229 Cα atoms. We used the structure of molecule A in this study. Details of the data-collection statistics and refinement are summarized in Table 3. The overall structure of LC-CelA (molecule A) is shown in Fig. 6A. This structure belongs to the typical β-jelly roll fold and consists of two β-sheets that pack with each other and one α-helix. The structure of LC-CelA excluding the FL region is nearly identical to that of ΔFL-RmCel12A (PDB: 1H0B) with a RMSD value of 0.33 Å for 226 Cα atoms. Superimposition of the structure of LC-CelA around the active site onto that of the ΔFL-RmCel12A-substrate (cellopentose) complex determined by the soaking method [20] shows that the steric configurations of all aromatic and non-aromatic residues forming the active site cleft of LC-CelA, including two acidic catalytic residues (Glu158 and Glu241), are nearly identical to those of RmCel12A (Fig. 6B). In the active site cleft of LC-CelA, a band of aromatic residues, mostly tryptophan residues, probably provides a series of glucose-binding subsites through stacking interactions with glucose residues, and charged and polar residues provide specificity through hydrogen bonding interactions with sugar molecules, as reported for RmCel12A [20]. In the crystal structure of ΔFL-RmCel12A, a HEPES molecule binds to the active site [19]. However, electron density of this molecule was not clearly detected in the LC-CelA structure, suggesting that no HEPES molecule binds strongly to the active site of LC-CelA. This may be due to slight differences in the active site geometry. The LC-CelA structure shows that two disulfide bonds are formed between Cys40 and Cys67 and between Cys100 and Cys105. The Cys40Cys67 bond apparently anchors the N-terminal region of the catalytic domain (loop between βA1 and βB1 strands) to the central region (βA2 strand). The Cys100Cys105 bond forms a short loop (residues 101–104). The Cys40Cys67 bond is well conserved in GH family 12 cellulases from various organisms, whereas the Cys100Cys105 bond is conserved only in GH family 12 cellulases from R. marinus and Streptomyces species.
Fig. 6

Crystal structure of LC-CelA. (A and B) The entire structure (A) and the structure around the active site (B) of LC-CelA are superimposed on those of RmCel12A in complex with the substrate (D(+)-cellopentose) (PDB: 2BWC). The structures of LC-CelA and RmCel12A are colored cyan and gray respectively. The region predicted as a flexible linker (FL) of LC-CelA is colored orange. The central region of LC-CelA that contacts FL is colored green. Two acidic active site residues (Glu158 and Glu241), two disulfide bonds (Cys40–Cys67 and Cys100–Cys105), twelve residues forming the substrate binding site of LC-CelA, and C-terminal two residues of FL (Glu34 and Pro35) are shown as stick models with labels. The corresponding residues of RmCel12A, except for those corresponding to Glu34 and Pro35, and the substrate bound to RmCel12A are also shown as stick models. In these stick models, the oxygen, nitrogen and sulfur atoms are colored red, blue and yellow respectively. The substrate and the residues forming the substrate binding site are not shown in panel A for simplicity. Six β-strands forming sheet A (βA1–βA6), nine β-strands forming sheet B (βB1–βB9), βC1-strand, and one α-helix are shown in panel A. N and C represent the N- and C-termini. (C) Electron density around the region, where hydrogen bonds are formed between the FL and central regions of LC-CelA. The 2Fo–Fc map contoured at the 1.5σ level is shown. The side chain and main chain atoms are shown as stick models, where the oxygen and nitrogen atoms are colored red and blue respectively, and the carbon atoms are colored orange and green for the FL and central regions respectively as in panel A. Three hydrogen bonds formed between Glu34 Oδ1 and Thr74 N, between Glu34 Oδ1 and Glu73 N, and between Pro35 O and Leu72 N are shown by broken lines together with their distances. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

As noted above, most of the FL region is structurally disordered. However, the C-terminal region of the FL (Asp33-Pro35) is visible in the electron density map (Fig. 6C), indicating that this region is not included in the FL. Interestingly, this region interacts with the central region (loop between βA2 and βA3 strands) via three hydrogen bonds, formed between Glu34 Oδ1 and Thr74 N, Glu34 Oδ1 and Glu73 N, and Pro35 O and Leu72 N. Because all of these residues are conserved in RmCel12A, these hydrogen bonds are probably also formed in that enzyme. We used the terminology of FL for the region predicted as FL (Leu20-Pro35), instead of the actual one (Leu20-Glu32), in this study.

Stability of ΔFL-LC-CelA-His and E34A-LC-CelA-His

To examine whether hydrogen bonding interactions between the FL and central regions contribute to the stabilization of LC-CelA, ΔFL-LC-CelA-His and E34A-LC-CelA-His were constructed. ΔFL-LC-CelA-His and E34A-LC-CelA-His are the LC-CelA-His derivatives without the FL and with a single Glu34Ala mutation, respectively. These proteins were purified to give a single band on SDS–PAGE as was LC-CelA-His by heat treatment, followed by the nickel affinity chromatography. The amounts of these proteins purified from 1 L culture were similar to that of LC-CelA or LC-CelA-His (3 mg). The far-UV CD spectra of ΔFL-LC-CelA-His and E34A-LC-CelA-His were similar to that of LC-CelA-His (Fig. 7), suggesting that neither removal of the FL region nor a single mutation in the FL significantly affects the gross structure of LC-CelA.
Fig. 7

CD spectra of LC-CelA-His and its derivatives. The far-UV CD spectrum of LC-CelA-His was measured at pH 7.0 and 25 °C either in the absence of DTT (thin dashed line) or in the presence of 1 mM DTT (thin solid line), as described in Experimental procedures. The far-UV CD spectra of ΔFL-LC-CelA-His (thick solid line) and E34A-LC-CelA-His (thick dashed line) were also measured at pH 7.0 and 25 °C in the absence of DTT.

The stability of ΔFL-LC-CelA-His and E34A-LC-CelA-His with respect to heat inactivation was analyzed at pH 7.0 by incubating these proteins at various temperatures for 30 min and determining their residual activities at 60 °C. The results are shown in Fig. 4. Unlike LC-CelA-His, which retains maximal activity after incubation at 90 °C, ΔFL-LC-CelA-His lost approximately 60% and 100% of its activity after incubation at 85 °C and 90 °C respectively. Similarly, E34A-LC-CelA-His lost approximately 20 and 60% of its activity after incubation at 85 °C and 90 °C respectively. These results indicate that the thermal stability of the proteins decreases in the order LC-CelA-His > E34A-LC-CelA-His > ΔFL-LC-CelA-His. The stability of ΔFL-LC-CelA-His and E34A-LC-CelA-His was also analyzed at pH 7.0 in the presence of 3 M GdnHCl using CD spectroscopy as described above for LC-CelA-His. Thermal denaturation of the proteins was reversible in this condition. The thermal denaturation curves of ΔFL-LC-CelA-His and E34A-LC-CelA-His are shown in Fig. 5. The midpoints of the transitions of these curves, Tm, are summarized in Table 2. The Tm values of ΔFL-LC-CelA-His and E34A-LC-CelA-His are lower than that of LC-CelA-His by 14.7 °C and 12.0 °C, respectively, indicating that the FL region of LC-CelA contributes to protein stability (as does the FL of RmCel12A). The difference in Tm values between LC-CelA-His and E34A-LC-CelA-His accounts for 82% of that between LC-CelA-His and ΔFL-LC-CelA-His. This result suggests that Glu34-mediated hydrogen bonding interactions are the major forces responsible for the FL-dependent stabilization of LC-CelA. Hydrogen bonds have been reported to make large contributions to protein stability [36]. The actual FL region (Leu20-Glu32) may not significantly contribute to the stabilization of LC-CelA.

Stability of LC-CelA-His in the presence of DTT

Proteins are usually [7,37-40], but not always [41,42], destabilized by removal of a native disulfide bond. To examine whether the two disulfide bonds of LC-CelA contribute to protein stability, thermal denaturation of LC-CelA-His was analyzed as described above, at pH 7.0 in the presence of 1 mM dithiothreitol (DTT), which reduces disulfide bonds to thiol groups, and 3 M GdnHCl. Thermal denaturation of LC-CelA-His was reversible in this condition. The thermal denaturation curve of LC-CelA-His measured in the presence of 1 mM DTT and 3 M GdnHCl is shown in Fig. 5. The Tm value of LC-CelA-His determined in the presence of 1 mM DTT is lower than that determined in the absence of DTT by 28.0 °C (Table 2). This result indicates that disulfide bonds contribute significantly to the stabilization of LC-CelA-His. The Tm value of LC-CelA-His (59.8 °C) determined in the presence of 10 mM DTT and 3 M GdnHCl was comparable to that (58.8 °C) determined in the presence of 1 mM DTT and 3 M GdnHCl, suggesting that the disulfide bonds of LC-CelA-His are fully reduced by 1 mM DTT. It remains to be determined which disulfide bond contributes more to the stabilization of LC-CelA. However, it has been reported that the contribution of disulfide bonds to protein stability increases as the number of the residues in the loop formed by the disulfide bond increases [37]. That number is 26 for Cys40Cys67 and four for Cys100Cys105. Therefore, the Cys40Cys67 bond may contribute significantly more to the stabilization of LC-CelA than the Cys100Cys105 bond. Disulfide bonds do not account for the differences in stability between LC-CelA or RmCel12A and a GH family 12 cellulase from Streptomyces species, because the two disulfide bonds of LC-CelA are conserved in these proteins. The Tm values of RmCel12A and Cel12A from Streptomyces sp. 11AG8 (StCel12A), which represents GH family 12 cellulases from Streptomyces species, have been reported to be 102.9 °C [35] and 65.7 °C [43] respectively in the absence of denaturant, indicating that RmCel12A is more stable than StCel12A by 37.2 °C. Because LC-CelA is nearly as stable as RmCel12A, LC-CelA is also more stable than StCel12A. Proteins are stabilized relative to each other by a combination of factors, including increased number of ion pairs, increased number of hydrogen bonds, increased number of disulfide bonds, increased number of proline residues in loop regions, and increased interior hydrophobicity [36]. Therefore, we compared these structural factors in LC-CelA, RmCel12A and StCel12A (PDB: 1OA4). The ratio of interior apolar residues to total interior residues in LC-CelA (66.7%) is comparable to that in RmCel12A (65.2%), but significantly higher than that in StCel12A (60.8%). Similarly, the number of ion pairs in LC-CelA (10) is comparable to that in RmCel12A (8), but higher than that in StCel12A (5). However, the number of hydrogen bonds and proline residues in loop regions of LC-CelA and RmCel12A are not consistently higher than in StCel12A. The number of hydrogen bonds is 202 in LC-CelA, 177 in RmCel12A, and 197 in StCel12A. The number of proline residues in loop regions is 8 in LC-CelA, 7 in RmCel12A, and 13 in StCel12A. These results suggest that differences in interior hydrophobicity and in the number of ion pairs at least partially account for the difference in stability between LC-CelA or RmCel12A and StCel12A.

Activities of ΔFL-LC-CelA-His and E34A-LC-CelA-His

The enzymatic activities of ΔFL-LC-CelA-His and E34A-LC-CelA-His were determined at 60 °C in the absence of DTT using CM-cellulose as a substrate. The activities of these proteins were not determined at the optimum temperature for activity of LC-CelA-His (90 °C), because they are thermally denatured at this temperature (Fig. 4). The specific activities of ΔFL-LC-CelA-His and E34A-LC-CelA-His determined in the absence of DTT were slightly lower than, but comparable to, that of LC-CelA-His (Table 2). These data indicate that the enzymatic activity of LC-CelA-His is not significantly affected by removal of, or mutation in, the FL. The specific activity of LC-CelA determined in the presence of 1 mM DTT was identical to that determined in the absence of DTT, indicating that the enzymatic activity of LC-CelA-His is not significantly affected by reduction of the disulfide bonds.

Conclusion

LC-CelA, a homolog of the highly thermostable GH family 12 cellulase from R. marinus (RmCel12A), was isolated from leaf–branch compost by a metagenomic approach and its crystal structure was determined. This structure is the first of a GH family 12 cellulase with an acidic flexible linker (FL) at the N-terminus. The structure shows that most of the region predicted as FL is disordered. This result supports the hypothesis that the FL is necessary to separate a hydrophobic signal peptide that anchors the enzyme to the cell from the catalytic core [35]. However, Glu34 and Pro35 located at the C-terminus of the FL assume a defined structure and form hydrogen bonds with the central region of the protein. The structure also shows that LC-CelA contains two disulfide bonds (Cys40Cys67 and Cys100Cys105). LC-CelA is a highly thermostable enzyme with an optimum temperature for activity of 90 °C. The stability of LC-CelA decreases by 14.9 °C on removal of the FL, mainly due to loss of hydrogen bonds between the FL and central regions of the protein. Removal of the FL destabilizes RmCel12A by 8.4 °C [35], probably due to loss of the residues equivalent to Glu34 and Pro35 of LC-CelA, which are conserved. Database searches indicate that these residues are conserved only in LC-CelA and RmCel12A, suggesting that stabilization by hydrogen bonds formed between the FL and central regions is unique to these proteins. The stability of LC-CelA also decreases by reducing disulfide bonds by 28 °C. The Cys40Cys67 bond located near the N-terminal region may more significantly contribute to the stabilization of LC-CelA than Cys100Cys105, because the number of the residues in the loop formed by Cys40Cys67 is higher than that formed by Cys100Cys105. It has been reported for GH family 11 xylanase that a unique N-terminal 11-residue extension contributes to protein stability by forming hydrogen bonds with the core region [44]. It has also been reported for this enzyme that mutations in the N-terminal region [45] and engineering of an N-terminal disulfide bond [46-49] stabilize the protein. GH family 12 cellulases with improved activity and stability have been engineered [50,51]. However, studies of protein engineering of GH family 12 cellulases are still limited. The results presented in this work provide valuable information that serves as a structural toolbox for the engineering of GH family 12 cellulases with improved function.
  49 in total

Review 1.  Thermozymes and their applications: a review of recent literature and patents.

Authors:  M E Bruins; A E Janssen; R M Boom
Journal:  Appl Biochem Biotechnol       Date:  2001-02       Impact factor: 2.926

2.  Cloning, sequencing and overexpression of a Rhodothermus marinus gene encoding a thermostable cellulase of glycosyl hydrolase family 12.

Authors:  S Halldórsdóttir; E T Thórólfsdóttir; R Spilliaert; M Johansson; S H Thorbjarnardóttir; A Palsdottir; G O Hreggvidsson; J K Kristjánsson; O Holst; G Eggertsson
Journal:  Appl Microbiol Biotechnol       Date:  1998-03       Impact factor: 4.813

3.  Cleavage of structural proteins during the assembly of the head of bacteriophage T4.

Authors:  U K Laemmli
Journal:  Nature       Date:  1970-08-15       Impact factor: 49.962

4.  Structural analysis of a glycoside hydrolase family 11 xylanase from Neocallimastix patriciarum: insights into the molecular basis of a thermophilic enzyme.

Authors:  Ya-Shan Cheng; Chun-Chi Chen; Chun-Hsiang Huang; Tzu-Ping Ko; Wenhua Luo; Jian-Wen Huang; Je-Ruei Liu; Rey-Ting Guo
Journal:  J Biol Chem       Date:  2014-03-11       Impact factor: 5.157

5.  Isolation of a novel cutinase homolog with polyethylene terephthalate-degrading activity from leaf-branch compost by using a metagenomic approach.

Authors:  Sintawee Sulaiman; Saya Yamato; Eiko Kanaya; Joong-Jae Kim; Yuichi Koga; Kazufumi Takano; Shigenori Kanaya
Journal:  Appl Environ Microbiol       Date:  2011-12-22       Impact factor: 4.792

6.  Allosteric disulfide bonds.

Authors:  Bryan Schmidt; Lorraine Ho; Philip J Hogg
Journal:  Biochemistry       Date:  2006-06-20       Impact factor: 3.162

7.  Engineering the pH-optimum of activity of the GH12 family endoglucanase by site-directed mutagenesis.

Authors:  Vladimir I Tishkov; Alexander V Gusakov; Anna S Cherkashina; Arkady P Sinitsyn
Journal:  Biochimie       Date:  2013-06-14       Impact factor: 4.079

8.  Conformational stability and activity of ribonuclease T1 with zero, one, and two intact disulfide bonds.

Authors:  C N Pace; G R Grimsley; J A Thomson; B J Barnett
Journal:  J Biol Chem       Date:  1988-08-25       Impact factor: 5.157

9.  Efficient direct ethanol production from cellulose by cellulase- and cellodextrin transporter-co-expressing Saccharomyces cerevisiae.

Authors:  Ryosuke Yamada; Yuki Nakatani; Chiaki Ogino; Akihiko Kondo
Journal:  AMB Express       Date:  2013-06-24       Impact factor: 3.298

10.  MolProbity: all-atom structure validation for macromolecular crystallography.

Authors:  Vincent B Chen; W Bryan Arendall; Jeffrey J Headd; Daniel A Keedy; Robert M Immormino; Gary J Kapral; Laura W Murray; Jane S Richardson; David C Richardson
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2009-12-21
View more
  7 in total

1.  Structure, activity, and stability of metagenome-derived glycoside hydrolase family 9 endoglucanase with an N-terminal Ig-like domain.

Authors:  Hiroyuki Okano; Eiko Kanaya; Masashi Ozaki; Clement Angkawidjaja; Shigenori Kanaya
Journal:  Protein Sci       Date:  2015-01-13       Impact factor: 6.725

2.  Characterization of the Biomass Degrading Enzyme GuxA from Acidothermus cellulolyticus.

Authors:  Neal N Hengge; Sam J B Mallinson; Patthra Pason; Vladimir V Lunin; Markus Alahuhta; Daehwan Chung; Michael E Himmel; Janet Westpheling; Yannick J Bomble
Journal:  Int J Mol Sci       Date:  2022-05-28       Impact factor: 6.208

3.  Evaluation of Cellulolytic Endo-1,4-β-D-Glucanase Activity in the Digestive Fluid of Adult Phytophagous Beetle Hoplasoma unicolor.

Authors:  Mohammad Mosleh Uddin; Suzana Afrin Lima; Tanim Jabid Hossain; Newton Kar; Yeasmin Zahan; Babamale AbdulKareem Olarewaju
Journal:  Trop Life Sci Res       Date:  2021-09-30

4.  Genome Analysis of a New Rhodothermaceae Strain Isolated from a Hot Spring.

Authors:  Kian Mau Goh; Kok-Gan Chan; Soon Wee Lim; Kok Jun Liew; Chia Sing Chan; Mohd Shahir Shamsir; Robson Ee; Tan-Guan-Sheng Adrian
Journal:  Front Microbiol       Date:  2016-07-14       Impact factor: 5.640

5.  Molecular and biochemical characterization of recombinant cel12B, cel8C, and peh28 overexpressed in Escherichia coli and their potential in biofuel production.

Authors:  Eman Ibrahim; Kim D Jones; Keith E Taylor; Ebtesam N Hosseney; Patrick L Mills; Jean M Escudero
Journal:  Biotechnol Biofuels       Date:  2017-02-27       Impact factor: 6.040

6.  Cloning, Expression, and Characterization of a Thermophilic Endoglucanase, AcCel12B from Acidothermus cellulolyticus 11B.

Authors:  Junling Wang; Gui Gao; Yuwei Li; Liangzhen Yang; Yanli Liang; Hanyong Jin; Weiwei Han; Yan Feng; Zuoming Zhang
Journal:  Int J Mol Sci       Date:  2015-10-22       Impact factor: 5.923

Review 7.  Cellulases from Thermophiles Found by Metagenomics.

Authors:  Juan-José Escuder-Rodríguez; María-Eugenia DeCastro; María-Esperanza Cerdán; Esther Rodríguez-Belmonte; Manuel Becerra; María-Isabel González-Siso
Journal:  Microorganisms       Date:  2018-07-10
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.