Peicheng Sun1, Xinxin Li2, Adiphol Dilokpimol2, Bernard Henrissat3,4, Ronald P de Vries2, Mirjam A Kabel1, Miia R Mäkelä5. 1. Laboratory of Food Chemistry, Wageningen University and Research, Bornse Weilanden 9, 6708 WG Wageningen, the Netherlands. 2. Fungal Physiology, Westerdijk Fungal Biodiversity Institute and Fungal Molecular Physiology, Utrecht University, Uppsalalaan 8, 3584 CT Utrecht, the Netherlands. 3. DTU Bioengineering, Technical University of Denmark, Søltofts Plads, 2800 Kongens Lyngby, Denmark. 4. Department of Biological Sciences, King Abdulaziz University, Jeddah, Saudi Arabia. 5. Department of Microbiology, University of Helsinki, Viikinkaari 9, 00790 Helsinki, Finland.
Abstract
Xyloglucan is a prominent matrix heteropolysaccharide binding to cellulose microfibrils in primary plant cell walls. Hence, the hydrolysis of xyloglucan facilitates the overall lignocellulosic biomass degradation. Xyloglucanases (XEGs) are key enzymes classified in several glycoside hydrolase (GH) families. So far, family GH44 has been shown to contain bacterial XEGs only. Detailed genome analysis revealed GH44 members in fungal species from the phylum Basidiomycota, but not in other fungi, which we hypothesized to also be XEGs. Two GH44 enzymes from Dichomitus squalens and Pleurotus ostreatus were heterologously produced and characterized. They exhibited XEG activity and displayed a hydrolytic cleavage pattern different from that observed in fungal XEGs from other GH families. Specifically, the fungal GH44 XEGs were not hindered by substitution of neighboring glucosyl units and generated various "XXXG-type," "GXXX(G)-type," and "XXX-type" oligosaccharides. Overall, these fungal GH44 XEGs represent a novel class of enzymes for plant biomass conversion and valorization.
Xyloglucan is a prominent matrix heteropolysaccharide binding to cellulose microfibrils in primary plant cell walls. Hence, the hydrolysis of xyloglucan facilitates the overall lignocellulosic biomass degradation. Xyloglucanases (XEGs) are key enzymes classified in several glycoside hydrolase (GH) families. So far, family GH44 has been shown to contain bacterial XEGs only. Detailed genome analysis revealed GH44 members in fungal species from the phylum Basidiomycota, but not in other fungi, which we hypothesized to also be XEGs. Two GH44 enzymes from Dichomitus squalens and Pleurotus ostreatus were heterologously produced and characterized. They exhibited XEG activity and displayed a hydrolytic cleavage pattern different from that observed in fungal XEGs from other GH families. Specifically, the fungal GH44 XEGs were not hindered by substitution of neighboring glucosyl units and generated various "XXXG-type," "GXXX(G)-type," and "XXX-type" oligosaccharides. Overall, these fungal GH44 XEGs represent a novel class of enzymes for plant biomass conversion and valorization.
Xyloglucan is present in all vascular plants, in particular within the primary cell walls, and is a major component of dicots (up to 25% of total dry weight) and non-commelinid monocots including conifers (10%) (Scheller and Ulvskov, 2010). The rigidity of plant cell walls is largely driven by loosening and tightening of cellulose microfibrils, in which xyloglucan plays a key role owing to its binding ability to cellulose by surface adsorption and chain intercalation (Park and Cosgrove, 2015; Pauly et al., 2001). In addition, xyloglucan is found as a major seed-storage polysaccharide in a number of terrestrial plants including tamarind and nasturtium (Kozioł et al., 2015). Enzymatic hydrolysis of xyloglucan not only facilitates the access of cellulases to cellulose but also contributes to complete saccharification of agro-industrial side streams to drive the production of biofuels and biochemicals (Benkő et al., 2008; Kaida et al., 2009). Furthermore, soluble xyloglucan and its hydrolysis products can serve as relevant food and feed additives, for example, to reduce calorie content, to increase the texture of food products, or due to their prebiotic behavior (Chen et al., 2020; Dutta et al., 2020; Mishra and Malhotra, 2009).Xyloglucan has a substituted β-(1→4)-d-glucan backbone, of which the exact type and degree of substitution varies for different plant sources and tissues (Scheller and Ulvskov, 2010). Still, most xyloglucan structures have been shown as the “XXGG-type” and “XXXG-type” (Vincken et al., 1997), in which G stands for a glucosyl residue and X for a glucosyl residue substituted with an α-(1→6)-d-xylosyl unit (Fry et al., 1993). In addition, xylosyl residues can be further substituted by β-(1→2)-d-galactosyl units (coded L) or, to a lesser extent, with α-(1→2)-L-arabinosyl units (coded S). The galactosyl residues can be fucosylated at the O-2 position (coded F). Other less common substitutions are denoted with other designations described elsewhere in detail (Fry et al., 1993; Kozioł et al., 2015).To date, characterized xyloglucanases (XEGs) have been found in glycoside hydrolase (GH) families GH5, GH9, GH12, GH16, GH44, and GH74 in the Carbohydrate-Active enZYme (CAZy) database (http://www.cazy.org) (Lombard et al., 2014). Fungal XEGs are, however, only found in families GH5 (Matsuzawa et al., 2020), GH12, and GH74 (Attia and Brumer, 2016). Fungal GH12 XEGs have been shown to catalyze the hydrolysis of the xyloglucan backbone in an endo-mode, cleaving the glycosidic linkages at the “reducing end” side of unsubstituted glucosyl units, releasing XXXG-type oligosaccharides (Damásio et al., 2012; Master et al., 2008; Vitcosque et al., 2016). Fungal GH5 and GH74 XEGs have also been shown to display an endo-type of hydrolysis of the xyloglucan backbone, cleaving at the reducing end side of unsubstituted glucosyl units like the GH12 XEGs (Master et al., 2008; Matsuzawa et al., 2020; Vitcosque et al., 2016). However, in contrast to the GH12 XEGs, the GH5 and GH74 XEGs display a processive mode of action (Arnal et al., 2019; Berezina et al., 2017; Desmet et al., 2007; Martinez-Fleites et al., 2006; Matsuzawa et al., 2020; Yaoi and Mitsuishi, 2002). As a result, from the early stage of hydrolysis, XXXG- and XXLG-building blocks are released by GH74 XEGs, which can even be further degraded into XX, XG, and LG (Arnal et al., 2019; Attia et al., 2016; Desmet et al., 2007; Yaoi and Mitsuishi, 2002). GH5 XEG generated XXXG-type oligosaccharides, in addition to several non-XXXG-type oligosaccharides such as X, XG, XX, LG, XL/LX, XXG, and XXX (Matsuzawa et al., 2020). Detailed genome analysis revealed the presence of basidiomycete proteins in enzyme family GH44 of the CAZy database, for which only bacterial endoglucanases and XEGs have been reported so far (Ariza et al., 2011). Interestingly, no members from other fungal phyla were present in GH44. In this study, we evaluated whether the basidiomycete members of GH44 are also XEGs and compared their product profile with that of XEGs of families GH12 and GH74.
Results and discussion
Discovery of new fungal GH44 enzymes through genome mining
A large set of fungal genomes (>400) were analyzed for their diversity with respect to the set of genes assigned to CAZy families in the CAZy database, including both genomes present in the public CAZy database and genomes only available at JGI MycoCosm (Grigoriev et al., 2014). This revealed that part of the basidiomycete genomes contained genes that were already assigned to GH44, a family for which, so far, only bacterial enzymes have been characterized (Ariza et al., 2011). Interestingly, our genome analyses also revealed that fungal GH44 sequences could only be identified in the phylum Basidiomycota. GH44 sequences were found in only about 28% of the basidiomycete genomes according to JGI MycoCosm (situation January 22, 2021) (Table S1). This narrow distribution among basidiomycetes suggests that the GH44 enzymes are ancient, possibly of bacterial origin, and that they were lost during evolution for most of the fungal species, including ascomycetes. Alternatively, a basidiomycete ancestor may have acquired this enzyme by horizontal gene transfer from bacteria after the split from ascomycetes.Labeling the phylogenetic tree of GH44 candidates with the fungal lifestyles (e.g., white rot, gray rot, litter decomposer, mycorrhizae, coprophilous, and endophyte) demonstrated that GH44 enzymes do not appear associated with a particular lifestyle (Figure 1). Some species contain multiple GH44-encoding genes resulting from recent gene duplications, suggesting a more important role for these enzymes in their habitat. One of these fungi is the mycorrhizal fungus Sebacina vermifera whose genome encodes five GH44 candidates. It is tempting to speculate that the putative GH44 enzymes may be involved in partial degradation of plant root cell walls that are enriched in xyloglucan, to enable the establishment of symbiosis between fungus and plant roots. Xyloglucan is also released from plant roots (Galloway et al., 2018) and could therefore be a carbon source for fungi that live in close association with roots (e.g., mycorrhizae) or inhabit soil, such as litter-decomposing basidiomycetes, explaining the presence of GH44 enzymes in these fungi (Figure 1).
Figure 1
Phylogenetic analysis of GH44 candidates from selected basidiomycete fungi
The figure is a representative Maximum Likelihood tree (500 bootstraps) of a MAFFT alignment of the amino acid sequences of the selected genes. JGI protein IDs are indicated behind the species. Lifestyles of the species are color coded as follows: blue, white rot; gray, gray rot; green, litter decomposer; orange, mycorrhizae; purple, coprophilous; red, endophyte; black, unknown/not clear. Bootstrap values are indicated at the nodes based on Maximum Likelihood, Minimal Evolution, and Neighbor Joining algorithms, respectively. Only bootstrap values that are 50 or higher are displayed. The scale bar corresponds to 0.10 amino acid substitutions per site.
Phylogenetic analysis of GH44 candidates from selected basidiomycete fungiThe figure is a representative Maximum Likelihood tree (500 bootstraps) of a MAFFT alignment of the amino acid sequences of the selected genes. JGI protein IDs are indicated behind the species. Lifestyles of the species are color coded as follows: blue, white rot; gray, gray rot; green, litter decomposer; orange, mycorrhizae; purple, coprophilous; red, endophyte; black, unknown/not clear. Bootstrap values are indicated at the nodes based on Maximum Likelihood, Minimal Evolution, and Neighbor Joining algorithms, respectively. Only bootstrap values that are 50 or higher are displayed. The scale bar corresponds to 0.10 amino acid substitutions per site.Furthermore, our analysis of the evolutionary relationships between the GH44 candidates showed that those from the two wood degrading white rot fungi, Dichomitus squalens XegA and Pleurotus ostreatus XegA, were present in distant clades in the phylogenetic tree (Figure 1). In situ expression and production of the D. squalens and P. ostreatus GH44 genes and proteins have been shown in previously published transcriptome and exoproteome datasets (Daly et al., 2018; Wu et al., 2020, 2021). In short, D. squalens CBS464.89 expressed xegA and produced the corresponding protein during growth on spruce and birch wood (Daly et al., 2018, 2020), whereas P. ostreatus PC9 expressed the XegA encoding gene on rice straw (Wu et al., 2021), and in the Δpex1 and Δgat1 mutants on beechwood sawdust (Wu et al., 2020), again supporting a role for these genes in plant biomass degradation. Therefore, these two candidates were selected for further characterization to demonstrate the function of basidiomycete GH44 enzymes.
The selected basidiomycete GH44 candidates are xyloglucan-specific XEGs
D. squalens xegA and P. ostreatus xegA were heterologously expressed in Pichia pastoris, and their recombinant enzymes were produced as C-terminal His-tag fusion proteins and purified from the culture supernatants using immobilized metal affinity chromatography via hexahistidine-tag. The apparent mass of purified enzymes was assessed by sodium dodecyl sulphate-polyacrylamide gel electrophoresis (SDS-PAGE) and summarized in Table 1. The hydrolytic activity of the purified GH44 enzymes toward four types of polysaccharides, including tamarind xyloglucan (TXG), Avicel PH-101, galactomannan, and beechwood xylan, was validated by high-performance anion-exchange chromatography with pulsed amperometric detection (HPAEC-PAD). Both enzymes were only active toward xyloglucan and not on other tested substrates, which suggested that these enzymes were specific XEGs (further referred to as DsXegA and PoXegA). The xyloglucan-specific activity of these fungal GH44 XEGs was also observed for fungal XEGs from GH5, GH12, and GH74 (Bauer et al., 2006; Grishutin et al., 2004; Master et al., 2008; Matsuzawa et al., 2020; Sinitsyna et al., 2010; Song et al., 2013; Xian et al., 2016; Yaoi and Mitsuishi, 2002). In contrast, bacterial GH44 XEGs have been reported to exhibit a broad substrate specificity and, in addition to xyloglucan, other polysaccharides can be hydrolyzed, including barley β-glucan, birchwood xylan, wheat arabinoxylan, carob galactomannan, and synthetic soluble cellulose derivatives (Ariza et al., 2011; Hirano et al., 2013; Najmudin et al., 2006; Warner et al., 2011; Ye et al., 2012).
Table 1
Physical and biochemical properties of purified XEGs used in this study
GH family
Fungal species
Accession number
Gene name
Enzyme name
Calculated mass (kDa)
Apparent mass (kDa)
pH optimum
Temperature optimum (°C)
Positive substrate
Reference
(−) Endo H
(+) Endo H
44
Dichomitus squalens
JGI|Dicsqu464_1|932646
xegA
DsXegA
78
115
90
5.0
50
This study
This study
44
Pleurotus ostreatus
JGI|PleosPC15_2|1036707
xegA
PoXegA
80
90
80
6.0
50
TXG
This study
12
Aspergillus nidulans
XP_658056.1, AN0452
xegA
AnXegA
24
25
25
6.5
47
TXG
Bauer et al. (2006)
74
Aspergillus nidulans
XP_659146.1, AN1542
xegB
AnXegB
85
100
90
3.0
42
TXG, TXG oligosaccharides
Bauer et al. (2005); Bauer et al. (2006)
TXG, tamarind xyloglucan.
Physical and biochemical properties of purified XEGs used in this studyTXG, tamarind xyloglucan.DsXegA and PoXegA showed an optimum temperature at 50°C (Figure 2A), which was slightly lower than that of the reported fungal XEGs of families GH12 and GH74, which typically have optimal activity in the range of 55°C–65°C (Damásio et al., 2012; Hakamada et al., 2011; Sinitsyna et al., 2010; Xian et al., 2016; Yaoi and Mitsuishi, 2002), but higher than the optimum temperature of A. oryzae GH5 XEG, i.e., 45°C (Matsuzawa et al., 2020). Both XEGs were stable up to 40°C, of which PoXegA showed better stability than DsXegA at lower temperature (20°C). For pH optimum, DsXegA and PoXegA showed the highest activity toward TXG at pH 5.0 and 6.0, respectively (Figures 2B and 2C). These values are comparable with those of the reported fungal GH5, GH12, and GH74 XEGs that generally have optimal activity in an acidic pH range (3.5–6.0) (Grishutin et al., 2004; Hakamada et al., 2011; Master et al., 2008; Matsuzawa et al., 2020; Sinitsyna et al., 2010; Song et al., 2013; Yaoi and Mitsuishi, 2002). When examined for pH stability, DsXegA retained more than 75% residual activity after 1 hour of incubation at pH 5.0–7.0, whereas PoXegA was stable at the pH range from 5.0 to 8.0 and was more stable than DsXegA between pH 6.0 and 8.0 (Figure 2D).
Figure 2
The biochemical properties of DsXegA and PoXegA
(A) Temperature optimum was measured at 20°C–80°C in 100 mM phosphate buffer, pH 5.0.
(B) pH optimum was measured at 40°C in 100 mM Britton-Robinson buffer, pH 2.0–10.0.
(C) Temperature stability was measured at 37°C after pretreatment of enzymes in 100 mM phosphate buffer, pH 5.0, at 20°C–70°C for 1 h.
(D) pH stability was measured at 37°C after pretreatment of enzymes in 100 mM Britton-Robinson buffer, pH 2.0–10.0, at 40°C for 1 h. Error bars depict standard deviation of technical duplicate reactions.
The biochemical properties of DsXegA and PoXegA(A) Temperature optimum was measured at 20°C–80°C in 100 mM phosphate buffer, pH 5.0.(B) pH optimum was measured at 40°C in 100 mM Britton-Robinson buffer, pH 2.0–10.0.(C) Temperature stability was measured at 37°C after pretreatment of enzymes in 100 mM phosphate buffer, pH 5.0, at 20°C–70°C for 1 h.(D) pH stability was measured at 37°C after pretreatment of enzymes in 100 mM Britton-Robinson buffer, pH 2.0–10.0, at 40°C for 1 h. Error bars depict standard deviation of technical duplicate reactions.
To characterize the xyloglucan cleavage patterns by the two GH44 XEGs, the TXG oligosaccharides released by DsXegA and PoXegA were profiled by HPAEC-PAD (Figures 3A and 3B). For comparison and annotation of peaks, HPAEC profiles of TXG digests generated by AnXegA (GH12), by AnXegB (GH74), and by the previously well-characterized NcLPMO9C (Sun et al., 2020b) and of the reference TXG oligosaccharides (Sun et al., 2020b) and TXG oligosaccharide standards were added in Figures 3C‒3F. AnXegB has previously been studied by Bauer et al. (2005), and AnXegA has not been extensively characterized so far (Bauer et al., 2006). In general, the HPAEC profiles of the DsXegA and PoXegA digests were similar and only the intensity of several peaks was different. It should be noted that the profiles of the AnXegA- and the AnXegB-TXG digests were very different, indicating distinct cleavage patterns of TXG by three types of fungal GH44, GH12, and GH74 XEGs used in this study.
Figure 3
HPAEC elution patterns of tamarind xyloglucan (TXG) oligosaccharides released by end-point (24 h) incubation
TXG oligosaccharides generated by (A) DsXegA, (B) PoXegA, (C) AnXegA, (D) AnXegB, and (E) NcLPMO9C (in the presence of 1 mM ascorbic acid). (F) Reference TXG oligosaccharides and (G) TXG oligosaccharide standards are also shown for annotation of peaks.
HPAEC elution patterns of tamarind xyloglucan (TXG) oligosaccharides released by end-point (24 h) incubationTXG oligosaccharides generated by (A) DsXegA, (B) PoXegA, (C) AnXegA, (D) AnXegB, and (E) NcLPMO9C (in the presence of 1 mM ascorbic acid). (F) Reference TXG oligosaccharides and (G) TXG oligosaccharide standards are also shown for annotation of peaks.Based on the comparison in elution from the standards and reference digests (Figures 3G and 3F), typical XXXG-type oligosaccharides, such as XXXG, XLXG, XXLG, and XLLG, were present in all GH44-, GH12- and GH74-TXG digests (Figures 3A‒3D), although the relative abundance of these compounds differed for each of the digests. In contrast, XXX, XLX, XXL, and XLL were only found in the two GH44 digests (Figures 3C and 3D). Notably, an unknown peak, eluting between XXXG and XLXG, was identified in both GH44-TXG digests. We propose that this unknown oligomer corresponds to GXXX, based on elution behavior and hydrophilic interaction chromatography coupled with electrospray ionization-collision induced dissociation-mass spectrometry (HILIC-ESI-CID-MS/MS2) analysis described in the next section. Further unique for the GH44-TXG digests was an additional set of yet unknown compounds, eluting after XLLG. Again, taking into account their elution together with the HILIC-ESI-CID-MS/MS2 data described in the next section, it could be speculated that these unknown compounds corresponded to GXXXG, GXLX(G), GXXL(G), and GXLL(G).Furthermore, in the AnXegB-TXG digest, XG and XX were identified based on comparison with the reference TXG oligosaccharides (Figure 3), suggesting that AnXegB was able to cleave XXXG into XX + XG. By analogy with this observation, it could be concluded that XXLG was cleaved into XX and LG, substantiated by the relative low abundance of XXLG. The compound eluting at around 14.5 min was therefore annotated as LG.
Structural elucidation of xyloglucan oligosaccharide released by GH44 XEGs
To further elucidate the structures of xyloglucan oligosaccharides generated by the two GH44 XEGs, DsXegA- and PoXegA-TXG digests were analyzed by negative ion mode HILIC-ESI-CID-MS/MS2. The TXG oligosaccharide standards were also analyzed (Figure S1) for the identification and annotation of (unknown) structures. Similar to the HPAEC results, DsXegA- and PoXegA-TXG digests showed comparable profiles in their base-peak ion chromatograms and only the relative abundance of each product varied (Figure S1). Owing to separation of α/β-anomers of TXG oligosaccharides in HILIC, each TXG oligosaccharide split into two distinct peaks. Nevertheless, this α/β-anomer separation did not influence the MS/MS2 elucidation of the TXG oligosaccharide structures. MS2 fragments of TXG oligosaccharides were annotated following the principle that C-/Z-type, D-type (derived from a double C-/Z-type cleavage of three linked sugar residues), and A-type fragments were generated predominately in negative collision-induced dissociation (CID) ion mode (Domon and Costello, 1988; Quéméner et al., 2015; Sun et al., 2020a, 2020b).In both GH44-TXG digests, regular XXXG-type oligosaccharides were analyzed, including XXXG (m/z 1,061.6), XLXG (m/z 1,223.6), XXLG (m/z 1,223.6), and XLLG (m/z 1,385.6). Next, “XXX-type” oligosaccharides (XXX, m/z 899.5; XLX and XXL, m/z 1,061.6; and XLL m/z 1,223.6) and the above suggested “GXXX-type” oligosaccharides (GXXX, m/z 1,061.6, GXLX, m/z 1,223.6, GXXL, m/z 1,223.6, and GXLL, m/z 1,385.6) were unambiguously characterized (Figure S1). In addition, several “GXXXG-type” oligosaccharides were found in the two GH44-TXG digests, but to a lesser extent. As an example, Figures 4A and 4B show the annotation of the MS2 fragmentation of XXL and GXXX, respectively. Identification of these two structures was based on the elution time and considering the TXG structure composed of XXXG-type blocks. As shown in Figure S1, m/z 1,061.6 can represent multiple isomeric structures, not only XXL and GXXX but also XXXG and XLX. Based on the elution time of TXG oligosaccharide standards (31.5–32.1 min), XXXG was first identified (peak number 2 in Figure S1) in DsXegA- and PoXegA-TXG digests, which was also confirmed by the MS2 fragments. However, extra MS2 fragments m/z 689, 707, and 767 were present in peak number 2 (Figure S1), which are slightly different from the MS2 fragments of the XXXG standard and indicated the co-elution of other isomers with XXXG. These three fragments represent a C-type glycosidic bond and 0,2A-type cross-ring cleavage to generate a GXX, an XL, or an LX fragment (three hexaosyl and two pentaosyl residues) from the non-reducing end. Therefore, these two structures can only be released from XLX or GXXX, while GXXX was identified in the following peak number 4 as described later. To summarize this information, the other isomer in peak number 2 (Figure S1) was XLX. Next, we investigated the peak numbers 3 and 4 also showing oligosaccharides with m/z 1,061.6. Interestingly, the oligosaccharide in peak number 3 generated comparable fragments as XXXG standards, but with a more abundant m/z 353 and a lack of ions m/z 899, 839, and 821 (Figure 4A). The abundant former fragment indicated the presence of xylosyl-galactosyl residues, and the lack of latter ions often represented the terminal unsubstituted glucosyl units at the reducing end side (Quéméner et al., 2015; Sun et al., 2020b). Taking this information into account, peak number 3 is annotated as an XXL structure. Regarding peak number 4 (XLX or GXXX), we detected the diagnostic fragment m/z 413, which resulted from a 0,2A-type cross-ring cleavage on a glucosyl residue from either a GX or an L unit at the non-reducing end. As a non-reducing end terminal L unit with parent m/z 1,061.6 cannot be present in XLX and GXXX (not even in TXG structure), this diagnostic fragment m/z 413 can only represent a terminal GX structure at the non-reducing end, and therefore, peak number 4 was annotated as GXXX (Figure 4B).
Figure 4
Negative ion mode HILIC-ESI-CID-MS/MS2 fragmentation patterns of diagnostic TXG oligosaccharides
(A) Diagnostic XXL (m/z 1,061.6), (B) GXXX (m/z 1,061.6), (C) XLF (m/z 1,369.7), and (D) GXXF (m/z 1,369.7) both from the end-point DsXegA- and PoXegA-TXG digests. The fragments are annotated according to the nomenclature proposed by Domon and Costello (1988).
Negative ion mode HILIC-ESI-CID-MS/MS2 fragmentation patterns of diagnostic TXG oligosaccharides(A) Diagnostic XXL (m/z 1,061.6), (B) GXXX (m/z 1,061.6), (C) XLF (m/z 1,369.7), and (D) GXXF (m/z 1,369.7) both from the end-point DsXegA- and PoXegA-TXG digests. The fragments are annotated according to the nomenclature proposed by Domon and Costello (1988).This distinct presence of identified XXXG-type, XXX-type, and “GXXX(G)-type” oligosaccharides suggests that the two GH44 XEGs cleaved at both sides of unsubstituted glucosyl units in TXG. To further investigate whether the two GH44 XEGs also cleaved next to unsubstituted glucosyl units neighbored by more heavily branched F units, we searched for the m/z of 1,369.6–1,369.8 in the black currant xyloglucan (BCXG) digests. The m/z 1,369 represents oligosaccharides having five hexaosyl, three pentaosyl, and one fucosyl residues, which can be XXFG, XFGX, FGXX, GXXF, and XLF. XXFG was first identified based on the diagnostic fragments C3 (m/z 605) and 0,2A3 (‒H2O) (m/z 1,147 and 1,129), which represented an XX fragment at the non-reducing end side (Figure S2). The non-reducing end side XX fragment can only result from an XXFG unit. Other fragments such as C4 (m/z 1,207), 0,2A4 (‒H2O) (m/z 1,147 and 1,129), D2-4 (m/z 601), 0,4A4 (m/z 499), and C3α (m/z 457) further confirmed the XXFG structure. In addition to XXFG, XXX-type XLF and GXXX-type GXXF units were released (Figures 4C and 4D). In these MS2 spectra (Figures 4C and 4D), fragments 2,4A4 (m/z 809), C3 (m/z 767), and 0,2A3 (‒H2O) (m/z 707 and 689) represented fragments composed of three hexaosyl and two pentaosyl residues at the non-reducing end side, which can only be XL and GXX structures in BCXG, and thus XLF and GXXF are the only two possibilities. As shown in the MS2 spectrum in Figure 4C, the diagnostic ion m/z 455 can only result from an internal L residue (regarding XLF and GXXF), and thus XLF was identified as such, whereas Figure 4D represented a GXXF unit.For comparison, AnXegA- and AnXegB-TXG digests were also analyzed by negative ion mode HILIC-ESI-CID-MS/MS2. In brief, AnXegA (GH12) only released XXXG-type building blocks (Figure S3). AnXegB (GH74) not only generated XXXG-type building blocks but also released XG (m/z 473.4), XX (m/z 605.4), and LG (m/z 635.4) units explained by the cleavage of XXXG and XXLG (Figure S4).
Distinct xyloglucan cleavage patterns of GH44 XEGs
Based on all identified products in the GH44-TXG and -BCXG digests (series of XXXG-type, XXX-type, and “GXXX[G]-type” oligosaccharides), it was concluded that the GH44 XEGs cleave xyloglucan at both sides of unsubstituted glucosyl units and that they are not hindered by substitution of neighboring units (Figure 5A). Different from the GH44 XEGs, the GH12 XEG used in this study (AnXegA) only cleaved within xyloglucan at the reducing end side of unsubstituted glucosyl units (Figure 5B), which is comparable with other fungal GH12 XEGs (Damásio et al., 2012; Master et al., 2008; Vitcosque et al., 2016). The GH74 enzyme (AnXegB) cleaved xyloglucan similar to the GH12 enzyme, but with additional cleavages within XXXG and XXLG building blocks to release XX, XG, and LG. However, GH74 did not cleave at the reducing end side of L units in XLXG and XLLG (Figure 5C). Similar xyloglucan cleavage patterns have also been found for other fungal GH74 XEGs (Desmet et al., 2007; Yaoi and Mitsuishi, 2002). Slightly different from GH74 XEGs, although not studied in this work, the one characterized fungal GH5 XEG can also cleave next to L units to release even G, X, XG, LG, XL/LX, XXG, XXX, and LLG units suggested by authors (Matsuzawa et al., 2020). So far, no fungal GH44 XEG has been reported. For a limited number of bacterial GH44, XEG degrading activity has been reported, but the extent of product profile identification differs and does not match the detail reached in our research. For example, for one bacterial GH44 XEG (PpXG44), it has been shown that the compound XXXGXXXG could be hydrolyzed into XXX and GXXXG (Ariza et al., 2011). Similarly, for the bacterial Cel44O, identified as an endocellulase, xyloglucan degrading activity and cleavage between XXX and GXXX have been suggested (Ravachol et al., 2016).
Figure 5
Schematic representation of xyloglucan cleavage patterns by two GH44s (DsXegA and PoXegA), one GH12 (AnXegA), and one GH74 (AnXegB)
(A) GH44 XEGs cleaved xyloglucan at both sides of unsubstituted glucosyl units to release “XXXG-type,” “GXXX(G)-type,” and “XXX-type” blocks. The cleavage by GH44 XEGs was not hindered by the substitution of neighboring units.
(B) GH12 XEG cleaved in xyloglucan at the “reducing end” side of unsubstituted glucosyl units to release “XXXG-type” blocks.
(C) GH74 XEG cleaved xyloglucan at the reducing end side of unsubstituted glucosyl units to release XXXG-type blocks and also cleaved within XXXG and XXLG building blocks to release XX + XG and XX + LG. However, GH74 XEG did not cleave at the reducing end side of L units in XLXG and XLLG.
Schematic representation of xyloglucan cleavage patterns by two GH44s (DsXegA and PoXegA), one GH12 (AnXegA), and one GH74 (AnXegB)(A) GH44 XEGs cleaved xyloglucan at both sides of unsubstituted glucosyl units to release “XXXG-type,” “GXXX(G)-type,” and “XXX-type” blocks. The cleavage by GH44 XEGs was not hindered by the substitution of neighboring units.(B) GH12 XEG cleaved in xyloglucan at the “reducing end” side of unsubstituted glucosyl units to release “XXXG-type” blocks.(C) GH74 XEG cleaved xyloglucan at the reducing end side of unsubstituted glucosyl units to release XXXG-type blocks and also cleaved within XXXG and XXLG building blocks to release XX + XG and XX + LG. However, GH74 XEG did not cleave at the reducing end side of L units in XLXG and XLLG.In conclusion, detailed genome analysis led to the discovery of fungal XEGs in family GH44. Our study provided a comprehensive characterization of two of these fungal XEGs and compared their xyloglucan cleavage patterns with that of fungal XEGs from other GH families. These basidiomycete XEGs are xyloglucan-specific but exhibited distinct xyloglucan cleavage patterns compared with fungal XEGs from families GH12 and GH74. Our study also gave insights on phylogenetic diversity of fungal GH44 and understanding of their roles in biomass (xyloglucan) conversion and valorization.
Limitations of the study
We think that the data of our study demonstrated clearly that the two characterized enzymes are xyloglucan-specific with a distinctive product profile compared with other XEG-containing CAZy families. The two characterized enzymes are distantly similar within the phylogenetic tree of fungal GH44 protein sequences, but we cannot exclude that there are other fungal members of GH44 that differ in their substrate specificity or product profile. Characterization of additional fungal GH44 enzymes will shed more light on this.
STAR★Methods
Key resources table
Resource availability
Lead contact
Further information and request for resources and reagents should be directed to and will be fulfilled by Miia R. Mäkelä (miia.r.makela@helsinki.fi).
Materials availability
Materials generated in this study are available from the lead contact with a completed Materials Transfer Agreement.
Experimental model and subject details
Microbial strains and growth conditions
E. coli DH5α competent cells (Invitrogen, Thermo Fisher Scientific, Carlsbad, CA, USA) were employed to propagate pPicZαA-xegADs and pPicZαA-xegAPo plasmids and the transformed cells were grown in low-salt Luria Bertani medium supplemented with 25 μg/mL Zeocin at 37°C. P. pastoris strain X-33 (Invitrogen) was transformed with pPicZαA-xegADs and pPicZαA-xegAPo plasmids and the positive transformants were selected on agar plates containing 1% yeast extract, 2% peptone, 2% glucose, 1 M sorbitol, 2% agar, and 100 μg/mL Zeocin. For production of recombinant proteins, the selected P. pastoris transformants were grown in Buffered Glycerol Complex medium and induced in Buffered Methanol Complex medium at 30°C (Coconi Linares et al., 2020).
Method details
Bioinformatics
The amino acid sequences of 190 GH44 candidates were retrieved from the JGI MycoCosm portal (https://mycocosm.jgi.doe.gov/mycocosm/home). After removing sequences from unpublished genomes and keeping only one strain per species, the sequences were aligned using Multiple Alignment using Fast Fourier Transform (MAFFT) (Katoh and Standley, 2013). After manually filtering and correcting the sequences for gene model errors, 57 sequences from 42 basidiomycete species were included in phylogenetic analysis (Table S1). Because characterized bacterial GH44 enzymes appeared to be too distant to be used as an outgroup, unrooted Maximum Likelihood, Minimum Evolution (ME) and Neighbor Joining (NJ) trees were computed using 500 bootstraps in MEGA7 (Kumar et al., 2016) to reveal the evolutionary relationship between the fungal GH44 sequences.
Cloning, protein production and purification of the selected candidates
The selected candidate genes (Table 1) without predicted signal peptides and introns were codon optimized and synthesized into pPicZαA plasmid (Genscript Biotech, Leiden, The Netherlands), which were then chemically transformed into E. coli DH5α and propagated. The plasmids pPicZαA-xegADs and pPicZαA-xegAPo were extracted and linearized with PmeI (New England Biolabs, Ipswich, MA) and subsequently transformed into P. pastoris strain X-33 by electroporation. The positive colonies were selected by colony Western Blot using 6x-His Tag Monoclonal Antibody conjugated to alkaline phosphatase (Invitrogen, Thermo Fisher Scientific, Carlsbad, CA, USA) and detected with a chromogenic substrate 5-bromo-4-chloro-3-indolyl phosphate/nitro blue tetrazolium (BCIP/NBT) (Li et al., 2020).The selected P. pastoris transformants were cultivated for the protein production according to Coconi Linares et al. (2020). After four days of induction, culture supernatants were harvested (8000 × g, 4°C, 20 min) and purified using ÄKTA FPLC device (GE Healthcare Life Sciences, Uppsala, Sweden) as described earlier (Coconi Linares et al., 2020). The purified proteins were stored at 4°C prior to further analysis.Molecular mass and concentration of the purified enzymes were evaluated by SDS-PAGE and ImageJ program, respectively (Li et al., 2020). Deglycosylation was performed using endoglycosidase H (Endo H; New England Biolabs, Ipswich, MA) following the manufacturer's recommendation.
Enzyme substrate screening
Different substrates including TXG, Avicel® PH-101, galactomannan and beechwood xylan were suspended in 50 mM sodium acetate buffer (pH 5.0) to a final concentration of 2 mg/mL. DsXegA and PoXegA were added to a final protein concentration of 100 μg/mL. The digestion (reaction volume of 300 μL) was incubated at 40°C with shaking (Multitron shaking incubators, Infors HT, Bottmingen, Switzerland) at 110 rpm for 24 h. Control reactions were performed without the addition of enzymes. All digestions were performed in duplicate. The reactions were stopped by heating at 97°C for 10 min. Afterwards, the digests were centrifuged at 22000 × g for 20 min, and the clear supernatants were collected and analyzed by high-performance anion exchange chromatography with HPAEC-PAD after diluting ten times.
Temperature and pH profiles
The temperature and pH optima of GH44 enzymes were determined at 20–80°C in 100 mM phosphate buffer, pH 5.0, or at 40°C in 100 mM Britton-Robinson buffer from pH 2.0 to pH 10.0 (Britton and Robinson, 1931), respectively. The thermal and pH stability were determined by measuring the residual enzyme activity at 37°C after 1 h incubation at 20–70°C in 100 mM phosphate buffer (pH 5.0 for DsXegA and pH 6.0 for PoXegA), or at 40°C in 100 mM Britton-Robinson buffer from pH 2.0 to pH 10.0. All digestions were performed in duplicate using TXG as a substrate.
Enzymatic digestion of xyloglucan substrates
Xyloglucan substrates (TXG or BCXG) were dissolved in 50 mM ammonium acetate buffer (pH 5.0) to a final concentration of 2 mg/mL. DsXegA, PoXegA, AnXegA and AnXegB were dosed in a final protein concentration of 100 μg/mL. The digestion (reaction volume of 300 μL) was incubated in an Eppendorf ThermoMixer® Comfort at 40°C with shaking at 800 rpm for 24 h. Control reactions were performed without the addition of enzymes. All digestions were performed in duplicate. The reactions were stopped by incubating at 97°C for 10 min. Afterwards, the digests were centrifuged at 22000 × g for 20 min, and the clear supernatants were collected and stored at −20°C until further usage. Previous well characterized NcLPMO9C- and GH5-TXG digests were used as reference mixtures (Sun et al., 2020b). The TXG oligosaccharides released by bacterial GH5 are further referred as the “reference TXG oligosaccharides”. DsXegA-, PoXegA-, AnXegA- and AnXegB-TXG digests were analyzed by HPAEC-PAD after diluting 20 times, and by HILIC-ESI-CID-MS/MS2. The TXG oligosaccharide standards were also analyzed by HPAEC-PAD (50 μg/mL) and HILIC-ESI-CID-MS/MS2 (100 μg/mL). The NcLPMO9C-TXG digest and reference TXG oligosaccharides were analyzed by HPAEC-PAD after diluting five and ten times, respectively.
HPAEC-PAD analysis for substrate screening and for xyloglucan oligosaccharide profiling
For the substrate screening and the xyloglucan oligosaccharide profiling, TXG and other carbohydrates corresponding digests were analyzed by HPAEC-PAD on an ICS5000 (Thermo Scientific, Waltham, MA, USA) system. Instrument settings, column, mobile phases and elution program have been described previously (Sun et al., 2020b).
HILIC-ESI-CID-MS/MS2 for structural elucidation of xyloglucan oligosaccharides
The TXG and BCXG digests were analyzed by HILIC-ESI-CID-MS/MS2 on a Vanquish UHPLC system (Thermo Scientific, San Jose, CA, USA) coupled to an LTQ Velos Pro mass spectrometer (Thermo Scientific). The UHPLC settings, column, mobile phases and elution program (Sun et al., 2020b), and the MS (negative ion mode) settings (Sun et al., 2020a) have been described previously. MS2 was performed under dependent scan mode.
Quantification and statistical analysis
All enzymatic digestions were performed in technical duplicates and standard deviations were calculated for temperature and pH profiles.
REAGENT or RESOURCE
SOURCE
IDENTIFIER
Antibodies
6x-His Tag Monoclonal Antibody (clone 3D5), AP
Thermo Fisher Scientific
Cat# R932-25; RRID: AB_2556555
Bacterial and virus strains
Escherichia coli DH5α
Thermo Fisher Scientific
Cat# EC0112
Chemicals, peptides, and recombinant proteins
Sodium acetate
Sigma-Aldrich
Cat# S8750
Ammonium acetate
Sigma-Aldrich
Cat# 240192
Britton-Robinson buffer
Britton and Robinson (1931)
N/A
Endoglycosidase H
New England Biolabs
Cat# P0702S
PmeI
New England Biolabs
Cat# R0560S
Zeocin
Thermo Fisher Scientific
Cat# R25001
5-Bromo-4-chloro-3-indolyl phosphate (BCIP)
Sigma-Aldrich
CAS: 6578-06-9
Nitro blue tetrazolium (NBT)
Sigma-Aldrich
CAS: 298-83-9
Recombinant DNA
pPicZαA-xegADs (E. coli propagation and P. pastoris expression)
This paper
N/A
pPicZαA-xegAPo (E. coli propagation and P. pastoris expression)
Authors: Oksana V Berezina; Jonathan Herlet; Sergey V Rykov; Petra Kornberger; Artem Zavyalov; Dmitriy Kozlov; Liliya Sakhibgaraeva; Irina Krestyanova; Wolfgang H Schwarz; Vladimir V Zverlov; Wolfgang Liebl; Sergey V Yarotsky Journal: Appl Microbiol Biotechnol Date: 2017-05-05 Impact factor: 4.813
Authors: Peicheng Sun; Matthias Frommhagen; Maloe Kleine Haar; Gijs van Erven; Edwin J Bakx; Willem J H van Berkel; Mirjam A Kabel Journal: Carbohydr Polym Date: 2020-01-26 Impact factor: 9.381
Authors: Libin Ye; Xiaoyun Su; George E Schmitz; Young Hwan Moon; Jing Zhang; Roderick I Mackie; Isaac K O Cann Journal: Appl Environ Microbiol Date: 2012-07-27 Impact factor: 4.792
Authors: Igor V Grigoriev; Roman Nikitin; Sajeet Haridas; Alan Kuo; Robin Ohm; Robert Otillar; Robert Riley; Asaf Salamov; Xueling Zhao; Frank Korzeniewski; Tatyana Smirnova; Henrik Nordberg; Inna Dubchak; Igor Shabalov Journal: Nucleic Acids Res Date: 2013-12-01 Impact factor: 16.971
Authors: Peicheng Sun; Christophe V F P Laurent; Stefan Scheiblbrandner; Matthias Frommhagen; Dimitrios Kouzounis; Mark G Sanders; Willem J H van Berkel; Roland Ludwig; Mirjam A Kabel Journal: Biotechnol Biofuels Date: 2020-05-29 Impact factor: 7.670