Literature DB >> 29780502

The impact of O-glycan chemistry on the stability of intrinsically disordered proteins.

Erica T Prates1,2, Xiaoyang Guan3, Yaohao Li3, Xinfeng Wang3, Patrick K Chaffey3, Munir S Skaf2, Michael F Crowley4, Zhongping Tan3, Gregg T Beckham1.   

Abstract

Protein glycosylation is a diverse post-translational modification that serves myriad biological functions. O-linked glycans in particular vary widely in extent and chemistry in eukaryotes, with secreted proteins from fungi and yeast commonly exhibiting O-mannosylation in intrinsically disordered regions of proteins, likely for proteolysis protection, among other functions. However, it is not well understood why mannose is often the preferred glycan, and more generally, if the neighboring protein sequence and glycan have coevolved to protect against proteolysis in glycosylated intrinsically disordered proteins (IDPs). Here, we synthesized variants of a model IDP, specifically a natively O-mannosylated linker from a fungal enzyme, with α-O-linked mannose, glucose, and galactose moieties, along with a non-glycosylated linker. Upon exposure to thermolysin, O-mannosylation, by far, provides the highest extent of proteolysis protection. To explain this observation, extensive molecular dynamics simulations were conducted, revealing that the axial configuration of the C2-hydroxyl group (2-OH) of α-mannose adjacent to the glycan-peptide bond strongly influences the conformational features of the linker. Specifically, α-mannose restricts the torsions of the IDP main chain more than other glycans whose equatorial 2-OH groups exhibit interactions that favor perpendicular glycan-protein backbone orientation. We suggest that IDP stiffening due to O-mannosylation impairs protease action, with contributions from protein-glycan interactions, protein flexibility, and protein stability. Our results further imply that resistance to proteolysis is an important driving force for evolutionary selection of α-mannose in eukaryotic IDPs, and more broadly, that glycan motifs for proteolysis protection likely coevolve with the protein sequence to which they attach.

Entities:  

Year:  2018        PMID: 29780502      PMCID: PMC5939190          DOI: 10.1039/c7sc05016j

Source DB:  PubMed          Journal:  Chem Sci        ISSN: 2041-6520            Impact factor:   9.825


Intrinsically disordered proteins (IDPs) and intrinsically disordered regions (IDRs) of proteins are prevalent in both eukaryotes and prokaryotes.1–3 Although often poorly conserved in sequence, the amino acid content of IDPs and IDRs is actively regulated, and IDPs and IDRs serve functions such as connecting ordered domains, regulating translation, molecular recognition and signaling, and assisting in protein folding.2–4 Because of their inherent flexibility and lack of structure, IDPs and IDRs are susceptible to proteolytic cleavage in the competitive, extracellular milieu, and O-glycosylation – the attachment of a sugar moiety to the β-hydroxyl group of serine or threonine – is an important mechanism to protect against proteolysis in these regions.5 In fungi and yeasts in particular, most of the secreted IDPs and proteins exhibiting IDRs are O-mannosylated,6–9 but the evolutionary preference for this specific glycosylation pattern is not well understood. The present study uses glycopeptide synthesis and molecular dynamics (MD) simulations to reveal that O-mannosylation is the preferred glycan motif on fungal IDP sequences and reveals the biophysical reasons underpinning this observation, in turn suggesting an evolutionary selection for α-mannose as the preferred glycan for IDP/IDR stabilization in some eukaryotic systems. O-Mannosylation is strongly preferred for proteolysis protection of a model fungal IDP. To investigate how glycan identity affects IDP proteolytic stability, we employed the naturally O-mannosylated linker from the Trichoderma reesei glycoside hydrolase family 7 cellobiohydrolase, TrCel7A, as a model.10 This enzyme is one of the most important industrial cellulases and its linker is a well-studied O-mannosylated IDP.11–14 The α-anomeric configuration was chosen since it is the only type reported so far in reducing terminal mannose residues of O-mannosylated proteins from fungi and yeasts.8 We used solid-state glycopeptide synthesis12,15,16 to produce four variants (Fig. 1), including the non-glycosylated linker, and measured the half-life to thermolysin degradation with MALDI-TOF MS (Fig. S1–S4†).15–19 As shown in Table 1, all glycosylated variants improve proteolytic stability over the non-glycosylated linker, LNG, but the O-mannosylated linker (Lman) exhibits an striking 112-fold improvement over LNG, 16-fold proteolysis protection over the O-galactosylated linker (Lgal), and 3-fold over the O-glucosylated linker (Lglc). These results, obtained using a model IDP, align with our previous observation that O-mannosylation improves proteolytic stability compared to other glycans in an ordered protein domain from the same enzyme.15,16
Fig. 1

The four linker models examined experimentally and computationally (left). Chair representations of α-mannose, α-galactose, and α-glucose are also depicted (right).

Table 1

Half-life to thermolysin degradation (minutes)

VariantsTrial 1Trial 2Trial 3Average
LNG1.61.51.21.5 ± 0.2
Lman163.9196.4130.5163.6 ± 32.9
Lgal8.58.812.19.8 ± 2.0
Lglc53.362.045.953.7 ± 8.1
Glycan stereochemistry impacts protein flexibility and accessibility. To explain the results presented in Table 1, we subsequently conducted temperature replica exchange molecular dynamics (T-REMD) with explicit solvent using various linker models, including the four experimental systems. Analyses are reported on the T-REMD population from the lowest temperature replica (300 K). Two hypotheses for the increased proteolytic stability imparted by glycans are that (i) glycans increase protein rigidity20,21 and that (ii) glycans impart steric hindrance to restrict protease access.22 Both hypotheses were tested computationally by examining differences in protein flexibility and accessibility. Notably, the predicted cleavage sites to various proteases coincide with the glycosylation sites (Fig. S5†), perhaps suggesting that steric hindrance may be responsible for proteolysis resistance. However, the calculated solvent accessible surface area is similar for all glycosylated models considered (Fig. S6†), while there is a considerable difference in proteolysis susceptibility among Lman, Lgal, and Lglc, with Lgal exhibiting only slightly higher resistance to proteolysis than LNG. These results suggest that steric hindrance alone cannot fully explain proteolytic resistance, since the glycan moieties occupy roughly the same volume. We subsequently examined how glycan chemistry affects protein flexibility, glycan orientation, specific interactions, and backbone torsional preferences in an attempt to explain the high proteolysis resistance imparted by O-mannosylation. Information about protein flexibility and extension were obtained from the free energy profiles, or potential of mean force (PMF), as a function of the end-to-end distance for all linkers (Fig. 2). Unlike Lgal and Lglc, for which the PMFs are somewhat flat-bottomed and resemble that of the non-glycosylated linker LNG, the PMF for Lman is slightly narrower and shows a well-defined local minimum at larger distances (∼3.0–3.5 nm). This indicates that Lman is, on average, stiffer and adopts more extended conformations than its counterparts. Further analyses reinforce the hypothesis that α-mannosylation is able to restrict protein flexibility. That is, the relative stiffening of Lman was corroborated by its greater persistence length (Table S1†). Also, similar structures from T-REMD were clustered considering the Cα atoms with a root mean squared deviation cutoff of 1.5 Å (Fig. S7, Table S3†).23 The most populated clusters were found for Lman. Moreover, values of root-mean-square deviation relative to average structures computed for 10 ns trajectory blocks also indicate lower mobility of the Lman backbone (Table S3†). Small differences in protein backbone flexibility and concomitant large differences in resistance to proteolysis were also recently found for a structured protein with a single attached glycan, α-mannose or α-glucose.24 Chaffey et al. suggested that a chain of specific interactions between O-mannosyl and side chains of close residues may be propagating stiffening along the protein backbone. The similar behavior observed with IDPs suggests that the effects of α-mannose on protein stiffening may not be exclusive to a specific protein fold. From these observations, we further hypothesized that the observed differences in linker extension are caused by local interactions with the C2-hydroxyl group (2-OH) adjacent to the glycan–peptide bond, which is equatorial in α-glucose and α-galactose and axial in α-mannose. Fig. 3A shows the average number of hydrogen bonds (HBs) between the protein and each of the carbohydrate hydroxyl groups computed from the T-REMD simulations. The HBs between the 2-OH group and the peptide contribute significantly to the higher total number of HBs in Lgal and Lglc. Compared to Lman, this indicates that the equatorial configuration of 2-OH, the closest hydroxyl to the peptide chain, favors glycan–protein HBs.
Fig. 2

Free energy profiles as a function of the end-to-end distance of TrCel7A linkers. Error bars were computed with bootstrapping analysis.

Fig. 3

(A) Average number of HBs involving hydroxyl groups in the different positions of the glycan ring. Solid and striped bars correspond to glycan–peptide and glycan–glycan interactions, respectively. Vertical lines indicate standard deviations; (B) probability distribution of the angle between the normal to the plane of the carbohydrate ring and the vector between Cα and Cβ belonging to the threonine to which the glycan is bound. The dashed lines correspond to the distributions resulting from trajectories without the frames with HBs between 2-OH and the protein; (C) representative structures for ∼90° and ∼170° angles obtained for Lman.

Next, we show that orientation of the glycans relative to the peptide chain depends on the glycan chemistry and affects the conformational freedom of the glycosylated IDP. Fig. 3B shows the normalized distribution of the angle θ between the normal to the plane of the sugar ring and the vector formed by Cα and Cβ of the threonine residues to which the glycan is attached. Values near 180° and 90° correspond, respectively, to conformations in which the plane of the rings are nearly parallel and perpendicular to the direction of the peptide chain (Fig. 3C). The shoulder at ∼90° observed for Lgal and Lglc indicates that the glycans are more frequently oriented perpendicularly to the peptide chain than in Lman, and, therefore, exhibit smaller contact surface with the protein (Table S2†). This effect is associated to the pronounced glycan–protein HBs involving the equatorial 2-OH in Lgal and Lglc. The normalized angle distributions computed for the subset of molecular frames in which these specific interactions are absent (Fig. 3B, dashed lines) lack the characteristic shoulder in the 80–100° range, demonstrating that the C2 stereochemistry impacts the glycan conformation. Taken together, the results presented thus far demonstrate that the 2-OH position affects glycan conformation and that protein dynamics differ depending on glycan chemistry. Next, why α-mannosylation leads to more extended conformations and reduces protein flexibility requires an explanation. To this end, we examined how glycans affect the protein backbone conformational sampling at the residue level. Fig. 4A shows the Ramachandran plots for the LNG threonines, in which the protein backbone frequently visits all three major conformational regions. The R3 region corresponds to α-helix like conformations, whereas R1 and R2 correspond to more extended conformations, such as those found in β-sheets and polyproline II structures. Although no persistent secondary structures were detected during the simulations, these results reflect the structural features of the linkers. We verified that attached glycans alter torsional sampling of the nearest amino acids, as seen elsewhere.25,26 For Lgal and Lglc, the same three regions are populated as in LNG, except that the peak in the R3 region occurs only every other residue because of the excluded volume of neighboring glycans (Fig. 4C and D, S8†). In contrast, the R2 region is predominantly favored in Lman for all glycosylated residues, suggesting that the relative rigidity of the α-mannosylated linker results in part from a reduced local dihedral flexibility of the glycosylated residues imparted by α-mannosylation (Fig. 4B). We suggest that perpendicularly oriented glycan rings in Lgal and Lglc allow for improved accommodation of neighboring glycan rings, favoring more compact conformations. Conversely, the preferred orientation of α-mannose glycans hinders the mobility of the surrounding atoms in the peptide chain, thus revealing a direct relationship between glycan chemistry, orientation, and protein conformational freedom.
Fig. 4

Ramachandran plots of threonine residues in (A) LNG, (B) Lman, (C) Lgal, and (D) Lglc. R1, R2, and R3 regions are indicated on panel A. Angles are presented in degrees.

Variants decorated with O-mannobiosyl (L2man) or O-galactobiosyl (L2gal) were also simulated, as well as the linker with a putative natural decoration based on a previous experimental characterization (Lman-h) (Fig. S9 and S10†).10 Our analyses suggest that the length of the glycan only slightly changes the dynamics of the protein when the chemistry of the 2-OH groups in the immediately attached glycosyl unit is preserved, reinforcing its importance (Fig. S11†). Glycosylation pattern and protein primary sequence are correlated. Although less well studied, many secreted bacterial proteins are also O-glycosylated.27 For example, the multi-enzyme cellulosome from Clostridium thermocellum exhibits O-glycans on its linkers.28 Similarly, the thermostable enzyme CelA from Caldicellulosiruptor bescii has linkers of up to 70 amino acids rich in O-glycans.29 However, unlike the typical O-mannosylated linkers from eukaryotic proteins, these linkers exhibit mostly O-galactosylation, and are enriched in proline, relative to eukaryotic IDRs.30 Aiming to understand why O-mannosylation is not prevalent in bacterial IDP and IDRs relative to their eukaryotic counterparts, we also studied a “PT linker”, which comprises a prolinethreonine repeat sequence, and represents a fragment of glycosylated linkers found in bacterial cellulases.20,28,29,31,32 PT linker models were uniformly decorated with α-mannose (LPT-man), α-galactose (LPT-gal), and α-glucose (LPT-glc) (Fig. 5A).
Fig. 5

(A) Non-glycosylated (LPT-NG) and glycosylated variants of “PT linker” (LPT-man, LPT-gal and LPT-glc). (B) Free energy profiles as a function of the end-to-end distance of PT linkers. Error bars were computed with bootstrapping analysis. The free energy profile of Lman was computed for the distance between Cα atoms in residue 10 (G) and residue 22 (G), so that fragments of same length can be compared.

It is well known that high proline content is generally found in disordered proteins33 and favors extended conformations of IDRs.34 Accordingly, the end-to-end distance PMF shows that the non-glycosylated PT linker favors extended conformations similarly to the glycosylated TrCel7A linker Lman (Fig. 5B). Elongation and further stiffening of the linkers are observed upon glycosylation and is consistent with NMR spectroscopy data,34 which demonstrated that glycosylation of PT linkers dampens the dynamics. Interestingly, in the PT linkers, varying the glycan chemistry is not as impactful to the protein dynamics as in the eukaryotic linker cases. To understand this difference, we examined the correlation between protein dynamics and carbohydrate structuring proposed from the findings with the eukaryotic linker models. In the PT linkers, the presence of the equatorial 2-OH groups in galactosylated and glucosylated linkers does not increase the number of protein–glycan HB compared to Lman nor favor perpendicular ring orientations, unlike Lgal and Lglc (Fig. S12†). Moreover, the Ramachandran plots of threonines are remarkably similar for the three glycosylated PT linkers (Fig. 6), and show the same preference for extended conformations as Lman does (R2 region). Together, these results predict that the C2 hydroxyl stereochemistry is unlikely to impact proline-rich IDPs. That may result from the loss of one of the HB sites in the protein backbone, since the backbone nitrogen atom is part of the pyrrolidine ring of proline residues.
Fig. 6

Ramachandran plots of threonine residues in the variants of PT linker (A) LPT-NG, (B) LPT-man, (C) LPT-gal, and (D) LPT-glc.

T-REMD simulations of glycosylated tripeptides GTG were also performed to evaluate the effects of 2-OH configuration on glycan orientation and interactions without the influence of neighboring glycans and amino acids. A single glycan, α-mannose, α-galactose or α-glucose, was O-linked to the central threonine in the models Tman, Tgal and Tglc, respectively (Fig. S9†). The parallel glycan-peptide backbone orientation is favored in the small model systems with α-O-mannosylation, Tman, relative to other glycans (Fig. S13†). In the tripeptides Tgal and Tglc, the equatorial configuration of 2-OH in α-Gal and α-Glc favors HB interactions with the peptide as in the Lgal and Lglc linkers. However, an excess of perpendicularly-oriented glycans relative to Lman is not observed for these tripeptides, indicating that the local HB interactions between 2-OH and the peptide are not the only factor affecting glycan conformation. Instead, these results indicate that the glycans in Lgal and Lglc are primarily perpendicularly oriented because of the excluded volumes of neighboring glycans and amino acid side chains, and that the 2-OH—peptide HBs stabilize this glycan conformation. Thus, our results with the small tripeptides suggest that the primary sequence and the distribution of glycosylated residues along the peptide chain are important factors for carbohydrate orientation in these systems. In summary, experimental comparisons of glycosylated and non-glycosylated IDPs show that O-mannosylation enhances protection against proteolysis by two orders of magnitude relative to the non-glycosylated parent IDP, followed by O-galactosylation (10-fold improved stability). Our results suggest that the resistance to proteolysis is an important driving force for the natural selection of α-mannose as the main O-linked glycan motif decorating IDRs and IDPs in secreted eukaryotic proteins. Furthermore, these results demonstrate that the stereochemistry of C2 in the carbohydrate rings plays a key role on glycan orientation, which is correlated to protein flexibility and extension. Accordingly, the axial position of 2-OH in an α-mannose glycan is related to the observed higher rigidity and extension of the studied IDR. While associating protein elongation with resistance to proteolysis is perhaps counterintuitive, protein stiffening can explain the remarkably higher stability of the O-mannosylated linker. That is, although we have not investigated the interactions between a protease and IDPs, we conjecture, in the light of the present findings, that increasing the peptide rigidity impairs binding to the catalytic site of a protease. This hypothesis is reinforced by the observation of a similar trend of glycan chemistry impacting resistance to proteolysis of a structured protein and its thermal stability, which is often linked to protein stiffening.16 Moreover, the effect of glycosylation on the average elongation of the studied IDR, as a protein linker, may be important to provide the optimum distance between the connected domains for protein function. Therefore, O-linked α-mannose exhibits the unique ability of both extending the IDR while protecting it against proteolysis. These results also suggest that the high content of proline residues, especially found in linkers from bacterial cellulases, avoids the need for α-mannose for increased protection against proteolysis. This hypothesis will be tested in future experimental studies. We further suggest that the glycosylation pattern in eukaryotic IDRs co-evolved with the primary sequence. That is, the lower content of proline residues in IDPs and IDRs from fungi compared to bacteria is compensated by O-linked α-mannosylation to guarantee optimal linker length, flexibility, and protection against proteolysis. Given the compelling alignment of experimental and computational results, we anticipate that our findings will be useful in the burgeoning field of glycoprotein engineering.

Conflicts of interest

There are no conflicts to declare. Click here for additional data file.
  31 in total

Review 1.  Protein O-mannosylation: conserved from bacteria to humans.

Authors:  Mark Lommel; Sabine Strahl
Journal:  Glycobiology       Date:  2009-05-09       Impact factor: 4.313

2.  Glycosylated linkers in multimodular lignocellulose-degrading enzymes dynamically bind to cellulose.

Authors:  Christina M Payne; Michael G Resch; Liqun Chen; Michael F Crowley; Michael E Himmel; Larry E Taylor; Mats Sandgren; Jerry Ståhlberg; Ingeborg Stals; Zhongping Tan; Gregg T Beckham
Journal:  Proc Natl Acad Sci U S A       Date:  2013-08-19       Impact factor: 11.205

3.  Limited proteolysis of natively unfolded protein 4E-BP1 in the presence of trifluoroethanol.

Authors:  Ellen V Hackl
Journal:  Biopolymers       Date:  2014-06       Impact factor: 2.505

4.  The influence of different linker modifications on the catalytic activity and cellulose affinity of cellobiohydrolase Cel7A from Hypocrea jecorina.

Authors:  Silke Flindt Badino; Jenny Kim Bathke; Trine Holst Sørensen; Michael Skovbo Windahl; Kenneth Jensen; Günther H J Peters; Kim Borch; Peter Westh
Journal:  Protein Eng Des Sel       Date:  2017-07-01       Impact factor: 1.650

5.  The O-glycosylated linker from the Trichoderma reesei Family 7 cellulase is a flexible, disordered protein.

Authors:  Gregg T Beckham; Yannick J Bomble; James F Matthews; Courtney B Taylor; Michael G Resch; John M Yarbrough; Steve R Decker; Lintao Bu; Xiongce Zhao; Clare McCabe; Jakob Wohlert; Malin Bergenstråhle; John W Brady; William S Adney; Michael E Himmel; Michael F Crowley
Journal:  Biophys J       Date:  2010-12-01       Impact factor: 4.033

6.  The nature of the carbohydrate-peptide linkage region in glycoproteins from the cellulosomes of Clostridium thermocellum and Bacteroides cellulosolvens.

Authors:  G J Gerwig; J P Kamerling; J F Vliegenthart; E Morag; R Lamed; E A Bayer
Journal:  J Biol Chem       Date:  1993-12-25       Impact factor: 5.157

7.  Specificity of O-glycosylation in enhancing the stability and cellulose binding affinity of Family 1 carbohydrate-binding modules.

Authors:  Liqun Chen; Matthew R Drake; Michael G Resch; Eric R Greene; Michael E Himmel; Patrick K Chaffey; Gregg T Beckham; Zhongping Tan
Journal:  Proc Natl Acad Sci U S A       Date:  2014-05-12       Impact factor: 11.205

8.  Structural and functional analysis of a bacterial cellulase by proteolysis.

Authors:  N R Gilkes; D G Kilburn; R C Miller; R A Warren
Journal:  J Biol Chem       Date:  1989-10-25       Impact factor: 5.157

9.  Glycosylation of bacterial cellulases prevents proteolytic cleavage between functional domains.

Authors:  M L Langsford; N R Gilkes; B Singh; B Moser; R C Miller; R A Warren; D G Kilburn
Journal:  FEBS Lett       Date:  1987-12-10       Impact factor: 4.124

10.  Cellulase linkers are optimized based on domain type and function: insights from sequence analysis, biophysical measurements, and molecular simulation.

Authors:  Deanne W Sammond; Christina M Payne; Roman Brunecky; Michael E Himmel; Michael F Crowley; Gregg T Beckham
Journal:  PLoS One       Date:  2012-11-06       Impact factor: 3.240

View more
  7 in total

1.  Chemical biology of glycoproteins: From chemical synthesis to biological impact.

Authors:  Yaohao Li; Amy H Tran; Samuel J Danishefsky; Zhongping Tan
Journal:  Methods Enzymol       Date:  2019-03-14       Impact factor: 1.600

Review 2.  Nucleocytoplasmic O-glycosylation in protists.

Authors:  Christopher M West; Hyun W Kim
Journal:  Curr Opin Struct Biol       Date:  2019-05-22       Impact factor: 6.809

Review 3.  Pathway engineering facilitates efficient protein expression in Pichia pastoris.

Authors:  Chao Liu; Jin-Song Gong; Chang Su; Hui Li; Heng Li; Zhi-Ming Rao; Zheng-Hong Xu; Jin-Song Shi
Journal:  Appl Microbiol Biotechnol       Date:  2022-08-30       Impact factor: 5.560

Review 4.  The Genetics and Biochemistry of Cell Wall Structure and Synthesis in Neurospora crassa, a Model Filamentous Fungus.

Authors:  Pavan K Patel; Stephen J Free
Journal:  Front Microbiol       Date:  2019-10-10       Impact factor: 5.640

5.  Potentially adaptive SARS-CoV-2 mutations discovered with novel spatiotemporal and explainable AI models.

Authors:  Michael R Garvin; Erica T Prates; Mirko Pavicic; Piet Jones; B Kirtley Amos; Armin Geiger; Manesh B Shah; Jared Streich; Joao Gabriel Felipe Machado Gazolla; David Kainer; Ashley Cliff; Jonathon Romero; Nathan Keith; James B Brown; Daniel Jacobson
Journal:  Genome Biol       Date:  2020-12-23       Impact factor: 13.583

6.  Structural and functional characterization of NEMO cleavage by SARS-CoV-2 3CLpro.

Authors:  Mikhail A Hameedi; Erica T Prates; Michael R Garvin; Irimpan I Mathews; B Kirtley Amos; Omar Demerdash; Mark Bechthold; Mamta Iyer; Simin Rahighi; Daniel W Kneller; Andrey Kovalevsky; Stephan Irle; Van-Quan Vuong; Julie C Mitchell; Audrey Labbe; Stephanie Galanie; Soichi Wakatsuki; Daniel Jacobson
Journal:  Nat Commun       Date:  2022-09-08       Impact factor: 17.694

7.  Mutually exclusive locales for N-linked glycans and disorder in human glycoproteins.

Authors:  Shyamili Goutham; Indu Kumari; Dharma Pally; Alvina Singh; Sujasha Ghosh; Yusuf Akhter; Ramray Bhat
Journal:  Sci Rep       Date:  2020-04-08       Impact factor: 4.379

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.