Jie Shi1,2, Suhela Sharif1, Cyril Balsollier1, Rob Ruijtenbeek1,3, Roland J Pieters1, Seino A K Jongkees1. 1. Department of Chemical Biology and Drug Discovery Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Universiteitsweg 99, Utrecht, 3584 CG, The Netherlands. 2. Key Laboratory of Carbohydrate Chemistry & Biotechnology Ministry of Education, School of Biotechnology, Jiangnan University, 214122, Wuxi, P. R. China. 3. PamGene International BV, 's-Hertogenbosch, 5211 HH, The Netherlands.
Abstract
O-GlcNAc transferase (OGT) is the only enzyme that catalyzes the post-translational modification of proteins at Ser/Thr with a single β-N-acetylglucosamine (O-GlcNAcylation). Its activity has been associated with chronic diseases such as cancer, diabetes and neurodegenerative disease. Although numerous OGT substrates have been identified, its accepted substrate scope can still be refined. We report here an attempt to better define the peptide-recognition requirements of the OGT active site by using mRNA display, taking advantage of its extremely high throughput to assess the substrate potential of a library of all possible nonamer peptides. An antibody-based selection process is described here that is able to enrich an OGT substrate peptide from such a library, but with poor absolute recovery. Following four rounds of selection for O-GlcNAcylated peptides, sequencing revealed 14 peptides containing Ser/Thr, but these were shown by luminescence-coupled assays and peptide microarray not to be OGT substrates. By contrast, subsequent testing of an N-terminal tag approach showed exemplary recovery. Our approach demonstrates the power of genetically encoded libraries for selection of peptide substrates, even from a very low initial starting abundance and under suboptimal conditions, and emphasizes the need to consider the binding biases of antibodies and both C- and N-terminal tags in profiling peptide substrates by high-throughput display.
O-GlcNAc transferase (OGT) is the only enzyme that catalyzes the post-translational modification of proteins at Ser/Thr with a single β-N-acetylglucosamine (O-GlcNAcylation). Its activity has been associated with chronic diseases such as cancer, diabetes and neurodegenerative disease. Although numerous OGT substrates have been identified, its accepted substrate scope can still be refined. We report here an attempt to better define the peptide-recognition requirements of the OGT active site by using mRNA display, taking advantage of its extremely high throughput to assess the substrate potential of a library of all possible nonamer peptides. An antibody-based selection process is described here that is able to enrich an OGT substrate peptide from such a library, but with poor absolute recovery. Following four rounds of selection for O-GlcNAcylatedpeptides, sequencing revealed 14 peptides containing Ser/Thr, but these were shown by luminescence-coupled assays and peptide microarray not to be OGT substrates. By contrast, subsequent testing of an N-terminal tag approach showed exemplary recovery. Our approach demonstrates the power of genetically encoded libraries for selection of peptide substrates, even from a very low initial starting abundance and under suboptimal conditions, and emphasizes the need to consider the binding biases of antibodies and both C- and N-terminal tags in profiling peptide substrates by high-throughput display.
O‐Linked GlcNAcylation (O‐GlcNAcylation) is a dynamic and ubiquitous post‐translational modification of proteins at Ser/Thr residues with a single β‐N‐acetylglucosamine sugar moiety, analogous to protein Ser/Thr phosphorylation.[
,
] Like the phosphorylation cycle, O‐GlcNAcylation is also reversibly controlled by “writer” and “eraser” enzymes. Remarkably, and unlike phosphorylation, O‐GlcNAc transferase (OGT) is the only enzyme that catalyzes the attachment of O‐GlcNAc directly to its substrates, using UDP‐GlcNAc as the sugar donor. Similarly, O‐GlcNAcase (OGA) is the only enzyme that removes this modification from proteins. O‐GlcNAcylation has been found on a diverse range of proteins, which are involved in many fundamental cellular processes such as gene transcription,
signal transduction,
and the cell cycle.
Aberrant regulation of O‐GlcNAcylation has been linked to chronic disease such as cancer,
diabetes,
and neurodegenerative disease.
Primary therapeutic effects have been seen in cases by modulating O‐GlcNAcylation levels,
so further investigation of OGT's catalytic properties is essential to understand disease development as well as for the design of inhibitors.To date, about 1734 O‐GlcNAcylation sites have been recorded (https://www.phosphosite.org) and recent proteomics studies estimated that this number should be even higher.[
,
] One of the fundamental questions that remains unanswered is how only one enzyme can define its substrates and modify so many of them with any specificity. Attempts to define the OGT substrate specificity through a high‐density peptide microarray showed that small peptide portions can be O‐GlcNAcylated to varying degrees,[
,
,
] suggesting that OGT has the ability to determine its substrate from hundreds of peptides by differential binding in its active site. Analysis of many OGTpeptide substrates has led to the generation of a substrate preference pattern for the amino acids around O‐GlcNAc as [TS] [PT] [VT] S/T [RLV] [ASY] (from −3 to +2 with respect to O‐GlcNAcylation site underlined), some of which were also supported by crystal structures.
Although this motif is not able to fully reflect the OGT substrate requirements, Pro at the −2 site was proposed to be necessary to access the OGT active site by inducing an extended conformation.
These data strongly suggested that OGT is able to differentiate between peptide substrates based on primary sequence. In another approach, saturation mutagenesis of a PPVSR substrate peptide was investigated using quantitative mass spectrometry on self‐assembled monolayers. In this study, variations in the motif made little difference, but addition of alanine in the +2 position or an aromatic or hydrophobic amino acid in the −4 position gave increased modification.In addition to direct recognition of primary sequence, N‐terminal domain tetratricopeptide repeats (TPR) in OGT are also believed to be responsible for recognition of physiological protein substrates by mediating a variety of protein‐protein interactions.[
,
,
] It is likely that OGT determines its physiological protein substrates following a synergistical mode of selectivity through a combination of TPR‐mediated protein‐protein interaction and active site‐constrained amino acid preference.
Despite this, short OGTpeptide substrates, which cannot make contact with the OGT TPR domain, are still ideal tools to study its inherent catalytic properties.The OGT substrate motif as currently understood can in some cases not accurately match or predict OGT substrates,[
,
] because microarray‐based peptide display is limited by sequence diversity and is unable to generate comprehensive sequence information at the required peptide length. Messenger RNA display is a very efficient in vitro technique that allows selection of proteins/peptides with functions of interest from ≥1012 unique variants in a single experiment. It has been widely used for applications such as the identification of ligands of a protein,[
,
] finding new and selective glycosidase inhibitors,
studying protein‐protein
and protein‐DNA interactions,
and enzyme‐substrate interactions.[
,
] We therefore felt that mRNA display would be an ideal method to survey short OGTpeptide substrates and comprehensively study the substrate requirement for OGT directly at the peptide level. Here we detail some challenges encountered in trying to develop such a system.To validate the utility of mRNA display to select short peptide substrates for OGT, we performed a proof of principle experiment following the selection scheme as depicted in Figure 1. However, instead of a DNA library, a positive DNA control that encodes a peptide known to be an OGT substrate was prepared and employed to validate and optimize the selection process. This template DNA control includes a region containing a T7 promoter, a region encoding an OGT substrate peptide (RESSYDIYRVPSSQS,
O‐GlcNAc attachment site underlined) and a region encoding a C‐terminal flexible peptide spacer (GAGAGA) that connects to the mRNA tag. The product of this was verified by in vitro translation to yield a product with the desired molecular weight on LC−MS (Figure S2). The control DNA template was then transcribed into mRNA using T7 RNA polymerase, attached to a puromycin (“Puro”) linker with a DNA spacer using T4 RNA ligase I, and this mRNA‐Puro conjugate was added to an in vitro translation reaction with release factors omitted, allowing the encoding mRNA to efficiently capture its translated peptide covalently through puromycin. The mRNA portion was reverse‐transcribed to form an mRNA/cDNA‐peptide fusion to avoid the degradation of mRNA and prevent any unfavorable RNA folding that might interfere with a future pull‐down step. Importantly, the resulting cDNA can be used to acquire peptide information after completion of enrichment.
Figure 1
Schematic depiction of the selection cycle for OGT substrate peptides. 1. A DNA library or positive control template (5′ and 3′ UTR in orange and cyan) was transcribed into mRNA using T7 RNA polymerase. 2. Messenger RNA was ligated to a puromycin‐containing linker by T4 RNA ligase I. 3. Peptide was translated and linked to its encoding RNA through the puromycin linker. 4. RNA was reverse transcribed to cDNA by reverse transcriptase. 5. mRNA/cDNA‐peptide fusions were subjected to OGT reaction. 6. O‐GlcNAcylated peptide‐RNA conjugates were isolated by O‐GlcNAc antibody pull‐down assay. 7. DNA was heat eluted from the fusions. 8. Eluted DNA was amplified by PCR for sequencing or the next round of selection.
Schematic depiction of the selection cycle for OGT substrate peptides. 1. A DNA library or positive control template (5′ and 3′ UTR in orange and cyan) was transcribed into mRNA using T7 RNA polymerase. 2. Messenger RNA was ligated to a puromycin‐containing linker by T4 RNA ligase I. 3. Peptide was translated and linked to its encoding RNA through the puromycin linker. 4. RNA was reverse transcribed to cDNA by reverse transcriptase. 5. mRNA/cDNA‐peptide fusions were subjected to OGT reaction. 6. O‐GlcNAcylated peptide‐RNA conjugates were isolated by O‐GlcNAc antibody pull‐down assay. 7. DNA was heat eluted from the fusions. 8. Eluted DNA was amplified by PCR for sequencing or the next round of selection.Before the OGT reaction and subsequent pull‐down steps, we assessed whether the presence of nucleotides and other components from the translation and reverse transcription reaction might affect OGT activity or antibody binding, as UDP has been reported as an inhibitor of OGT.
The crude product after translation and reverse transcription was directly mixed with a standard microarray OGT assay, wherein OGT activity is reflected by the O‐GlcNAcylation of an immobilized peptide substrate that is detected by antibody binding. Comparing with a parallel control OGT assay, the crude translation/reverse transcription product has no detectable effect on OGT activity (Figure S3), indicating that this reaction mixture is compatible with both OGT modification of the mRNA displayed library and binding of the product by RL2 antibody. The same crude reverse transcription product was then subjected to modification by OGT, while a negative control experiment was performed in identical conditions but with OGT omitted in case of non‐enzymatic modification. Because OGT itself can be O‐GlcNAcylated during the reaction and might interfere with following pull‐down of O‐GlcNAcylated peptide, ethanol precipitation of the mRNA/cDNA‐peptide conjugate was performed after completion of the OGT reaction to remove proteins and small molecules. Pull‐down of the O‐GlcNAcylated peptide was achieved by RL2 O‐GlcNAc antibody‐charged protein G Magnetic beads (Figure S4), and the cDNA was eluted and collected by transferring supernatant at 95 °C. Analysis of the eluted cDNA was performed using quantitative PCR. As shown in Figure 2, qPCR analysis indicated a higher recovery of cDNA from reaction with OGT present, as compared with the control reaction wherein OGT was absent. However, the recovery rate was much lower than expected in a positive control, as this sequence has been shown to be an efficient substrate by a microarray assay.
As the O‐GlcNAcylation site in this peptide is close to the C terminus, we speculated that the bulky puromycin linker and nucleic acid might still hinder access of the final fusions to the catalytic site of OGT, even though a flexible C‐terminal spacer (GAGAGA) was used.
Figure 2
Recovery efficiency of the proof‐of‐principle selection using ZO‐3 peptide. A) Illustration of the construct used, connecting the C terminus of the peptide (colored blue to red on carbons from N to C terminus) via puromycin to the mRNA strand of an mRNA/cDNA hybrid duplex. B) Pull‐down of O‐GlcNAcylated fraction of peptide‐Puro‐cDNA/mRNA conjugates with eluted cDNA quantified by qPCR. The recovery rate with and without OGT was calculated by dividing the absolute amount of DNA after pull‐down by that before pull‐down, using a standard curve for quantification.
Recovery efficiency of the proof‐of‐principle selection using ZO‐3 peptide. A) Illustration of the construct used, connecting the C terminus of the peptide (colored blue to red on carbons from N to C terminus) via puromycin to the mRNA strand of an mRNA/cDNA hybrid duplex. B) Pull‐down of O‐GlcNAcylated fraction of peptide‐Puro‐cDNA/mRNA conjugates with eluted cDNA quantified by qPCR. The recovery rate with and without OGT was calculated by dividing the absolute amount of DNA after pull‐down by that before pull‐down, using a standard curve for quantification.Given that the OGT substrate sequence was close to the C terminus in our test substrate sequence, we decided to proceed with selection using a relatively long peptide library in the hope that a part of the library sequence would be sufficient to act as additional spacer where required. Since crystal structures of several peptide substrates in complex with OGT have suggested that the substrate specificity of OGT seems to be mainly confined to residues from −3 to +2 relative to the O‐GlcNAcylation site,
we decided to study OGT activity among a library of peptides with 9 randomized residues. To this end, the initial DNA library for this study contains a promoter region for transcription, a translation start site, a random region of 9 NNK codons (N is any nucleotide and K is guanine or thymine, giving a less biased coverage of all 20 amino acids vs NNN) encoding a 9‐mer random peptide, a constant region encoding a flexible (GA)3 peptide spacer then an ochre stop codon, and finally a puromycin linker annealing sequence. We did not introduce a Serine/Threonine at a fixed location in this library because both Ser and Thr appear to be common surrounding the OGT modification site, and analysis would thus not be simplified. The final DNA library was expected to express 5.12×109 unique peptides with sequences of fM‐(X)9‐GAGAGA, where X indicates any of the 20 natural amino acids and fM is formyl methionine. With >1012 sequences examined in an mRNA display experiment, this library should be well covered (a feat that cannot be achieved by competing methodologies, including phage display). The assembly of this DNA library was achieved by two rounds of overlapping PCR, with the annealing temperature of each round of PCR optimized to avoid loss of library diversity through over‐cycling (Figure S6). The DNA library was then transcribed in vitro to achieve the corresponding mRNA library, purified by preparative‐scale urea PAGE, and further conjugated with a puromycin‐containing oligonucleotide linker. In vitro translation of the Puro‐mRNA library resulted in the mRNA‐peptide conjugate library. As with the positive control, the mRNA was reverse transcribed into cDNA for stability and to minimize aptamer selection.The successful enrichment of O‐GlcNAc modified peptides greatly relies on a specific and efficient pull‐down method. We have previously observed that O‐GlcNAc RL2 antibody could bind to some peptides that were not O‐GlcNAcylated.
To remove peptides binding directly to either RL2 or protein G, the mRNA/cDNA‐peptide fusion library was purified by ethanol‐precipitation and three library pre‐clear steps were performed by pull‐down using a mixture of 50 % uncharged protein G magnetic beads and 50 % RL2‐antibody charged protein G magnetic beads. While this preclear might lead to a loss of peptide diversity, it was seen as critical to prevent enrichment of peptides binding directly to these proteins without involvement of O‐GlcNAc. After washing with phosphate buffered saline containing TWEEN‐20 (PBS‐T), DNA from all three sets of pre‐clear beads was eluted by heat treatment and analyzed by qPCR. A small amount of recovery was observed from pre‐clear 1, but no further recovery was found in pre‐clear 2 and pre‐clear 3, suggesting any direct antibody‐binding peptides were completely removed (Figure 3). The supernatant after pre‐clear was then subjected to OGT reaction at room temperature overnight. After completion of the OGT reaction, the mRNA/cDNA‐peptide library was again ethanol precipitated to remove O‐GlcNAcylated OGT and then incubated with RL2 antibody‐saturated protein G magnetic beads for 1 hour at 4 °C. The supernatant was removed and the beads were stringently washed three times with ice cold PBS‐T. The DNA heat‐eluted from these positive selection beads was analyzed by qPCR as for the pre‐clears. As shown in Figure 3, a clear increase in the recovery of DNA was achieved from the selection following OGT reaction (Round 1), when it was compared with DNA from pre‐clear beads. To further confirm this selection recovery of DNA, an additional pair of parallel control selections were performed using the same mRNA‐peptide library but without OGT and without the UDP‐GlcNAcsugar donor, respectively. The DNA recoveries (Cq value) from both control experiments were the same as that from pre‐clear 3 (Figure S7), indicating the above recovery from the initial library was attributable to the O‐GlcNAcylation of the peptides by OGT. The DNA from the first round of selection was subsequently amplified by PCR to allow further rounds of selection, of which four were performed. For reasons as yet unknown, the positive to negative recovery ratio of the selection did not increase substantially across these rounds, and in fact decreased from round 3 to round 4. For this reason the selection was stopped at this point.
Figure 3
DNA recovery in each round of selection. The mRNA/cDNA‐peptide library was pre‐cleared three times using a 1 : 1 mixture of uncharged and RL2‐antibody charged beads before OGT reaction. After OGT reaction, the mRNA/cDNA‐peptide library was incubated with RL2‐antibody charged beads to bind O‐GlcNAcylated peptides. DNA eluted by heat from pre‐clear beads (“Pre‐clear 1”, “Pre‐clear 2”, and “Pre‐clear 3”) and selection beads (“Selection”) were analyzed by qPCR. The recovery ratio from each round was calculated by dividing the absolute amount of DNA after selection by that before selection (input), calibrated with a standard curve run in parallel. “S/P” values indicate the ratio of selection to pre‐clear 3.
DNA recovery in each round of selection. The mRNA/cDNA‐peptide library was pre‐cleared three times using a 1 : 1 mixture of uncharged and RL2‐antibody charged beads before OGT reaction. After OGT reaction, the mRNA/cDNA‐peptide library was incubated with RL2‐antibody charged beads to bind O‐GlcNAcylated peptides. DNA eluted by heat from pre‐clear beads (“Pre‐clear 1”, “Pre‐clear 2”, and “Pre‐clear 3”) and selection beads (“Selection”) were analyzed by qPCR. The recovery ratio from each round was calculated by dividing the absolute amount of DNA after selection by that before selection (input), calibrated with a standard curve run in parallel. “S/P” values indicate the ratio of selection to pre‐clear 3.While far from optimal, some enrichment was observed and so DNA eluted from the third round of selection (highest selection to pre‐clear recovery ratio) was prepared for sequencing by PCR amplification and subsequently cloning into a PCR2.1‐TOPO vector. This was transformed into Escherichia coli DH5α and plated, from which 35 colonies were found. Plasmids were extracted and verified by PCR using M13R and M13F primers. Among these 35 plasmids, 19 showed a band corresponding to the size of the DNA library (111 bp), indicating that DNA fragments from the in vitro selection were cloned into the vectors (Figure S8). Sequencing of these verified plasmids led to 16 DNA sequences of the correct library architecture, which were translated into the primary amino acid sequences as shown in Figure 4A. Notably, all sequences except peptides 2 and 11 bear at least one serine or threonine that could be O‐GlcNAcylated by OGT, although these sequences do not match the currently accepted sequon.
In addition, duplicate sequences were found in plasmids 8 through 10 and 13 through 14, suggesting that these two sequences have been more strongly enriched during the selection.
Figure 4
A) Peptide sequences from Sanger DNA sequencing results following cloning into the PCR2.1‐TOPO vector (repeated sequences highlighted in blue). B) Peptides corresponding to clone 13 and 8 were synthesized and printed on a microarray at various concentrations. The OGT reaction was carried out in the present of 1 mM UDP‐GlcNAc, with reaction in the absence of UDP‐GlcNAc used as control. C) All peptides were synthesized and tested as substrates for OGT by a UDP‐Glo assay, with ZO‐3 as a positive control. Bars represent the average of triplicate measurement with standard error. Significant activity above the no‐peptide control is represented as *** (p<0.001, two‐tailed t‐test).
A) Peptide sequences from Sanger DNA sequencing results following cloning into the PCR2.1‐TOPO vector (repeated sequences highlighted in blue). B) Peptides corresponding to clone 13 and 8 were synthesized and printed on a microarray at various concentrations. The OGT reaction was carried out in the present of 1 mM UDP‐GlcNAc, with reaction in the absence of UDP‐GlcNAc used as control. C) All peptides were synthesized and tested as substrates for OGT by a UDP‐Glo assay, with ZO‐3 as a positive control. Bars represent the average of triplicate measurement with standard error. Significant activity above the no‐peptide control is represented as *** (p<0.001, two‐tailed t‐test).Peptides 8 and 13 were thus synthesized and O‐GlcNAcylation of them was tested by peptide microarray. Unfortunately, this revealed that peptide 13 is a false positive and peptide 8 showed binding of the antibody to the non‐glycosylated form of the peptide, with only a slight increase in signal following OGT treatment (Figure 4B). This microarray approach is relatively labor intensive and relies on the same antibody as the selection, so rather than printing the remaining 11 sequences we sought further validation using a luminescence‐coupled assay that detects liberated UDP following GlcNAc transfer (UDP‐Glo). This removes the possibility of false positives based on antibody binding and allows a more convenient plate‐based format. The results showed that none of these peptides gave luminescence above a no‐peptide control, including sequence 8, and so we conclude that none are accepted as substrates by OGT (Figure 4C). Clone 8 can thus be concluded to have enriched through binding to the antibody, despite pre‐clearing out such sequences. An appropriate control in future would be to carry out a direct enrichment for peptide binders to the antibody and also subject these to sequencing, allowing them to be excluded in subsequent analysis.Although none of these sequences were confirmed to be substrates, we carried out high‐throughput sequencing (HTS) of the input, negative and positive selection libraries in rounds 1 through 3 by Illumina Miseq. Such a high‐throughput analysis of the input and selected libraries would yield more statistically robust information that could be used to compare the relative abundance of each sequence in the selected population,
and which we hoped would help in the identification of new OGT substrates. The results of this sequencing showed that, even after selection, the libraries remained at extremely high diversity, with few repeated sequences and no clear pattern of preferred amino acids emerged (Figure S9). In addition, none of the sequences found from Sanger sequencing were found in the HTS results. The high diversity of this dataset, coupled with no way to define the site of modification for sequence alignment, prevented clear analysis of the results. Unexpectedly, the HTS data did show that the test sequence used for method development (fMRESSYDIYRVPSSQS) was enriching during the selection more than any other sequence. This sequence is not of the correct length to derive from the nine‐amino‐acid library, and was not deliberately spiked into the selection. We presume that this sequence entered the selection through cross‐contamination from use of the same pipettes, despite the use of filter tips and other precautions. While not the result that we sought, this is nonetheless an indication that our approach is sufficiently powerful to enrich substrate sequences from a very low background starting point, even with the poor recovery seen in Figure 2B.As the crystal structure of substrate‐bound OGT shows a tunnel at the C terminus, we hypothesized that this might be interfering with the tag in our mRNA display experiment. Thus, to attempt to clarify the reason for our low enrichment, an additional pull‐down experiment was performed using a synthetic DNA‐peptide fusion bearing the same peptide sequence as in Figure 2, but in which the DNA was instead attached to the N terminus of the peptide. As described in Figure S5, this DNA‐peptide fusion was achieved using an succinimidyl 4‐(N‐maleimidomethyl)cyclohexane‐1‐carboxylate (SMCC) crosslinker to attach a 3′ amine‐modified oligo to a syntheticpeptide with an N‐terminal cysteine of the OGT substrate sequence to that used in the puromycin‐based method validation experiments.
After OGT reaction with this peptide‐oligo fusion, the same selection process was carried out as in that proof‐of‐principle experiment. A dramatically improved recovery of cDNA was observed in this case (Figure 5), corresponding to roughly ∼2400‐fold higher recovery with OGT compared to without. While not suitable for direct application in high‐throughput substrate profiling, this construct nonetheless indicates that the reason for our low recovery was indeed the placement of the oligonucleotide acid tag. This could perhaps be solved by linker optimization in this case, while combining C‐ and N‐terminal tags would likely present a more general strategy (particularly with enzymes for which no structural information is available). The location of the mRNA tag at the C terminus of the peptide is an inherent limitation of mRNA display as currently applied, but phage display does offer this possibility.
Figure 5
A) Illustration of the C‐terminal occlusion of peptide substrates bound to OGT. Gray: OGT; rainbow cartoon: peptide, with blue at the N terminus to red at the C terminus; dotted yellow circle: OGT catalytic site (using PDB ID: 4GYY). B) Illustration of the construct used to test pulldown by N‐terminal display, connecting a synthetic peptide (colored blue to red on carbons from N to C terminus) at the side chain of an N‐terminal cysteine via SMCC to an amine‐modified strand of DNA (together with an amine‐free complementary strand). C) Pull‐down of O‐GlcNAcylated fraction of dsDNA‐SMCC‐peptide conjugates, with eluted DNA quantified by qPCR as for Figure 1.
A) Illustration of the C‐terminal occlusion of peptide substrates bound to OGT. Gray: OGT; rainbow cartoon: peptide, with blue at the N terminus to red at the C terminus; dotted yellow circle: OGT catalytic site (using PDB ID: 4GYY). B) Illustration of the construct used to test pulldown by N‐terminal display, connecting a syntheticpeptide (colored blue to red on carbons from N to C terminus) at the side chain of an N‐terminal cysteine via SMCC to an amine‐modified strand of DNA (together with an amine‐free complementary strand). C) Pull‐down of O‐GlcNAcylated fraction of dsDNA‐SMCC‐peptide conjugates, with eluted DNA quantified by qPCR as for Figure 1.Although stringent pre‐clear steps were performed against direct binding of the initial library, we still found a peptide in the enriched pool that binds directly to the antibody. This suggests that the use of an antibody may not the best approach for specific O‐GlcNAcpeptide selection, even with a higher recovery from an optimized linker or N‐terminal tag. Antibodies or lectins with higher specificity may be able to overcome the limitations of the current RL2 antibody‐based selection approach, but any binding protein with sufficient affinity for an O‐GlcNAc modified peptide is likely to bring its own inherent biases. A better alternative may be found in azido‐functionalized sugar donors such as UDP‐GlcNAz and UDP‐GlcNAc6 N3, which have been shown to be accepted as substrates for OGT in vitro,[
,
] and can subsequently be further covalently modified to attach biotin. However, these azido modifications exhibit a reduced rate for conversion of substrate as compared with the natural donor UDP‐GlcNAc
and, more importantly, may also induce a bias in the selected peptide sequences. A further alternative tool to selectively introduce an azide only on O‐GlcNAc modified peptides or proteins is by means of a mutant β‐1,4‐galactosyltransferase that specifically transfers azido‐modified galactose (GalNAz) from UDP‐GalNAz to O‐GlcNAc residues on glycosylated peptides or proteins.[
,
] This tandem enzyme modification approach would avoid any bias or low turnover resulting from changing the OGT donor sugar while also overcoming the limitation of the antibody approach employed here. A final enrichment strategy that could be applicable here is through the use of GlcNAc electrophilic probes that react with a cysteine in a mutant OGT. On glycosyl transfer this gives a covalent link from peptide substrate to OGT and so allows substrate co‐enrichment by pull‐down of the enzyme.
There are thus several reported approaches that offer solutions to this problem.We have demonstrated here the feasibility of using mRNA display to enrich for an OGT substrate from a low starting ratio by use of antibody pull‐down. Our data suggest that the described enrichment process works for a known substrate even at an extremely low starting ratio (derived in this case from cross‐contamination), but with serious inherent issues from the antibody‐based enrichment process and library architecture when applied to an unbiased library. Because of these limitations, with the current version of the selection protocol it was not possible to obtain any new insights into OGT's substrate requirement. Despite this, we report promising first steps towards an approach for high‐throughput profiling of glycosyltransferase peptide substrates, as well as providing detail of problems encountered and proposed solutions.
Experimental Section
All the experimental details can be found in the Supporting Information.
Conflict of interest
The authors declare no conflict of interests.As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.SupplementaryClick here for additional data file.
Authors: Alain Mayer; Tracey M Gloster; Wayne K Chou; David J Vocadlo; Martin E Tanner Journal: Bioorg Med Chem Lett Date: 2010-12-23 Impact factor: 2.823
Authors: Jun Zhong; Marissa Martinez; Srona Sengupta; Albert Lee; Xinyan Wu; Raghothama Chaerkady; Aditi Chatterjee; Robert N O'Meally; Robert N Cole; Akhilesh Pandey; Natasha E Zachara Journal: Proteomics Date: 2015-01 Impact factor: 3.984
Authors: Seino A K Jongkees; Sami Caner; Christina Tysoe; Gary D Brayer; Stephen G Withers; Hiroaki Suga Journal: Cell Chem Biol Date: 2017-03-02 Impact factor: 8.116
Authors: S A Caldwell; S R Jackson; K S Shahriari; T P Lynch; G Sethi; S Walker; K Vosseller; M J Reginato Journal: Oncogene Date: 2010-03-01 Impact factor: 9.867
Authors: Jie Shi; Suhela Sharif; Cyril Balsollier; Rob Ruijtenbeek; Roland J Pieters; Seino A K Jongkees Journal: Chembiochem Date: 2020-11-06 Impact factor: 3.164