Literature DB >> 30101321

GlycanAnalyzer: software for automated interpretation of N-glycan profiles after exoglycosidase digestions.

Ian Walsh1, Terry Nguyen-Khuong1, Katherine Wongtrakul-Kish1, Shi Jie Tay1, Daniel Chew1, Tasha José2, Christopher H Taron2, Pauline M Rudd1.   

Abstract

SUMMARY: Many eukaryotic proteins are modified by N-glycans. Liquid chromatography (ultra-performance -UPLC and high-performance-HPLC) coupled with mass spectrometry (MS) is conventionally used to characterize N-glycan structures. Software can automatically assign glycan structures by matching their observed retention times and masses with standardized values in reference databases. However, more precise confirmation of N-glycan structures can be derived using exoglycosidases, enzymes that remove specific monosaccharides from glycans. Exoglycosidase removal of monosaccharides results in signature peak shifts, in both UPLC and MS1, yielding an effective way to verify N-glycan structure with high detail (down to the position and isomeric linkage of each monosaccharide). Because manual interpretation of exoglycosidase data is complex and time consuming, we developed GlycanAnalyzer, a web application that pattern matches N-glycan peak shifts following exoglycosidase digestion and automates structure assignments. GlycanAnalyzer significantly improves assignment accuracy over other auto-assignment methods on tests with a monoclonal antibody and four glycan standards (100% versus 82% for the next best software). By automating data interpretation, GlycanAnalyzer enables the easier use of exoglycosidases to precisely define N-glycan structure.
AVAILABILITY AND IMPLEMENTATION: http://glycananalyzer.neb.com. Datasets available online. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30101321      PMCID: PMC6378934          DOI: 10.1093/bioinformatics/bty681

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

N-Glycans are carbohydrates that are covalently bonded to many glycoproteins. They play critical roles in numerous physiological and pathological processes. In addition, their characterization is imperative in biotherapeutic quality by design (Dalziel ). N-Glycans have diverse branching, topology and monosaccharide linkages (Cummings, 2009). Consequently, there are significant analytical challenges in their complete characterization. In recent years, analytical techniques have advanced and are now able to detect and quantitate N-glycans in complex samples. Such techniques may involve various methodologies such as high/ultra-performance liquid chromatography (H/UPLC), capillary electrophoresis (CE), mass spectrometry (MS) or combinations thereof (Mariño ). In UPLC-MS two types of information are generated to help characterize N-glycans structures: (i) the retention time (RT) of N-glycans on an LC column, and (ii) the mass of each eluted N-glycan (Supplementary Fig. S2). Typically, software approaches to assign structures to observed peaks in UPLC-MS involve matching observed RTs and masses to reference values in glycan repositories (Royle ). As RTs may vary across different experimental conditions, software often draws upon comparisons to the mobility of a dextran ladder standard [Glucose Units (GU)] (Mariño ). For example, the UNIFI Scientific Information System (Waters) utilizes both N-glycan GU and mass matching criteria at its core. Exoglycosidases are enzymes that catalyze sequential removal of monosaccharides from the non-reducing end of glycans. They have specificity for a particular type of sugar, its stereochemistry (α/β anomer) and position of attachment to an adjacent sugar on the glycan (Supplementary Fig. S1). Consequently, monitoring the UPLC-MS peak movements after step-wise application of exoglycosidases provides a unique and effective way to annotate sugars with high detail down to the level of position and linkage isomers (Supplementary Fig. S3). This level of detail is not directly ascertained when only matching GU and/or mass values (Supplementary Fig. S2). Interpretation of exoglycosidase array data analysis is currently performed manually and can be complex and very time-consuming. Previously, the GlycoDigest tool (Gotz ) was developed to model the action of exoglycosidase(s) on particular glycans but it did not automate structure assignment. Additionally, GlycoProfileAssigner (Duffy and Rudd, 2015), sought to automate interpretation of exoglycosidase array digestion data (Supplementary Table S1 shows how it compares to GlycanAnalyzer). In our experiments, GlycoProfileAssigner handled simple scenarios well but failed on more complex samples. Here, we present GlycanAnalyzer, a web application that automates the annotation of N-glycans in UPLC-MS chromatograms after treatment with one or more exoglycosidases. It presents data visualization and links to published reference databases (Aoki-Kinoshita ; Zhao ). Comprehensive help pages/tutorials are also included to facilitate user training and support the program’s use (Supplementary Table S2). GlycanAnalyzer is the most accurate and comprehensive software for automated interpretation of N-glycan exoglycosidase digestion data described to date.

2 Usage and output

Input: GlycanAnalyzer uses GU mobility reference and mass data. All input is supplied through tab-separated text. H/UPLC peak data can be uploaded using tab-separated triplets (RT, GU, % Area) for each peak (Supplementary Fig. S4a). Three dimensional mass information can be supplied by providing the triplets (RT, m/z, intensity) in tab-separated form (Supplementary Fig. S4a). All variables (RT, m/z, intensity, GU and % area) can easily be extracted from H/UPLC-MS instruments, but users may wish to supply their own observed masses (Supplementary Fig. S4b). The software also executes when no mass information is supplied (i.e. only supplying UPLC peak GU and % area). However, accuracy is significantly diminished in this case (see ‘NA’ values in Supplementary Fig. S4b). Before input pre-processing proceeds, users must select the fluorescent label used in their experiment, or select ‘label free’ if no label was used (Supplementary Section S4.1.2). Additionally, users can select a type of glycoprotein to help refine the final glycan assignments. Output: The N-glycans assigned by the software are sorted by a score assigned to each peak (the best score is zero). All glycans are drawn in accepted structural notation (Haltiwanger, 2016), and links to the databases GlycoStore (Zhao ) and GlyTouCan (Aoki-Kinoshita ) are provided. GlycanAnalyzer outputs three pieces of supporting evidence for each structure assignment: (i) GU similarity to reference values in GlycoStore, (ii) mass matching with known structures and (iii) interpretation of peak shifts after exoglycosidase digestion. This is the first software where all three pieces of evidence are available together. When all three lines of evidence are returned, the assignment has excellent support and a low score (close to zero) will be returned. In some cases, not all three lines of evidence will be found. Example output is shown in Supplementary Figures S5 and S6.

3 Materials and methods

GlycanAnalyzer extracts each glycan’s mass from the 3-dimensional mass input (i.e. RT, m/z, intensity; Supplementary Fig. S4) or user-supplied values. Using detailed understanding of each exoglycosidase’s specificity, it searches for peak shifts anticipated by the removal of individual monosaccharide masses. Supplementary Figure S7 shows example mass and UPLC peak shifts for a simple standard. However, in more complex samples, many peak movements are possible (Supplementary Fig. S9). A score is implemented to help predict and rank the most likely peak movements that result from enzyme digestions. An assigned glycan’s score depends on a number of factors such as: optimized regression models that predict its expected GU shift, whether all masses are found in each profile of the exoglycosidase array, and the GU similarity to average values in GlycoStore (Supplementary Section S5.2). In cases where no mass is supplied, mass is estimated by linear regression and matching to GlycoStore (Supplementary Fig. S10). The algorithm and scoring technique are described in detail in the Supplementary Section S5.

4 Results

Correct assignments: Table 1 shows 100% assignment accuracy for GlycanAnalyzer compared to other software capable of automated assignment. Benchmarking was performed on N-glycans released from a monoclonal antibody and four glycan standards (one standard was used for illustration in Supplementary Fig. S7). Supplementary Table S3 shows examples where GlycanAnalyzer’s score helps remove the GU and mass annotation ambiguity. Supplementary Table S4 shows GlycanAnalyzer’s ability to detect co-elution.
Table 1.

Fraction of correctly assigned peaks measured against expert manual assignment

Anti-Her2StandardsAccuracy (%)
GlycanAnalyzer18/184/4100.0
GlycoProfileAssigner4/181/422.7
UNIFI17/181/481.8
Fraction of correctly assigned peaks measured against expert manual assignment Time savings: Depending on the complexity of a sample and the number of exoglycosidases used, manual annotation could involve interpreting hundreds of peak movements, potentially taking days/weeks to complete. Using GlycanAnalyzer all assignments were returned in 20 min when using the ‘Assign all top 5’ option for our monoclonal antibody. Click here for additional data file. Click here for additional data file.
  8 in total

1.  GlycoProfileAssigner: automated structural assignment with error estimation for glycan LC data.

Authors:  Fergal J Duffy; Pauline M Rudd
Journal:  Bioinformatics       Date:  2015-02-26       Impact factor: 6.937

2.  GlycoStore: a database of retention properties for glycan analysis.

Authors:  Sophie Zhao; Ian Walsh; Jodie L Abrahams; Louise Royle; Terry Nguyen-Khuong; Daniel Spencer; Daryl L Fernandes; Nicolle H Packer; Pauline M Rudd; Matthew P Campbell
Journal:  Bioinformatics       Date:  2018-09-15       Impact factor: 6.937

Review 3.  A systematic approach to protein glycosylation analysis: a path through the maze.

Authors:  Karina Mariño; Jonathan Bones; Jayesh J Kattla; Pauline M Rudd
Journal:  Nat Chem Biol       Date:  2010-09-17       Impact factor: 15.040

Review 4.  The repertoire of glycan determinants in the human glycome.

Authors:  Richard D Cummings
Journal:  Mol Biosyst       Date:  2009-07-28

5.  GlycoDigest: a tool for the targeted use of exoglycosidase digestions in glycan structure determination.

Authors:  Lou Gotz; Jodie L Abrahams; Julien Mariethoz; Pauline M Rudd; Niclas G Karlsson; Nicolle H Packer; Matthew P Campbell; Frederique Lisacek
Journal:  Bioinformatics       Date:  2014-07-10       Impact factor: 6.937

6.  HPLC-based analysis of serum N-glycans on a 96-well plate platform with dedicated database software.

Authors:  Louise Royle; Matthew P Campbell; Catherine M Radcliffe; Dawn M White; David J Harvey; Jodie L Abrahams; Yun-Gon Kim; George W Henry; Nancy A Shadick; Michael E Weinblatt; David M Lee; Pauline M Rudd; Raymond A Dwek
Journal:  Anal Biochem       Date:  2007-12-23       Impact factor: 3.365

Review 7.  Emerging principles for the therapeutic exploitation of glycosylation.

Authors:  Martin Dalziel; Max Crispin; Christopher N Scanlan; Nicole Zitzmann; Raymond A Dwek
Journal:  Science       Date:  2014-01-03       Impact factor: 47.728

8.  GlyTouCan 1.0--The international glycan structure repository.

Authors:  Kiyoko Aoki-Kinoshita; Sanjay Agravat; Nobuyuki P Aoki; Sena Arpinar; Richard D Cummings; Akihiro Fujita; Noriaki Fujita; Gerald M Hart; Stuart M Haslam; Toshisuke Kawasaki; Masaaki Matsubara; Kelley W Moreman; Shujiro Okuda; Michael Pierce; René Ranzinger; Toshihide Shikanai; Daisuke Shinmachi; Elena Solovieva; Yoshinori Suzuki; Shinichiro Tsuchiya; Issaku Yamada; William S York; Joseph Zaia; Hisashi Narimatsu
Journal:  Nucleic Acids Res       Date:  2015-10-17       Impact factor: 16.971

  8 in total
  2 in total

Review 1.  Advancing Solutions to the Carbohydrate Sequencing Challenge.

Authors:  Christopher J Gray; Lukasz G Migas; Perdita E Barran; Kevin Pagel; Peter H Seeberger; Claire E Eyers; Geert-Jan Boons; Nicola L B Pohl; Isabelle Compagnon; Göran Widmalm; Sabine L Flitsch
Journal:  J Am Chem Soc       Date:  2019-08-21       Impact factor: 16.383

Review 2.  Early Stage Glycosylation Biomarkers in Alzheimer's Disease.

Authors:  Patricia Regan; Paula L McClean; Thomas Smyth; Margaret Doherty
Journal:  Medicines (Basel)       Date:  2019-09-03
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.