| Literature DB >> 32576591 |
Ameera Raudah Ahmad Izaham1, Nichollas E Scott2.
Abstract
Mass spectrometry has become an indispensable tool for the characterization of glycosylation across biological systems. Our ability to generate rich fragmentation of glycopeptides has dramatically improved over the last decade yet our informatic approaches still lag behind. Although glycoproteomic informatics approaches using glycan databases have attracted considerable attention, database independent approaches have not. This has significantly limited high throughput studies of unusual or atypical glycosylation events such as those observed in bacteria. As such, computational approaches to examine bacterial glycosylation and identify chemically diverse glycans are desperately needed. Here we describe the use of wide-tolerance (up to 2000 Da) open searching as a means to rapidly examine bacterial glycoproteomes. We benchmarked this approach using N-linked glycopeptides of Campylobacter fetus subsp. fetus as well as O-linked glycopeptides of Acinetobacter baumannii and Burkholderia cenocepacia revealing glycopeptides modified with a range of glycans can be readily identified without defining the glycan masses before database searching. Using this approach, we demonstrate how wide tolerance searching can be used to compare glycan use across bacterial species by examining the glycoproteomes of eight Burkholderia species (B. pseudomallei; B. multivorans; B. dolosa; B. humptydooensis; B. ubonensis, B. anthina; B. diffusa; B. pseudomultivorans). Finally, we demonstrate how open searching enables the identification of low frequency glycoforms based on shared modified peptides sequences. Combined, these results show that open searching is a robust computational approach for the determination of glycan diversity within bacterial proteomes.Entities:
Keywords: Glycoproteomics; N-glycosylation; bacteria; bioinformatics searching; glycomics; glycoprotein structure; glycoproteins; glycosylation; open searching; post-translational modifications
Year: 2020 PMID: 32576591 PMCID: PMC8143609 DOI: 10.1074/mcp.TIR120.002100
Source DB: PubMed Journal: Mol Cell Proteomics ISSN: 1535-9476 Impact factor: 5.911
Strain list
| Strains | Source (description, country, year) | Reference | Proteome database |
|---|---|---|---|
| Campylobacter fetus subsp. fetus NCTC 10842 | Brain of sheep fetus, France, 1952 | ( | Uniprot Database: UP000001035 |
| Acinetobacter baumannii ATCC17978 | Fatal meningitis of a 4-month old infant, 1951 | ( | GenBank assembly accession: |
| Burkholderia pseudomallei K96243 | Human clinical specimen, Thailand, 1996 | ( | Uniprot Database: UP000000605 |
| Burkholderia cenocepacia (LMG 16656/J2315) | Human clinical specimen, United Kingdom, 1989 | ( | Uniprot Database: UP000001035 |
| Burkholderia multivorans MSMB2008 | Soil isolate, Australia, 2012 | ( | Burkholderia Genome Database ( |
| Burkholderia dolosa AU0158 | Human clinical specimen, USA unknown | ( | Uniprot database: UP000032886 |
| Burkholderia humptydooensis MSMB43 | Water isolate, Australia, unknown | ( | Burkholderia Genome Database ( |
| Burkholderia ubonensis MSMB22 | Soil isolate, Australia, 2001 | ( | Burkholderia Genome Database ( |
| Burkholderia anthina MSMB649 | Soil isolate, Australia, 2010 | ( | Burkholderia Genome Database ( |
| Burkholderia diffusa MSMB375 | Water isolate, Australia, 2008 | ( | Burkholderia Genome Database ( |
| Burkholderia pseudomultivorans MSMB2199 | Soil isolate, Australia, 2011 | ( | Burkholderia Genome Database ( |
Fig. 1Open searching analysis of A, C. fetus fetus glycopeptide delta mass plot of 0.001 Da increments showing the detection of PSMs modified with masses over 1000 Da. B, Zoomed view of the C. fetus fetus glycopeptide delta mass plot highlighting the most numerous observed delta masses; red masses correspond to nonformylated glycans whereas black masses correspond to formylated glycans. C, Density and zoomed delta mass plot of the C. fetus fetus glycan masses 1243.507 Da and 1202.481 Da. D, Comparison of the unique glycopeptide sequences and glycoproteins observed between open and focused searches across C. fetus fetus data sets.
Fig. 2Open searching analysis of A, A. baumannii glycopeptide delta mass plot of 0.001 Da increments showing the detection of PSMs modified with masses over 800 Da. B, Zoomed view of A. baumannii glycopeptide delta mass plot highlighting the most numerous observed modifications. C, Comparison of unique glycopeptide sequences and glycoproteins observed between open and focused searches across A. baumannii data sets D, Glycan mass plot showing the amount of glycan (in Da) observed on glycopeptides PSMs within the focused searches. E, Venn diagram showing the number of unique glycopeptide sequences grouped based on the number of glycans observed on these peptides.
Fig. 3Open searching analysis of A, B. Cenocepacia glycopeptide delta mass plot of 0.001 Da increments showing the detection of PSMs modified with masses over 500 Da. Highlighted area shown in zoomed panel. B, Comparison of the unique glycopeptide sequences and glycoproteins observed between open and focused searches across B. Cenocepacia data sets C, Glycan mass plot showing the amount of glycan (in Da) observed on glycopeptides PSMs within the focused searches. Nearly 40% of all glycopeptide PSMs are decorated with two or more glycans.
Fig. 4Comparison of Burkholderia glycoproteomes using open searching.A, Representative delta mass plots of four out of the eight Burkholderia strains examined demonstrating the 568 Da and 668 Da glycans are frequently identified delta masses in Burkholderia glycopeptide enrichments. Formylated glycans are denoted in black whereas Burkholderia O-linked glycans are in red. B, Pearson correlation and clustering analysis of delta mass plots enable the comparison and grouping of samples.
Fig. 5Identification of minor glycoforms within A, Delta mass plot, binned by 0.001 Da increments, showing delta masses observed for peptide sequences also modified with the 568 or 668D glycans. B, MS/MS analysis (FTMS-HCD, ITMS-CID and FTMS-EThcD) supporting the assignment of a linear glycan of HexNAc-Heptose-Heptose-188-215 attached to the peptide KAATAAPADAASQ. C, Glycan mass plot showing the amount of glycan (in Da) observed on glycopeptides PSMs within focused searches. Only ∼6% of all PSMs observed are modified with the 990 Da glycan.