| Literature DB >> 32743578 |
Peng Zhao1, Jeremy L Praissman1, Oliver C Grant1, Yongfei Cai2, Tianshu Xiao2, Katelyn E Rosenbalm1, Kazuhiro Aoki1, Benjamin P Kellman3, Robert Bridger1, Dan H Barouch4, Melinda A Brindley5, Nathan E Lewis3,6, Michael Tiemeyer1, Bing Chen2, Robert J Woods1, Lance Wells1.
Abstract
The currentEntities:
Keywords: 3D-modeling; ACE2; COVID-19; SARS-CoV-2; Spike protein; coronavirus; glycoprotein; glycosylation; mass spectrometry; molecular dynamics
Year: 2020 PMID: 32743578 PMCID: PMC7386495 DOI: 10.1101/2020.06.25.172403
Source DB: PubMed Journal: bioRxiv
Figure 1.Expression and Characterization of SARS-CoV-2 Spike Glycoprotein Trimer Immunogen and Soluble Human ACE2.
A) Sequences of SARS-CoV-2 S immunogen and soluble human ACE2. The N-terminal pyroglutamines for both mature protein monomers are bolded, underlined, and shown in green. The canonical N-linked glycosylation sequons are bolded, underlined, and shown in red. Negative stain electron microscopy of the purified trimer (B) and Coomassie G-250 stained reducing SDS-PAGE gels (C) confirmed purity of the SARS-CoV-2 S protein trimer and of the soluble human ACE2. MWM = molecular weight markers. D) A representative Step-HCD fragmentation spectrum from mass spectrometry analysis of a tryptic digest of S annotated manually based on search results from pGlyco 2.2. This spectrum defines the N-terminus of the mature protein monomer as (pyro-)glutamine 0014. A representative N-glycan consistent with this annotation and our glycomics data (Fig. 2) is overlaid using the Symbol Nomenclature For Glycans (SNFG) code. This complex glycan occurs at N0017. Note, that as expected, the cysteine is carbamidomethylated and the mass accuracy of the assigned peptide is 0.98 ppm. On the sequence of the N-terminal peptide and in the spectrum, the assigned b (blue) and y (red) ions are shown. In the spectrum, purple highlights glycan oxonium ions and green marks intact peptide fragment ions with various partial glycan sequences still attached. Note that the green-labeled ions allow for limited topology to be extracted including defining that the fucose is on the core and not the antennae of the glycopeptide.
Figure 2.Glycomics Informed Glycoproteomics Reveals Substantial Site-Specific Microheterogeneity of N-linked Glycosylation on SARS-CoV-2 S.
A) Glycans released from SARS-CoV-2 S protein trimer immunogen were permethylated and analyzed by MSn. Structures were assigned, grouped by type and structural features, and prevalence was determined based on ion current. The pie chart shows basic division by broad N-glycan type. The bar graph provides additional detail about the glycans detected. The most abundant structure with a unique categorization by glycomics for each N-glycan type in the pie chart, or above each feature category in the bar graph, is indicated. B – E) Glycopeptides were prepared from SARS-CoV-2 S protein trimer immunogen using multiple combinations of proteases, analyzed by LC-MSn, and the resulting data was searched using several different software packages. Four representative sites of N-linked glycosylation with specific features of interest were chosen and are presented here. N0074 (B) and N0149 (C) are shown that occur in variable insert regions of S compared to SARS-CoV and other related coronaviruses, and there are emerging variants of SARS-CoV-2 that disrupt these two sites of glycosylation in S. N0234 (D) contains the most high-mannose N-linked glycans. N0801 (D) is an example of glycosylation in the S2 region of the immunogen and displays a high degree of hybrid glycosylation compared to other sites. The abundance of each composition is graphed in terms of assigned spectral counts. Representative glycans (as determined by glycomics analysis) for several abundant compositions are shown in SNFG format. The abbreviations used here and throughout the manuscript are N for HexNAc, H for Hexose, F for Fucose, A for Neu5Ac, and S for Sulfation. Note that the graphs for the other 18 sites and other graphs grouping the microheterogeneity observed by other properties are presented in Supplemental Information.
Figure 3.SARS-CoV-2 S Immunogen N-glycan Sites are Predominantly Modified by Complex N-glycans.
N-glycan topologies were assigned to all 22 sites of the S protomer and the spectral counts for each of the 3 types of N-glycans (high-mannose, hybrid, and complex) as well as the unoccupied peptide spectral match counts at each site were summed and visualized as pie charts. Note that only N1173 and N1194 show an appreciable amount of the unoccupied amino acid.
Figure 4.3D Structural Modeling of Glycosylated SARS-CoV-2 Spike Trimer Immunogen Reveals Predictions for Antigen Accessibility and Other Key Features.
Results from glycomics and glycoproteomics experiments were combined with results from bioinformatics analyses and used to model several versions of glycosylated SARS-CoV-2 S trimer immunogen. A) Sequence of the SARS-CoV-2 S immunogen displaying computed antigen accessibility and other information. Antigen accessibility is indicated by red shading across the amino acid sequence. B) Emerging variants confirmed by independent sequencing experiments were analyzed based on the 3D structure of SARS-CoV-2 S to generate a proximity chart to the determined N-linked glycosylation sites. C) SARS-CoV-2 S trimer immunogen model from MD simulation displaying abundance glycoforms and antigen accessibility shaded in red for most accessible, white for partial, and black for inaccessible (see Supplemental movie A). D) SARS-CoV-2 S trimer immunogen model from MD simulation displaying oxford class glycoforms and sequence variants. * indicates not visible while the box represents 3 amino acid variants that are clustered together in 3D space. E) SARS-CoV-2 S trimer immunogen model from MD simulation displaying processed glycoforms plus shading of Thr-323 that has O-glycoslyation at low stoichiometry in yellow.
Figure 5:Glycomics Informed Glycoproteomics of Soluble Human ACE2 Reveals High Occupancy, Complex N-linked Glycosylation.
A) Glycans released from soluble, purified ACE2 were permethylated and analyzed by MSn. Structures were assigned, grouped by type and structural features, and prevalence was determined based on ion current. The pie chart shows basic division by broad N-glycan type. The bar graph provides additional detail about the glycans detected. The most abundant structure with a unique categorization by glycomics for each N-glycan type in the pie chart, or above each feature category in the bar graph, is indicated. B – G) Glycopeptides were prepared from soluble human ACE2 using multiple combinations of proteases, analyzed by LC-MSn, and the resulting data was searched using several different software packages. All six sites of N-linked glycosylation are presented here. Displayed in the bar graphs are the individual compositions observed graphed in terms of assigned spectral counts. Representative glycans (as determined by glycomics analysis) for several abundant compositions are shown in SNFG format. The abbreviations used here and throughout the manuscript are N for HexNAc, H for Hexose, F for Fucose, and A for Neu5Ac. The pie chart (analogous to Figure 3 for SARS-CoV-2 S) for each site is displayed in the upper corner of each panel. B) N053. C) N090. D) N103. E) N322. F) N432. G) N546, a site that does not exist in 3 in 10,000 people.
Figure 6:3D Structural Modeling of Glycosylated Soluble Human ACE2.
Results from glycomics and glycoproteomics experiments were combined with results from bioinformatics analyses and used to model several versions of glycosylated soluble human ACE2. A) Soluble human ACE2 model from MD simulations displaying abundance glycoforms, interaction surface with S, and sequence variants. N546 variant is boxed that would remove N-linked glycosylation at that site (see Supplemental movie B). B) Soluble human ACE2 model from MD simulations displaying processed glycoforms and interaction surface with S.
Figure 7:Interactions of Glycosylated Soluble Human ACE2 and Glycosylated SARS-CoV-2 S Trimer Immunogen Revealed By 3D-Structural Modeling and Molecular Dynamics Simulations.
A) Molecular dynamics simulation of glycosylated soluble human ACE2 and glycosylated SARS-CoV-2 S trimer immunogen interaction (see Supplemental simulations 1–3). ACE2 (top) is colored red with glycans in pink while S is colored white with glycans in dark grey. Highlighted are ACE2 glycans that interacts with S that are zoomed in on to the right. B) Zoom in of ACE2-S interface highlighting ACE2 glycan interactions using 3D-SNFG icons (60) with S protein (pink) as well as ACE2-S glycan-glycan interactions. C) Zoom in of dynamics trajectory of glycans at the interface of soluble human ACE2 and S (see Supplemental movies C and D).
KEY RESOURCES TABLE
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Chemicals, Peptides, and Recombinant Proteins | ||
| SARS-CoV-2 S protein | This Study | N/A |
| Human ACE2 protein | This Study | N/A |
| 2x Laemmli sample buffer | Bio-Rad | Cat#161–0737 |
| Invitrogen NuPAGE 4 to 12%, Bis-Tris, Mini Protein Gel | Thermo Fisher Scientific | Cat#NP0321PK2 |
| Coomassie Brilliant Blue G-250 Dye | Thermo Fisher Scientific | Cat#20279 |
| Dithiothreitol | Sigma Aldrich | Cat#43815 |
| Iodoacetamide | Sigma Aldrich | Cat#I1149 |
| Trypsin | Promega | Cat#V5111 |
| Lys-C | Promega | Cat#V1671 |
| Arg-C | Promega | Cat#V1881 |
| Glu-C | Promega | Cat#V1651 |
| Asp-N | Promega | Cat#VA1160 |
| Endoglycosidase H | Promega | Cat#V4871 |
| PNGaseF | Promega | Cat#V4831 |
| Chymotrypsin | Athens Research and Technology | Cat#16-19-030820 |
| Alpha lytic protease | New England BioLabs | Cat#P8113 |
| 18O water | Cambridge Isotope Laboratories | OLM-782-10-1 |
| O-protease OpeRATOR | Genovis | Cat#G1-OP1–020 |
| Deposited Data | ||
| MS data for site-specific N-linked glycopeptides for SARS-Cov-2 S and human ACE2 | This Study | PXD019937 |
| MS data for site-specific O-linked glycopeptides for SARS-Cov-2 S and human ACE2 | This Study | PXD019940 |
| MS data for deglycosylated N-linked glycopeptides for SARS-Cov-2 S and human ACE2 | This Study | PXD019938 |
| MS data for disulfide bond analysis for SARS-Cov-2 S | This Study | PXD019939 |
| MS data for N-linked glycomics deposited at GlycoPost | This Study | GPST000120 |
| MS data for O-linked glycomics deposited at GlycoPost | This Study | GPST000121 |
| Experimental Models: Cell Lines | ||
| 293-F Cells | Gibco | Cat#R79007 |
| Experimental Models: Organisms/Strains | ||
| VSV(G)-Pseudoviruses | This Study | |
| Software and Algorithms | ||
| pGlyco | ( | v2.2.2 |
| Proteome Discoverer | Thermo Fisher Scientific | v1.4 |
| Byonic | Protein Metrics Inc. ( | v3.8.13 |
| ProteoIQ | Premier Biosoft | v2.7 |
| GRITS Toolbox | v1.1 | |
| EMBOSS needle | ( | v6.6.0 |
| Biopython | v1.76 | |
| Rpdb | v2.3 | |
| SignalP | v5.0 | |
| LibreOFFICE Writer | The Document Foundation | V6.4.4.2 |
| GlyGen | V1.5 | |
| GNOme | V1.5.5 | |
| GlyTouCan | V3.1.0 | |
| Inkscape | Inkscape Developers | V1.0 |