Literature DB >> 35511693

Genetic and Structural Analysis of SARS-CoV-2 Spike Protein for Universal Epitope Selection.

Christopher Markosian1,2, Daniela I Staquicini1,2, Prashant Dogra3,4, Esteban Dodero-Rojas5, Joseph H Lubin6, Fenny H F Tang1,2, Tracey L Smith1,2, Vinícius G Contessoto5, Steven K Libutti7,8, Zhihui Wang3,4, Vittorio Cristini3,9,10,11, Sagar D Khare6,7,12, Paul C Whitford13, Stephen K Burley6,7,12,14,15, José N Onuchic5,16,17,18, Renata Pasqualini1,2, Wadih Arap1,19.   

Abstract

Evaluation of immunogenic epitopes for universal vaccine development in the face of ongoing SARS-CoV-2 evolution remains a challenge. Herein, we investigate the genetic and structural conservation of an immunogenically relevant epitope (C662-C671) of spike (S) protein across SARS-CoV-2 variants to determine its potential utility as a broad-spectrum vaccine candidate against coronavirus diseases. Comparative sequence analysis, structural assessment, and molecular dynamics simulations of C662-C671 epitope were performed. Mathematical tools were employed to determine its mutational cost. We found that the amino acid sequence of C662-C671 epitope is entirely conserved across the observed major variants of SARS-CoV-2 in addition to SARS-CoV. Its conformation and accessibility are predicted to be conserved, even in the highly mutated Omicron variant. Costly mutational rate in the context of energy expenditure in genome replication and translation can explain this strict conservation. These observations may herald an approach to developing vaccine candidates for universal protection against emergent variants of coronavirus.
© The Author(s) 2022. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution.

Entities:  

Keywords:  COVID-19; epitope; variants

Mesh:

Substances:

Year:  2022        PMID: 35511693      PMCID: PMC9129195          DOI: 10.1093/molbev/msac091

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   8.800


Introduction

The ongoing Coronavirus disease 2019 (COVID-19) global pandemic continues to pose an unprecedented health threat to humankind and the potential for evolution of more infectious and/or virulent SARS-CoV-2 variants with resistance to vaccines remains a considerable concern. Within 1 year after SARS-CoV-2 first emerged clinically, the United States Food & Drug Administration (FDA) began issuing Emergency Use Authorization (EUA) for COVID-19 vaccines for the general population. Less than 1 year out from their initial deployment, we began experiencing two setbacks in the fight against the virus. First, the Delta and Omicron “Variants of Concern” (VOCs) have proven to be more transmissible and more virulent than other previously described variants. Second, immunological protection unfortunately appears to wane in the months following either initial infection or full vaccination. Although the FDA granted full approval for the Moderna and Pfizer-BioNTech vaccines, COVID-19 vaccines in general have been reported to have decreased effectiveness against Delta and Omicron variant infection and symptomatic illness (Fowlkes et al. 2021; Andrews et al. 2022). The spike (S) protein, which is one of the main structural proteins of SARS-CoV-2 (Cao et al. 2021), is the primary target of neutralizing antibodies generated following infection (Liu et al. 2020) and has formed the basis of nearly all first-generation COVID-19 vaccines (Dai and Gao 2021; Jalkanen et al. 2021). A recent analysis of more than 300,000 viral genomes suggested that mutations within the S protein represent one of the main pathways of adaptive evolution in SARS-CoV-2 (Rochman et al. 2021), thereby raising concerns about the potential for resistance to neutralizing antibodies acquired through either vaccination or previous COVID-19 infection (Chen et al. 2021; Wang et al. 2021; Zhou and Wang, 2021). Despite recent developments supporting the concept of “inescapable” monoclonal antibodies with broad activity against multiple variants of SARS-CoV-2 (Starr et al. 2021) and the FDA-approved antiviral drug, remdesivir, for treatment of infected patients, the ongoing genetic evolution of the virus continues to raise uncertainty regarding the possibility of new viral mutations that allow immune escape. Both the World Health Organization (WHO, 2022) and the United States Centers for Disease Control & Prevention (CDC, 2021) actively monitor new variants; by definition, the ones with greater numbers of mutations and high transmissibility have been labeled VOCs (Chen et al. 2021; Harvey et al. 2021). In the face of the highly transmissible Delta and Omicron VOCs, the FDA now recommends an additional dose (booster) of vaccine for all eligible individuals at least 6 months post full vaccination. The persistence of the COVID-19 pandemic strongly re-emphasizes the need for adaptable strategies for immunization that enhance efficacy and broaden the spectrum of protection beyond those elicited by the vaccines currently in clinical use and under investigational research and development. In previous work, we have established alternative vaccine strategies based on targeted pulmonary immunization with ligand-directed phage constructs (Staquicini, Barbu, et al. 2021). Phage particles are generally harmless to humans, but induce a potent nonspecific humoral response; they are inexpensively produced and distributed at an industrial scale with no temperature-controlled supply cold-chain requirement (Staquicini, Tang, et al. 2021), which represents one of the major impediments to moving first-generation COVID-19 vaccines into resource-limited settings. To drive the phage-based vaccine concept towards clinical translation, phage particles were engineered to display a highly immunogenic cyclic decapeptide epitope of the SARS-CoV-2 S protein (residues C662–C671, sequence CDIPIGAGIC) (hereafter referred to as the C662–C671 epitope) fused with a lung-directed motif that facilitates uptake and distribution after targeted aerosol delivery (Staquicini, Barbu, et al. 2021; Staquicini, Tang, et al. 2021). The cyclization of this epitope within the native full-length S protein is enabled by a disulfide bridge between the flanking cysteine residues. The C662–C671 epitope closely recapitulates its native cyclic conformation with the same disulfide bridge when displayed on the surface of the phage and elicits a strong and specific antibody response in mice. This experimental evidence strongly suggests that such a strategy fulfills the functional attributes for induction of a broad immune response to SARS-CoV-2 (Staquicini, Tang, et al. 2021). In order to assess the utility of our approach against existing and/or new variants of SARS-CoV-2, we herein investigated the C662–C671 epitope further by serially performing comparative sequence, structural, simulation, and computational analyses in the context of the emerging mutational impact of the five current and former VOCs.

Results

C662–C671 Sequence is Conserved across Major Variants

To examine the nature of mutational selective pressure on SARS-CoV-2, we retrieved S protein sequences from 14 major variants that have arisen throughout the pandemic (supplementary table S1, Supplementary Material online). We observed that although these variants together harbor substitutions or deletions at 86 different amino acid positions distributed throughout the 1,273-residue protein, the C662–C671 epitope is strictly conserved (supplementary fig. S1, Supplementary Material online), an attribute that could be exploited towards the development of a broad-spectrum, possibly universal, COVID-19 vaccine. Of note, mutations in the current and former VOCs (Alpha, Beta, Gamma, Delta, and Omicron) are primarily clustered in the N-terminal domain (NTD) and the receptor-binding domain (RBD) of S protein (fig. 1). Based on the antibody repertoires identified in the plasma of SARS-CoV-2-infected individuals, both RBD and NTD are the targets of various neutralizing antibodies, with RBD being immunodominant (Harvey et al. 2021). Moreover, selection pressure on the virus from neutralizing antibodies generated via natural infection, vaccines, or antibody-based therapies is also consistent with the rapid rise in mutations within the RBD, which is highly tolerant to immune-evading protein changes based on evolutionary modeling (Van Egeren et al. 2021). The C662–C671 epitope, which is located outside the NTD and the RBD, has also been shown to be targeted by neutralizing antibodies (Farrera-Soler et al. 2020; Li et al. 2020; Heffron et al. 2021). While studies with patient convalescent serum samples suggest that antigen escape from neutralizing antibodies likely drives the rise and dominance of new variants (Harvey et al. 2021), conserved epitopes across different strains, such as C662–C671, may help guide the development of a vaccine candidate against different SARS-CoV-2 variants. Moreover, such mutations within C662–C671 were only identified in 1,852 instances of 9,205,762 sequenced (a total of ∼0.02% for the 10-residue epitope) SARS-CoV-2 isolates worldwide as of March 22, 2022 according to COVID-19 virusMutation Tracker (Alam et al. 2021). It is also remarkable that the C662–C671 epitope shares 100% identity with the S protein of the previous zoonotic coronavirus (SARS-CoV) that caused the original SARS outbreak in 2003 and partial homology with the corresponding S protein of the Middle East Respiratory Syndrome coronavirus from 2012 (supplementary fig. S1, Supplementary Material online), indicating the potential role of this epitope for immunization against not only SARS-CoV-2, but also other pathogenic coronaviruses. Collectively, these empirical results suggest that C662–C671 epitope might represent a privileged motif for vaccine development and perhaps even a potential target for inescapable antibodies (Starr et al. 2021).
Fig. 1.

Sequence and structural conservation of the C662–C671 epitope of SARS-CoV-2 S protein across variants. (A) Domain representation of the residue sites affected by missense mutations or deletions present throughout the five current and former VOCs (Alpha, Beta, Gamma, Delta, and Omicron) (magenta) and the unaffected C662–C671 epitope (cyan). Amino acid insertions (i.e., ins214EPE for Omicron) are not represented. The two arrows signify cleavage sites. NTD, N-terminal domain; RBD, receptor-binding domain; FP, fusion peptide; HR, heptapeptide repeat sequence; TM, transmembrane; CT, cytoplasmic tail. (B) Atomic structural representation of one monomer (dim gray) showing all residue sites affected by VOC mutations or deletions (magenta) and the C662–C671 epitope (cyan). SARS-CoV-2 particle created with BioRender.com. (C) The closed- and open-state (one RBD up) S protein structures of each VOC were simulated to assess the impact of mutations on the conformation of the C662–C671 epitope. The structural model of the Delta VOC that was used for simulations is shown on the left as a representative example. Shown (right) are the probability distributions of the spatial RMSD values (with respect to the wild-type strain conformation) of the C662–C671 epitope backbone atoms in each variant, calculated from 500-ns explicit-solvent simulations without glycans. (D) SASA of C662–C671 of each chain in the context of the trimeric S protein structure. Top: closed conformation; bottom: one-open (chain A) conformation. Bars represent the average SASA computed for the five lowest-energy models out of 100 generated for each variant for each conformation. Error bars indicate standard deviation. (E) Mutation effects analysis of the C662–C671 epitope, RBD, and NTD presented as a heatmap of the combined average costs of single nucleotide and amino acid synthesis for the three sequences of interest. For RBD and NTD, 100 sub-sequences of 30-nucleotide length were sampled from the wild-type sequences to generate mutants. The color legend denotes the total cost of nucleotide and amino acid production for mutants normalized to the cost of wild-type production. Each bar of the heatmap represents a single mutant.

Sequence and structural conservation of the C662–C671 epitope of SARS-CoV-2 S protein across variants. (A) Domain representation of the residue sites affected by missense mutations or deletions present throughout the five current and former VOCs (Alpha, Beta, Gamma, Delta, and Omicron) (magenta) and the unaffected C662–C671 epitope (cyan). Amino acid insertions (i.e., ins214EPE for Omicron) are not represented. The two arrows signify cleavage sites. NTD, N-terminal domain; RBD, receptor-binding domain; FP, fusion peptide; HR, heptapeptide repeat sequence; TM, transmembrane; CT, cytoplasmic tail. (B) Atomic structural representation of one monomer (dim gray) showing all residue sites affected by VOC mutations or deletions (magenta) and the C662–C671 epitope (cyan). SARS-CoV-2 particle created with BioRender.com. (C) The closed- and open-state (one RBD up) S protein structures of each VOC were simulated to assess the impact of mutations on the conformation of the C662–C671 epitope. The structural model of the Delta VOC that was used for simulations is shown on the left as a representative example. Shown (right) are the probability distributions of the spatial RMSD values (with respect to the wild-type strain conformation) of the C662–C671 epitope backbone atoms in each variant, calculated from 500-ns explicit-solvent simulations without glycans. (D) SASA of C662–C671 of each chain in the context of the trimeric S protein structure. Top: closed conformation; bottom: one-open (chain A) conformation. Bars represent the average SASA computed for the five lowest-energy models out of 100 generated for each variant for each conformation. Error bars indicate standard deviation. (E) Mutation effects analysis of the C662–C671 epitope, RBD, and NTD presented as a heatmap of the combined average costs of single nucleotide and amino acid synthesis for the three sequences of interest. For RBD and NTD, 100 sub-sequences of 30-nucleotide length were sampled from the wild-type sequences to generate mutants. The color legend denotes the total cost of nucleotide and amino acid production for mutants normalized to the cost of wild-type production. Each bar of the heatmap represents a single mutant.

Epitope Conformation and Accessibility are Predicted to be Stable Despite Adjacent Mutations

To gain further insight into the structural properties of C662–C671, we performed comparative structural analysis (fig. 1) and molecular dynamics simulations (fig. 1 and supplementary figs. S2–S6, Supplementary Material online). Prior simulations suggest that the disulfide bridge-enabled cyclization acts as a structural constraint that enables the short peptide to retain a conformation with minimal variation (Staquicini, Tang, et al. 2021). In addition, all-atom explicit-solvent simulations of the S protein indicate that mutations associated with VOCs do not impact the structural properties of this epitope. For each individual variant, 500 ns simulations were performed, and the spatial root-mean-square deviation (RMSD) of the C662–C671 epitope was calculated, relative to its configuration in the closed- and open-state structures of the wild-type sequence (fig. 1). For each variant, the probability distribution as a function of RMSD (backbone atoms) was very similar and peaked around 0.35 Å. RMSD values that included the side-chain atoms were also very low and centered around 0.4–0.6 Å (supplementary fig. S2, Supplementary Material online). These observations strongly suggest that this region remains conformationally stable despite the presence of numerous mutations acquired elsewhere in the protein during the global COVID-19 pandemic. Further, this observation appears independent of the glycosylation state of the S protein, though the sampled RMSD values are slightly smaller when glycans were included (supplementary figs. S3–S5, Supplementary Material online). The C662–C671 epitope is partially located on the solvent-exposed surface in both the closed- and open-state S protein (Staquicini, Tang, et al. 2021), a central structural property for vaccine development, because antibody binding requires access to epitopes. To further evaluate this empirical observation, we performed solvent accessibility surface area (SASA) calculations of the C662–C671 epitope for the wild-type strain and major variants. We demonstrate that opening of the RBD of one protomer (chain A) results in a marked increase in epitope accessibility of a separate protomer (chain C); furthermore, S protein mutants of the current and former VOCs do not display major changes in SASA patterns for each protomer compared to the wild-type strain (fig. 1 and supplementary table S2, Supplementary Material online), revealing that epitope accessibility is likely not affected.

High Mutational Cost Supports Epitope Conservation

To understand the mechanistic basis of the relatively low mutational rate of the C662–C671 epitope in comparison to the RBD and NTD, we have reasoned that mutations within the two latter protein domains—in addition to providing a functional advantage to the virus—may also be energetically more favorable with respect to the cost of genome replication and translation of mutant sequences. Based on the observation that genome replication and translation combined are the most expensive processes of virus production for the host cells, energy limitation could induce selection pressure and genetic drift on newly incorporated elements in viral genomes (Mahmoudabadi et al. 2017). Thus, the incorporation of “lower-cost” nucleotides or amino acid residues [based on adenosine triphosphate (ATP) utilization] resulting in energy conservation could be advantageous for SARS-CoV-2 (Chen et al. 2016). To test this possibility, we used the Monte Carlo technique to generate mutants of the C662–C671 epitope, RBD, and NTD and to calculate the cost of synthesis of nucleotides and amino acids relative to the wild-type sequences of these three regions (Supplementary material online) (Dogra et al. 2020). We have found that the evolutionary costs of production of C662–C671 mutants (per nucleotide and per residue combined) are significantly higher than those of RBD and NTD mutants (P < 0.0001; one-way ANOVA and Tukey’s test) (fig. 1 and supplementary fig. S7, Supplementary Material online). At specific nucleotide positions, point mutations can cause up to a 4% increase in the relative cost of synthesis in the C662–C671 epitope, unlike in the RBD or NTD (supplementary fig. S7, Supplementary Material online). This value is based on the relative cost of synthesis of individual nucleotides in the mutants compared to the wild-type sequence. While these increments in the relative cost may seem marginal, when scaled to the infection level, the total cost of synthesis of millions of copies of such mutated sequences can be quite large. Overall, this result indicates that point mutations in the C662–C671 epitope may be evolutionarily costly for the virus compared to mutations in the RBD and NTD, thereby making the former less favorable for natural selection. Such empirical analysis, while admittedly lacking an intimate description of molecular mechanisms involved in virus production, might have captured an essential phenomenon in the natural selection of mutations.

Discussion

The rapid evolution of the SARS-CoV-2 genome (Harvey et al. 2021) highlights the need for developing a single vaccine (or a small number of vaccines) with a broad, ideally universal, spectrum of activity against viral variants. To achieve these desirable attributes, a candidate vaccine antigen should be: (1) immunogenic, (2) highly conserved (i.e., free of non-conserved missense mutations and/or single-residue deletions), (3) capable of recapitulating its endogenous conformation when displayed in heterologous contexts, and (4) surface solvent-exposed (either partially or fully) and thus suitable for recognition by antibodies and other ligands from either the humoral or cellular immune response. Several recent studies with different strategies such as artificial intelligence/machine learning (AI/ML) (Malone et al. 2020), phage display-based immunoprecipitation and sequencing technology (Shrock et al. 2020), or systematic site-directed mutagenesis (Starr et al. 2020) scanned and predicted mutational hotspot regions for the design of broad-spectrum vaccines, diagnostics, and antibody-based therapies. Notwithstanding insights into epitope mapping, it remains unclear whether these polypeptide chain segments are evolutionary stable and, therefore, suitable to control the spread of emerging COVID-19 variants worldwide. We recently reported that the C662–C671 epitope of SARS-CoV-2 elicits the generation of anti-S protein antibodies when incorporated into an aerosol-delivered targeted phage-based vaccine prototype (Staquicini, Barbu, et al. 2021; Staquicini, Tang, et al. 2021). Here, we refined and integrated structural, biophysical, and mathematical tools to present additional experimental evidence that C662–C671 meets all the desirable criteria of an epitope for a potential broad-spectrum, hopefully universal, vaccine against SARS-CoV-2 variants. Indeed, computational methods, including but not limited to AI/ML-based tools, can facilitate timely insight into the impact of novel viral mutations during the COVID-19 pandemic (Burley et al. 2021). As of March 2022, the Omicron sub-variant BA.2, which also does not contain mutations within the C662–C671 epitope, is rapidly spreading across the globe, emphasizing the importance of our integrated strategies. In the context of phage-based vaccines, such a versatile approach is highly amenable to simple cloning strategies that would allow the display of other exposed and preserved immunogenic epitopes from SARS-CoV-2 or other viral surface glycoproteins. Targeted phage-based vaccines could, therefore, be designed to include admixtures (cocktails) of distinct epitopes of S protein, which may help suppress the development of viral resistance or immune evasion (Baum et al. 2020). Finally, targeted phage-based vaccines are affordable to manufacture and robust in cold-free supply chain conditions, which would be particularly impactful in low-income countries with vaccine shortages and distribution challenges. From a broad perspective, this work provides an experimental framework and rational tools for the discovery and evaluation of SARS-CoV-2 epitopes with potentially universal features along with the possible generation of inescapable monoclonal antibodies as early therapeutics against COVID-19. Click here for additional data file.
  27 in total

1.  Energetic cost of building a virus.

Authors:  Gita Mahmoudabadi; Ron Milo; Rob Phillips
Journal:  Proc Natl Acad Sci U S A       Date:  2017-05-16       Impact factor: 11.205

2.  Potent neutralizing antibodies against multiple epitopes on SARS-CoV-2 spike.

Authors:  Lihong Liu; Pengfei Wang; Manoj S Nair; Jian Yu; Micah Rapp; Qian Wang; Yang Luo; Jasper F-W Chan; Vincent Sahi; Amir Figueroa; Xinzheng V Guo; Gabriele Cerutti; Jude Bimela; Jason Gorman; Tongqing Zhou; Zhiwei Chen; Kwok-Yung Yuen; Peter D Kwong; Joseph G Sodroski; Michael T Yin; Zizhang Sheng; Yaoxing Huang; Lawrence Shapiro; David D Ho
Journal:  Nature       Date:  2020-07-22       Impact factor: 49.962

3.  Predicting Proteome-Scale Protein Structure with Artificial Intelligence.

Authors:  Stephen K Burley; Wadih Arap; Renata Pasqualini
Journal:  N Engl J Med       Date:  2021-12-02       Impact factor: 91.245

4.  Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding.

Authors:  Tyler N Starr; Allison J Greaney; Sarah K Hilton; Daniel Ellis; Katharine H D Crawford; Adam S Dingens; Mary Jane Navarro; John E Bowen; M Alejandra Tortorici; Alexandra C Walls; Neil P King; David Veesler; Jesse D Bloom
Journal:  Cell       Date:  2020-08-11       Impact factor: 41.582

5.  Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designs.

Authors:  Brandon Malone; Boris Simovski; Clément Moliné; Jun Cheng; Marius Gheorghe; Hugues Fontenelle; Ioannis Vardaxis; Simen Tennøe; Jenny-Ann Malmberg; Richard Stratford; Trevor Clancy
Journal:  Sci Rep       Date:  2020-12-23       Impact factor: 4.379

6.  Covid-19 Vaccine Effectiveness against the Omicron (B.1.1.529) Variant.

Authors:  Nick Andrews; Julia Stowe; Freja Kirsebom; Samuel Toffa; Tim Rickeard; Eileen Gallagher; Charlotte Gower; Meaghan Kall; Natalie Groves; Anne-Marie O'Connell; David Simons; Paula B Blomquist; Asad Zaidi; Sophie Nash; Nurin Iwani Binti Abdul Aziz; Simon Thelwall; Gavin Dabrera; Richard Myers; Gayatri Amirthalingam; Saheer Gharbia; Jeffrey C Barrett; Richard Elson; Shamez N Ladhani; Neil Ferguson; Maria Zambon; Colin N J Campbell; Kevin Brown; Susan Hopkins; Meera Chand; Mary Ramsay; Jamie Lopez Bernal
Journal:  N Engl J Med       Date:  2022-03-02       Impact factor: 91.245

Review 7.  SARS-CoV-2 variants, spike mutations and immune escape.

Authors:  William T Harvey; Alessandro M Carabelli; Ben Jackson; Ravindra K Gupta; Emma C Thomson; Ewan M Harrison; Catherine Ludden; Richard Reeve; Andrew Rambaut; Sharon J Peacock; David L Robertson
Journal:  Nat Rev Microbiol       Date:  2021-06-01       Impact factor: 78.297

8.  Antibody cocktail to SARS-CoV-2 spike protein prevents rapid mutational escape seen with individual antibodies.

Authors:  Alina Baum; Benjamin O Fulton; Elzbieta Wloga; Richard Copin; Kristen E Pascal; Vincenzo Russo; Stephanie Giordano; Kathryn Lanza; Nicole Negron; Min Ni; Yi Wei; Gurinder S Atwal; Andrew J Murphy; Neil Stahl; George D Yancopoulos; Christos A Kyratsous
Journal:  Science       Date:  2020-06-15       Impact factor: 47.728

9.  Ongoing global and regional adaptive evolution of SARS-CoV-2.

Authors:  Nash D Rochman; Yuri I Wolf; Guilhem Faure; Pascal Mutz; Feng Zhang; Eugene V Koonin
Journal:  Proc Natl Acad Sci U S A       Date:  2021-07-02       Impact factor: 11.205

10.  Design and proof of concept for targeted phage-based COVID-19 vaccination strategies with a streamlined cold-free supply chain.

Authors:  Daniela I Staquicini; Fenny H F Tang; Christopher Markosian; Virginia J Yao; Fernanda I Staquicini; Esteban Dodero-Rojas; Vinícius G Contessoto; Deodate Davis; Paul O'Brien; Nazia Habib; Tracey L Smith; Natalie Bruiners; Richard L Sidman; Maria L Gennaro; Edmund C Lattime; Steven K Libutti; Paul C Whitford; Stephen K Burley; José N Onuchic; Wadih Arap; Renata Pasqualini
Journal:  Proc Natl Acad Sci U S A       Date:  2021-07-27       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.