| Literature DB >> 32835302 |
Mark Yarmarkovich1, John M Warrington1, Alvin Farrel1,2, John M Maris1,3.
Abstract
Here we propose a SARS-CoV-2 vaccine design concept based on identification of highly conserved regions of the viral genome and newly acquired adaptations, both predicted to generate epitopes presented on major histocompatibility complex (MHC) class I and II across the vast majority of the population. We further prioritize genomic regions that generate highly dissimilar peptides from the human proteome and are also predicted to produce B cell epitopes. We propose sixty-five 33-mer peptide sequences, a subset of which can be tested using DNA or mRNA delivery strategies. These include peptides that are contained within evolutionarily divergent regions of the spike protein reported to increase infectivity through increased binding to the ACE2 receptor and within a newly evolved furin cleavage site thought to increase membrane fusion. Validation and implementation of this vaccine concept could specifically target specific vulnerabilities of SARS-CoV-2 and should engage a robust adaptive immune response in the vast majority of the population.Entities:
Keywords: COVID-19; DNA vaccine; HLA; MHC; RNA vaccine; SARS-CoV-2; coronavirus; epitope; immunogenicity; vaccine
Year: 2020 PMID: 32835302 PMCID: PMC7276303 DOI: 10.1016/j.xcrm.2020.100036
Source DB: PubMed Journal: Cell Rep Med ISSN: 2666-3791
Figure 1Epitope Scoring along SARS-CoV-2 Proteome
(A) HLA presentation of 33-mers across viral proteome. Representation of MHC class I presentation (top) and MHC class II presentation (bottom) reported as frequency of the population predicted to present peptides derived from each region of the viral proteome.
(B) Scoring of each epitope derived from the 33-mers along the length of the proteome as compared with the epitopes derived from the normal human proteome presented across 84 HLA alleles, reported as normalized scores in which the highest scoring epitopes are maximally dissimilar to self-peptides derived from normal proteins (top). Scoring for genomic conservation against 15 cross-species coronaviruses and 727 human sequences, with highest scoring regions conserved across human and other mammalian coronaviruses (bottom).
(C) Combined epitope score reported as sum of four above parameters (local maximum for epitopes with 90th percentile total score in red).
(D) Scoring of B cell epitopes for each amino acid for linear epitopes for Spike, Envelope, and Matrix proteins (top) and conformational epitopes in Spike protein (bottom).
(E) Combined scoring of 33-mer epitopes as described in (D).
(F) Combined B and T cell epitope scoring in Spike, Envelope, and Matrix proteins. Receptor binding domain epitope highlighted with red arrow and epitope containing furin cleavage site highlighted with blue arrow (Figure 2).
Figure 2Proposed Vaccine Epitopes in SARS-CoV-2 Spike Protein
Crystal structure of SARS-CoV-2 Spike protein trimer (PDB: 6VYB) with two highlighted vaccine epitopes targeting newly evolved acquired viral vulnerabilities. First, SARS-CoV-2 receptor binding domain (cyan) has up to 10-fold higher affinity binding to the ACE2 receptor as compared with previous coronaviruses. Using our analysis, we identify a high-ranking vaccine epitope (red) within the receptor binding domain. Second, SARS-CoV-2 has acquired a novel furin cleavage site RRAR, along for increased infectivity due to improved membrane fusion (epitope containing the novel furin cleavage site highlighted in orange).
Sample of Highest Scoring Viral Epitopes Suggested for Vaccination Based on MHC Class I Population-Scale Presentation, MHC Class II Population Presentation, Similarity Score, and Homology Score across 15 Mammal Species and 727 Human SARS-CoV-2 Gene Sequences
| Gene Position | Epitope | HLA Class I Population Presentation | HLA Class I Alleles Bound | HLA Class I Binders | HLA Class II Population Presentation | HLA Class II Alleles Bound | HLA Class II Binders | Dissimilarity Score | Conservation Score | Combined T Cell Score | B Cell Total Score | B and T Cell Total Percentile |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ORF1ab_3619 | IAMSAFAMMFV | 98.6% | 74 | HLA-A: | 82.1% | 24 | HLA-DRB1: | 0.82 | 0.96 | 3.59 | N/A | N/A |
| S_129 | KVCEFQFCNDP | 98.5% | 58 | HLA-A: | 39.0% | 9 | HLA-DRB1: | 0.60 | 0.83 | 2.80 | 1.14 | 91% |
| S_252 | GDSSSGWTAG | 95.7% | 53 | HLA-A: | 68.9% | 15 | HLA-DRB1: | 0.48 | 0.71 | 2.84 | 0.76 | 81% |
| S_462 | KPFERDISTEIYQ | 74.8% | 27 | A: | 18.7% | 5 | DRB1: | 0.51 | 0.77 | 2.21 | 1.29 | 75.2% |
| N_305 | AQFAPSASAFFG | 87.5% | 40 | HLA-A: | 6.4% | 1 | DRB1: | 0.46 | 0.91 | 2.31 | N/A | N/A |
Columns represent gene and position of first amino acid of 33-mer, number of HLA class I and II alleles predicted to bind at least one predicted epitope within 33-mer, list of bound alleles, the proportion of the population predicted to have at least one of these HLAs, normalized dissimilarity scores, normalized conservation scores, across the 33-mer, total T cell score, B cell score, and combined B and T cell percentile for 33-mers. Table includes S_462 in S protein containing novel receptor binding sites and N_305 containing five peptides shown to be immunogenic in IEDB. N/A, not applicable.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| SARS-CoV-2 Immunogenicity Map | This manuscript | |
| Vaccine constructs | This manuscript | N/A |
| netMHC 4.0 | Andreatta and Nielsen | |
| netMHCII 2.3 | Jensen et al. | |
| BepiPred 2.0 | Jespersen et al. | |
| DiscoTope 2.0 | Kringelum et al. | |
| shinyNAP | Yarmarkovich et al. | |
| Clustal Omega | Sievers et al. | |