| Literature DB >> 32717854 |
Vipin Ranga1, Erik Niemelä2,3, Mahlet Z Tamirat1, John E Eriksson2,3, Tomi T Airenne1, Mark S Johnson1.
Abstract
The emergence of the COVID-19 outbreak at the end of 2019, caused by the novel coronavirus SARS-CoV-2, has, to date, led to over 13.6 million infections and nearly 600,000 deaths. Consequently, there is an urgent need to better understand the molecular factors triggering immune defense against the virus and to develop countermeasures to hinder its spread. Using in silico analyses, we showed that human major histocompatibility complex (MHC) class I cell-surface molecules vary in their capacity for binding different SARS-CoV-2-derived epitopes, i.e., short sequences of 8-11 amino acids, and pinpointed five specific SARS-CoV-2 epitopes that are likely to be presented to cytotoxic T-cells and hence activate immune responses. The identified epitopes, each one of nine amino acids, have high sequence similarity to the equivalent epitopes of SARS-CoV virus, which are known to elicit an effective T cell response in vitro. Moreover, we give a structural explanation for the binding of SARS-CoV-2-epitopes to MHC molecules. Our data can help us to better understand the differences in outcomes of COVID-19 patients and may aid the development of vaccines against SARS-CoV-2 and possible future outbreaks of novel coronaviruses.Entities:
Keywords: COVID-19; HLA; MHC class I epitopes; SARS-CoV; SARS-CoV-2; antigen presentation; homology modeling; immunoinformatics; in silico analysis; molecular dynamics simulations; structural biology; vaccine development; viral peptides
Year: 2020 PMID: 32717854 PMCID: PMC7564651 DOI: 10.3390/vaccines8030408
Source DB: PubMed Journal: Vaccines (Basel) ISSN: 2076-393X
SARS-CoV-2 proteins analyzed to predict major histocompatibility complex (MHC) class I binding epitopes.
| Protein Name | Length (aa) | NCBI RefSeq Accession ID |
|---|---|---|
| nsp1 | 180 | YP_009725297.1 |
| nsp2 | 638 | YP_009725298.1 |
| nsp3 | 1945 | YP_009725299.1 |
| nsp4 | 500 | YP_009725300.1 |
| 3C-like proteinase (3CLpro) | 306 | YP_009725301.1 |
| nsp6 | 290 | YP_009725302.1 |
| nsp7 | 83 | YP_009725303.1 |
| nsp8 | 198 | YP_009725304.1 |
| nsp9 | 113 | YP_009725305.1 |
| nsp10 | 139 | YP_009725306.1 |
| nsp11 | 13 | YP_009725312.1 |
| RNA-dependent RNA polymerase (RdRp) | 932 | YP_009725307.1 |
| Helicase | 601 | YP_009725308.1 |
| 3′-to-5′ exonuclease (35EXO) | 527 | YP_009725309.1 |
| Endo RNAse (EndoR) | 346 | YP_009725310.1 |
| 2′-O-ribose methyltransferase | 298 | YP_009725311.1 |
| Surface glycoprotein (S) | 1273 | YP_009724390.1 |
| ORF3a | 275 | YP_009724391.1 |
| Envelope protein (E) | 75 | YP_009724392.1 |
| Membrane glycoprotein (M) | 222 | YP_009724393.1 |
| ORF6 | 61 | YP_009724394.1 |
| ORF7a | 121 | YP_009724395.1 |
| ORF7b | 43 | YP_009725318.1 |
| ORF8 | 121 | YP_009724396.1 |
| Nucleocapsid phosphoprotein (N) | 419 | YP_009724397.2 |
| ORF10 | 38 | YP_009725255.1 |
Figure 1Prediction of binding of antiviral peptides to MHC-I allotypes. (A) Distributions of the predicted affinities (IC50, log scale) to HLA-A and HLA-B supertypes of all possible linear 8- to 11-mer peptides derived from the 26 proteins of the SARS-CoV-2 proteome. The predicted MHC class I binding epitopes were classified as strong MHC binders (IC50 ≤ 50 nM), weak binders (50 nM < IC50 ≤ 500 nM) and non-binders (IC50 > 500 nM); (B) Number of MHC-I–binding 9- and 10-mers categorized as strong, weak and non-binders; (C) Number of 9-mer epitopes with an immunogenicity score ≥ 0.25 in SARS-CoV-2 proteins identified with the IEDB and NetCTL1.2 prediction methods.
Most potent SARS-CoV-2–derived MHC class I binding epitopes identified with both the IEDB (lowest IC50) and NetCTL1.2 (highest combined score) in silico prediction methods.
| Epitopes | Protein | Allotype | Supertype | Combined Score | Predicted IC50 (nM) |
|---|---|---|---|---|---|
| 738DTDFVNEFY746 | RdRp | A*01:01 | A01 | 3.619 | 2.83 |
| 1505LVAEWFLAY1513 | nsp3 | A*29:02 | A01 | 2.748 | 3.02 |
| 289SHFAIGLAL297 | Helicase | B*39:01 | B39 | 2.168 | 4.55 |
| 1507AEWFLAYIL1515 | nsp3 | B*40:01 | B44 | 2.036 | 4.88 |
| 1505LVAEWFLAY1513 | nsp3 | B*35:01 | A01 | 2.748 | 5.66 |
| 1507AEWFLAYIL1515 | nsp3 | B*40:02 | B44 | 2.036 | 7.64 |
| 217AMDEFIERY225 | EndoR | A*01:01 | A01 | 3.138 | 10.47 |
| 1505LVAEWFLAY1513 | nsp3 | B*15:01 | A01 | 2.748 | 11.16 |
| 1505LVAEWFLAY1513 | nsp3 | A*26:01 | A01 | 2.748 | 18.88 |
SARS-CoV-2–derived HLA-A*02 supertype-binding epitopes that are identical to the epitopes of SARS-CoV strains experimentally known to activate cytotoxic T-cells.
| Epitopes | Protein | Epitope Mutation | Combined Score | Allotypes | Predicted IC50 (nM) | Experimental IC50 (nM) |
|---|---|---|---|---|---|---|
| 1220FIAGLIAIV1228 | S | No | 1.212 | A*02:01 | 10.29 | 1.48 |
| 17VLLFLAFVV25 | E | No | 1.213 | A*02:01 | 21.72 | 5.62 |
| 20FLAFVVFLL28 | E | No | 1.440 | A*02:01 | 5.26 | 0.23 |
| 204VLAWLYAAV212 | 3CLpro | No | 1.173 | A*02:01 | 13.40 | 0.435 |
| 184VLWAHGFEL192 | 35EXO | No | 1.360 | A*02:01 | 5.78 | 0.40 |
| 330LLSAGIFGA338 | nsp3 | I335V | 1.217 | A*02:01 | 10.09 | 8.1 |
Predicted half-lives of complexes of the conserved SARS-CoV-2-derived most immunogenic experimentally identified epitopes and HLA-A*02 allotypes shown in Table 3. Secondary structures, localization within SARS-CoV-2 and GRAVY (grand average of hydropathicity index) scores of the epitopes.
| Epitopes | Allotypes | Half-Life (in Hours) | Secondary Structure | Localization | GRAVY Score |
|---|---|---|---|---|---|
| 1220FIAGLIAIV1228 | A*02:01 | 5.11 | helix | transmembrane | 3.056 |
| 17VLLFLAFVV25 | A*02:01 | 4.13 | helix | transmembrane | 3.489 |
| 20FLAFVVFLL28 | A*02:01 | 11 | helix | transmembrane | 3.333 |
| 204VLAWLYAAV212 | A*02:01 | 8.13 | helix | intravirion | 2.133 |
| 184VLWAHGFEL192 | A*02:01 | 6.51 | strand-coil-helix | intravirion | 0.933 |
Predicted half-lives of the novel SARS-CoV-2-derived, most immunogenic in-silico-identified epitopes in complex with the allotypes shown in Table 2. Secondary structures, localization within SARS-CoV-2, GRAVY scores and known mutations in the epitopes.
| Epitopes | Allotype | Epitope Mutation | Half-Life (in Hours) | Localization | GRAVY Score |
|---|---|---|---|---|---|
| 738DTDFVNEFY746 | A*01:01 | E744D | 2.84 | intravirion | −0.689 |
| 1505LVAEWFLAY1513 | A*29:02 | No | 3.64 | transmembrane | 1.389 |
| 289SHFAIGLAL297 | B*39:01 | H290Y | 2.02 | intravirion | 1.567 |
| 1507AEWFLAYIL1515 | B*40:01 | No | 2.04 | transmembrane | 1.422 |
| 1505LVAEWFLAY1513 | B*35:01 | No | 1.69 | transmembrane | 1.389 |
| 1507AEWFLAYIL1515 | B*40:02 | No | 3.81 | transmembrane | 1.422 |
| 217AMDEFIERY225 | A*01:01 | A217V | 1.26 | intravirion | −0.589 |
| 1505LVAEWFLAY1513 | B*15:01 | No | 7.51 | transmembrane | 1.389 |
| 1505LVAEWFLAY1513 | A*26:01 | No | 1.33 | transmembrane | 1.389 |
Figure 23D structural models of MHC-I in complex with selected epitopes (Table 4) and ternary complex of MHC-I–epitope—T cell receptor (TCR). (A) and (B) Five immunogenic epitopes (Table 4) were docked into the cleft between the α1 and α2 helices of HLA-A*02:01 (PDB ID: 5TEZ, chain A) and HLA-A*02:06 (PDB ID: 3OXR, chain A) (surface presentation). The epitopes 1220FIAGLIAIV1228, 17VLLFLAFVV25, 20FLAFVVFLL28, 204VLAWLYAAV212 and 184VLWAHGFEL192 are respectively colored cyan, yellow, blue, salmon and green, and are shown as ribbons; residues at Pos1-3 and Pos9 as sticks; (C) Structure of the epitope 1220FIAGLIAIV1228 (cyan sticks) docked into the cleft of HLA-A*02:01 (PDB ID: 5TEZ, chain A; gray cartoon and sticks). Residues G1223 to I1227 of the epitope and A69, K66, V76, T80, K146, V152 and Q155 of HLA-A*02:01 have solvent-exposed side chains; (D) Ternary complex structure of HLA-A*02:01 (gray cartoon), epitope 1220FIAGLIAIV1228 (cyan loop and sticks) and TCR. Conformations of the CDR1α, CDR2α and CDR3α loops of the TCR-α chain (orange) and CDR1β, CDR2β and CDR3β of the TCR-β chain (blue) are shown; (E) Side chains of residues of CDR3α (orange sticks) and CDR3β (blue sticks) loops making hydrophobic interactions (dotted yellow line; distances in Ångströms) with both the epitope 1220FIAGLIAIV1228 (cyan sticks) and the HLA-A*02:01 molecule (residues located at the α1 and α2 helices; gray sticks) are shown; (F) Structure of the epitope 1507AEWFLAYIL1515 (green loop and sticks) docked into the cleft of HLA-B*40:01 (PDB ID: 6IEX, chain A; gray cartoon and sticks).
Comparison of the Cα atom positions of the five most immunogenic SARS-CoV-2 epitopes. Upper and lower triangular data show root–mean–square deviations in Ångströms for the epitopes in complex with HLA-A*02:01 and HLA-A*02:06, respectively.
| A*02:01 | Fiagliaiv | Vllflafvv | Flafvvfll | Vlawlyaav | Vlwahgfel | |
|---|---|---|---|---|---|---|
| A*02:06 | ||||||
| Fiagliaiv | 0 | 0.685 | 0.566 | 0.554 | 0.852 | |
| Vllflafvv | 0.940 | 0 | 0.65 | 0.881 | 0.699 | |
| Flafvvfll | 1.002 | 0.636 | 0 | 0.518 | 0.888 | |
| Vlawlyaav | 0.599 | 0.669 | 0.501 | 0 | 1.039 | |
| Vlwahgfel | 1.296 | 1.250 | 1.470 | 1.432 | 0 | |
Figure 3Structural dynamics of the HLA-A*02:01-1220FIAGLIAIV1228 S protein epitope–T cell receptor (TCR) complex during a 100 ns simulation. (A) Cα atom root–mean–square fluctuation (RMSF) of the ternary complex. (B) Superimposed conformations of the complex sampled at 0 ns (blue), 50 ns (yellow) and 100 ns (pink) of the simulation. Results from the first of three independent MD simulations are illustrated here.