| Literature DB >> 32843695 |
Stephen N Crooke1, Inna G Ovsyannikova1, Richard B Kennedy1, Gregory A Poland2.
Abstract
A novel coronavirus (Entities:
Mesh:
Substances:
Year: 2020 PMID: 32843695 PMCID: PMC7447814 DOI: 10.1038/s41598-020-70864-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(A) Diagram of SARS-CoV-2 virion structure with the major structural proteins (S, M, N, and E) highlighted. (B) Cartoon representation of the SARS-CoV-2 genome with the 10 major protein-coding regions annotated. The box diagrams are proportional to the protein size. (C) Diagram of peptide identification workflow illustrating the algorithms used[36,44–47,49–51,58,60] and filtering criterion applied to refine peptide selection. (D) Cladogram illustrating the genetic relationship of SARS-CoV-2 isolates. The original viral isolate and consensus sequence (Wuhan-Hu-1) is highlighted in red.
Figure 2Immunogenicity scoring of peptides in the SARS-CoV-2 proteome with predicted HLA class I and II coverage and binding affinities. (A) Plots illustrating the NetCTL score for each sequential peptide across the entire amino acid sequence for each SARS-CoV-2 protein. Scores presented are the highest score identified across all HLA class I supertypes for each peptide. (B) Total number of predicted peptide epitopes distributed across HLA class I alleles. (C) Average predicted binding affinities by HLA allele for the top candidate class I peptides listed in Table 1. (D) Total number of predicted peptide epitopes distributed across HLA class II alleles. (E) Average predicted binding affinities by HLA allele for the top candidate class II peptides listed in Table 1.
Top predicted HLA class I and class II T cell epitopes.
| Protein | Peptide | Residues | Antigenicity score | Predicted Alleles | Binding affinity (nM) |
|---|---|---|---|---|---|
| S | FAMQMAYRF# | 898–906 | 1.0278 | A*24:02 | 142.9 |
| B*15:01 | 123.9 | ||||
| B*58:01 | 23.4 | ||||
| ORF1ab | LSFKELLVY | 4758–4767 | 0.7234 | A*01:01 | 371.8 |
| B*15:01 | 42.6 | ||||
| B*58:01 | 35.7 | ||||
| ORF1ab | MMISAGFSL# | 6425–6434 | 1.0248 | A*02:01 | 6.9 |
| B*08:01 | 367.6 | ||||
| B*15:01 | 16.2 | ||||
| ORF1ab | MSNLGMPSY* | 2254–2262 | 0.9272 | A*01:01 | 184.2 |
| B*15:01 | 74.1 | ||||
| B*58:01 | 87.6 | ||||
| ORF1ab | STNVTIATY# | 2273–2281 | 0.7143 | A*01:01 | 241.1 |
| B*15:01 | 81.9 | ||||
| B*58:01 | 294.5 | ||||
| M | ASFRLFARTRSMWSF* | 98–112 | 0.7304 | DRB1*01:01 | 19.2 |
| DRB1*07:01 | 30.9 | ||||
| DRB1*08:02 | 53.5 | ||||
| DRB1*09:01 | 49.9 | ||||
| DRB1*11:01 | 12.2 | ||||
| DRB5*01:01 | 16.3 | ||||
| DPA1*02:01/DPB1*05:01 | 256.2 | ||||
| DPA1*02:01 DPB1*14:01 | 387.3 | ||||
| M | LLQFAYANRNRFLYI* | 34–48 | 0.7387 | DRB1*03:01 | 179.8 |
| DRB1*07:01 | 58.2 | ||||
| DRB1*08:02 | 225.6 | ||||
| DRB1*11:01 | 36.2 | ||||
| DRB1*13:02 | 27.8 | ||||
| DRB3*02:02 | 46.6 | ||||
| DRB5*01:01 | 26.3 | ||||
| S | AAEIRASANLAATKM* | 1015–1029 | 0.7125 | DRB1*08:02 | 101.3 |
| DRB1*13:02 | 23.0 | ||||
| DRB3*02:02 | 52.7 | ||||
| DQA1*01:02/DQB1*06:02 | 141.5 | ||||
| DPA1*02:01/DPB1*14:01 | 327.4 | ||||
| S | ALQIPFAMQMAYRFN* | 893–907 | 1.0112 | DRB1*09:01 | 52.9 |
| DRB1*12:01 | 159.5 | ||||
| DRB1*15:01 | 50.3 | ||||
| S | PYRVVVLSFELLHAP* | 507–521 | 0.8161 | DPA1*02:01/DPB1*01:01 | 79.6 |
| DPA1*01:03/DPB1*02:01 | 53.3 | ||||
| DPA1*01:03/DPB1*04:01 | 77.1 | ||||
| DPA1*03:01/DPB1*04:02 | 92.9 | ||||
| S | QPYRVVVLSFELLHA# | 506–520 | 0.9109 | DPA1*02:01/DPB1*01:01 | 73.2 |
| DPA1*01:03/DPB1*02:01 | 50.2 | ||||
| DPA1*01:03/DPB1*04:01 | 71.4 | ||||
| DPA1*03:01/DPB1*04:02 | 90.1 | ||||
| DPA1*02:01/DPB1*05:01 | 211.1 | ||||
| S | YQPYRVVVLSFELLH* | 505–519 | 0.9711 | DPA1*02:01/DPB1*01:01 | 102.2 |
| DPA1*01:03/DPB1*04:01 | 93.0 | ||||
| DPA1*03:01/DPB1*04:02 | 127.5 | ||||
| DPA1*02:01/DPB1*05:01 | 299.3 | ||||
| ORF1ab | ANYIFWRNTNPIQLS# | 7024–7038 | 1.0311 | DRB1*04:05 | 89.9 |
| DRB1*07:01 | 35.2 | ||||
| DRB1*13:02 | 13.5 | ||||
| ORF1ab | FKWDLTAFGLVAEWF* | 2314–2328 | 0.8059 | DQA1*05:01/DQB1*02:01 | 178.3 |
| DQA1*03:01/DQB1*03:02 | 425.3 | ||||
| DQA1*04:01/DQB1*04:02 | 349.3 | ||||
| ORF1ab | HIQWMVMFTPLVPFW* | 3125–3139 | 0.7238 | DQA1*01:01/DQB1*05:01 | 293.1 |
| DPA1*02:01/DPB1*01:01 | 116.3 | ||||
| DPA1*01:03/DPB1*04:01 | 84.6 | ||||
| DPA1*03:01/DPB1*04:02 | 135.4 | ||||
| ORF1ab | IINLVQMAPISAMVR* | 4048–4062 | 0.7682 | DRB1*01:01 | 12.8 |
| DRB1*08:02 | 118.8 | ||||
| DRB4*01:01 | 54.7 | ||||
| ORF1ab | INLVQMAPISAMVRM* | 4049–4063 | 0.9037 | DRB1*12:01 | 176.9 |
| DRB4*01:01 | 57.1 | ||||
| DQA1*01:02/DQB1*06:02 | 116.5 | ||||
| DPA1*02:01/DPB1*14:01 | 398.6 | ||||
| ORF1ab | IVFMCVEYCPIFFIT | 3758–3772 | 1.0267 | DPA1*02:01/DPB1*01:01 | 116.2 |
| DPA1*01:03/DPB1*02:01 | 53.9 | ||||
| DPA1*01:03/DPB1*04:01 | 70.9 | ||||
| DPA1*03:01/DPB1*04:02 | 144.9 | ||||
| ORF1ab | IVTALRANSAVKLQN* | 4127–4141 | 0.7692 | DRB1*08:02 | 115.9 |
| DRB1*13:02 | 9.4 | ||||
| DRB3*02:02 | 19.5 | ||||
| DPA1*02:01/DPB1*14:01 | 408.7 | ||||
| ORF1ab | KGRLIIRENNRVVIS* | 7075–7089 | 0.7821 | DRB1*12:01 | 170.9 |
| DRB1*13:02 | 9.5 | ||||
| DRB1*15:01 | 48.2 | ||||
| DRB4*01:01 | 58.8 | ||||
| ORF1ab | KSAFYILPSIISNEK* | 1350–1364 | 0.7169 | DRB1*01:01 | 9.3 |
| DRB1*04:01 | 49.3 | ||||
| DRB1*04:05 | 47.5 | ||||
| DRB1*08:02 | 96.3 | ||||
| ORF1ab | LIVTALRANSAVKLQ# | 4126–4140 | 0.7473 | DRB1*01:01 | 8.8 |
| DRB1*07:01 | 39.2 | ||||
| DRB4*01:01 | 78.6 | ||||
| DQA1*01:02/DQB1*06:02 | 142.5 | ||||
| DPA1*02:01/DPB1*14:01 | 368.3 | ||||
| ORF1ab | NLPFKLTCATTRQVV | 2737–2751 | 1.1632 | DRB1*07:01 | 35.9 |
| DRB1*09:01 | 58.6 | ||||
| DRB5*01:01 | 23.9 | ||||
| ORF1ab | PASRELKVTFFPDLN | 1950–1964 | 1.0155 | DPA1*02:01/DPB1*01:01 | 76.9 |
| DPA1*01:03/DPB1*02:01 | 48.9 | ||||
| DPA1*01:03/DPB1*04:01 | 64.3 | ||||
| DPA1*03:01/DPB1*04:02 | 149.5 | ||||
| ORF1ab | PFAMGIIAMSAFAMM* | 3613–3627 | 0.9834 | DRB1*01:01 | 12.3 |
| DRB1*09:01 | 57.6 | ||||
| DQA1*05:01/DQB1*03:01 | 45.6 | ||||
| ORF1ab | QMNLKYAISAKNRAR# | 4933–4947 | 1.5044 | DRB1*01:01 | 14.9 |
| DRB1*04:01 | 56.9 | ||||
| DRB1*08:02 | 49.1 | ||||
| DRB1*09:01 | 45.2 | ||||
| DRB1*11:01 | 22.1 | ||||
| DRB3*02:02 | 84.9 | ||||
| DPA1*02:01/DPB1*14:01 | 158.3 | ||||
| ORF1ab | QQKLALGGSVAIKIT | 6956–6970 | 1.2533 | DRB1*01:01 | 12.6 |
| DRB1*07:01 | 23.4 | ||||
| DRB1*09:01 | 32.3 | ||||
| DQA1*05:01/DQB1*03:01 | 42.9 | ||||
| ORF1ab | RFKESPFELEDFIPM# | 6709–6723 | 1.2101 | DPA1*02:01/DPB1*01:01 | 74.0 |
| DPA1*01:03/DPB1*02:01 | 65.9 | ||||
| DPA1*01:03/DPB1*04:01 | 81.9 | ||||
| DPA1*03:01/DPB1*04:02 | 130.6 | ||||
| ORF1ab | SAFAMMFVKHKHAFL | 3622–3636 | 0.7305 | DRB1*08:02 | 110.4 |
| DRB1*11:01 | 18.3 | ||||
| DRB1*15:01 | 50.9 | ||||
| DRB4*01:01 | 79.2 | ||||
| DRB5*01:01 | 15.1 | ||||
| ORF1ab | SFLAHIQWMVMFTPL# | 3121–3135 | 0.8215 | DPA1*02:01/DPB1*01:01 | 103.9 |
| DPA1*01:03/DPB1*02:01 | 47.8 | ||||
| DPA1*01:03/DPB1*04:01 | 70.7 | ||||
| DPA1*03:01/DPB1*04:02 | 140.6 | ||||
| ORF1ab | SIGFDYVYNPFMIDV* | 6155–6169 | 1.0823 | DPA1*02:01/DPB1*01:01 | 108.9 |
| DPA1*01:03/DPB1*02:01 | 47.1 | ||||
| DPA1*01:03/DPB1*04:01 | 81.9 | ||||
| DPA1*03:01/DPB1*04:02 | 137.6 | ||||
| ORF1ab | TEETFKLSYGIATVR* | 5465–5479 | 0.8859 | DRB1*01:01 | 8.7 |
| DRB1*07:01 | 21.8 | ||||
| DRB1*09:01 | 25.9 | ||||
| ORF1ab | VLVQSTQWSLFFFLY* | 3593–3607 | 0.7309 | DPA1*02:01/DPB1*01:01 | 77.0 |
| DPA1*01:03/DPB1*02:01 | 35.3 | ||||
| DPA1*01:03/DPB1*04:01 | 42.3 | ||||
| DPA1*03:01/DPB1*04:02 | 93.1 | ||||
| ORF1ab | VQSTQWSLFFFLYEN* | 3595–3609 | 0.7509 | DPA1*02:01/DPB1*01:01 | 107.1 |
| DPA1*01:03/DPB1*02:01 | 49.9 | ||||
| DPA1*03:01/DPB1*04:02 | 129.8 | ||||
| ORF1ab | WLIINLVQMAPISAM# | 2366–2380 | 0.9389 | DRB1*12:01 | 130.6 |
| DRB4*01:01 | 65.9 | ||||
| DQA1*01:02/DQB1*06:02 | 139.6 | ||||
| ORF1ab | YFNMVYMPASWVMRI* | 3649–3663 | 0.7244 | DRB1*01:01 | 8.3 |
| DRB1*04:05 | 80.2 | ||||
| DRB1*07:01 | 38.2 | ||||
| DRB1*09:01 | 37.4 | ||||
| DRB1*12:01 | 184.5 | ||||
| DRB1*15:01 | 30.1 | ||||
| ORF3 | KKRWQLALSKGVHFV# | 66–80 | 0.8172 | DRB1*01:01 | 9.2 |
| DRB1*07:01 | 11.6 | ||||
| DRB1*08:02 | 200.3 | ||||
| DRB1*09:01 | 17.9 | ||||
| DRB1*11:01 | 43.1 | ||||
| DRB1*12:01 | 119.6 | ||||
| DRB1*13:02 | 30.0 | ||||
| DRB1*15:01 | 34.2 | ||||
| DRB4*01:01 | 79.8 | ||||
| DRB5*01:01 | 18.4 | ||||
| ORF6 | MFHLVDFQVTIAEIL# | 1–15 | 1.0366 | DQA1*05:01/DQB1*02:01 | 192.0 |
| DQA1*01:01/DQB1*05:01 | 292.1 | ||||
| DPA1*02:01/DPB1*01:01 | 108.3 | ||||
| DPA1*01:03/DPB1*04:01 | 100.7 | ||||
| ORF7 | VKHVYQLRARSVSPK# | 71–85 | 1.0865 | DRB1*01:01 | 14.3 |
| DRB1*08:02 | 150.6 | ||||
| DRB1*11:01 | 38.3 | ||||
| DRB4*01:01 | 86.6 | ||||
| ORF7 | NKFALTCFSTQFAFA* | 52–66 | 1.1728 | DPA1*02:01/DPB1*01:01 | 50.9 |
| DPA1*01:03/DPB1*02:01 | 29.1 | ||||
| DPA1*01:03/DPB1*04:01 | 35.9 | ||||
| DPA1*03:01/DPB1*04:02 | 80.2 | ||||
| DPA1*02:01/DPB1*05:01 | 273.4 | ||||
| ORF8 | SKWYIRVGARKSAPL* | 43–57 | 0.8829 | DRB1*01:01 | 13.7 |
| DRB1*08:02 | 87.8 | ||||
| DRB1*09:01 | 50.7 | ||||
| DRB1*11:01 | 15.3 | ||||
| DRB5*01:01 | 8.8 | ||||
*Significant sequence overlap with peptides reported in[38,43].
#Exact peptide replicated from analyses reported in[38,43].
Figure 3Docking of top predicted HLA class I peptides with a shared HLA molecule. (A) Structural docking model for each indicated peptide with the molecular structure of HLA-B*15:01 (PDB: 3C9N). Individual panels represent top-down views of the peptide binding groove. (B) Binding motif for HLA-B*15:01. (C) Template Modeling and Interaction Similarity scores for the selected peptide docking models shown in panel A[81,82].
Top predicted B cell epitopes for the S protein.
| Peptide | Residues | Bepipred scorea | DiscoTope scorea |
|---|---|---|---|
| DEVRQIAPGQTGKIADYNYKLPDD | 405–428 | 0.715 | − 5.71 |
| NLDSKVGGNYN | 440–450 | 0.577 | − 5.77 |
| GFQPTNGVGYQP | 496–507 | 1.01 | − 5.73 |
| DIADTT | 568–573 | 0.853 | − 5.55 |
| PPIKD | 792–796 | 0.936 | − 3.28 |
| VYDPLQPELDSF | 1137–1148 | 0.747 | − 4.12 |
aReported scores represent the average calculated across all amino acids for the combined epitope sequence.
Figure 4Modeling of predicted B cell epitopes on the crystal structure of the S glycoprotein. Predicted structural epitopes in the S1 domain (A) and S2 domain (B) highlighted on the structure of the S glycoprotein monomer (PDB: 6VSB). (C) Top predicted B cell epitopes identified by both Bepipred and DiscoTope prediction algorithms highlighted on the trimeric structure of the S glycoprotein. Inset panels show the S1 domain (upper) and S2 domain (lower). Predicted epitopes are highlighted as colored atoms (green, blue, red) on the surface of the S protein (salmon).