| Literature DB >> 33091154 |
Jinlei He1, Fan Huang2, Jianhui Zhang1, Qiwei Chen1, Zhiwan Zheng1, Qi Zhou1, Dali Chen1, Jiao Li1, Jianping Chen1,3.
Abstract
The global outbreak of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) urgently requires an effective vaccine for prevention. In this study, 66 epitopes containing pentapeptides of SARS-CoV-2 spike protein in the IEDB database were compared with the amino acid sequence of SARS-CoV-2 spike protein, and 66 potentially immune-related peptides of SARS-CoV-2 spike protein were obtained. Based on the single-nucleotide polymorphisms analysis of spike protein of 1218 SARS-CoV-2 isolates, 52 easily mutated sites were identified and used for vaccine epitope screening. The best vaccine candidate epitopes in the 66 peptides of SARS-CoV-2 spike protein were screened out through mutation and immunoinformatics analysis. The best candidate epitopes were connected by different linkers in silico to obtain vaccine candidate sequences. The results showed that 16 epitopes were relatively conservative, immunological, nontoxic, and nonallergenic, could induce the secretion of cytokines, and were more likely to be exposed on the surface of the spike protein. They were both B- and T-cell epitopes, and could recognize a certain number of HLA molecules and had high coverage rates in different populations. Moreover, epitopes 897-913 were predicted to have possible cross-immunoprotection for SARS-CoV and SARS-CoV-2. The results of vaccine candidate sequences screening suggested that sequences (without linker, with linker GGGSGGG, EAAAK, GPGPG, and KK, respectively) were the best. The proteins translated by these sequences were relatively stable, with a high antigenic index and good biological activity. Our study provided vaccine candidate epitopes and sequences for the research of the SARS-CoV-2 vaccine.Entities:
Keywords: SARS-CoV-2; epitope; non-synonymous mutation; spike protein; vaccine
Mesh:
Substances:
Year: 2020 PMID: 33091154 PMCID: PMC7675516 DOI: 10.1002/jmv.26596
Source DB: PubMed Journal: J Med Virol ISSN: 0146-6615 Impact factor: 20.693
Sequence alignment of 66 epitopes in IEDB database to SARS‐CoV‐2 spike protein
| IEDB ID number | Epitope sequence | Organism | Position in spike protein | Amino acid sequence in spike protein | Easily mutated site |
|---|---|---|---|---|---|
| 307 | aalvsgtatagWTFGAg | SARS‐CoV | 875‐891 | SALLAGTITSGWTFGAG | N/A |
| 462 | aatkMSECVlgqskrvd | SARS‐CoV |
|
| N/A |
| 1460 | agclIGAEHvdtsyecd | SARS‐CoV | 647‐663 | AGCLIGAEHVNNSYECD | 653, 660 |
| 3176 | aMQMAYRF | SARS‐CoV |
|
| N/A |
| 6011 | canlllqygsFCTQLnralsgia | SARS‐CoV | 749‐771 | CSNLLLQYGSFCTQLNRALTGIA | 769 |
| 6333 | cgpklstdliknqCVNFNfngltgtgvltpsskrfqpfqqfg | SARS‐CoV | 525‐566 | CGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFG | N/A |
| 6334 | cgpklstdliknqCVNFNfngltgtgvltpsskrfqpfqqfgrdvsdftd | SARS‐CoV | 525‐574 | CGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTD | 574 |
| 7066 | csqnplaelkcsvksfeidkGIYQTsnfrvvpsgd | SARS‐CoV | 291‐325 | CALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTES | N/A |
| 7217 | cttfddvqapnytqhtssmRGVYYPDeifr | SARS‐CoV | 15‐44 | CVNLTTRTQLPPAYTNSFTRGVYYPDKVFR | 17, 21, 22, 29 |
| 7383 | CYGVSatklndlcfsnv | SARS‐CoV |
|
| 382 |
| 8239 | dfcgkGYHLMSfpqaap | SARS‐CoV | 1041‐1057 | DFCGKGYHLMSFPQSAP | N/A |
| 12417 | eidkGIYQTsnfrvvps | SARS‐CoV | 307‐323 | TVEKGIYQTSNFRVQPT | N/A |
| 15903 | ffSTFKCYGVSatklnd | SARS‐CoV |
|
| 382 |
| 18161 | fvfngtswfiTQRNFfs | SARS‐CoV | 1095‐1110 | FVSNGTHWFVTQRNFY | N/A |
| 18515 | gaalqipFAMQMAYRFn | SARS‐CoV |
|
| N/A |
| 21464 | gnliaprGYFKIrsgkssim | Influenza A virus | 192‐211 | FVFKNIDGYFKIYSKHTPIN | 211 |
| 22321 | gsFCTQLn | SARS‐CoV |
|
| N/A |
| 24978 | htssmRGVYYPDeifrs | SARS‐CoV | 29‐45 | TNSFTRGVYYPDKVFRS | 29 |
| 25250 | IADYNYKLpddfmgcvl | SARS‐CoV |
|
| N/A |
| 25293 | iaglIAIVMvtillccm | SARS‐CoV | 1221‐1237 | IAGLIAIVMVTIMLCCM | 1237 |
| 25378 | iapgqtgvIADYNYKLp | SARS‐CoV | 410‐426 | IAPGQTGKIADYNYKLP | N/A |
| 25382 | iaprGYFKIrngkssimrsdapigtcssecit | Influenza A virus | 195‐226 | KNIDGYFKIYSKHTPINLVRDLPQGFSALEPL | 211, 215, 220 |
| 29728 | iywtivkpgdillinstgnliaprGYFKIrn | Influenza A virus | 175‐205 | FLMDLEGKQGNFKNLREFVFKNIDGYFKIYS | 180, 181 |
| 30987 | kGIYQTsn | SARS‐CoV |
|
| N/A |
| 30988 | kGIYQTsnfrvvpsgdvvrf | SARS‐CoV | 310‐329 | KGIYQTSNFRVQPTESIVRF | N/A |
| 31581 | kkisnCVADYsvlynst | SARS‐CoV |
|
| N/A |
| 31582 | kkisnCVADYsvlynstf | SARS‐CoV |
|
| N/A |
| 33305 | ksfeidkGIYQTsnfrvv | SARS‐CoV | 304‐321 | KSFTVEKGIYQTSNFRVQ | N/A |
| 33358 | ksivAYTMSlgadssia | SARS‐CoV | 690‐706 | QSIIAYTMSLGAENSVA | 690, 691, 701 |
| 33874 | kTSVDCnMYICGDSTEC | SARS‐CoV | 733‐749 | KTSVDCTMYICGDSTEC | N/A |
| 36579 | liknqCVNFNfngltgt | SARS‐CoV | 533‐549 | LVKNKCVNFNFNGLTGT | N/A |
| 36815 | lkcsvksfeidkGIYQT | SARS‐CoV | 299‐315 | TKCTLKSFTVEKGIYQT | N/A |
| 36856 | lkgacscgsCCKFDedd | SARS‐CoV | 1244‐1260 | LKGCCSCGSCCKFDEDD | N/A |
| 37758 | llrstsqksivAYTMSl | SARS‐CoV | 683‐699 | RARSVASQSIIAYTMSL | 688, 690, 691 |
| 39023 | lqygsFCTQLnralsgi | SARS‐CoV | 754‐770 | LQYGSFCTQLNRALTGI | 769 |
| 41177 | MAYRFNGIgvtqnvlye | SARS‐CoV | 902‐918 | MAYRFNGIGVTQNVLYE | N/A |
| 42999 | mvtilLCCMTSCCsclk | SARS‐CoV | 1229‐1245 | MVTIMLCCMTSCCSCLK | 1237 |
| 43145 | nafnCTFEYisdafsld | SARS‐CoV | 162‐178 | SANNCTFEYVSQPFLMD | N/A |
| 46379 | nvfqtqagclIGAEHvd | SARS‐CoV | 641‐657 | NVFQTRAGCLIGAEHVN | 653 |
| 46822 | PAICHegkayfpregvfvfngtswfitqrnffs | SARS‐CoV | 1079‐1111 | PAICHDGKAHFPREGVFVSNGTHWFVTQRNFYE | N/A |
| 47479 | pFAMQMAYRFNGIgvtq | SARS‐CoV |
|
| N/A |
| 49968 | pvsmakTSVDCnMYICGds | SARS‐CoV | 728‐746 | PVSMTKTSVDCTMYICGDS | N/A |
| 50058 | pwyvwlgfiaglIAIVM | SARS‐CoV | 1213‐1229 | PWYIWLGFIAGLIAIVM | N/A |
| 53202 | rasanlaatkMSECVlg | SARS‐CoV | 1019‐1035 | RASANLAATKMSECVLG | N/A |
| 54989 | rnfttaPAICHegkayf | SARS‐CoV | 1073‐1089 | KNFTTAPAICHDGKAHF | 1078 |
| 58143 | sgncdvvigiinNTVYD | SARS‐CoV | 1123‐1139 | SGNCDVVIGIVNNTVYD | N/A |
| 58730 | sivAYTMSl | SARS‐CoV | 691‐699 | SIIAYTMSL | 691 |
| 61554 | stdliknqCVNFNfn | SARS‐CoV | 530‐544 | STNLVKNKCVNFNFN | N/A |
| 61598 | stffSTFKCYGVSatkl | SARS‐CoV |
|
| 382 |
| 62872 | tagWTFGAgaalqipfa | SARS‐CoV | 883‐899 | TSGWTFGAGAALQIPFA | N/A |
| 63309 | tecanlllqygsFCTQL | SARS‐CoV | 747‐763 | TECSNLLLQYGSFCTQL | N/A |
| 68971 | vigiinNTVYDplqpel | SARS‐CoV | 1129‐1145 | VIGIVNNTVYDPLQPEL | N/A |
| 72205 | VYYPDeifrsdtlyltqd | SARS‐CoV | 36‐53 | VYYPDKVFRSSVLHSTQD | N/A |
| 74173 | yicgDSTECanlllqyg | SARS‐CoV | 741‐757 | YICGDSTECSNLLLQYG | N/A |
| 75920 | ysvlynstffSTFKCYG | SARS‐CoV |
|
| N/A |
| 99918 | CTFEYisdafsld | SARS‐CoV | 166‐178 | CTFEYVSQPFLMD | N/A |
| 100048 | gaalqipFAMQMAYRF | SARS‐CoV | 891‐906 | GAALQIPFAMQMAYRF | N/A |
| 100230 | ksivAYTMSlgadssiay | SARS‐CoV | 690‐707 | QSIIAYTMSLGAENSVAY | 690, 691, 701 |
| 100300 | MAYRFNGIgvtqnvly | SARS‐CoV | 902‐917 | MAYRFNGIGVTQNVLY | N/A |
| 100316 | nafnCTFEYisdafsldv | SARS‐CoV | 162‐179 | SANNCTFEYVSQPFLMDL | N/A |
| 100537 | swfiTQRNFfspqii | SARS‐CoV | 1101‐1115 | HWFVTQRNFYEPQII | N/A |
| 100711 | agclIGAEHvdtsyecdi | SARS‐CoV | 647‐664 | AGCLIGAEHVNNSYECDI | 653, 660 |
| 129239 | liaprGYFKIrsgkssi | Influenza A virus | 194‐210 | FKNIDGYFKIYSKHTPI | N/A |
| 532052 | gtswfiTQRNFfspq | SARS‐CoV | 1099‐1113 | GTHWFVTQRNFYEPQ | N/A |
| 873061 | mmcehiyytcvrTSVDCc | Human herpes virus 6B | 722‐739 | VTTEILPVSMTKTSVDCT | N/A |
| 874104 | ytcvrTSVDCcmkgaep | Human herpes virus 6B | 729‐745 | VSMTKTSVDCTMYICGD | N/A |
Note: In the table, the capitalized amino acid sequences were the sequences in SARS‐CoV‐2 spike protein. The underlined peptides partially overlaped with CR3022 epitope published in Science by Yuan et al. The bolded peptides were completely consistent with the corresponding epitope sequences in the IEDB database.
Figure 1Mutation analysis of severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) spike protein and prediction of epitope population coverage. (A) The amino acid sites with nonsynonymous mutations of spike protein in 1218 clinical isolates of SARS‐CoV‐2. The amino acid sites with nonsynonymous mutations appeared twice or more in 1218 isolates, which were considered to be easily mutated and marked in the figure. We totally found 52 easily mutated sites. (B) Prediction of population coverage rates of 28 epitopes in SARS‐CoV‐2 spike protein
Prediction of protective antigen, toxicity, allergenicity, and cytokine secretion of 66 peptides in SARS‐CoV‐2 spike protein
| Position in spike protein | Amino acid sequence in spike protein | Protective antigen prediction | Toxicity prediction | Allergenicity prediction | IFN‐γ prediction | IL‐4 prediction | IL‐10 prediction |
|---|---|---|---|---|---|---|---|
| 875‐891 | SALLAGTITSGWTFGAG | Nonantigen | Nontoxin | Allergen | Inducer | Noninducer | Noninducer |
|
|
| Antigen | Nontoxin | Nonallergen | Noninducer | Inducer | Noninducer |
| 647‐663 | AGCLIGAEHVNNSYECD | Antigen | Toxin | Nonallergen | Inducer | Inducer | Inducer |
|
|
| Antigen | Nontoxin | Nonallergen | Inducer | Inducer | Noninducer |
| 749‐771 | CSNLLLQYGSFCTQLNRALTGIA | Antigen | Nontoxin | Nonallergen | Inducer | Noninducer | Inducer |
| 525‐566 | CGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFG | Antigen | Nontoxin | Nonallergen | N/A | Noninducer | Inducer |
| 525‐574 | CGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTD | Antigen | Nontoxin | Nonallergen | N/A | Noninducer | Inducer |
| 291‐325 | CALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTES | Antigen | Nontoxin | Nonallergen | N/A | Inducer | Inducer |
| 15‐44 | CVNLTTRTQLPPAYTNSFTRGVYYPDKVFR | Antigen | Nontoxin | Nonallergen | Inducer | Inducer | Inducer |
|
|
| Antigen | Nontoxin | Nonallergen | Non‐Inducer | Inducer | Noninducer |
| 1041‐1057 | DFCGKGYHLMSFPQSAP | Nonantigen | Nontoxin | Nonallergen | Inducer | Inducer | Noninducer |
| 307‐323 | TVEKGIYQTSNFRVQPT | Antigen | Nontoxin | Nonallergen | Inducer | Inducer | Noninducer |
|
|
| Antigen | Nontoxin | Allergen | Noninducer | Inducer | Noninducer |
| 1095‐1110 | FVSNGTHWFVTQRNFY | Nonantigen | Nontoxin | Allergen | Noninducer | Inducer | Noninducer |
|
|
| Antigen | Nontoxin | Allergen | Noninducer | Inducer | Noninducer |
| 192‐211 | FVFKNIDGYFKIYSKHTPIN | Antigen | Nontoxin | Allergen | Inducer | Inducer | Inducer |
|
|
| Antigen | Nontoxin | Allergen | Inducer | Noninducer | Noninducer |
| 29‐45 | TNSFTRGVYYPDKVFRS | Nonantigen | Nontoxin | Allergen | Inducer | Noninducer | Inducer |
|
|
| Antigen | Nontoxin | Allergen | Non‐Inducer | Inducer | Noninducer |
| 1221‐1237 | IAGLIAIVMVTIMLCCM | Antigen | Toxin | Allergen | Inducer | Inducer | Inducer |
| 410‐426 | IAPGQTGKIADYNYKLP | Antigen | Nontoxin | Nonallergen | Inducer | Inducer | Noninducer |
| 195‐226 | KNIDGYFKIYSKHTPINLVRDLPQGFSALEPL | Antigen | Nontoxin | Nonallergen | N/A | Noninducer | Inducer |
| 175‐205 | FLMDLEGKQGNFKNLREFVFKNIDGYFKIYS | Nonantigen | Nontoxin | Nonallergen | N/A | Inducer | Inducer |
|
|
| Nonantigen | Nontoxin | Allergen | Inducer | Inducer | Noninducer |
| 310‐329 | KGIYQTSNFRVQPTESIVRF | Nonantigen | Nontoxin | Nonallergen | Inducer | Inducer | Noninducer |
|
|
| Nonantigen | Nontoxin | Nonallergen | Non‐Inducer | Inducer | Inducer |
|
|
| Nonantigen | Nontoxin | Nonallergen | Non‐Inducer | Inducer | Inducer |
| 304‐321 | KSFTVEKGIYQTSNFRVQ | Nonantigen | Nontoxin | Nonallergen | Inducer | Inducer | Noninducer |
| 690‐706 | QSIIAYTMSLGAENSVA | Antigen | Nontoxin | Nonallergen | Inducer | Noninducer | Noninducer |
| 733‐749 | KTSVDCTMYICGDSTEC | Nonantigen | Toxin | Nonallergen | Non‐Inducer | Noninducer | Noninducer |
| 533‐549 | LVKNKCVNFNFNGLTGT | Antigen | Nontoxin | Nonallergen | Non‐Inducer | Inducer | Noninducer |
| 299‐315 | TKCTLKSFTVEKGIYQT | Nonantigen | Nontoxin | Allergen | Inducer | Inducer | Noninducer |
| 1244‐1260 | LKGCCSCGSCCKFDEDD | Nonantigen | Toxin | Nonallergen | Noninducer | Inducer | Noninducer |
| 683‐699 | RARSVASQSIIAYTMSL | Antigen | Nontoxin | Nonallergen | Inducer | Noninducer | Noninducer |
| 754‐770 | LQYGSFCTQLNRALTGI | Antigen | Nontoxin | Nonallergen | Inducer | Noninducer | Noninducer |
| 902‐918 | MAYRFNGIGVTQNVLYE | Antigen | Nontoxin | Nonallergen | Noninducer | Noninducer | Noninducer |
| 1229‐1245 | MVTIMLCCMTSCCSCLK | Nonantigen | Toxin | Nonallergen | Noninducer | Inducer | Inducer |
| 162‐178 | SANNCTFEYVSQPFLMD | Nonantigen | Nontoxin | Nonallergen | Noninducer | Inducer | Noninducer |
| 641‐657 | NVFQTRAGCLIGAEHVN | Antigen | Nontoxin | Allergen | Noninducer | Noninducer | Inducer |
| 1079‐1111 | PAICHDGKAHFPREGVFVSNGTHWFVTQRNFYE | Nonantigen | Nontoxin | Nonallergen | N/A | Inducer | Inducer |
|
|
| Antigen | Nontoxin | Nonallergen | Noninducer | Inducer | Noninducer |
| 728‐746 | PVSMTKTSVDCTMYICGDS | Nonantigen | Nontoxin | Allergen | Noninducer | Noninducer | Noninducer |
| 1213‐1229 | PWYIWLGFIAGLIAIVM | Antigen | Nontoxin | Nonallergen | Noninducer | Noninducer | Inducer |
| 1019‐1035 | RASANLAATKMSECVLG | Antigen | Nontoxin | Nonallergen | Inducer | Noninducer | Noninducer |
| 1073‐1089 | KNFTTAPAICHDGKAHF | Nonantigen | Nontoxin | Allergen | Inducer | Inducer | Inducer |
| 1123‐1139 | SGNCDVVIGIVNNTVYD | Antigen | Nontoxin | Allergen | Noninducer | Noninducer | Noninducer |
| 691‐699 | SIIAYTMSL | Antigen | Nontoxin | Allergen | Noninducer | Noninducer | Noninducer |
| 530‐544 | STNLVKNKCVNFNFN | Antigen | Nontoxin | Nonallergen | Noninducer | Inducer | Noninducer |
|
|
| Antigen | Nontoxin | Nonallergen | Noninducer | Inducer | Noninducer |
| 883‐899 | TSGWTFGAGAALQIPFA | Nonantigen | Nontoxin | Nonallergen | Noninducer | Inducer | Noninducer |
| 747‐763 | TECSNLLLQYGSFCTQL | Antigen | Nontoxin | Nonallergen | Inducer | Noninducer | Inducer |
| 1129‐1145 | VIGIVNNTVYDPLQPEL | Antigen | Nontoxin | Nonallergen | Noninducer | Inducer | Inducer |
| 36‐53 | VYYPDKVFRSSVLHSTQD | Nonantigen | Nontoxin | Nonallergen | Inducer | Noninducer | Inducer |
| 741‐757 | YICGDSTECSNLLLQYG | Nonantigen | Nontoxin | Allergen | Inducer | Noninducer | Noninducer |
|
|
| Nonantigen | Nontoxin | Nonallergen | Inducer | Inducer | Noninducer |
| 166‐178 | CTFEYVSQPFLMD | Nonantigen | Nontoxin | Allergen | Noninducer | Inducer | Noninducer |
| 891‐906 | GAALQIPFAMQMAYRF | Antigen | Nontoxin | Nonallergen | Noninducer | Inducer | Noninducer |
| 690‐707 | QSIIAYTMSLGAENSVAY | Antigen | Nontoxin | Allergen | Inducer | Noninducer | Noninducer |
| 902‐917 | MAYRFNGIGVTQNVLY | Antigen | Nontoxin | Nonallergen | Noninducer | Noninducer | Noninducer |
| 162‐179 | SANNCTFEYVSQPFLMDL | Nonantigen | Nontoxin | Nonallergen | Inducer | Inducer | Noninducer |
| 1101‐1115 | HWFVTQRNFYEPQII | Antigen | Nontoxin | Nonallergen | Noninducer | Inducer | Noninducer |
| 647‐664 | AGCLIGAEHVNNSYECDI | Antigen | Toxin | Nonallergen | Inducer | Inducer | Inducer |
| 194‐210 | FKNIDGYFKIYSKHTPI | Antigen | Nontoxin | Nonallergen | Noninducer | Inducer | Inducer |
| 1099‐1113 | GTHWFVTQRNFYEPQ | Nonantigen | Nontoxin | Nonallergen | Noninducer | Inducer | Noninducer |
| 722‐739 | VTTEILPVSMTKTSVDCT | Antigen | Nontoxin | Nonallergen | Noninducer | Inducer | Inducer |
| 729‐745 | VSMTKTSVDCTMYICGD | Nonantigen | Nontoxin | Nonallergen | Noninducer | Noninducer | Noninducer |
Note: In the table, underlined peptides partially overlaped with CR3022 epitope published in Science by Yuan et al. The bolded peptides were consistent with the corresponding epitope sequences in IEDB database. N/A meant undetectable because the peptide length was beyond the range of the analysis system (≤30 amino acids).
Prediction of B cell and T cell epitopes of 28 epitopes in SARS‐CoV‐2 spike protein
| Position in spike protein | Amino acid sequence in spike protein | Average SOA score | B cell epitope prediction | T cell epitope prediction | ||
|---|---|---|---|---|---|---|
| ABCpred | IEDB | HLA class I molecule | HLA class II molecule | |||
|
|
| 12.33 | 2‐17 | 7‐14 | N/A | HLA‐DRB1 0101 |
|
|
| 4.06 | N/A | 0 | HLA‐A2402 | N/A |
| 749‐771 | CSNLLLQYGSFCTQLNRALTGIA | 22.79 | 2‐17 | 17 |
HLA‐A0301 HLA‐A2402 HLA‐B0801 HLA‐B1501 HLA‐B2705 HLA‐B3901 |
HLA‐DRB1 0101 HLA‐DRB1 0301 HLA‐DRB1 0401 HLA‐DRB1 0701 HLA‐DRB1 1501 |
| 525‐566 | CGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFG | 25.59 | 1‐37 | 5‐39 |
HLA‐A0101 HLA‐A0301 HLA‐A1101 HLA‐A2601 HLA‐B0702 HLA‐B0801 HLA‐B1501 HLA‐B3901 HLA‐B3902 HLA‐B5101 |
HLA‐DRB1 0101 HLA‐DRB1 0301 HLA‐DRB1 0401 HLA‐DRB1 0404 HLA‐DRB1 0701 HLA‐DRB1 1501 |
| 525‐574 | CGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTD | 24.71 | 2‐21, 24‐43 | 7‐46 |
HLA‐A0101 HLA‐A0301 HLA‐A1101 HLA‐A2601 HLA‐B0702 HLA‐B0801 HLA‐B1501 HLA‐B3901 HLA‐B3902 HLA‐B5101 |
HLA‐DRB1 0101 HLA‐DRB1 0301 HLA‐DRB1 0401 HLA‐DRB1 0404 HLA‐DRB1 0701 HLA‐DRB1 1101 HLA‐DRB1 1501 |
| 291‐325 | CALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTES | 32.24 | 2‐28 | 18‐32 |
HLA‐A0301 HLA‐A1101 HLA‐A2402 HLA‐A2601 HLA‐B5801 |
HLA‐DRB1 0101 HLA‐DRB1 0401 HLA‐DRB1 0701 HLA‐DRB1 1101 HLA‐DRB1 1501 |
| 15‐44 | CVNLTTRTQLPPAYTNSFTRGVYYPDKVFR | 31.38 | 5‐24 | 5‐26 |
HLA‐A0101 HLA‐A0201 HLA‐A0301 HLA‐A2601 HLA‐B0702 HLA‐B1501 HLA‐B1516 HLA‐B2705 |
HLA‐DRB1 0101 HLA‐DRB1 0401 HLA‐DRB1 0701 |
|
|
| 15.48 | 1‐16 | 6‐13 | HLA‐A2402 | HLA‐DRB1 0301 HLA‐DRB1 0701 |
| 307‐323 | TVEKGIYQTSNFRVQPT | 36.14 | 1‐16 | 6‐14 |
HLA‐A0301 HLA‐A2402 HLA‐B5801 |
HLA‐DRB1 0701 HLA‐DRB1 1101 HLA‐DRB1 1501 |
| 410‐426 | IAPGQTGKIADYNYKLP | 8.61 | 2‐17 | 6‐13 |
HLA‐A0201 HLA‐B0702 HLA‐B1501 HLA‐B5101 | N/A |
| 195‐226 | KNIDGYFKIYSKHTPINLVRDLPQGFSALEPL | 30.24 | 3‐22 | 8‐28 |
HLA‐A0101 HLA‐A0201 HLA‐A2601 HLA‐B0702 HLA‐B0801 HLA‐B1501 HLA‐B3501 HLA‐B3901 HLA‐B5802 |
HLA‐DRB1 0101 HLA‐DRB1 0401 HLA‐DRB1 0404 HLA‐DRB1 0701 HLA‐DRB1 1101 HLA‐DRB1 1501 HLA‐DQA1 0301 HLA‐DQB1 0302 |
| 690‐706 | QSIIAYTMSLGAENSVA | 30.17 | 1‐16 | 6‐13 |
HLA‐A0201 HLA‐A2601 HLA‐B0702 HLA‐B0801 HLA‐B1501 HLA‐B1516 HLA‐B3901 HLA‐B5801 |
HLA‐DRB1 0101 HLA‐DRB1 0401 HLA‐DRB1 0402 HLA‐DRB1 0701 HLA‐DRB1 1101 HLA‐DRB1 1501 |
| 533‐549 | LVKNKCVNFNFNGLTGT | 19.67 | 4‐17 | 8‐13 | HLA‐B0801 HLA‐B1501 | HLA‐DRB1 0404 HLA‐DRB1 1501 |
| 683‐699 | RARSVASQSIIAYTMSL | N/A | 2‐15 | 6‐13 |
HLA‐A0101 HLA‐A0201 HLA‐A2601 HLA‐B0702 HLA‐B0801 HLA‐B1501 HLA‐B1516 HLA‐B2705 HLA‐B3901 HLA‐B4001 HLA‐B5801 HLA‐B5802 | HLA‐DRB1 0101 HLA‐DRB1 0301 |
| 754‐770 | LQYGSFCTQLNRALTGI | 25.03 | 2‐15 | 0 |
HLA‐A0201 HLA‐A0301 HLA‐A1101 HLA‐B1501 HLA‐A2402 HLA‐B2705 HLA‐B3901 |
HLA‐DRB1 0101 HLA‐DRB1 0301 HLA‐DRB1 0401 HLA‐DRB1 0701 |
| 902‐918 | MAYRFNGIGVTQNVLYE | 12.31 | 2‐15 | 6‐13 | HLA‐B2705 |
HLA‐DRB1 0401 HLA‐DRB1 0402 HLA‐DRB1 0404 HLA‐DRB1 1501 |
|
|
| 12.19 | 2‐17 | 10, 12‐13 |
HLA‐A2402 HLA‐A2601 HLA‐B0801 HLA‐B1501 HLA‐B2705 HLA‐B3901 HLA‐B5801 |
HLA‐DRB1 0101 HLA‐DRB1 0401 HLA‐DRB1 0402 HLA‐DRB1 1501 |
| 1213‐1229 | PWYIWLGFIAGLIAIVM | N/A | 1‐16 | 0 |
HLA‐A0201 HLA‐A2402 HLA‐A2601 HLA‐B3901 |
HLA‐DRB1 0101 HLA‐DRB1 0301 HLA‐DRB1 0401 HLA‐DRB1 0402 HLA‐DRB1 0701 HLA‐DRB1 1101 HLA‐DRB1 1501 |
| 1019‐1035 | RASANLAATKMSECVLG | 14.31 | 2‐17 | 8‐13 |
HLA‐A0301 HLA‐A1101 HLA‐B5801 | HLA‐DRB1 0101 |
| 530‐544 | STNLVKNKCVNFNFN | 28.43 | 3‐14 | 6‐11 | HLA‐B0801 HLA‐B1501 | HLA‐DRB1 0301 |
|
|
| 24.57 | 3‐16 | 8‐13 |
HLA‐A0101 HLA‐A0301 HLA‐A2402 HLA‐B1501 HLA‐B3901 HLA‐B5801 | HLA‐DRB1 0401 HLA‐DRB1 1501 |
| 747‐763 | TECSNLLLQYGSFCTQL | 26.54 | 1‐16 | 7‐13 |
HLA‐A0101 HLA‐A2402 HLA‐B0801 HLA‐B1501 HLA‐B3901 | HLA‐DRB1 0101 HLA‐DRB1 1501 |
| 1129‐1145 | VIGIVNNTVYDPLQPEL | 37.57 | 1‐16 | 5‐13 | HLA‐A0201 HLA‐A2402 | HLA‐DRB1 0701 HLA‐DRB1 0801 |
| 891‐906 | GAALQIPFAMQMAYRF | 9.67 | 1‐14 | 0 |
HLA‐A2402 HLA‐A2601 HLA‐B0801 HLA‐B1501 HLA‐B3501 HLA‐B3901 HLA‐B4001 HLA‐B5301 HLA‐B5801 |
HLA‐DRB1 0101 HLA‐DRB1 0401 HLA‐DRB1 0404 HLA‐DRB1 0701 HLA‐DRB1 1501 |
| 902‐917 | MAYRFNGIGVTQNVLY | 11.38 | 2‐15 | 5‐12 | HLA‐B2705 |
HLA‐DRB1 0401 HLA‐DRB1 0402 HLA‐DRB1 0404 HLA‐DRB1 1501 |
| 1101‐1115 | HWFVTQRNFYEPQII | 23.55 | 2‐13 | 5‐11 |
HLA‐A2402 HLA‐A2601 HLA‐B2705 | HLA‐DRB1 0801 |
| 194‐210 | FKNIDGYFKIYSKHTPI | 22.72 | 1‐16 | 9‐13 |
HLA‐A0101 HLA‐A0201 HLA‐B0702 HLA‐B0801 HLA‐B3901 |
HLA‐DRB1 1501 HLA‐DQA1 0301 HLA‐DQB1 0302 |
| 722‐739 | VTTEILPVSMTKTSVDCT | 13.03 | 1‐16 | 9‐14 |
HLA‐A0101 HLA‐A0301 HLA‐B1516 HLA‐B3901 | HLA‐DRB1 0101 HLA‐DRB1 0701 |
Note: In the table, the underlined epitopes overlaped with CR3022 epitope published in Science by Yuan et al. The bolded epitopes were consistent with the corresponding epitope sequences in IEDB database. The prediction results of T‐cell epitope and MHC binding combined the results of three analysis tools NetMHC 4.0 Sever, Rankpep and SYFPEITHI. N/A meant undetectable. HLA, human lymphocyte antigen; SOA, solvent accessibility.
Figure 2Schematic diagram of the tandem sequence of 11 vaccine candidate peptides and sequence analysis after connecting with different linkers. (A) Schematic diagram of tandem sequences of 11 vaccine candidate peptides. Peptides 897‐913 were predicted possible cross‐immunoprotection for SARS‐CoV and SARS‐CoV‐2. Peptides 371‐387 and 747‐771 contained easily mutated sites. (B) Amino acid composition, secondary structure composition, and solvent accessibility analysis of different vaccine candidate sequences
Figure 3Analysis of flexibility and antigenic index of different vaccine candidate sequences