| Literature DB >> 34960205 |
Marsia Gustiananda1, Bobby Prabowo Sulistyo1, David Agustriawan2, Sita Andarini3.
Abstract
SARS-CoV-2 and its variants caused the COVID-19 pandemic. Vaccines that target conserved regions of SARS-CoV-2 and stimulate protective T-cell responses are important for reducing symptoms and limiting the infection. Seven cytotoxic (CTL) and five helper T-cells (HTL) epitopes from ORF1ab were identified using NetCTLpan and NetMHCIIpan algorithms, respectively. These epitopes were generated from ORF1ab regions that are evolutionary stable as reflected by zero Shannon's entropy and are presented by 56 human leukocyte antigen (HLA) Class I and 22 HLA Class II, ensuring good coverage for the Indonesian and world population. Having fulfilled other criteria such as immunogenicity, IFNγ inducing ability, and non-homology to human and microbiome peptides, the epitopes were assembled into a vaccine construct (VC) together with β-defensin as adjuvant and appropriate linkers. The VC was shown to have good physicochemical characteristics and capability of inducing CTL as well as HTL responses, which stem from the engagement of the vaccine with toll-like receptor 4 (TLR4) as revealed by docking simulations. The most promiscuous peptide 899WSMATYYLF907 was shown via docking simulation to interact well with HLA-A*24:07, the most predominant allele in Indonesia. The data presented here will contribute to the in vitro study of T-cell epitope mapping and vaccine design in Indonesia.Entities:
Keywords: HLA-A*24:07; SARS-CoV-2; T-cell epitopes; cytotoxic T-cells; helper T-cells; human leukocyte antigen; immunoinformatics; multi-epitope peptide-based vaccine
Year: 2021 PMID: 34960205 PMCID: PMC8704007 DOI: 10.3390/vaccines9121459
Source DB: PubMed Journal: Vaccines (Basel) ISSN: 2076-393X
Figure 1Flowchart indicating the immunoinformatics methods used in this study to identify the CTL and HTL epitopes from SARS-CoV-2 ORF1ab polyprotein.
The number of ORF1ab from SARS-CoV-2 variants included in the study. The sequences were retrieved from NCBI Virus SARS-CoV-2 Data Hub on 22 September 2021 in FASTA format. All sequences were of complete length between 7093 and 7096 amino acids and contained no ambiguous amino acid characters. The delta sequences were selected only from isolates collected between 31 July and 22 September 2021.
| SARS-CoV-2 Variants | Number of Isolates |
|---|---|
| Alpha (B.1.1.7) | 158 |
| Beta (B.1.351) | 374 |
| Delta (B.1.617.2) | 1157 |
| Eta (B.1.525) | 436 |
| Gamma (P.1) | 9 |
| Iota (B.1.526) | 24 |
| Kappa (B.1.617.1) | 148 |
| Lambda (C.7) | 286 |
| Mu (B.1.621) | 18 |
Figure 2Entropy values of SARS-CoV-2 ORF1ab protein 9-mers. Blue line denotes the entropy values for 9-mer in the corresponding center position. Yellow vertical lines in the background depicts the occurrence of 0 entropy.
The number of SARS-CoV-2 immunogenic epitopes (T-cell assay positive) as reported in IEDB. The percentage of immunogenic epitopes over the possible number of 9-mer peptides generated by a protein was calculated. The number in bold indicates the percentage of the immunogenic epitopes over the total number of epitopes. IEDB is accessed on 16 September 2021.
| Protein | Size (aa) | Number of Immunogenic Epitopes | ||
|---|---|---|---|---|
| Reported in IEDB | % Immunogenic Epitopes Per Protein | % Immunogenic Epitopes Per Total Reported in IEDB | ||
| ORF1ab | 7096 | 678 | 9.6 | 38.4 |
| Spike | 1273 | 578 | 4.5 | 32.7 |
| ORF3a | 275 | 88 | 32.0 | 5 |
| Envelope | 75 | 13 | 17.3 | 0.7 |
| Membrane | 222 | 131 | 59.0 | 7.4 |
| ORF6 | 61 | 18 | 29.5 | 1.0 |
| ORF7a | 121 | 28 | 23.1 | 1.6 |
| ORF7b | 43 | 3 | 7.0 | 0.2 |
| ORF8 | 121 | 37 | 30.6 | 2.1 |
| Nucleocapsid | 419 | 185 | 44.2 | 0.5 |
| ORF10 | 38 | 8 | 21.0 | 0.5 |
| Total epitopes | 1767 | |||
Figure 3Indonesian HLA alleles and frequency. There were 56 HLA Class I and 22 HLA Class II alleles included in the study. The allele frequency in the Indonesian population (red bar) was compared to those in Thailand (Green bar) and Germany (Blue bar).
The number of experimentally known T-cell epitopes associated with HLA alleles of Indonesia (INA), Thailand (THA), and Germany (GER). The HLA alleles listed are predominant in the Indonesian and Thai populations (allele frequency ≥ 5%). The HLA alleles of the German population are the ones included in a study of T-cell responses to SARS-CoV-2 [25]. The number of immunogenic T-cell epitopes associated with the alleles were extracted from IEDB (accessed on 16 September 2021).
| HLA alleles | Populations | ORF1ab T-Cell Epitopes | SARS-CoV-2 T-Cell Epitopes | % ORF1ab/SARS-CoV-2 Epitopes in T-Cell Assay | ||||
|---|---|---|---|---|---|---|---|---|
| Total | T-Cell Assay | HLA Assay | Total | T-Cell Assay | HLA Assay | |||
| A*01:01 | GER | 54 | 48 | 12 | 96 | 85 | 21 | 56.47 |
| A*02:01 | GER INA | 138 | 82 | 86 | 224 | 156 | 126 | 52.56 |
| A*02:03 | INA THA | 0 | 0 | 0 | 0 | 0 | 0 | |
| A*02:07 | THA | 0 | 0 | 0 | 0 | 0 | 0 | |
| A*03:01 | GER | 42 | 17 | 33 | 69 | 37 | 45 | 45.95 |
| A*11:01 | GER INA THA | 49 | 19 | 39 | 69 | 33 | 48 | 57.58 |
| A*24:02 | GER INA THA | 64 | 45 | 29 | 129 | 100 | 47 | 45.00 |
| A*24:07 | INA THA | 0 | 0 | 0 | 0 | 0 | 0 | |
| A*33:03 | INA THA | 0 | 0 | 0 | 0 | 0 | 0 | |
| A*34:01 | INA | 0 | 0 | 0 | 0 | 0 | 0 | |
| B*07:02 | GER | 38 | 34 | 4 | 81 | 72 | 13 | 47.22 |
| B*08:01 | GER | 26 | 25 | 1 | 56 | 52 | 4 | 48.08 |
| B*13:01 | THA | 0 | 0 | 0 | 1 | 1 | 0 | 0.00 |
| B*15:01 | GER | 34 | 29 | 5 | 56 | 44 | 12 | 65.91 |
| B*15:02 | INA THA | 0 | 0 | 0 | 0 | 0 | 0 | |
| B*15:13 | INA | 0 | 0 | 0 | 0 | 0 | 0 | |
| B*15:21 | INA | 0 | 0 | 0 | 0 | 0 | 0 | |
| B*18:01 | INA THA | 1 | 0 | 1 | 3 | 0 | 3 | |
| B*35:05 | INA | n.a. | n.a. | n.a. | n.a. | n.a. | n.a. | |
| B*38:02 | INA | 0 | 0 | 0 | 0 | 0 | 0 | |
| B*40:01 | GER THA | 41 | 18 | 28 | 67 | 33 | 41 | 54.55 |
| B*44:03 | INA THA | 11 | 11 | 0 | 25 | 25 | 0 | 44.00 |
| B*46:01 | THA | 0 | 0 | 0 | 0 | 0 | 0 | |
| B*58:01 | INA THA | 6 | 0 | 6 | 14 | 0 | 14 | |
| DRB1*01:01 | GER | 8 | 4 | 7 | 61 | 8 | 59 | 50.00 |
| DRB1*03:01 | GER THA | 1 | 0 | 1 | 28 | 20 | 15 | 0.00 |
| DRB1*04:01 | GER | 24 | 2 | 24 | 156 | 5 | 156 | 40.00 |
| DRB1*04:05 | THA | 1 | 0 | 1 | 39 | 0 | 39 | |
| DRB1*07:01 | GER INA THA | 9 | 9 | 1 | 63 | 34 | 40 | 26.47 |
| DRB1*09:01 | THA | 1 | 0 | 1 | 38 | 0 | 38 | |
| DRB1*11:01 | GER INA | 1 | 0 | 1 | 42 | 12 | 32 | 0.00 |
| DRB1*12:02 | INA THA | 0 | 0 | 0 | 3 | 3 | 0 | 0.00 |
| DRB1*14:54 | THA | 0 | 0 | 0 | 0 | 0 | 0 | |
| DRB1*15:01 | GER INA THA | 17 | 14 | 7 | 83 | 50 | 58 | 28.00 |
| DRB1*15:02 | INA THA | 2 | 2 | 0 | 10 | 10 | 0 | 20.00 |
| DRB1*16:02 | INA THA | 0 | 0 | 0 | 8 | 8 | 0 | 0.00 |
Figure 4The number of predicted CTL epitopes that are presented by the HLA Class I allele. NetCTLpan 1.1 predicted 1132 9-mer peptides that bind to at least 1 HLA Class I alleles with the percentile rank of less than 1%.
List of 9-mer peptides from ORF1ab predicted as promiscuous CTL epitopes.
| Start Residue | Peptide | HLA Class I Alleles | Immunogenicity Score |
|---|---|---|---|
| 295 | FMGRIRSVY | HLA-A*01:01, HLA-A*29:01, HLA-B*15:01, HLA-B*15:02, HLA-B*15:12, HLA-B*15:13, HLA-B*15:21, HLA-B*15:25, HLA-B*15:32, HLA-B*35:01, HLA-B*35:05, HLA-B*35:30, HLA-B*46:01 | 0.1259 |
| 541 | RVVRSIFSR | HLA-A*03:01, HLA-A*11:01, HLA-A*11:04, HLA-A*33:03, HLA-A*74:01 | 0.0318 |
| 611 | WLTNIFGTV | HLA-A*02:01, HLA-A*02:03 | 0.2972 |
| 806 | MVTNNTFTL | HLA-A*02:06, HLA-A*34:01, HLA-B*35:02, HLA-B*35:30, HLA-B*56:01, HLA-B*56:02, HLA-B*46:01 | 0.1578 |
| 899 | WSMATYYLF b | HLA-A*01:01, HLA-A*24:02, HLA-A*24:07, HLA-A*24:10, HLA-A*29:01, HLA-A*32:01, HLA-B*13:01, HLA-B*15:02, HLA-B*15:12, HLA-B*15:13, HLA-B*15:17, HLA-B*15:21, HLA-B*15:25, HLA-B*15:32, HLA-B*18:01, HLA-B*18:02, HLA-B*35:01, HLA-B*35:05, HLA-B*35:30, HLA-B*52:01, HLA-B*56:07, HLA-B*57:01, HLA-B*58:01, HLA-B*46:01 | 0.0071 |
| 1055 | VVVNAANVY a | HLA-A*26:01, HLA-B*15:01, HLA-B*15:02, HLA-B*15:12, HLA-B*15:21, HLA-B*15:25, HLA-B*15:32, HLA-B*35:01, HLA-B*46:01 | 0.1005 |
| 1140 | HEVLLAPLL c | HLA-B*13:01, HLA-B*18:01, HLA-B*18:02, HLA-B*37:01, HLA-B*38:02, HLA-B*40:01, HLA-B*40:02, HLA-B*40:06, HLA-B*41:01, HLA-B*44:03 | 0.0124 |
| 1247 | FLTENLLLY b | HLA-A*01:01, HLA-A*26:01, HLA-A*29:01 | 0.0808 |
| 1254 | LYIDINGNL | HLA-A*24:02, HLA-A*24:07, HLA-A*24:10 | 0.2138 |
| 1269 | LVSDIDITF a | HLA-B*15:02, HLA-B*15:13, HLA-B*15:17, HLA-B*15:21, HLA-B*35:01, HLA-B*35:02, HLA-B*35:05, HLA-B*35:30, HLA-B*57:01, HLA-B*58:01, HLA-B*46:01 | 0.2541 |
| 1366 | ILGTVSWNL b | HLA-A*02:01, HLA-A*02:07 | 0.1177 |
| 1674 | YLATALLTL a,b | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*02:07, HLA-B*46:01 | 0.0927 |
| 2175 | LLQLCTFTR | HLA-A*33:03, HLA-A*74:01 | 0.0568 |
| 2327 | FLAYILFTR | HLA-A*33:03, HLA-A*74:01 | 0.2496 |
| 2331 | ILFTRFFYV a,b | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*74:01, HLA-B*08:01, HLA-A*02:07 | 0.3343 |
| 2350 | FSYFAVHFI | HLA-B*51:01, HLA-B*51:02, HLA-B*52:01 | 0.2893 |
| 2597 | FSSTFNVPM | HLA-B*15:10, HLA-B*15:21, HLA-B*35:01, HLA-B*35:05, HLA-B*35:30, HLA-B*56:02, HLA-B*46:01 | 0.1216 |
| 2629 | LSTFISAAR | HLA-A*33:03, HLA-A*34:01, HLA-A*74:01 | 0.1602 |
| 2784 | AIFYLITPV b,c | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*34:01, HLA-A*02:07 | 0.1750 |
| 2786 | FYLITPVHV a | HLA-A*24:02, HLA-A*24:07, HLA-A*24:10 | 0.2114 |
| 2787 | YLITPVHVM a | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*26:01, HLA-B*15:01, HLA-B*15:02, HLA-B*15:10, HLA-B*15:12, HLA-B*15:21, HLA-B*15:25, HLA-B*15:32, HLA-B*35:01, HLA-A*02:07, HLA-B*46:01 | 0.1617 |
| 2883 | FLPRVFSAV a,b | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-B*08:01, HLA-A*02:07 | 0.0821 |
| 3059 | LAYYFMRFR a | HLA-A*33:03, HLA-A*74:01 | 0.0559 |
| 3060 | AYYFMRFRR | HLA-A*33:03, HLA-A*74:01 | 0.1234 |
| 3076 | VVAFNTLLF | HLA-A*24:07, HLA-A*29:01 | 0.1449 |
| 3121 | FLAHIQWMV a,b | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*02:07 | 0.1502 |
| 3137 | FWITIAYII d | HLA-A*24:02, HLA-A*24:07, HLA-A*24:10 | 0.3233 |
| 3152 | FYWFFSNYL | HLA-A*24:02, HLA-A*24:07, HLA-A*24:10 | 0.1404 |
| 3466 | VLAWLYAAV a,b | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*02:07 | 0.2772 |
| 3481 | FLNRFTTTL a,b | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-B*08:01, HLA-A*02:07, HLA-B*46:01 | 0.2560 |
| 3582 | LLLTILTSL b,c | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-B*08:01, HLA-A*02:07 | 0.0907 |
| 3605 | LYENAFLPF | HLA-A*24:02, HLA-A*24:07, HLA-A*24:10 | 0.1584 |
| 3652 | VYMPASWVM a,b | HLA-A*24:02, HLA-A*24:07, HLA-A*24:10 | 0.0253 |
| 3684 | YASAVVLLI a,c | HLA-B*51:01, HLA-B*51:02, HLA-B*52:01, HLA-B*56:07, HLA-B*58:01 | 0.0489 |
| 3692 | ILMTARTVY a | HLA-A*29:01, HLA-B*15:01, HLA-B*15:02, HLA-B*15:12, HLA-B*15:21, HLA-B*15:25, HLA-B*15:32, HLA-B*35:05, HLA-B*35:30, HLA-B*46:01 | 0.1258 |
| 3752 | FLARGIVFM a,b,c | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*02:07 | 0.3263 |
| 4030 | TMLFTMLRK b | HLA-A*03:01, HLA-A*11:01, HLA-A*11:04, HLA-A*74:01 | 0.0076 |
| 4265 | VLSFCAFAV b | HLA-A*02:01, HLA-A*02:07 | 0.1701 |
| 4513 | YTMADLVYA b | HLA-A*02:01, HLA-A*02:06, HLA-A*02:07 | 0.0262 |
| 4656 | YIKWDLLKY | HLA-A*01:01, HLA-A*26:01, HLA-A*29:01, HLA-B*15:02, HLA-B*15:12, HLA-B*15:21, HLA-B*46:01 | 0.0287 |
| 4698 | ILHCANFNV a | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*02:07 | 0.0833 |
| 4723 | KIFVDGVPF | HLA-A*32:01, HLA-B*15:01, HLA-B*15:02, HLA-B*15:25, HLA-B*15:32 | 0.1614 |
| 4846 | YYRYNLPTM | HLA-A*24:02, HLA-A*24:10 | 0.0097 |
| 4862 | FVVEVVDKY a | HLA-A*26:01, HLA-A*29:01, HLA-A*34:01, HLA-B*15:21, HLA-B*35:01, HLA-B*35:30, HLA-B*46:01 | 0.0859 |
| 5024 | MASLVLARK a | HLA-A*03:01, HLA-A*11:01, HLA-A*11:04, HLA-A*30:01, HLA-A*33:03, HLA-A*34:01, HLA-A*74:01 | 0.0282 |
| 5132 | FVNEFYAYL a | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*26:01, HLA-A*34:01, HLA-A*02:07, HLA-B*46:01 | 0.2400 |
| 5245 | LMIERFVSL a | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*32:01, HLA-B*08:01, HLA-B*15:01, HLA-B*15:02, HLA-B*15:10, HLA-B*15:12, HLA-B*15:21, HLA-B*15:25, HLA-B*15:32, HLA-B*35:02, HLA-B*37:01, HLA-B*38:02, HLA-B*48:01, HLA-A*02:07, HLA-B*46:01 | 0.2427 |
| 5247 | IERFVSLAI | HLA-B*13:01, HLA-B*37:01, HLA-B*40:01, HLA-B*40:02, HLA-B*40:06, HLA-B*41:01, HLA-B*44:03, HLA-B*52:01 | 0.0326 |
| 5250 | FVSLAIDAY | HLA-A*01:01, HLA-A*26:01, HLA-A*29:01, HLA-A*34:01, HLA-B*15:02, HLA-B*15:21, HLA-B*35:01, HLA-B*35:05, HLA-B*35:30, HLA-B*46:01 | 0.1401 |
| 5273 | HLYLQYIRK b | HLA-A*03:01, HLA-A*11:01, HLA-A*11:04, HLA-A*74:01 | 0.0139 |
| 5614 | FAIGLALYY a,c | HLA-A*01:01, HLA-A*26:01, HLA-A*29:01, HLA-B*15:13, HLA-B*15:21, HLA-B*35:01, HLA-B*35:05, HLA-B*35:30, HLA-B*58:01, HLA-B*46:01 | 0.0918 |
| 5678 | YVFCTVNAL a | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*26:01, HLA-A*34:01, HLA-B*07:02, HLA-B*07:05, HLA-B*15:02, HLA-B*15:10, HLA-B*15:21, HLA-B*35:01, HLA-B*35:02, HLA-B*35:05, HLA-B*35:30, HLA-B*38:02, HLA-B*48:01, HLA-B*56:01, HLA-B*56:02, HLA-A*02:07, HLA-B*46:01 | 0.0778 |
| 6070 | FKHLIPLMY | HLA-A*29:01, HLA-B*18:02 | 0.0065 |
| 6108 | VLWAHGFEL a | HLA-A*02:01, HLA-A*02:06, HLA-A*02:07 | 0.3320 |
| 6506 | FELWAKRNI | HLA-B*40:01, HLA-B*40:02, HLA-B*40:06, HLA-B*41:01 | 0.0943 |
| 6585 | FRNARNGVL | HLA-B*15:10, HLA-B*27:06 | 0.1343 |
| 6700 | HLLIGLAKR | HLA-A*33:03, HLA-A*74:01 | 0.0599 |
| 6714 | FELEDFIPM b | HLA-B*13:01, HLA-B*15:10, HLA-B*18:01, HLA-B*18:02, HLA-B*37:01, HLA-B*38:02, HLA-B*40:01, HLA-B*40:02, HLA-B*40:06, HLA-B*41:01, HLA-B*44:03, HLA-B*48:01 | 0.3348 |
| 6748 | LLLDDFVEI a,b,c | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-B*52:01, HLA-A*02:07 | 0.2439 |
| 6848 | CQYLNTLTL | HLA-B*13:01, HLA-B*15:10, HLA-B*27:06, HLA-B*37:01, HLA-B*38:02, HLA-B*48:01, HLA-B*52:01 | 0.0312 |
| 6850 | YLNTLTLAV a,b | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*02:07 | 0.0762 |
| 6885 | WLPTGTLLV | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*02:07 | 0.0892 |
| 6978 | YKLMGHFAW | HLA-B*18:01, HLA-B*18:02 | 0.0048 |
| 7019 | YVMHANYIF a | HLA-A*24:02, HLA-A*24:07, HLA-B*15:02, HLA-B*15:13, HLA-B*15:21, HLA-B*35:01, HLA-B*35:05, HLA-B*35:30, HLA-B*56:02, HLA-B*46:01 | 0.0822 |
| 7026 | IFWRNTNPI | HLA-A*24:02, HLA-A*24:07, HLA-A*24:10 | 0.1423 |
a The peptide has been experimentally proven by T-cell assay and reported in IEDB; b The peptide has been experimentally proven by HLA binding and reported in IEDB; c Peptide has some degree of homology with human self-peptide; d The peptide existed only in 9.61% of delta variant isolates, whereas the rest had the mutant peptide 3137SWITIAYII3145.
Figure 5The number of predicted HTL epitopes that are presented by the HLA-DRB1 allele. NetHLAIIpan4.0 predicted 792 peptides bind to at least 1 HLA Class II with a strong binding affinity (≤1% percentile rank).
List of 15-mer peptides predicted as promiscuous HTL epitopes and their IFNγ score.
| Start Residue | Epitope Sequence | HLA DRB1 Alleles | IFNγ Score |
|---|---|---|---|
| 447 | NDNLLEILQKEKVNI | DRB1*12:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:54, | 0.1311 |
| 448 | DNLLEILQKEKVNIN | DRB1*12:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:54, | 0.0556 |
| 554 | TAQNSVRVLQKAAIT | DRB1*12:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:54, | 0.0684 |
| 736 | PKEIIFLEGETLPTE | DRB1*01:01, DRB1*12:02, DRB1*15:01, DRB1*15:02, DRB1*16:02, | 0.0771 |
| 1054 | PTVVVNAANVYLKHG | DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:54, DRB1*15:01, DRB1*15:02, DRB1*16:02 | 0.0917 |
| 1187 | VSSFLEMKSEKQVEQ | DRB1*04:01, DRB1*04:03, DRB1*04:05, DRB1*04:06, DRB1*10:01, | 0.0899 |
| 1211 | VKPFITESKPSVEQR | DRB1*08:03, DRB1*11:01, DRB1*13:02, DRB1*14:05, DRB1*14:07, | 0.3157 |
| 1349 | CKSAFYILPSIISNE | DRB1*01:01, DRB1*04:01, DRB1*04:05, DRB1*08:03, DRB1*10:01, DRB1*11:01, DRB1*15:02, DRB1*16:02, | 0.2898 |
| 1350 | KSAFYILPSIISNEK a | DRB1*01:01, DRB1*04:01, DRB1*04:03, DRB1*04:05, DRB1*04:06, DRB1*08:03, DRB1*10:01, DRB1*11:01, DRB1*12:02, DRB1*15:02, DRB1*16:02, | 0.3378 |
| 1355 | ILPSIISNEKQEILG | DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:07, DRB1*14:54, | 0.4244 |
| 1356 | LPSIISNEKQEILGT | DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:07, DRB1*14:54, | 0.3025 |
| 1357 | PSIISNEKQEILGTV | DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:54, | 0.5074 |
| 2944 | AYESLRPDTRYVLMD | DRB1*03:01, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:54, | 0.3078 |
| 2945 | YESLRPDTRYVLMDG | DRB1*03:01, DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:07, DRB1*14:54, | 0.1649 |
| 2958 | DGSIIQFPNTYLEGS | DRB1*04:02, DRB1*13:02, DRB1*15:01, DRB1*15:02, DRB1*16:02, | 0.2103 |
| 3815 | VSTQEFRYMNSQGLL | DRB1*01:01, DRB1*07:01, DRB1*09:01, DRB1*15:02, DRB1*16:02, | 0.0976 |
| 3944 | IASEFSSLPSYAAFA | DRB1*01:01, DRB1*04:01, DRB1*10:01, DRB1*15:02, DRB1*16:02, | 0.0754 |
| 3945 | ASEFSSLPSYAAFAT | DRB1*01:01, DRB1*04:01, DRB1*10:01, DRB1*15:02, DRB1*16:02, | 0.3973 |
| 3951 | LPSYAAFATAQEAYE | DRB1*04:01, DRB1*04:03, DRB1*04:05, DRB1*04:06, DRB1*08:03, | 0.0518 |
| 4457 | LIDSYFVVKRHTFSN | DRB1*08:03, DRB1*11:01, DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:07, DRB1*14:54, | 0.1304 |
| 4458 | IDSYFVVKRHTFSNY | DRB1*08:03, DRB1*11:01, DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:07, DRB1*14:54, | 0.1870 |
| 4560 | NPDILRVYANLGERV | DRB1*04:02, DRB1*08:03, DRB1*12:02, DRB1*15:01, DRB1*15:02, DRB1*16:02, | 0.2299 |
| 4561 | PDILRVYANLGERVR a | DRB1*04:02, DRB1*08:03, DRB1*13:02, DRB1*15:01, DRB1*15:02, DRB1*16:02, | 0.2616 |
| 4761 | KELLVYAADPAMHAA | DRB1*04:01, DRB1*04:02, DRB1*15:01, DRB1*15:02, DRB1*16:02, | 0.2258 |
| 4830 | KHFFFAQDGNAAISD | DRB1*01:01, DRB1*04:01, DRB1*10:01, DRB1*14:07, DRB1*16:02, | 0.4401 |
| 4933 | QMNLKYAISAKNRAR | DRB1*08:03, DRB1*10:01, DRB1*11:01, DRB1*13:02, DRB1*14:05, DRB1*14:07, | 0.4044 |
| 4934 | MNLKYAISAKNRART | DRB1*08:03, DRB1*10:01, DRB1*11:01, DRB1*13:02, DRB1*14:05, DRB1*14:07, | 0.4019 |
| 4935 | NLKYAISAKNRARTV | DRB1*08:03, DRB1*11:01, DRB1*13:02, DRB1*14:05, DRB1*14:07, | 0.5938 |
| 5019 | PNMLRIMASLVLARK a | DRB1*01:01, DRB1*12:02, DRB1*14:04, DRB1*15:01, DRB1*15:02, DRB1*16:02, | 0.3914 |
| 5717 | AKHYVYIGDPAQLPA | DRB1*04:01, DRB1*04:03, DRB1*04:05, DRB1*04:06, DRB1*08:03, DRB1*10:01, DRB1*16:02, | 0.1673 |
| 5775 | TVSALVYDNKLKAHK | DRB1*03:01, DRB1*11:01, DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:07, DRB1*14:54, | 0.3517 |
| 5776 | VSALVYDNKLKAHKD a | DRB1*03:01, DRB1*08:03, DRB1*11:01, DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:07, DRB1*14:54, | 0.2560 |
| 5777 | SALVYDNKLKAHKDK | DRB1*03:01, DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:07, DRB1*14:54, | 0.4910 |
| 5834 | VFISPYNSQNAVASK | DRB1*01:01, DRB1*04:01, DRB1*04:02, DRB1*10:01, DRB1*15:01, DRB1*15:02, | 0.2236 |
| 6046 | PTGYVDTPNNTDFSR | DRB1*04:01, DRB1*04:03, DRB1*04:05, DRB1*04:06, DRB1*08:03, | 0.0690 |
| 6454 | LENVAFNVVNKGHFD | DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:07, DRB1*14:54, | 0.0787 |
| 6726 | TVKNYFITDAQTGSS | DRB1*01:01, DRB1*04:01, DRB1*07:01, DRB1*09:01, DRB1*10:01, DRB1*16:02, | 0.0871 |
| 7075 | KGRLIIRENNRVVIS | DRB1*04:02, DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:54, DRB1*15:01, DRB1*15:02 | 0.7895 |
| 7076 | GRLIIRENNRVVISS | DRB1*04:02, DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:54, DRB1*15:01, DRB1*15:02 | 0.7985 |
| 7077 | RLIIRENNRVVISSD | DRB1*13:02, DRB1*14:01, DRB1*14:05, DRB1*14:07, DRB1*14:54, | 0.8026 |
a The peptide has been experimentally proven by T-cell assay and reported in IEDB.
Comparison of binding affinity between the ancestral (FWITIAYII) and mutant (SWITIAYII) peptide. The calculation was made using NetCTLpan 1.1., which also predicted the peptide processing (proteasomal cleavage and TAP transport efficiency) inside the cell.
| Peptide | Allele | HLA | TAP | Cle | Comb | Aff(nM) | %Rank |
|---|---|---|---|---|---|---|---|
| FWITIAYII | HLA-A*24:02 | 0.604 | 0.566 | 0.463 | 0.722 | 250.54 | 0.8 |
| SWITIAYII | HLA-A*24:02 | 0.65 | 0.884 | 0.617 | 0.811 | 35.49 | 0.3 |
| FWITIAYII | HLA-A*24:07 | 0.497 | 0.566 | 0.463 | 0.615 | 220.88 | 0.8 |
| SWITIAYII | HLA-A*24:07 | 0.591 | 0.884 | 0.617 | 0.752 | 40.82 | 0.15 |
| FWITIAYII | HLA-A*24:10 | 0.8 | 0.566 | 0.463 | 0.918 | 61.69 | 0.8 |
| SWITIAYII | HLA-A*24:10 | 0.848 | 0.884 | 0.617 | 1.009 | 14.15 | 0.4 |
SARS-CoV-2 peptides with sequence homologous with human peptides (underlined). The HLA alleles presenting the human peptides as revealed by NetCTLpan analysis are indicated. Some of these peptides were curated in IEDB and had been confirmed by T-cell assay, HLA assay, or both.
| Start | SARS-CoV-2 Peptide | Human Peptides | Human Proteins | HLA Allele Presenting the Human Peptide | IEDB Confirmation Assay |
|---|---|---|---|---|---|
| 1140 | HEVLLAPLL | AEVLLAPLL | HSVI binding protein (AAF76892.1) | HLA-B*37:01, HLA-B*38:02, HLA-B*40:01, HLA-B*40:02, HLA-B*40:06, HLA-B*41:01, HLA-B*44:03, HLA-B*13:01 | n.a. |
| 2784 | AIFYLITPV | AIFYLITLV | olfactory receptor, family 2, subfamily W, member 1, isoform CRA_b (EAX03180.1) | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06 | T-cell assay and HLA assay |
| 3582 | LLLTILTSL | LLLTILTRP | hCG2023968 (EAW49626.1) | non binder | HLA assay |
| 3684 | YASAVVLLI | VASAVVLLG | molybdenum cofactor biosynthesis protein 1 isoform 7 (NP_001345459.1) | non-binder | T-cell assay |
| 3752 | FLARGIVFM | XCARGIVFM | immunoglobulin heavy chain junction region (MOL38621.1) | cannot generate a similar peptide, since the sequence is at the N-terminal end of the protein. | T-cell assay and HLA assay |
| 5614 | FAIGLALYY | SYIGLALYY | immunoglobulin heavy chain junction region (MOJ91547.1) | HLA-A*29:01 | T cell assay |
| 6748 | LLLDDFVEI | IALDDFVEI | Wolfram syndrome 1 (wolframin), isoform CRA_a (EAW82396.1) | HLA-A*02:06, HLA-B*52:01 | T-cell assay and HLA assay |
Seven CTL and five HTL epitopes chosen for the VC and the population coverage. Epitopes fulfilled the criteria such as highest percentile rank, high promiscuity, high immunogenicity, high IFNγ induction ability, conservancy across all variants, low entropy value, and the absence of homology with human peptides.
| Start Residue | Peptide and | HLA Alleles Bind to the Peptides | Population Coverage | |||
|---|---|---|---|---|---|---|
| Indonesia | Thailand | Germany | World | |||
| 899 | WSMATYYLF | HLA-A*01:01, HLA-A*24:02, HLA-A*24:07, HLA-A*24:10, HLA-A*29:01, HLA-A*32:01, HLA-B*13:02, HLA-B*15:02, HLA-B*15:12, HLA-B*15:13, HLA-B*15:17, HLA-B*15:21, HLA-B*15:25, HLA-B*15:32, HLA-B*18:01, HLA-B*18:02, HLA-B*35:01, HLA-B*35:05, HLA-B*35:30, HLA-B*52:01, HLA-B*56:07, HLA-B*57:01, HLA-B*58:01, HLA-B*46:01 | 94.80 | 77.44; | 66.25; | 64.13 |
| 5678 | YVFCTVNAL | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*26:01, HLA-A*34:01, HLA-B*07:02, HLA-B*07:05, HLA-B*15:02, HLA-B*15:10, HLA-B*15:21, HLA-B*35:01, HLA-B*35:02, HLA-B*35:05, HLA-B*35:30, HLA-B*38:02, HLA-B*48:01, HLA-B*56:01, HLA-B*56:02, HLA-A*02:07, HLA-B*46:01 | 77.39 | 75.05 | 72.07 | 65.66 |
| 5245 | LMIERFVSL | HLA-A*02:01, HLA-A*02:03, HLA-A*02:06, HLA-A*32:01, HLA-B*08:01, HLA-B*15:01, HLA-B*15:02, HLA-B*15:10, HLA-B*15:12, HLA-B*15:21, HLA-B*15:25, HLA-B*15:32, HLA-B*35:02, HLA-B*37:01, HLA-B*38:02, HLA-B*48:01, HLA-A*02:07, HLA-B*46:01 | 63.65 | 74.26 | 71.42 | 63.19 |
| 6714 | FELEDFIPM | HLA-B*13:01, HLA-B*13:02, HLA-B*15:10, HLA-B*18:01, HLA-B*18:02, HLA-B*37:01, HLA-B*38:02, HLA-B*40:01, HLA-B*40:02, HLA-B*40:06, HLA-B*41:01, HLA-B*44:03, HLA-B*48:01 | 51.64 | 46.46 | 35.87 | 35.59 |
| 5024 | MASLVLARK | HLA-A*03:01, HLA-A*11:01, HLA-A*11:04, HLA-A*30:01, HLA-A*33:03, HLA-A*34:01, HLA-A*74:01 | 67.42 | 55.82 | 40.12 | 40.42 |
| 6848 | CQYLNTLTL | HLA-B*13:02, HLA-B*15:10, HLA-B*27:06, HLA-B*37:01, HLA-B*38:02, HLA-B*48:01, HLA-B*52:01 | 21.20 | 20.90 | 10.15 | 13.16 |
| 2350 | FSYFAVHFI | HLA-B*51:01, HLA-B*52:01 | 8.29 | 13.51 | 12.13 | 10.26 |
| 1350 | KSAFYILPSIISNEK | DRB1*01:01, DRB1*04:01, DRB1*04:03, DRB1*04:05, DRB1*04:06, DRB1*08:03, DRB1*10:01, DRB1*11:01, DRB1*12:02, DRB1*15:02, DRB1*16:02 | 91.13 | 74.99 | 47.87 | 47.60 |
| 7076 | GRLIIRENNRVVISS | DRB1*04:02, DRB1*13:02, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:54, DRB1*15:01, DRB1*15:02 | 53.28 | 50.97 | 40.57 | 37.72 |
| 7077 | RLIIRENNRVVISSD (0.000; 0.000; 0.000; 0.000; 0.000; 0.000; 0.000) | DRB1*13:02, DRB1*14:01, DRB1*14:05, DRB1*14:07, DRB1*14:54 | 4.18 | 11.27 | 13.43 | 16.78 |
| 2944 | AYESLRPDTRYVLMD (0.068; 0.045; 0.039; 0.0505; 0.027; 0.028; 0.026) | DRB1*03:01, DRB1*14:01, DRB1*14:04, DRB1*14:05, DRB1*14:54 | 10.59 | 23.69 | 25.46 | 27.58 |
| 3815 | VSTQEFRYMNSQGLL (0.000; 0.000; 0.000; 0.000; 0.000; 0.000; 0.129) | DRB1*01:01, DRB1*07:01, DRB1*09:01, DRB1*15:02, DRB1*16:02 | 63.57 | 63.45 | 41.63 | 38.08 |
| Epitope set | 100.00 | 100.00 | 99.98 | 99.88 | ||
* The entropy value in the bracket. For HTL epitopes the entropy values are for seven possible 9-mer core-peptides.
Population coverage of the 12 chosen epitopes. A projected population coverage, average number of epitope hits/HLA combinations recognized by the population, and minimum number of epitope hits/HLA combinations recognized by 90% of the population.
| Population/Area | Class I | Class II | Class Combined | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Coverage a | Average_Hit b | pc90 c | Coverage a | Average_Hit b | pc90 c | Coverage a | Average_Hit b | pc90 c | |
| Germany | 99.75% | 3.89 | 2.54 | 93.25% | 1.81 | 1.09 | 99.98% | 5.7 | 4.03 |
| Indonesia | 100.0% | 5.66 | 4.1 | 99.68% | 2.68 | 1.46 | 100.0% | 8.35 | 6.19 |
| Thailand | 99.76% | 4.82 | 2.9 | 98.69% | 2.63 | 1.48 | 100.0% | 7.45 | 5.08 |
| World | 98.77% | 3.65 | 2.08 | 90.66% | 1.82 | 1.02 | 99.88% | 5.47 | 3.38 |
| Average | 99.57 | 4.5 | 2.9 | 95.57 | 2.23 | 1.26 | 99.97 | 6.74 | 4.67 |
| Standard deviation | 0.47 | 0.8 | 0.75 | 3.75 | 0.42 | 0.21 | 0.05 | 1.2 | 1.07 |
a Projected population coverage; b average number of epitope hits/HLA combinations recognized by the population; c minimum number of epitope hits/HLA combinations recognized by 90% of the population.
Figure 6Amino acid sequences of the VC. Beta defensin (was used as adjuvant, HTL epitopes, and CTL epitopes were connected using linkers EAAAK, GPGPG, and AAG.
Figure 7The immune simulation analysis of the VC: (A) Population of HTL. (B) The population of different states of HTL. (C) population of CTL. (D) Population of different states of CTL. (E) Population of NK cells. (F) Population of different states of dendritic cells. (G) Population of different states of macrophages. (H) Level of cytokines and interleukins produced in responses to vaccine.
Figure 8The secondary and tertiary structure of the VC: (A) Amino acid sequence and position of the secondary structure. (B) The global composition of the secondary structure of the VC. (C) Position of secondary structure within the protein sequence. The secondary structure is color-coded with blue—α-helix, red—extended strand, green—β-turn, and purple—random coil. (D) The propensity of each residue in adopting the secondary structure. (E) Tertiary structure as predicted by RAPTOR X (3D model) of the VC. (F) z-score value of the 3D model of the VC as calculated by ProSAweb is −7.25 (indicated by a black dot), which falls within the range of the z-score for the native proteins of similar size (212 aa).
Figure 9Interaction analysis of the VC and TLR4. (A) The tertiary structure model of the complex between TLR4 (purple) and the VC (red). (B) Diagram of interaction between the VC and TLR4 (red = salt bridges, blue = H-bonds, striped line = non-bonded contacts). (C) Residues involved in forming the complex. (D) Ramachandran plot of the interaction model showing the number of residues in the most favored region and less favored region.
Figure 10The best CABS-dock Modeling with default settings for peptide WSMATYYLF (stick structure in red color) with HLA-A*24:07 (A) and HLA-A*24:02 (B). The contact map between peptide and receptor for HLA-A*24:02 (C) and HLA-A*24:07 (D). CABS-dock server returns 10 top scored models of the protein-peptide complex. The best model prediction had high accuracy with average RMSD of 0.94 Å and 1.64 Å for HLA-A*24:07 and HLA-A*24:02, respectively. Moreover, the best model also had the highest cluster density score. HLA-A*24:07–WSMATYYLF complex had a higher accuracy compare to the HLA-A*24:02–WSMATYYLF complex.