| Literature DB >> 33422682 |
Clayton Fernando Rencilin1, Joseph Christina Rosy1, Manikandan Mohan2, Richard Coico3, Krishnan Sundar4.
Abstract
An immunoinformatics-based approach was used to identify potential multivalent subunit CTL vaccine candidates for SARS-CoV-2. Criteria for computational screening included antigen processing, antigenicity, allergenicity, and toxicity. A total of 2604 epitopes were found to be strong binders to MHC class I molecules when analyzed using IEDB tools. Further testing for antigen processing yielded 826 peptides of which 451 were 9-mers that were analyzed for potential antigenicity. Antigenic properties were predicted for 102 of the 451 peptides. Further assessment for potential allergenicity and toxicity narrowed the number of candidate CTL epitopes to 50 peptide sequences, 45 of which were present in all strains of SARS-CoV-2 that were tested. The predicted CTL epitopes were then tested to eliminate those with MHC class II binding potential, a property that could induce hyperinflammatory responses mediated by TH2 cells in immunized hosts. Eighteen of the 50 epitopes did not show class II binding potential. To our knowledge this is the first comprehensive analysis on the proteome of SARS-CoV-2 for prediction of CTL epitopes lacking binding properties that could stimulate unwanted TH2 responses. Future studies will be needed to assess these epitopes as multivalent subunit vaccine candidates which stimulate protective CTL responses against SARS-COV-2.Entities:
Keywords: CTL epitopes; Epitope mapping; SARS-CoV-2
Year: 2021 PMID: 33422682 PMCID: PMC7836868 DOI: 10.1016/j.meegid.2021.104712
Source DB: PubMed Journal: Infect Genet Evol ISSN: 1567-1348 Impact factor: 3.342
Fig. 1Illustration of systematic reverse vaccinology work flow for prediction of SARS-CoV-2 CTL epitopes.
Prediction of potential CTL epitopes for nine HLA alleles.
| HLA A*0101 | 2 | 103 | 112 | 34 | 251 |
| HLA A*0201 | 3 | 124 | 104 | 14 | 245 |
| HLA A*0206 | 0 | 145 | 102 | 3 | 250 |
| HLA A*0301 | 1 | 126 | 127 | 17 | 271 |
| HLA A*1101 | 5 | 164 | 133 | 55 | 357 |
| HLA A*2402 | 1 | 136 | 122 | 21 | 280 |
| HLA A*2601 | 1 | 146 | 96 | 1 | 244 |
| HLA A*3101 | 21 | 94 | 73 | 3 | 191 |
| HLA A*3303 | 42 | 288 | 156 | 29 | 515 |
| Total number of peptides | |||||
Fig. 2MHC binding analysis of peptides of SARS-CoV-2.
Fig. 3MHC allele specific distribution of SARS-CoV-2 predicted peptides.
Immunogenicity and antigenicity prediction of putative CTL epitopes of SARS-CoV-2.
| Envelope protein | 8 | 0 | |||
| 9 | 9 | 81.82 | 7 | 3 | |
| 10 | 2 | ||||
| 11 | 0 | ||||
| Membrane glycoprotein | 8 | 2 | |||
| 9 | 27 | 58.70 | 23 | 7 | |
| 10 | 16 | ||||
| 11 | 1 | ||||
| Nucleocapsid phosphoprotein | 8 | 0 | |||
| 9 | 9 | 52.94 | 6 | 2 | |
| 10 | 7 | ||||
| 11 | 1 | ||||
| ORF10 protein | 8 | 0 | |||
| 9 | 2 | 40 | 0 | 0 | |
| 10 | 3 | ||||
| 11 | 0 | ||||
| ORF1ab polyprotein | 8 | 5 | |||
| 9 | 328 | 55.03 | 182 | 72 | |
| 10 | 248 | ||||
| 11 | 15 | ||||
| ORF3a protein | 8 | 0 | |||
| 9 | 16 | 50.00 | 10 | 3 | |
| 10 | 14 | ||||
| 11 | 2 | ||||
| ORF6 protein | 8 | 0 | |||
| 9 | 1 | 50.00 | 1 | 1 | |
| 10 | 1 | ||||
| 11 | 0 | ||||
| ORF7a protein | 8 | 0 | |||
| 9 | 7 | 70.00 | 2 | 1 | |
| 10 | 3 | ||||
| 11 | 0 | ||||
| ORF7b | 8 | 0 | |||
| 9 | 3 | 42.86 | 3 | 2 | |
| 10 | 4 | ||||
| 11 | 0 | ||||
| ORF8 protein | 8 | 0 | |||
| 9 | 7 | 63.63 | 3 | 2 | |
| 10 | 4 | ||||
| 11 | 0 | ||||
| Surface glycoprotein | 8 | 0 | |||
| 9 | 42 | 47.19 | 23 | 9 | |
| 10 | 41 | ||||
| 11 | 6 |
Fig. 4Radar graph indicating the presence of predicted epitopes in 10 selected strains of SARS-CoV-2. The graph illustrates the ten SARS-CoV-2 strains as concentric rings in the following order from the center to outer circles: MT123291.1, MT066176.1, MT152824.1, MT019533.1, MT007544.1, MN985325.1, MT039873.1, MN997409.1, MT126808.1, LC528233.1 respectively.” First four-letter amino acid sequences are shown for each of the 9-mer peptides listed in Supplementary Table 2.
Validation of predicted epitopes using molecular docking analysis.
| 1 | FLAFVVFLL | −7.5 | −7.1 | −7.9 | −7.2 | −9.3 |
| 2 | FVVFLLVTL | −8 | −7.1 | −7.5 | −8.8 | −9.4 |
| 3 | SVLLFLAFV | −8.1 | −7.9 | −7.4 | −7.8 | −8.2 |
| 4 | FVLAAVYRI | −7.7 | −7.7 | −6.5 | −6.6 | −8.2 |
| 5 | SFRLFARTR | −7.4 | −7 | −7.5 | −7 | −8 |
| 6 | FLFLTWICL | −8 | −7.4 | −8 | NB | −9.4 |
| 7 | LWPVTLACF | −7.9 | −7.1 | −7.5 | −8 | −8.6 |
| 8 | SYFIASFRL | −7.6 | −7.5 | −8.2 | −7.3 | −9.9 |
| 9 | LSPRWYFYY | −7.4 | −7.2 | −7.1 | −8.4 | −9.8 |
| 10 | FLARGIVFM | −5.5 | −6.6 | −6.9 | −6.2 | −7.1 |
| 11 | LAYYFMRFR | −6.7 | −6.7 | −7.3 | −7.9 | −8.8 |
| 12 | LSYGIATVR | −6.4 | −6.7 | −6.3 | −6.9 | −7.9 |
| 13 | TQWSLFFFL | −8.2 | −8.8 | −8.2 | −8.4 | −10 |
| 14 | VMVELVAEL | −6 | −7.4 | −6.4 | −6.9 | −6.9 |
| 15 | WLMWLIINL | −7.2 | −5.6 | −7.2 | −6.9 | −8.7 |
| 16 | EFTPFDVVR | −8.1 | −6.6 | −8.5 | −8.8 | −7.9 |
| 17 | EIAIILASF | −7.3 | −5.8 | −7 | −7.5 | −8.7 |
| 18 | FAIGLALYY | −6 | −7.8 | −8 | −8.9 | −9.8 |
| 19 | FLAYILFTR | −7.5 | −6.8 | −7 | −8.5 | −8.4 |
| 20 | FVVEVVDKY | −7 | −7.3 | −7.4 | −7.3 | −8.7 |
| 21 | HLYLQYIRK | −7.7 | −7.5 | −7.6 | −7.8 | −9.1 |
| 22 | HSIGFDYVY | −9.2 | −7 | −8.3 | −7.7 | −9.4 |
| 23 | ILHCANFNV | −7.2 | −7.6 | −7.4 | −7.4 | −8.8 |
| 24 | IYLYLTFYL | −7.8 | −7.1 | −7.9 | −7.4 | −9.3 |
| 25 | KLNVGDYFV | NB | NB | NB | NB | NB |
| 26 | KSAGFPFNK | −8.1 | −7.3 | −8.2 | −7.7 | −8.6 |
| 27 | LQLGFSTGV | −7 | −6 | −6.9 | −7 | −8.4 |
| 28 | LYENAFLPF | −7.4 | −8.1 | −7.1 | −8.6 | −8.4 |
| 29 | MFLARGIVF | −6.2 | −7.3 | −7.3 | −6.8 | −9.2 |
| 30 | NLSDRVVFV | −7.2 | −6.6 | −7.7 | −6.9 | −8.5 |
| 31 | NTVIWDYKR | −6.7 | −6.5 | −8 | −7.8 | −7.7 |
| 32 | RLIIRENNR | NB | NB | NB | NB | NB |
| 33 | RNYVFTGYR | −8 | −7 | −7.8 | −8.1 | −8.3 |
| 34 | RQLLFVVEV | −7.4 | −7.3 | −7.1 | −8 | −7.5 |
| 35 | SQLGGLHLL | −7 | −6.7 | −6.6 | −6.7 | −7.7 |
| 36 | STQWSLFFF | −9.2 | −7.1 | −7.6 | −6.6 | −9.6 |
| 37 | SVIYLYLTF | −8 | −6.9 | −6.8 | −7.6 | −8.3 |
| 38 | VIYLYLTFY | −7.4 | −7.4 | −8 | −8 | −9.9 |
| 39 | FLQSINFVR | −7.5 | −6.3 | −7.5 | −7.6 | −9.5 |
| 40 | WLIVGVALL | −7.1 | −7.3 | −7.1 | −6.7 | −8.5 |
| 41 | FQVTIAEIL | −6.8 | −6.9 | −7.3 | −7.9 | −8 |
| 42 | ILFLALITL | −6.9 | −6.7 | −7.2 | −8 | −8 |
| 43 | FLAFLLFLV | −7.1 | −6.4 | −7.3 | −6.8 | −8.7 |
| 44 | LIDFYLCFL | −8.1 | −6.8 | −8.3 | −7.7 | −7.8 |
| 45 | YIDIGNYTV | −7.2 | −7.2 | −7.2 | −7.9 | −7.8 |
| 46 | KWYIRVGAR | −7 | −7.3 | −7.1 | −8 | −7.9 |
| 47 | WTAGAAAYY | −6.1 | −7.3 | −7.3 | −6.9 | −6.6 |
| 48 | FTISVTTEI | −6.9 | −6.8 | −7.3 | −7.5 | −8.7 |
| 49 | PYRVVVLSF | NB | NB | NB | NB | NB |
| 50 | VVFLHVTYV | −6.9 | −7.2 | −7.3 | −8.1 | −8.1 |
NB - Ligand not binding to the target MHC allele at the peptide binding groove.
Fig. 5Molecular and 2D ligand interaction maps of predicted epitopes TQWS (a, c) and SYFI (b, d) with HLA allele A*2402.
Predicted non-binders to MHC Class II.
| 1 | LSPRWYFYY | Nucleocapsid phosphoprotein | HLA-A*01:01 |
| 2 | FLARGIVFM | nsp6 | HLA-A*02:01; HLA-A*02:06 |
| 3 | MFLARGIVF | nsp6 | HLA-A*24:02 |
| 4 | EFTPFDVVR | 3C-like proteinase | HLA-A*33:03 |
| 5 | FAIGLALYY | nsp13 | HLA-A*26:01 |
| 6 | RNYVFTGYR | nsp13 | HLA-A*31:01 |
| 7 | FVVEVVDKY | nsp12 | HLA-A*26:01 |
| 8 | ILHCANFNV | nsp12 | HLA-A*02:01 |
| 9 | KSAGFPFNK | nsp12 | HLA-A*11:01 |
| 10 | RQLLFVVEV | nsp12 | HLA-A*02:06 |
| 11 | HSIGFDYVY | nsp14 | HLA-A*26:01 |
| 12 | LQLGFSTGV | nsp14 | HLA-A*02:06 |
| 13 | NLSDRVVFV | nsp14 | HLA-A*02:01 |
| 14 | NTVIWDYKR | nsp15 | HLA-A*33:03 |
| 15 | SQLGGLHLL | nsp15 | HLA-A*02:06 |
| 16 | WLIVGVALL | ORF3a protein | HLA-A*02:01 |
| 17 | YIDIGNYTV | ORF8 protein | HLA-A*02:01; HLA-A*02:06 |
| 18 | VVFLHVTYV | Surface glycoprotein | HLA-A*02:06 |