| Literature DB >> 28584450 |
Sathish Sankar1, Mageshbabu Ramamurthy1, Balaji Nandagopal1, Gopalan Sridharan1.
Abstract
Hantavirus cardiopulmonary syndrome in North America is caused by Sin Nombre virus (SNV) and poses a public health problem. We identified T-cell epitopes restricted to HLA alleles commonly seen in the N. American population. Nucleocapsid (N) protein is 428 aminoacid in length and binds to RNA and functions also as a key molecule between virus and host cell processes. The predicted epitopes from N protein that bind to class I MHC were analyzed for human proteasomes cleavage, TAP efficiency, immunogenicity and antigenicity. We identified 8 epitopes through MHC binding prediction, proteasomal cleavage prediction and TAP efficiency. Epitope VMGVIGFSF had highest Vaxijen score and the epitope, TNRAYFITR had highest immunogenicity score. Epitope AAVSALETK and TIACGLFPA had 100% homology to many HCPS causing viruses. Our study focused on T-cell epitope prediction specific to restricted HLA haplotypes of racial groups in North America for the potential vaccine development. Among the candidate epitopes, FLAARCPFL was conserved in SNV, which is suitable for vaccine specific to the virus genotype. Peptide-based vaccines can be designed to include multiple determinants from several hantavirus genotypes, or multiple epitopes from the same genotype. Thereby, immune response will focus solely on relevant epitopes, avoiding non-protective responses or immune evasion. The other advantages include absence of infectious material unlike in live or attenuated vaccines. There is no risk of reversion or formation of adverse reassortants leading to virulence and no risk of genetic integration or recombination forming a rationale for vaccine design including for distinct geographical regions.Entities:
Keywords: Hantaviruses; MHC; Sin Nombre;; T cell epitopes; nucleocapsid
Year: 2017 PMID: 28584450 PMCID: PMC5450251 DOI: 10.6026/97320630013094
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
List of T-cell epitopes with strong binding affinity to MHC Class I alleles (SB: strong binders)
| HLA types | Epitopes (SB) | |||||||||
| HLA-A | ||||||||||
| A*01:01 | HLKEKSSLRY | TADWKSIGLY | QLDQKIIILY | |||||||
| A*02:01 | YILSFALPII | MGVIGFSFFV | ALYVAGMPEL | GLYILSFAL | YILSFALPI | ILSFALPII | TIACGLFPA | GVIGFSFFV | FLAARCPFL | |
| A*02:03 | ALYVAGMPEL | YMSHWGREAV | ILSFALPII | HLYVSMPTA | TIACGLFPA | NIISPVMGV | FLAARCPFL | YMSHWGREA | ||
| A*02:06 | YILSFALPII | RTIACGLFPA | MGVIGFSFFV | YILSFALPI | IILKALYML | IACGLFPAQ | NIISPVMGV | GVIGFSFFV | FLAARCPFL | |
| A*02:07 | IGLYILSFAL | YILSFALPII | ILSFALPIIL | MGVIGFSFFV | DFLAARCPFL | ALYVAGMPEL | GLYILSFAL | YILSFALPI | ILSFALPII | IILKALYML |
| GVIGFSFFV | FLAARCPFL | |||||||||
| A*03:01 | LIAAQKLASK | LSFALPIILK | SMPTAQSTMK | GVIGFSFFVK | VIGFSFFVK | KLKKKSAFY | ||||
| A*11:01 | LSFALPIILK | GVIGFSFFVK | ATNRAYFITR | KSAFYQSYLR | QSMGIQLDQK | AAVSALETK | VIGFSFFVK | SAFYQSYLR | ||
| A*23:01 | LYILSFALPI | RFRTIACGLF | VMGVIGFSF | LYVAGMPEL | SYLRRTQSM | |||||
| A*24:02 | LYILSFALPI | RFRTIACGLF | KDWMERIDDF | VMGVIGFSF | ATPHSVWVF | LYVAGMPEL | SYLRRTQSM | DAALATNRAY | ||
| A*25:01 | DAALATNRAY | NIISPVMGV | FVKDWMERI | ESATIFADI | DIATPHSVW | NTIMASKSV | ||||
| A*26:01 | EVQDNITLH | FVKDWMERI | ||||||||
| A*29:02 | - | - | - | - | - | - | - | - | - | - |
| A*30:01 | GIRKPRHLYV | RTIACGLFPA | VKARNIISPV | KARNIISPVM | QSRRAAVSA | KSSLRYGNV | STRGRQTIK | RFRTIACGL | KARNIISPV | |
| A*30:02 | TADWKSIGLY | RIRFKDDSSY | GIRKPRHLY | AALATNRAY | KLKKKSAFY | |||||
| A*31:01 | SFFVKDWMER | ATNRAYFITR | AFFAILQDMR | KSAFYQSYLR | SAFYQSYLRR | IILYMSHWGR | HLKEKSSLR | KALYMLSTR | LYMLSTRGR | RIDDFLAAR |
| TNRAYFITR | SAFYQSYLR | AFYQSYLRR | ILYMSHWGR | |||||||
| A*32:01 | KSIGLYILSF | GLYILSFAL | YILSFALPI | IILKALYML | VMGVIGFSF | RAYFITRQL | KSAFYQSYL | RTQSMGIQL | ||
| A*33:03 | SFFVKDWMER | KSAFYQSYLR | SAFYQSYLRR | IILYMSHWGR | HLKEKSSLR | LYMLSTRGR | EVNGIRKPR | FFVKDWMER | TNRAYFITR | FFAILQDMR |
| SAFYQSYLR | ILYMSHWGR | |||||||||
| A*34:02 | LIAAQKLASK | LSFALPIILK | GVIGFSFFVK | DMRNTIMASK | SAFYQSYLRR | QSMGIQLDQK | IAAQKLASK | SAFYQSYLR | ||
| A*68:01 | ELADLIAAQK | GVIGFSFFVK | KSAFYQSYLR | SAFYQSYLRR | EVNGIRKPR | WVFACAPDR | SAFYQSYLR | |||
| A*68:02 | QTADWKSIGL | MGVIGFSFFV | ESATIFADIA | TIACGLFPA | NIISPVMGV | GVIGFSFFV | ESATIFADI | HSVWVFACA | NTIMASKSV | MSHWGREAV |
| A*74:01 | ALYMLSTRGR | GLFPAQVKAR | GVIGFSFFVK | ATNRAYFITR | KSAFYQSYLR | SAFYQSYLRR | IILYMSHWGR | KALYMLSTR | VIGFSFFVK | RIDDFLAAR |
| SAFYQSYLR | ILYMSHWGR | |||||||||
| HLA-B | ||||||||||
| B*07:02 | RKPRHLYVSM | KPRHLYVSMP | TPGRFRTIAC | KARNIISPVM | KDPRDAALAT | APDRCPPTAL | MPELGAFFAI | KPRHLYVSM | TPGRFRTIA | KARNIISPV |
| B*08:01 | MLSTRGRQTI | ILQDMRNTIM | TLQSRRAAV | KPRHLYVSM | FLAARCPFL | SYLRRTQSM | ||||
| B*13:01 | AESATIFADI | MPELGAFFAI | RELAQTLVDI | KEVQDNITL | YILSFALPI | PELGAFFAI | REAVNHFHL | REISNQEPL | ||
| B*14:02 | NRAYFITRQL | SRRAAVSAL | DHLKEKSSL | SYLRRTQSM | ||||||
| B*15:01 | KSIGLYILSF | VMGVIGFSFF | ALATNRAYF | |||||||
| B*15:02 | AALATNRAY | |||||||||
| B*15:03 | IRFKDDSSY | VMGVIGFSF | AALATNRAY | ARAESATIF | LQDMRNTIM | KKSAFYQSY | ||||
| B*18:01 | DDFLAARCPF | MPELGAFFAI | HEQQLVTAR | |||||||
| B*35:01 | DAALATNRAY | MPELGAFFAI | LPIILKALY | SPVMGVIGF | AALATNRAY | MPELGAFFA | ||||
| B*38:02 | MPELGAFFAI | WKSIGLYIL | NHFHLGDDM | |||||||
| B*40:01 | LKEVQDNITL | AESATIFADI | MPELGAFFAI | GREAVNHFHL | RELAQTLVDI | VREISNQEPL | KEVQDNITL | PELGAFFAI | REAVNHFHL | REISNQEPL |
| B*40:02 | AESATIFADI | MPELGAFFAI | RELAQTLVDI | KEVQDNITL | RELADLIAA | PELGAFFAI | REAVNHFHL | REISNQEPL | ||
| B*42:01 | LPIILKALYM | RKPRHLYVSM | TPGRFRTIAC | FPAQVKARNI | APDRCPPTAL | MPELGAFFAI | KPRHLYVSM | TPGRFRTIA | ||
| B*44:02 | EEPSGQTADW | KENKGTRIRF | AESATIFADI | ADIATPHSVW | MPELGAFFAI | |||||
| B*44:03 | EEPSGQTADW | AESATIFADI | ADIATPHSVW | MPELGAFFAI | ||||||
| B*45:01 | KRELADLIAA | MERIDDFLAA | AESATIFADI | MPELGAFFAI | RELADLIAA | MERIDDFLA | AESATIFAD | |||
| B*46:01 | FALPIILKAL | VSMPTAQSTM | YILSFALPI | MGVIGFSFF | FSFFVKDWM | |||||
| B*49:01 | AESATIFADI | MPELGAFFAI | RELAQTLVDI | REAVNHFHL | ||||||
| B*51:01 | FPAQVKARNI | MPELGAFFAI | YILSFALPI | LATNRAYFI | IATPHSVWV | |||||
| B*52:01 | MPELGAFFAI | RELAQTLVDI | YILSFALPI | LSFALPIIL | RAYFITRQL | IQLDQKIII | ||||
| B*53:01 | LPIILKALYM | FPAQVKARNI | MPELGAFFAI | EPSGQTADW | LPIILKALY | SPVMGVIGF | ||||
| B*54:01 | MPTAQSTMKA | SPVMGVIGFS | ATPHSVWVFA | TPHSVWVFAC | MPELGAFFAI | FALPIILKA | TPHSVWVFA | CPPTALYVA | MPELGAFFA | |
| B*55:02 | MPTAQSTMKA | ATPHSVWVFA | TPHSVWVFAC | MPELGAFFAI | TPGRFRTIA | TPHSVWVFA | CPPTALYVA | MPELGAFFA | ||
| B*57:01 | KSIGLYILSF | IGFSFFVKDW | IATPHSVWVF | KIIILYMSHW | KSAFYQSYL | IIILYMSHW | ||||
| B*58:01 | KSIGLYILSF | IATPHSVWVF | KIIILYMSHW | LSFALPIIL | KSAFYQSYL | |||||
| HLA-C | ||||||||||
| C*01:02 | SSLRYGNVL | VLDVNSIDL | TADWKSIGL | YILSFALPI | SMPTAQSTM | ITPGRFRTI | FLAARCPFL | RAYFITRQL | KSAFYQSYL | RTQSMGIQL |
| C*02:02 | FADIATPHSV | LSFALPIIL | IISPVMGVI | FSFFVKDWM | FVKDWMERI | RAYFITRQL | IATPHSVWV | KSAFYQSYL | MSHWGREAV | |
| C*03:02 | FALPIILKAL | YILSFALPI | LSFALPIIL | FSFFVKDWM | AALATNRAY | RAYFITRQL | MSHWGREAV | |||
| C*03:03 | FALPIILKAL | FADIATPHSV | SSLRYGNVL | YILSFALPI | LSFALPIIL | RAYFITRQL | IATPHSVWV | MSHWGREAV | ||
| C*03:04 | FALPIILKAL | FADIATPHSV | SSLRYGNVL | YILSFALPI | LSFALPIIL | RAYFITRQL | IATPHSVWV | MSHWGREAV | ||
| C*04:01 | VLDVNSIDL | FRTIACGLF | WMERIDDFL | FLAARCPFL | GMPELGAFF | LQDMRNTIM | RTQSMGIQL | |||
| C*05:01 | LADLIAAQKL | KADEITPGRF | FADIATPHSV | VLDVNSIDL | TADWKSIGL | WMERIDDFL | LQDMRNTIM | KSAFYQSYL | RTQSMGIQL | LGDDMDPEL |
| C*06:02 | NRAYFITRQL | YLRRTQSMGI | SRRAAVSAL | LRYGNVLDV | IRKPRHLYV | FRTIACGLF | ARNIISPVM | FVKDWMERI | RAYFITRQL | YFITRQLQV |
| ARAESATIF | SYLRRTQSM | LRRTQSMGI | ||||||||
| C*07:01 | NRAYFITRQL | YLRRTQSMGI | SRRAAVSAL | LRYGNVLDV | IRKPRHLYV | FRTIACGLF | ARNIISPVM | RAYFITRQL | YFITRQLQV | ARAESATIF |
| SYLRRTQSM | LRRTQSMGI | |||||||||
| C*07:02 | SRRAAVSAL | LRYGNVLDV | IRKPRHLYV | FRTIACGLF | ARNIISPVM | RAYFITRQL | YFITRQLQV | ARAESATIF | LYVAGMPEL | SYLRRTQSM |
| C*08:01 | FALPIILKAL | FADIATPHSV | TADWKSIGL | YILSFALPI | LSFALPIIL | FALPIILKA | FSFFVKDWM | WMERIDDFL | RAYFITRQL | IATPHSVWV |
| MSHWGREAV | ||||||||||
| C*08:02 | LADLIAAQKL | FADIATPHSV | VLDVNSIDL | TADWKSIGL | WMERIDDFL | LQDMRNTIM | MSHWGREAV | LGDDMDPEL | ||
| C*12:03 | FADIATPHSV | YILSFALPI | LSFALPIIL | FALPIILKA | SSYEEVNGI | KARNIISPV | FSFFVKDWM | FVKDWMERI | LATNRAYFI | RAYFITRQL |
| IATPHSVWV | MSHWGREAV | |||||||||
| C*14:02 | SRRAAVSAL | SMPTAQSTM | RFRTIACGL | RAYFITRQL | YFITRQLQV | LYVAGMPEL | AFFAILQDM | SYLRRTQSM | ||
| C*16:01 | YILSFALPI | LSFALPIIL | FSFFVKDWM | RAYFITRQL | KSAFYQSYL | MSHWGREAV | ||||
| C*17:01 | FADIATPHSV | TADWKSIGL | LSFALPIIL | IISPVMGVI | WMERIDDFL | FLAARCPFL | LATNRAYFI | RAYFITRQL | KVSDIEDLI | IATPHSVWV |
| KSAFYQSYL | RTQSMGIQL | MSHWGREAV | ||||||||
| C*18:01 | SRRAAVSAL | IRKPRHLYV | FRTIACGLF | ARNIISPVM | FLAARCPFL | YFITRQLQV | ARAESATIF | LQDMRNTIM | SYLRRTQSM | |
List of Class I MHC T-cell epitopes with their predicted proteasome cleavage, TAP efficiency, antigenicity and immunogenicity
| List of epitopes | Peptide Rank | Start Position | Proteasome score* | TAP Score and its predicted affinity* | Vaxijen score* | Immunogenicity score* |
| AAVSALETK | 286 | 49 | 0.5009 | 3.907 (Intermediate) | 1.5281 (Probable antigen) | -0.01823 |
| FLAARCPFL | 218 | 239 | 0.9323 | 4.846 (Intermediate) | 1.2043 (Probable antigen) | 0.11076 |
| KARNIISPV | 291 | 211 | 0.6363 | 3.89 (Intermediate) | 0.0035 (Probable non-antigen) | 0.11907 |
| LYVAGMPEL | 236 | 319 | 1 | 4.602 (Intermediate) | 0.3162 (Probable non-antigen) | -0.03039 |
| QSRRAAVSA | 174 | 45 | 0.5548 | 5.81 (Intermediate) | 0.8992 (Probable antigen) | 0.08199 |
| TIACGLFPA | 52 | 200 | 0.5077 | 8.202 (High) | 0.5097 (Probable antigen) | 0.07333 |
| TNRAYFITR | 299 | 260 | 0.5741 | 3.863 (Intermediate) | 0.6425 (Probable antigen) | 0.29777 |
| VMGVIGFSF | 240 | 219 | 0.5015 | 4.523 (Intermediate) | 1.8515 (Probable antigen) | 0.21618 |
Figure 23D structure of SNV N protein generated by I-TASSER program