Literature DB >> 25380768

Prediction of T-cell epitopes of hepatitis C virus genotype 5a.

Maemu P Gededzha, M Jeffrey Mphahlele, Selokela G Selabe1.   

Abstract

BACKGROUND: Hepatitis C virus (HCV) is a public health problem with almost 185 million people estimated to be infected worldwide and is one of the leading causes of hepatocellular carcinoma. Currently, there is no vaccine for HCV infection and the current treatment does not clear the infection in all patients. Because of the high diversity of HCV, protective vaccines will have to overcome significant viral antigenic diversities. The objective of this study was to predict T-cell epitopes from HCV genotype 5a sequences.
METHODS: HCV near full-length protein sequences were analyzed to predict T-cell epitopes that bind human leukocyte antigen (HLA) class I and HLA class II in HCV genotype 5a using Propred I and Propred, respectively. The Antigenicity score of all the predicted epitopes were analysed using VaxiJen v2.0. All antigenic predicted epitopes were analysed for conservation using the IEDB database in comparison with 406, 221, 98, 33, 45, 45 randomly selected sequences from each of the HCV genotypes 1a, 1b, 2, 3, 4 and 6 respectively, downloaded from the GenBank. For epitope prediction binding to common HLA alleles found in South Africa, the IEDB epitope analysis tool was used.
RESULTS: A total of 24 and 77 antigenic epitopes that bind HLA class I and HLA class II respectively were predicted. The highest number of HLA class I binding epitopes were predicted within the NS3 (63%), followed by NS5B (21%). For the HLA class II, the highest number of epitopes were predicted in the NS3 (30%) followed by the NS4B (23%) proteins. For conservation analysis, 8 and 31 predicted epitopes were conserved in different genotypes for HLA class I and HLA class II alleles respectively. Several epitopes bind with high affinity for both HLA class I alleles and HLA class II common in South Africa.
CONCLUSION: The predicted conserved T-cell epitopes analysed in this study will contribute towards the future design of HCV vaccine candidates which will avoid variation in genotypes, which in turn will be capable of inducing broad HCV specific immune responses.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25380768      PMCID: PMC4289306          DOI: 10.1186/1743-422X-11-187

Source DB:  PubMed          Journal:  Virol J        ISSN: 1743-422X            Impact factor:   4.099


Introduction

Hepatitis C virus (HCV) is estimated to infect 185 million people worldwide [1]. Chronic HCV infection leads to progressive liver disease, being one of the major causes of hepatocellular carcinoma and one of the most common indications for liver transplantation [2]. The World Health Organization (WHO) strongly recommends combination therapy with pegylated interferon and ribavirin for chronically infected patients who qualify for treatment [1]. Recently, two NS3 protease inhibitors (boceprevir and telaprevir) have been approved by the US Food and Drug Administration, and the WHO conditionally (until more evidence has accumulated) recommends that these drugs should be given in combination with pegylated interferon and ribavirin for the treatment of chronic HCV genotype 1 infections. Also, the WHO strongly recommends that sofosbuvir be given in combination with ribavirin alone in patients who cannot tolerate interferon and are chronically infected with genotypes 1, 2, 3 and 4 [1]. However, these therapies are still not affordable in most developing countries. As a result, the development of an effective HCV vaccine is undoubtedly the best solution for the ultimate control of HCV infections, and is a public health priority. Prophylactic vaccines against viral infections are generally aimed at inducing a humoral (B-cell) immune response, while therapeutic vaccines preferably activate both humoral and cellular (T-cell) immune responses [3]. A successful HCV vaccine will need to stimulate both arms of the adaptive immune response, since while both cellular and humoral .immune responses occur in a naturally infected host, the current consensus is that a strong cellular response is vital for viral clearance and protection [4]. The development of an effective HCV vaccine requires an understanding of the host’s adaptive immune response to natural infection. As with other viral infections, viral antigens are presented to CD4+ and CD8+ T-cells via human leukocyte antigen (HLA) class II and class I molecules, respectively [5]. Different HLA class alleles have been found to be associated with HCV infection. For example, HLA-A*11, HLA-Cw*04 and HLA-B*53 have been associated with HCV persistence [6, 7], while HLA-B*27, HLA-A*11:01, HLA-B*57, HLA-Cw*01:02 and HLA-A*03 have been associated with spontaneous HCV clearance [8, 9]. Also, HLA-DRB1*11 and HLA-DQB1*03:01 have been associated with decreased disease severity of HCV infection globally, suggesting they may present HCV-derived epitopes more efficiently to CD4+ T-cells than others and thus capable of viral clearance [10]. During acute HCV infection, development and persistence of strong specific responses by CD8+ and CD4+ T-cells [11, 12] and neutralizing antibodies [13] are associated with viral clearance, with HCV-specific CD8+ and CD4+ T-cells usually being transient or absent in patients who develop persistent infections. The rate of chronic liver disease progression has been shown to be determined by the magnitude of HCV-specific CD4+ T-cell responses, since these cells are essential for both the cellular and humoral responses [14]. CD8+ T-cells are essential for long-term protection against chronic HCV [15], while CD4+ T-cells play a role in viral clearance [16]. HCV infection evades the host’s immune system by generating immune escape variants through alteration of the virus HLA-restricted epitopes to avoid being recognized by T-cells and neutralizing antibodies [14]. Thus effective HCV vaccines will need to target protective epitopes that display minimal cross-genotype amino acid variability as they will provide broad potency [17]. Peptides corresponding to protective epitopes are desirable vaccine candidates because they are easy to construct and produce, and they do not contain infectious materials [18]. The first step in the process of epitope-based vaccine design and development is the in-silico prediction of peptide binding affinities to HLA proteins [19]. Genotype 5 accounts for over 50% of HCV infections in South Africa [20], and is becoming more prevalent in Europe and North America [21]. It is the most conserved of HCV genotypes, being classified into only one subtype (5a) [22-24]. The growing prevalence of HCV genotype 5a in different parts of the world necessitates its molecular characterization in order to improve the formulations of vaccine candidates that are in development. Thus the aim of this study was to assess immunological determinants by predicting conserved epitopes in near-full length HCV genotype 5a sequences using a suite of online programmes to help in the designing of new vaccine candidates.

Results

Prediction of T-cell epitopes

For HLA class I, a total of 24 antigenic epitopes were predicted in the consensus near full-length of genotype 5a (Table 1). Epitope NS31325–1333 covered 30 of the 47 HLA class I alleles that were analysed assuring high binding affinity to different alleles. For conservation analysis with other genotypes, 8 of 24 epitopes were 100% conserved for specific genotypes. Epitopes NS31332–1340 and NS5B2557–2565 were highly conserved in all genotypes analysed, and epitope E2684–692 was conserved in all genotypes except for genotype 2, while 5 other epitopes were conserved in either 2 or 3 genotypes analysed. In addition epitope NS4B1832–1840 was conserved in genotype 1a, 1b, 2 and 4 epitope variants at anchor residues position 2 and 9 while E2677–685, NS31325–1333 and NS31357–1365 were conserved in at least 3 genotypes epitope variants each (Table 1). For HLA class II, 77 epitopes were predicted (Table 2). Epitope NS4B1879–1887 and NS4B1880–1888 covered 51 of 51 HLA class II alleles analysed. For conservation analysis with other genotypes, 31 of 71 epitopes were conserved for specific genotypes. Some epitopes were highly conserved in all genotypes (E2507–515, E2509–517, NS31253–1261, NS31254–1262, NS31327–1335, NS31392–1400, NS4B1916–1924 and NS4B1919–1927), while other epitopes were conserved in at least 1 to 4 of the genotypes analysed. Epitope NS4B1886–1894 was conserved at anchor residues position 1, 4, 6 and 9 in all 6 genotypes epitope variants while E2692–700, NS2964–972, NS31418–1426 and NS41561–1569 were conserved in at least 5 genotypes epitope variants each (Table 2). Epitopes NS31585–1593, NS5A2285–2293 and NS5B2889–2897 were predicted to cover both HLA class I and HLA class II alleles. The highest number of HLA class I binding epitopes were predicted within the NS3 (63%), followed by NS5B (21%), and for the HLA class II, the highest number of epitopes were predicted in the NS3 (30%) followed by the NS4B (23%) proteins (Table 3).
Table 1

HLA class I predicted epitopes of HCV genotype 5a and their antigenicity prediction score, number of allele and conservation (in percentages) in different genotypes

PositionEpitope sequenceNo of alleleAntigenecity scoreGenotype 1aGenotype 1bGenotype 2Genotype 3Genotype 4Genotype 6
E2
651 RCDLEDRDR 92.7428
677 CSFTPTPAL 270.8772CSFTTLPAL (94)CSFTTLPAL (92)CSFTPMPAL (73)
684 ALSTGLIHL # [2527]290.9005 ALSTGLIHL (70) ALSTGLIHL (99) ALSTGLIHL (97) ALSTGLIHL (97) ALSTGLIHL (84) ALSTGLIHL (91)
NS2
861 VPPLQVRGG 61.4312VPPLNVRGG (85)
NS3
1032 YAQQTRGVL 260.7638YAQQTRGLL (99)
1033 AQQTRGVLG 50.7751AQQTRGLLG (99)
1128 ADLYLVTRH 51.0277 SDLYLVTRH (99) SDLYLVTRH (99) VDLYLVTRN (92)
1325 TILGIGTVL 300.7555 SILGIGTVL (93) TILGIGTVL (88) SILGIGTVL (94) TILGIGTVL (82)
1332 VLDQAETAG 50.5004 VLDQAETAG (96) VLDQAETAG (98) VLDQAETAG (96) VLDQAETAG (100) VLDQAETAG (93) VLDQAETAG (71)
1357 TPHPNIEEV 240.8423 VPHPNIEEV (89) VPHPNIEEV (80) VPHSNIEEV (94)
1359 HPNIEEVAL # [2831]241.1654 HPNIEEVAL (88) HPNIEEVAL (76) HSNIEEVAL (94)
1370 EGEIPFYGR 91.0560 TGEIPFYGK (97)
1374 PFYGRAIPL 70.9479PFYGKAIPL (99)PFYGKAIPI (85)PFYGKAIPL (82)PFYGKAIPL (73)
1642 TKYIMACMS 60.7008PFYGKAIPL (99) TKYIMACMS (70) TKYIMACMS (70) TKYIMACMS (93) TKYIMTCMS (87)
NS4B
1712 SASLPYMDE 50.5488
1819 QIAPPTAAT 50.5293QLAAPGAAT (99)QLAPPSAAS (94)QIAPPAGAT (91)QIATPTAST (89)
1832 SGMAGAAVG 50.8750 AGLAGAAIG (81) AGIAGAAVG (93)SGLVGAAVG (98)SGLAGAAIG (76)
1848 LIDILAGYG 50.8431LVDILAGYG (94)LVDILAGYG (95)LLDILAGYG (79)LVDILAGYG (84)
1854 GYGAGVAGA 90.5562 GYGAGVAGA (99) GYGAGVAGA (99) GYGAGVSGA (94) GYGAGVAGA (91) GYGAGVSGA (84)
NS5B
2522 GYGAKEVRS 71.3540GYGAKDVRC (95)GYGAKDVRN (84)GFGAKEVRS (93)GYSAKDVRS (79)
2557 TIMAKNEVF # [32]120.6303 TIMAKNEVF (99) TIMAKNEVF (80) TIMAKNEVF (83) TIMAKNEVF (100) TIMAKNEVF (84) TIMAKNEVF (93)
2568 EPSKGGKKP 61.4218 QPEKGGRKP (93)
2720 LASCRAAKL 280.5264
2886 LHGLSAFSL 110.6732 LHGLSAFSL (99) LHGLSAFSL (98) LHGLSAFTL(82)LHGMAAFSL (98)

#-indicates that the epitope has been experimentally proven to be a true positive.

Bold- indicates percentage of epitope that is 100% conserved in more than 70% of the sequences analysed in each genotype.

Italic- indicates amino acid(s) variation in epitope in comparison to the predicted epitope.

Table 2

HLA class II predicted epitopes of HCV genotype 5a and their antigenicity prediction score, number of allele and conservation (in percentages) in different genotypes

PositionEpitope sequenceNo of alleleAntigenecity scoreGenotype 1aGenotype 1bGenotype 2Genotype 3Genotype 4Genotype 6
E1
320 WDMMMNWSP 101.3610 WDMMMNWSP (99) WDMMMNWSP (97) WDMMMNWSP (97) WDMMMNWSP (93) WDMMMNWSP (93)
E2
507 YCFTPSPVV 61.2112 YCFTPSPVV (93) YCFTPSPVV (98) YCFTPSPVV (88) YCFTPSPVV (100) YCFTPSPVV (91) YCFTPSPVV (91)
509 FTPSPVVVG 171.3553 FTPSPVVVG (94) FTPSPVVVG (98) FTPSPVVVG (88) FTPSPVVVG (100) FTPSPVVVG (89) FTPSPVVVG (73)
665 LLHTTTQWA 50.6616LLLSTTQWQ (100)LLHSTTELA (73)LLLSTTQWQ (73)
666 LHTTTQWAI 250.5863LLTTTQWQI (100)LHSTTELAI (73)
692 LHQNIVDTQ 60.8564LHQNIVDVQ (100)LHQNIVDVQ (73)LHQNIVDVQ (95)LHQNIVDVQ (100)LHQNIVDVQ (87)
739 LLVCQAEAA 220.5506LLISQAEAA (100)LLIAQAEAA (72)LMISQAEAA (73)
P7
803 LPHRALALD 70.6966LPQRAYALD (100)LPPRAYAMD (86)
NS2
860 WVPPLQVRG 71.1998WVPPLNVRG (100)WVPPLLARG (82)
861 VPPLQVRGG 251.4312VPPLNVRGG (100)
864 LQVRGGRDA 200.6368LNVRGGRDA (100)LNVRGGRDA (86)
880 FHPALGFEI 191.4374 VHPALVFDI (99)
892 LLGILGPLY 140.5940LLAVLGPLW (100)LIAVLGPLY (82)
895 ILGPLYLLQ 50.8264 VLGPLWVLQ (99) VLGPLYLIQ (88)
938 LLHLGRLTG 250.5554
964 LRDLAVATE 91.0497LRDLAVAVE (100)LRDLAVAVE (93)LRDLAVAVE (85)LKDLAVATE (73)LRDLAVAVE (93)
997 LAGLPVSAR 110.9120 INGLPVSAR (100)LCGLPVSAR (100)
1025 LLAPITAYA 220.5253 LLAPITAYA (99) LLAPITAYA (79) LLAPITAYA (89)
NS3
1047 LTGRDKNEA 221.5633LTGRDKNQV (100)LTGRDKNQV (93)LTGRDKNVV (90)
1131 YLVTRHADV # [3335]150.7569 YLVTRHADV (99) YLVTRHADV (99) YLVTRNADV (97)
1152 LLSPRPISY 52.2999 LLSPRPISY (99) LLSPRPVSY (73)LLSPRPLST (89)
1153 LSPRPISYL 51.3109 LSPRPISYL (99) LSPRPVSYL (73)LSPRPLSTL (89)
1253 VLNPSVAAT 141.2489 VLNPSVAAT (100) VLNPSVAAT (100) VLNPSVAAT (97) VLNPSVAAT (100) VLNPSVAAT (96) VLNPSVAAT (89)
1254 LNPSVAATL 60.9169 LNPSVAATL (100) LNPSVAATL (100) LNPSVAATL (98) LNPSVAATL (100) LNPSVAATL (95) LNPSVAATL (89)
1258 VAATLGFGA 111.2617 VAATLGFGA (90) VAATLGFGA (85) VAATLGFGA(98)
1262 LGFGAYMSK # [3638]140.8784 LGFGAYMSK (87) LGFGAYMSK (82) LGFGAYMSK(78)
1264 FGAYMSKAY 140.6561FGAYMSKAH (100)FGAYMSKAH (82)
1327 LGIGTVLDQ 420.5607 LGIGTVLDQ (97) LGIGTVLDQ (88) LGIGTVLDQ (98) LGIGTVLDQ (100) LGIGTVLDQ (93) LGIGTVLDQ (96)
1373 IPFYGRAIP 160.5679IPFYGKAIP (100)IPFYGKAIP (93)IPFYGKAIP (81)
1375 FYGRAIPLA 221.0280FYGKAIPLE (100)FYGKAIPIE (81)FYGKAIPLE (71)
1392 IFCHSKKKC 221.5437 IFCHSKKKC (93) IFCHSKKKC (94) IFCHSKKKC (95) IFCHSKKKC (88) IFCHSKKKC (80) IFCHSKKKC (93)
1415 VAYYRGLDV 210.8250 VAYYRGLDV (98) VAYYRGLDV (96) VAYYRGLDV (92) VAYYRGLDV (79) VAYYRGLDV (91) VAFYRGVDV (89)
1418 YRGLDVAVI 511.0522YRGLDVSVI (100)YRGLDVSVI (97)YRGLDVSVI (84)YRGLDVSVI (85)YRGLDVSVI (91)YRGVDVSVI (67)
1482 VSRSQRRGR 122.0649VSRTQRRGR (100) VSRSQRRGR (93) VSRSQRRGR (96) VSRSQRRGR (97) VSRSQRRGR (98) VSRSQRRGR (73)
1561 VFTGLTNID 91.1689VFTGLTHID (100)VFTGLTHID (96)VFTGLTHID (98)VFTGLTHID (97)VFTGLTHID (89)
1583 FPYLVAYQA 240.6584 FPYLVAYQA (85) FAYLVAYQA (78)
1585 YLVAYQATV # [25, 39]170.5020 YLVAYQATV (99) YLVAYQATV (85) YLTAYQATV (70) YLVAYQATV (87) YLVAYQATV (91)
1586 LVAYQATVC 280.6052 LVAYQATVC (99) LVAYQATVC (85) LVAYQATVC (82) LVAYQATVC (91)
1631 VQNEITLTH 351.1979VQNEVTLTH (100)VQNEVTLTH (80)
1641 ITKYIMACM 80.6964 VTKYIMTCM (100) VTKYIATCM (94) ITKYIMACM (75) ITKYIMTCM (87)
1645 IMACMSADL 220.8406IMTCMSADL (100) IMACMSADL (82) IATCMQADL (99) IMACMSADL (70) IMACMSADL (93) IMTCMSADL (87)
NS4B
1733 LGLIGTAGQ 431.2450
1765 MWNFVSGIQ 181.2300MWNFISGIQ (100)MWNFISGIQ (95)MWNFISGIQ (96) MWNFVSGIQ (100) MWNFISGIQ (87)
1766 WNFVSGIQY 220.6246WNFISGIQY (100)WNFISGIQY (95)WNFISGIQY (96) WNFVSGIQY (100) WNFISGIQY (87) WNFVSGIQY (91)
1791 MSFTAAVTS 80.7414MAFTAAVTS (100)MAFTASITS (88)MAFSAALTS (93)MAFTASVTS (92) MSFTAAVTS (93)
1812 LGGWVASQI 160.608LGGWVAAQL (100)LGGWVAAQL (95)LGGWVATHL (88) LGGWVASQI (96)
1849 IDILAGYGA 90.7525 VDILAGYGA (100) VDILAGYGA (97) LDILAGYGA (79) VDILAGYGA (84)
1863 LVAFKIMCG 471.0160LVAFKIMSG (100)LVAFKIMSG (95)LVAFKIMGG (100) VVTFKIMSG (89)LVAFKIMSG (84)
1879 LVNLLPSIL 510.7592LVNLLPAIL (100)LVNLLPAIL (90) MVNLLPAIL (92)LVNLLPAIL (73)
1880 VNLLPSILC 510.9874VNLLPAILS (100)VNLLPAILS (92)VNLLPAILS (94)VNLLPAILS (89)VNLLPALLS (82)
1883 LPSILCPGA 60.7104LPAILSPGA (100)LPAILSPGA (99)LPAILSPGA (74)LPAILSPGA (100)LPAILSPGA (89)LPALLSPGA (84)
1886 ILCPGALVV 90.9365ILSPGALVV (100)ILSPGALVV (99)ILSPGALVV (98)ILSPGALVV (100)ILSPGALVV (91) LLSPGALVV (89)
1892 LVVGVICAA 300.7097LVVGVVCAA (100)LVVGVVCAA (98) LVVGVICAA (97) LVVGVICAA (94) LVVGVVCAA (89)LVVGVVCAA (93)
1893 VVGVICAAV 500.9530VVGVVCAAI (100)VVGVVCAAI (96)VVGVICAAI (98)VVGVICAAI (85)VVGVVCAAI (91)VVGVVCAAI (76)
1896 VICAAVLRR 350.9887VVCAAILRR (100)VVCAAILRR (95)VICAAILRR (97)VICAAILRR (85)VVCAAILRR (91)VVCAAILRR (76)
1897 ICAAVLRRH 121.2316 VCAAILRRH (100) VCAAILRRH (97)ICAAILRRH (97)ICAAILRRH (85) VCAAILRRH (89)
1916 MNRLIAFAS 480.5291 MNRLIAFAS (100) MNRLIAFAS (99) MNRLIAFAS (99) MNRLIAFAS (100) MNRLIAFAS (93) MNRLIAFAS (96)
1919 LIAFASRGN 242.0235 LIAFASRGN (100) LIAFASRGN (99) LIAFASRGN (98) LIAFASRGN (100) LIAFASRGN (93) LIAFASRGN (96)
1922 FASRGNHVS 181.2938 FASRGNHVS (100) FASRGNHVS (98) FASRGNHVA (97) FASRGNHVS (94) FASRGNHVS (82) FASRGNHVS (93)
NS5A
1994 WLQAKLLPQ 331.1603WLKAKLMPQ (100)WLQSKLLPR (84)
2101 YITGVTQDN 61.1358YVTGMTTDN (100)
2105 VTQDNLKCP 61.0178 MTTDNLKCP (100)
2237 MGGNITRVE 201.2656 MGGNITRVE (92) MGGNITRVE (96) MGSNITRVE (91)
2285 LPVWARPGY 90.6143LPIWARPDY (100)LPIWARPDY (78)
NS5B
2422 MSYSWTGAL 90.5749 MSYSWTGAL (76) MSYTWTGAL (94) MSYSWTGAL (97) MSYSWTGAL (85) MSYSWTGAL (91)
2451 LRHHNLVYS 401.2321 LRHHNLVYS (89) LRHHNMVYA (86) LRHHNLVYS (97)
2523 YGAKEVRSL 241.1739YGAKDVRCH (100)YGAKDVRNL (85) FGAKEVRSL (92)YSAKDVRSL (76)
2559 MAKNEVFAV 180.8466MAKNEVFCV (100)MAKNEVFCV (81)MAKNEVFCV (65)MAKNEVFCV (85)MAKNEVFCV (96)
2564 VFAVEPSKG 100.7925VFCVQPEKG (100)VFCVQPEKG (89)
2637 FSYDTRCFD 1115808 FSYDTRCFD (99) FSYDTRCFD (95) FSYDTRCFD (100) FSYDTRCFD (93) FSYDTRCFD (87)
2660 YQSCDLQPE 111.3802YQCCDLDPQ (100)YQCCDLAPE (94)
2696 YRRCRASGV 411.2642 YRRCRASGV (99) YRRCRASGV (99) YRRCRASGV (98) YRRCRASGV (97)
2778 YDLELVTSC 61.0923YDLELITSC (100)YDLELITSC (96)YDLELITSC (98)YDLELITSC (91)YDLELITSC (96)
2850 FSVLQSQEQ 210.8451FSILLAQEQ (84)FSILQSQEA (78)
2871 VYSVTPLDL 292.0113 CYSIEPLDL (100) CYSIEPLDL (75) TYSVTPLDL (91) TYSITPLDL (87)
2883 IQRLHGLSA 500.4322 IQRLHGLSA (98) IQRLHGLSA (72) IQYLAGLST (93)IERLHGLSA (79) IQRLHGLSA (75) IQRLHGMAA (96)
2889 LSAFSLHSY # [40]60.6007 LSAFSLHSY (99) LSAFSLHSY (97) LSAFTLHGY (80) MAAFSLHGY (89)

#-indicates that the epitope has been experimentally proven to be a true positive.

Bold- indicates percentage of epitope that is 100% conserved in more than 70% of the sequences analysed in each genotype.

Italic- indicates amino acid(s) variation in epitope in comparison to the predicted epitope.

Table 3

Distribution of genotype 5a HLA class I and II predicted epitopes in each of the HCV gene

HLA classNo of predicted epitopesDistribution of the predicted epitopes in each of the HCV gene a
CE1E2P7NS2NS3 b NS4ANS4BNS5ANS5B
I240 (0)0 (0)3 (12)0 (0)1 (4) 15 (63) 0 (0)0 (0)0 (0)5 (21)
II770 (0)1 (1)6 (8)1 (1)10 (13) 23 (30) 0 (0)18 (23)5 (7)13 (7)

aNumber of epitopes and their percentage in brackets.

bHCV gene with highest number of binding epitopes (in bold).

HLA class I predicted epitopes of HCV genotype 5a and their antigenicity prediction score, number of allele and conservation (in percentages) in different genotypes #-indicates that the epitope has been experimentally proven to be a true positive. Bold- indicates percentage of epitope that is 100% conserved in more than 70% of the sequences analysed in each genotype. Italic- indicates amino acid(s) variation in epitope in comparison to the predicted epitope. HLA class II predicted epitopes of HCV genotype 5a and their antigenicity prediction score, number of allele and conservation (in percentages) in different genotypes #-indicates that the epitope has been experimentally proven to be a true positive. Bold- indicates percentage of epitope that is 100% conserved in more than 70% of the sequences analysed in each genotype. Italic- indicates amino acid(s) variation in epitope in comparison to the predicted epitope. Distribution of genotype 5a HLA class I and II predicted epitopes in each of the HCV gene aNumber of epitopes and their percentage in brackets. bHCV gene with highest number of binding epitopes (in bold).

Epitope binding affinity to common South African HLA class I and HLA class II alleles

Genotype 5 epitopes and their genotypic variants were analysed for their binding affinity to HLA class I and HLA class II alleles most common in South Africa, where genotype 5a is predominating. For HLA class I, 11 of the most common South African HLA-A alleles (HLA-A*01:01, HLA-A*02:01, HLA-A*30:01 and HLA-A*30:02), HLA-B (HLA-B*0702, HLA-B*08:01 and HLA-B*3501), and HLA-C (HLA-C*04:01, HLA-C*06:01, HLA-C*07:01 and HLA-C*07:02) were analysed. The limitation of Propred 1 is that it does not cover most of the main HLA class I alleles: HLA-A (HLA-A*01:01, HLA-A*30:01 and HLA-A*30:02), HLA-B (HLA-B*08:01) and HLA-C (HLA-C*04:01, HLA-C*06:01, HLA-C*07:01 and HLA-C*07:02) that are observed in South Africa. As a results the IEDB epitope analysis tool was used to predict epitopes of all the 11 most common HLA-A, HLA-B, HLA-C covering HLA class I alleles found in South Africa with the ANN prediction server. Thirteen antigenic epitopes with high binding affinity score of <50 IC50nM were predicted. Most epitopes bind with high affinity to single HLA class I alleles with exception of epitope NS5B2889–2297 LSAFSLHSY that bind with high affinity to 2 HLA-A alleles (HLA-A*01:01 and HLA-A*30:02). Four of the epitopes binding to HLA-B*35:01 followed by HLA-A*02:01 with 3 epitopes binding to it, however, none of the epitopes bind to HLA-C*04:01, HLA-C*06:01 and HLA-C*07:02 alleles. The NS31359–1367 shows a level of promiscuity to HLA-A and HLA-B alleles. NS31359–1367 HPNIEEVAL bind with intermediate affinity to HLA-B*07:02, while its genotype 2, 3 and 4 variant HSNIEEVAL bind with poor affinity and genotype 6 variant HPNITETAL bind with high affinity. For HLA-B*35:01, the NS31359–1367 HPNIEEVAL and genotype 6 variant HPNITETAL bind with high affinity while genotype 2, 3 and 4 variant HSNIEEVAL bind with poor affinity. Three (E2684–692, NS31032–1040 and NS31359–1367) of the thirteen epitopes were predicted by both Propred 1 and IEDB analysis tool (Table 4).
Table 4

Binding affinity scores of predicted epitopes and their variants to common HLA I allele types prevalent in South Africa

GeneEpitope sequence a Genotype of epitopeHLA I allele types b
HLA-AHLA-BHLA-C
HLA-A*01:01HLA-A*02:01HLA-A*30:01HLA-A*30:02HLA-B*07:02HLA-B*08:01HLA-B*35:01HLA-C*04:01HLA-C*06:02HLA-C*07:01HLA-C*07:02
E2
684ALSTGLIHL# [2527]1a, 1b, 3, 4,5a, 6+++++++++++++
ALSTGLLHL2+++++++++++++
NS2
1025LLAPITAYA1a, 3, 4, 5a++++++++++++++
LLAPITAYT 2++++++++++++++
NS3
1032YAQQTRGVL5a+++++++++++++
YAQQTRGLL1a, 2++++++++++++
YSQQTRGLL1b+++++++++++
YAQQTRGLV 6+++++++++++
1242AAYAAQGYK1a, 1b, 4, 5a+++++++++++++
AAYASQGYK2+++++++++++++
1264FGAYMSKAY5a+++++++++++++
FGAYMSKAH 1a, 1b, 2+++++++++++
1359HPNIEEVAL# [2831]1a, 1b, 5a++++++++++++++
HSNIEEVAL2, 3, 4+++++++++++
HPNITETAL6+++++++++++++++
1367LPSEGEIPF5a+++++++++++++
LGHEGEIPF2++++++++++++
LGSEGEIPF3++++++++++++
LPTTGEIPF4, 6+++++++++++++
1585YLVAYQATV# [25, 39]1a, 1b, 4, 5a, 6+++++++++++++
YLTAYQATV2, 3+++++++++++++
NS5A
1927NHVSPTHYV1a, 1b, 3, 4,5a, 6++++++++++++++
NHVAPTHYV2++++++++++++
2285LPVWARPGY5a+++++++++++++
LPIWARPDY1a, 3, 4, 6+++++++++++++
LPAWARPDY2+++++++++++++
NS5B
2696YRRCRASGV1a, 1b, 2, 3, 5a++++++++++++++++
2763MTRYSAPPG1a, 1b, 2, 3, 4, 5a, 6+++++++++++++
2889LSAFSLHSY# [40]1a, 1b, 5a++++++++++++++++
LSAFTLHSY3++++++++++++++++
LSAFTLHGY4+++++++++++++++

#-indicates that the epitope has been experimentally proven to be a true positive.

a–italics indicates amino acid(s) variation in epitope in comparison to the predicted epitope.

b+++ indicates high binding affinity (<50 IC50nm), ++ indicates medium binding affinity (>50 IC50nm, <500 IC50nm), + indicates poor binding affinity (>500 IC50nm).

Binding affinity scores of predicted epitopes and their variants to common HLA I allele types prevalent in South Africa #-indicates that the epitope has been experimentally proven to be a true positive. a–italics indicates amino acid(s) variation in epitope in comparison to the predicted epitope. b+++ indicates high binding affinity (<50 IC50nm), ++ indicates medium binding affinity (>50 IC50nm, <500 IC50nm), + indicates poor binding affinity (>500 IC50nm). For HLA class II alleles, 4 most common HLA-DRB alleles (HLA-DRB1*03:01, HLA-DRB1*04:01, HLA-DRB1*11:01 and HLA-DRB1*15:01) were analysed. Nineteen antigenic epitopes with high binding affinity score of <50 IC50nM were predicted. The HLA-DRB1*11:01 has the highest number of binding epitopes (10) followed by HLA-DRB1*04:01 and HLA-DRB1*15:01 (7 each). Epitopes NS4B1919–1927 LIAFASRGN was predicted to be the most promiscuous epitope binding to HLA-DRB1*04:01, HLA-DRB1*11:01 and HLA-DRB1*15:01 with high affinity and HLA-DRB1*03:01 with intermediate affinity. This epitope is conserved in all genotypes. Epitopes E2507-515, NS4B1774–1782, NS4B1919–1927, and NS4B1920–1928 were highly conserved in all genotypes (Table 5).
Table 5

Binding affinity scores of predicted epitopes and their variants to common HLA class II allele types prevalent in South Africa

PositionPredicted epitopesGenotype of epitopeHLA II- DRB1 allele types a
HLA-DRB1*03:01HLA-DRB1*04:01HLA-DRB1*11:01HLA-DRB1*15:01
E2
507YCFTPSPVV1a, 1b, 2, 3, 4, 5a, 6++++++++
695NIVDTQYLY5a++++++++
NS2
938LLHLGRLTG5a+++++++
1025LLAPITAYA1a, 3, 4, 5a,++++++++
NS3
1129DLYLVTRHA1a, 1b, 5a++++++
1131YLVTRHADV1a, 1b, 5a++++++
1391LIFCHSKKK1b, 2, 3, 4, 5a, 6++++++
1417YYRGLDVAV5a+++++++
1464FSLDPTFTI1a, 1b, 2, 5a++++++++
1535TTVRLRAYL3, 5a, 6++++++
1562FTGLTNIDA5a+++++++
1666VVAALAAYC5a++++++
NS4B
1774YLAGLSTLP1a, 1b, 2, 3, 4, 5a, 6++++++
1919LIAFASRGN1a, 1b, 2, 3, 4, 5a, 6+++++++++++
1920IAFASRGNH1a, 1b, 2, 3, 4, 5a, 6++++++
NS5A
1994WLQAKLLPQ5a+++++++
2099YHYITGVTQ5a++++++++
NS5B
2450LLRHHNLVY1a, 3, 5a++++++++
2579LIVYPDLGV2, 3, 4, 5a, 6++++++

a+++ indicates high binding affinity (<50 IC50nm), ++ indicates medium binding affinity (>50 IC50nm, <500 IC50nm), + indicates poor binding affinity (>500 IC50nm).

Binding affinity scores of predicted epitopes and their variants to common HLA class II allele types prevalent in South Africa a+++ indicates high binding affinity (<50 IC50nm), ++ indicates medium binding affinity (>50 IC50nm, <500 IC50nm), + indicates poor binding affinity (>500 IC50nm).

Validation of epitopes

Seven of the predicted epitopes were previously confirmed experimentally by other studies as true positives in comparison with the epitopes analysed in the IEDB resource database. Majority of the epitopes predicted in this study have not been previously tested experimentally. The ‘true epitopes’ are highlighted by (#) in Tables 1, 2 and 4.

Discussion

Several studies that have published HCV epitopes focused mainly on genotype 1 [41, 42], but most of these studies do not take into account the diversity in other genotypes that are common in developing countries like most African countries. In the present study, predicted antigenic epitopes of HCV genotype 5a proteins from South Africa were analysed followed by conservation with randomly selected genotypes 1–6 references from GenBank. Several studies have confirmed the importance of using immunoinformatics as good predictors for selecting HLA ligands, T-cell epitopes and immunogenicity [43]. As a result, several immunoinformatics methods have been developed to assist in the identification of HLA binding peptides [44, 45]. For this analysis, near full-length sequences covering all HCV proteins with the exclusion of the 3′end of the NS5B were included to maximize number of epitopes predicted. The use of the whole viral genome for developing epitope vaccines has a potential control over the immune response and eliminating the side effects [43], and it also increases the chance of detecting a virus at any developmental stage [46]. It has been shown that multiple epitopes from different parts of the HCV genome are important to produce a vaccine that can elicit strong humoral immune responses and multiple specific cellular immune responses [47]. A polyepitope-based strategy with multiple components combining core, E1, and E2 proteins; and conserved T-cell epitopes in the NS proteins has been suggested to be a good vaccine candidate for HCV [48]. High number of epitopes was predicted for HLA class II as compared to class I. The findings of this study are consistent with a study by Shehzadi et al. that predicted epitopes in genotype 3 from Pakistan. The study showed that majority of predicted epitopes were found in the NS3 protein for both HLA class I and HLA class II alleles and most of the epitopes were conserved among different genotypes [49]. Although the NS3 region is one of the conserved regions in HCV, variability in the nucleotide and amino acids has been reported by several studies in the same genotype and also in different genotypes [50, 51]. A recent study that analysed 1568 NS3-protease sequences from genotypes 1–6 reported that the protease amino acids sequence was moderately conserved and majority of the amino acids clustered in small regions. Of the 181 amino acids analysed 47% showed <1% variability among all HCV genotypes, and 17.1% amino acid positions showing >25.1% variability [51]. The NS3 is considered to be a good cellular target candidate for a therapeutic vaccine [52] since majority of the HCV viral epitopes recognized by CD8+ and CD4+ T-cells are located in the NS3 region [53-56]. The NS3 specific CD4+ and CD8+ T-cell responses were reported in patient responders to interferon therapy [57] and in spontaneous clearance of HCV [58]. Most of the predicted epitopes in the study sequence were found to be conserved across different HCV genotypes with a higher number of epitopes conserved at the anchor residues. The anchor residues are important for epitope high binding affinity to HLA [59]. Conserved epitopes might influence the immunogenic potential since mutations within the epitopes can increase the chance of immune escape [60]. For a vaccine to be effective globally the selected epitopes must cover HLAs of different populations and it must also be conserved among different genotypes. The high mutation rates of viral epitopes and HLA polymorphisms are some of the challenges that are associated with the development of peptide vaccines [61]. Successful epitope vaccine design requires a broad knowledge of HCV genotype diversity. This will help in the proper selection of conserved HCV-specific T-cell epitopes that will help in avoiding HCV immune evasion [62]. This study attempted to ensure maximal coverage of HLA polymorphism and different genotypes by analyzing conserved epitopes considering different HLA alleles. Majority of the epitopes predicted from HCV proteins isolated from South African genotype 5a were good binders against HLA alleles that are found worldwide. HLA is both polygenic and polymorphic, and the pool of HLA molecules differs for every individual. Different HLA alleles bind peptides with a particular sequence pattern [63]. For an HLA allele to be covered by a set of epitopes, at least one of the epitopes should be capable of inducing an immune response when bound to the corresponding HLA molecule [46]. The epitopes predicted in this study bind to many HLA alleles including the ones common in South Africa and can be used for designing good vaccine candidates that will eventually work in genetically diverse populations. In-vitro and in-silico studies have showed that HLA alleles preferentially bind to conserved regions of viral proteins in human viruses [64]. Very few epitopes were found to be experimentally true positive, however this can be due to the fact that most of the previous studies focused on genotype 1. A limitation of the study was a lack of in-vivo and in-vitro studies to confirm the predicted immunogenic epitopes, which will be the focus of future studies. However in-silico studies still provide the basis for designing good vaccine candidates. In conclusion, the results of this study demonstrated antigenic T-cell epitopes that are conserved among genotypes and good HLA binders derived from genotype 5a sequences that can be good candidates for vaccine development. Predicted epitopes analysed in this study will contribute to the future design of an efficient vaccine with the use of conserved epitopes to avoid variation in genotypes and as such, it will be able to induce broad HCV specific immune responses. Conserved epitopes among different genotypes will be experimentally tested in the future to determine their involvement in immune response.

Methods

Ethical statement

The study was approved by the Medunsa Research and Ethics Committee of the Faculty of Health Sciences at the University of Limpopo as project no MREC/p/142/2009:PG. The MREC is registered as an Independent Review Board with a reference no (IRB00005122). Genotype 5a full-length sequences available in the GenBank and 6 of the near full length sequences generated from a previous study conducted by our group [24] were aligned and consensus sequences created using BioEdit [65] for the prediction of T-cell epitopes. For HLA class I, prediction for binding alleles was performed using ProPred I (http://www.imtech.res.in/raghava/propred1/) at a 4% default threshold by keeping the proteosome and immunoproteosome filters on at 5% threshold. ProPred 1 predicts antigenic epitopes for 47 HLA class I alleles [44]. For HLA class II, prediction was performed using ProPred (http://www.imtech.res.in/raghava/propred/) at a 3% default threshold. ProPred predicts antigenic epitopes for 51 HLA class II alleles [45].

Antigenicity of the epitopes

The Antigenicity score of all the predicted epitopes were analysed using VaxiJen v2.0 online antigen prediction (http://www.ddg-pharmfac.net/vaxijen/). Epitopes having antigenic score >0.5 were selected as antigenic. Vaxijen server performed well with 87% accuracy at a threshold of 0.5 antigenic score for viruses. VaxiJen v2.0 allows antigen classification based on the physicochemical properties of proteins without recourse to sequence alignment.

Epitope conservation analysis

All predicted epitopes were analyzed for conservation using the IEDB database (http://tools.immuneepitope.org/tools/conservancy/iedb_input) at a threshold of 100% conservation in comparison with 406, 221, 98, 33, 45, 45 randomly selected sequences from each of the HCV genotypes 1a, 1b, 2, 3, 4 and 6 respectively. The epitopes were considered conserved in another genotype if it shows 100% identity across the epitope in at least 70% of sequences in that genotype in the randomly selected sequences used in this study, downloaded from the public database. In addition, epitope variants that were conserved in at least 70% of the sequences were analysed for conservancy for anchor residues at positions 2 and 9 for HLA class I and positions 1, 4, 6 and 9 for HLA class II.

Validation of predicted epitopes

All the predicted epitopes were submitted to IEDB database (http://www.immuneepitope.org/) to confirm if they had been tested previously by other studies. The immuneepitope database contains experimentally confirmed data about antibody, T-cell epitopes, HLA binding, HLA restriction and HLA class.

Common South African HLA alleles

For epitope prediction binding to common HLA alleles found in South Africa, the IEDB epitope analysis tool (http://tools.immuneepitope.org/tools/conservancy/iedb_input) was used for HLA class I using the artificial neural network (ANN) algorithm [66] on the IEDB server, while for Class II ProPred (http://www.imtech.res.in/raghava/propred/) at a 3% default threshold was used. ProPred predicts antigenic epitopes for 51 HLA class II alleles [54]. The most common South African alleles were found in published literature [67].
  63 in total

1.  Successful vaccination induces multifunctional memory T-cell precursors associated with early control of hepatitis C virus.

Authors:  Su-Hyung Park; Eui-Cheol Shin; Stefania Capone; Laura Caggiari; Valli De Re; Alfredo Nicosia; Antonella Folgori; Barbara Rehermann
Journal:  Gastroenterology       Date:  2012-06-13       Impact factor: 22.682

2.  Novel CD4+ and CD8+ T-cell determinants within the NS3 protein in subjects with spontaneously resolved HCV infection.

Authors:  Anne M Wertheimer; Camette Miner; David M Lewinsohn; Anna W Sasaki; Ezra Kaufman; Hugo R Rosen
Journal:  Hepatology       Date:  2003-03       Impact factor: 17.425

Review 3.  The neglected hepatitis C virus genotypes 4, 5 and 6: an international consensus report.

Authors:  Nabil Antaki; Antonio Craxi; Sanaa Kamal; Rami Moucari; Schalk Van der Merwe; Samir Haffar; Adrian Gadano; Nizar Zein; Ching Lung Lai; Jean-Michel Pawlotsky; E Jenny Heathcote; Geoffrey Dusheiko; Patrick Marcellin
Journal:  Liver Int       Date:  2009-12-10       Impact factor: 5.828

Review 4.  Therapeutic vaccination against chronic hepatitis C virus infection.

Authors:  Peng Peng Ip; Hans W Nijman; Jan Wilschut; Toos Daemen
Journal:  Antiviral Res       Date:  2012-07-25       Impact factor: 5.970

5.  HLA-A*01:03, HLA-A*24:02, HLA-B*08:01, HLA-B*27:05, HLA-B*35:01, HLA-B*44:02, and HLA-C*07:01 monochain transgenic/H-2 class I null mice: novel versatile preclinical models of human T cell responses.

Authors:  Rachid Boucherma; Hédia Kridane-Miledi; Romain Bouziat; Michael Rasmussen; Tanja Gatard; Francina Langa-Vives; Brigitte Lemercier; Annick Lim; Marion Bérard; Lbachir Benmohamed; Søren Buus; Ronald Rooke; François A Lemonnier
Journal:  J Immunol       Date:  2013-06-17       Impact factor: 5.422

Review 6.  The nature of interferon-alpha resistance in hepatitis C virus infection.

Authors:  Jean-Michel Pawlotsky
Journal:  Curr Opin Infect Dis       Date:  2003-12       Impact factor: 4.915

7.  Identification of HLA-A3 and -B7-restricted CTL response to hepatitis C virus in patients with acute and chronic hepatitis C.

Authors:  K M Chang; N H Gruener; S Southwood; J Sidney; G R Pape; F V Chisari; A Sette
Journal:  J Immunol       Date:  1999-01-15       Impact factor: 5.422

8.  Promiscuous prediction and conservancy analysis of CTL binding epitopes of HCV 3a viral proteome from Punjab Pakistan: an in silico approach.

Authors:  Abida Shehzadi; Shahid Ur Rehman; Muhammad Idrees
Journal:  Virol J       Date:  2011-02-08       Impact factor: 4.099

9.  Molecular footprints reveal the impact of the protective HLA-A*03 allele in hepatitis C virus infection.

Authors:  Karen Fitzmaurice; Danijela Petrovic; Narayan Ramamurthy; Ruth Simmons; Shazma Merani; Silvana Gaudieri; Stuart Sims; Eugene Dempsey; Elizabeth Freitas; Susan Lea; Susan McKiernan; Suzanne Norris; Aideen Long; Dermot Kelleher; Paul Klenerman
Journal:  Gut       Date:  2011-05-06       Impact factor: 23.059

10.  Analysis of successful immune responses in persons infected with hepatitis C virus.

Authors:  F Lechner; D K Wong; P R Dunbar; R Chapman; R T Chung; P Dohrenwend; G Robbins; R Phillips; P Klenerman; B D Walker
Journal:  J Exp Med       Date:  2000-05-01       Impact factor: 14.307

View more
  3 in total

1.  Towards the knowledge-based design of universal influenza epitope ensemble vaccines.

Authors:  Qamar M Sheikh; Derek Gatherer; Pedro A Reche; Darren R Flower
Journal:  Bioinformatics       Date:  2016-07-10       Impact factor: 6.937

2.  Bioinformatics Analysis of Domain 1 of HCV-Core Protein: Iran.

Authors:  Behzad Dehghani; Tayebeh Hashempour; Zahra Hasanshahi; Javad Moayedi
Journal:  Int J Pept Res Ther       Date:  2019-04-20       Impact factor: 1.931

3.  Inferring Protective CD8+ T-Cell Epitopes for NS5 Protein of Four Serotypes of Dengue Virus Chinese Isolates Based on HLA-A, -B and -C Allelic Distribution: Implications for Epitope-Based Universal Vaccine Design.

Authors:  Jiandong Shi; Jing Sun; Meini Wu; Ningzhu Hu; Jianfan Li; Yanhan Li; Haixuan Wang; Yunzhang Hu
Journal:  PLoS One       Date:  2015-09-18       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.