| Literature DB >> 23227878 |
Nishi Prabdial-Sing1, Adrian J Puren, Sheila M Bowyer.
Abstract
BACKGROUND: Host genetics influence the outcome of HCV disease. HCV is also highly mutable and escapes host immunity. HCV genotypes are geographically distributed and HCV subtypes have been shown to have distinct repertoires of HLA-restricted viral epitopes which explains the lack of cross protection across genotypes observed in some studies. Despite this, immune databases and putative epitope vaccines concentrate almost exclusively on HCV genotype 1 class I-epitopes restricted by the HLA-A*02 allele. While both genotype and allele predominate in developed countries, we hypothesise that HCV variation and population genetics will affect the efficacy of proposed epitope vaccines in South Africa. This in silico study investigates HCV viral variability within well-studied epitopes identified in genotype 1 and uses algorithms to predict the immunogenicity of their variants from other less studied genotypes and thus rate the most promising vaccine candidates for the South African population. Six class I- and seven class II- restricted epitope sequences within the core, NS3, NS4B and NS5B regions were compared across the six HCV genotypes using local genotype 5a sequence data together with global data. Common HLA alleles in the South African population are A30:01, A02:01, B58:02, B07:02; DRB1*13:01 and DRB1*03:01. Epitope binding to 13 class I- and 8 class -II alleles were described using web-based prediction servers, Immune Epitope Database, (IEDB) and Propred. Online population coverage tools were used to assess vaccine efficacy.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23227878 PMCID: PMC3552980 DOI: 10.1186/1471-2172-13-67
Source DB: PubMed Journal: BMC Immunol ISSN: 1471-2172 Impact factor: 3.615
Six well studied HLA class I- and seven class II- restricted HCV immunodominant epitope sequences were chosen from previous publications for this study
| NS3 1073-1081 | CINGVCWTV (1a) | A02 | [ | 78 |
| NS3 1406-1415 | KLVALGINAV (1a) | A02 | [ | 70 |
| NS4 1807-1816 | LLFNILGGWV (1a) | A02 | [ | 39 |
| NS4 1851-1859 | ILAGYGAGV (1) | A02 | [ | 29 |
| NS5B 2422-2433 | MSYSWTGALVTP (1) | B15 | [ | 1 |
| NS5B 2727-2735 | GLQDCTMLV (1) | A02 | [ | 22 |
| | | | | |
| Core 17-35^ | RRPQDVKFPGGGQIVGGVY (1) | Undetermined Class II allele | [ | 1 |
| Core 21-40^ | DVKFPGGGQIVGGVYLLPRR (1) | HLA-DRB1*1501 | [ | 13 |
| NS3 1248-1261 | GYKVLVLNPSVAAT (1) | HLA-DRB1*1201; 1101; 1301; 0401 | [ | 5 |
| NS4A 1781-1800 | LPGNPAIASLMAFTAAVTSP (1a) | Undetermined Class II allele | [ | 3 |
| NS4A 1801-1820 | LTTSQTLLFNILGGWVAAQL (1a) | Undetermined Class II allele | [ | 4 |
| NS5 2571-2590 | KGGRKPARLIVFPDLGVRVC (1a) | Undetermined Class II allele | [ | 4 |
| NS5 2661-2680 | QCCDLDPQARVAIKSLTERL (1a) | Undetermined Class II allele | [ | 4 |
^Class II- restricted epitopes in the core region are overlapping sequences.
The sequences of the chosen epitopes were compared to the consensus sequence and conservation scores (as percentages) were calculated
| NS31073-1081 | CINGVMWTV | 78 | 67 | 67 | 67 | 78 | 67 | 70.67 | 67 | 78 | 5.680 | 0.3062 |
| NS31406-1415 | LTSLGLNAV | 67 | 56 | 67 | 78 | 67 | 56 | 65.17 | 56 | 78 | 8.280 | 0.1645 |
| NS41807-1816 | LLFNILGGW | 100 | 78 | 78 | 100 | 100 | 78 | 89.00 | 78 | 100 | 12.049 | 0.6513 |
| NS41851-1859 | ILAGYGAGV | 89 | 67 | 89 | 78 | 89 | 67 | 79.83 | 67 | 89 | 10.815 | 0.2231 |
| NS5B2422-2433 | MSYSWTGAL | 89 | 89 | 89 | 100 | 89 | 67 | 87.17 | 67 | 100 | 10.815 | 0.406 |
| NS5B2727-2735 | GLRDCTMLV | 78 | 56 | 44 | 78 | 78 | 33 | 61.17 | 33 | 78 | 19.823 | 0.4142 |
| | Mean within genotypes | 83.50 | 68.83 | 72.33 | 83.50 | 83.50 | 61.33 | | | | | |
| CORE17-40 | RRPQDVKFPGGGQIVGGVYLLPRR | 100 | 96 | 66 | 96 | 96 | 96 | 91.67 | 67 | 78 | 5.680 | 0.3062 |
| NS31248-1261 | GYKVLVLNPSVAAT | 100 | 93 | 93 | 100 | 100 | 93 | 96.50 | 93 | 100 | 3.834 | 0.32 |
| NS31781-1800 | LPGNPAVASLMATAAVTSP | 85 | 80 | 95 | 85 | 90 | 65 | 83.33 | 65 | 95 | 10.327 | 0.4142 |
| NS41801-1820 | LTTSQTLLFNILGGWVASQL | 85 | 65 | 80 | 90 | 85 | 70 | 79.17 | 65 | 90 | 9.703 | 0.962 |
| NS5B2571-2590 | KGGRKPALIVYPDLGVRVC | 80 | 80 | 90 | 95 | 95 | 80 | 86.67 | 80 | 95 | 7.527 | 0.2231 |
| NS5B2661-2680 | QCCDLEPEARVAIKSLTERL | 85 | 55 | 70 | 80 | 60 | 50 | 66.67 | 50 | 85 | 14.023 | 0.4159 |
| Mean within genotypes | 89.17 | 78.17 | 82.33 | 91.00 | 87.67 | 75.67 | ||||||
Binding affinity scores of published epitopes and their variants were determined by the IEDB prediction program to relevant supertypes in South Africa
| | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| | | | ||||||||||||||
| CINGVCWTV | 1a | | 17802 | 14908 | 15501 | 12611 | 23637 | 20927 | 25523 | 19827 | 13679 | 19257 | 23485 | |||
| 1073-1081 | CVNGVCWTV | 1b | | 16997 | 12228 | 13122 | 11766 | 21885 | 15696 | 13382 | 18288 | 12132 | 20367 | 23007 | ||
| | SISGVLWTV | 2a variant | | 18961 | 21483 | 11417 | 11417 | 22455 | 22186 | 29702 | 18590 | 15055 | 15691 | 20667 | ||
| | TVGGVMWTV | 3a | | 19940 | 12677 | 14750 | 9776 | 20729 | 21877 | 24623 | 16182 | 18054 | 26500 | 24303 | ||
| | AVNGVMWTV | 4a variant | | 17734 | 24001 | 4015# | 12036 | 10753 | 20258 | 20595 | 17093 | 12996 | 13641 | 18882 | ||
| | CINGVLWTV | 5a | | 15172 | 17548 | 13613 | 13865 | 23524 | 21854 | 15854 | 18628 | 11203 | 17516 | 21090 | ||
| | CINGVMWTL | 5a variant | | 17922 | 10449 | 14413 | 11435 | 18947 | 13165 | 11237 | 2239 | 13165 | 13572 | 19956 | ||
| KLVALGINA | 1a | | 22719 | 15048 | 32261 | 1830 | 18800 | 24242 | 25216 | 37253 | 23529 | 20557 | 4839 | 19019 | ||
| 1406-1415 | KLSGLGLNA | 1b | | 19133 | 21824 | 33559 | 2557 | 13152 | 20740 | 27147 | 37083 | 23891 | 19220 | 8973 | 18099 | |
| | QLTSLGLNA | 4a | | 20013 | 7051 | 15292 | 33674 | 12859 | 12517 | 26454 | 24440 | 37244 | 22168 | 26218 | 7165 | 19904 |
| | KLVALGINAV | 1a | | 37929 | 8564 | 39134 | NO VALUE | 31977 | 19547 | 42247 | 34339 | NO VALUE | NO VALUE | NO VALUE | 26021 | |
| | LTGLGINAV | 5a | | 12100 | 5692 | 32426 | 10980 | 20519 | 21309 | 20981 | 33652 | 25012 | 21599 | 12577 | 26332 | |
| | QLTGLGINA | 5a variant | | 22408 | 6972 | 7419 | 34672 | 13389 | 17488 | 26117 | 23541 | 36968 | 25569 | 22283 | 15466 | 20054 |
| LLFNILGGW | 1a, 1b, 4, 5a | | 22942 | 14359 | 17095 | 18086 | 17906 | 9175 | 24903 | 19854 | 17154 | 956# | 962# | 5918 | 23118 | |
| 1807-1816 | MFFNILGGWV | 3a | | 24613 | 23482 | 19706 | 15640 | 1707# | 21757 | 11817 | 8151 | 10769 | 1251 | 13832 | 26621 | |
| | LLFNILGGWV | 1a, 1b, 4, 5a | | 32231 | 1159# | 38969 | NO VALUE | 19453 | 32445 | 40287 | 25767 | NO VALUE | NO VALUE | NO VALUE | 25868 | |
| ILAGYGAGV | 1a, 1b, 5a | | 20500 | 530# | 30882 | 15492 | 10120 | 11883 | 21134 | 37213 | 22934 | 20702 | 3735 | 20143 | ||
| 1851-1859 | ILAGYGTGV | 5a variant | | 20351 | 32028 | 17493 | 12563 | 11272 | 21994 | 36657 | 23555 | 20603 | 2196 | 19849 | ||
| MSYSWTGAL | 1a, 1b, 4 | | 12612 | 1522 | 2924 | 2372 | 5457 | 1530# | 8456 | 10166 | 523# | 16876 | ||||
| 2422-2433 | MSYTWTGAL | 5a | | 12133 | 2640 | 8602 | 2141# | 7606 | 2515# | 9150 | 10680 | 787# | 17267 | |||
| | YTWTGALIT | 5a variant | | 15779 | 3000 | 13286 | 33166 | 13737 | 1561 | 18979 | 3920 | 27619 | 22480 | 17360 | 6553 | 18765 |
| GLQDCTMLV | 1a | | 18371 | 5733 | 11972 | 13187 | 6275 | 20996 | 27015 | 35681 | 25282 | 22002 | 10687 | 17601 | ||
| 2727-2735 | KLQDCTMLV | 1b | | 17735 | 3878 | 6160 | 2071# | 9527 | 17308 | 26776 | 35038 | 23310 | 18296 | 3587 | 16634 | |
| | KLRDCTLLV | 5a | | 19744 | 14912 | 15150 | 5150 | 2800 | 27145 | 36627 | 21481 | 20362 | 1720# | 18071 | ||
| ALRDCTMLV | 4a | 19976 | 4673 | 19836 | 9982 | 5384 | 26302 | 36740 | 24190 | 22343 | 1206# | 20027 | ||||
<50 IC50nm, bold, high affinity.
>50 IC50nm, <500 IC50nm, italic, intermediate affinity.
>500 IC50nm, #, poor affinity.
No value indicates server produced no binding score.
Binding affinity scores of “newly predicted” epitopes and their variants were determined by the IEDB prediction program to relevant supertypes in South Africa
| | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| | | | ||||||||||||||
| NS3 | LTGPTPLLY | 5a, 1b | | 23679 | 24474 | 24873 | 4551 | 24599 | 6188 | 7688 | 1558# | 22842 | ||||
| | LHGPTPLLY | 1a | | 10396 | 24884 | 27469 | 21381 | 17350 | 26731 | 12561 | 6443 | 21175 | 9987 | 23420 | ||
| | FLSTATQTF | 5a | | 15329 | 17845 | 3634 | 1663 | 1886 | 15839 | 17977 | 16320 | 4231 | 18662 | |||
| | IVSTAAQTF | 1a | | 20409 | 23323 | 22013 | 4758 | 11756 | 5496 | 11246 | 13372 | 814# | 22273 | |||
| | VLSTVTQSF | 1b, 2a | | 18550 | 13712 | 17004 | 4838 | 16940 | 5785 | 15666 | 988# | 29052 | 11492 | 1654 | 21745 | |
| | IVSTDTQSF | 4a | | 19885 | 22289 | 20020 | 12440 | 14943 | 5300 | 6080 | 8229 | 4394 | 973# | 24757 | ||
| | TLAGPKGPV | 5a, 6a | | 23444 | 2081# | 33957 | 16907 | 18949 | 6657 | 21854 | 39095 | 25027 | 22108 | 20499 | 22034 | |
| | TLASPRGPV | 1b | | 22044 | 1451# | 32375 | 13790 | 18855 | 2453 | 21660 | 39379 | 25346 | 22237 | 6235 | 22501 | |
| | TLASSRGPV | 2a | | 22034 | 857# | 29481 | 9965 | 20464 | 2095 | 19022 | 38571 | 24511 | 22225 | 3353 | 22267 | |
| | TLASAKHPA | 3a | | 21914 | 29038 | 10681 | 19694 | 16284 | 12935 | 39038 | 24637 | 21967 | 13015 | 23364 | ||
| | TIASPKGPV | 1a | | 22885 | 7397 | 34010 | 15054 | 20663 | 7437 | 19533 | 38620 | 25493 | 22070 | 16303 | 24145 | |
| | SVIDCNSAV | 5a | | 21948 | 24435 | 8789 | 12923 | 1702 | 4571 | 35486 | 21627 | 21514 | 3381 | 25586 | ||
| | SVTDCNTCV | 1b | | 21476 | 30991 | 15202 | 21169 | 19345 | 15846 | 19349 | 26045 | 22021 | 11521 | 22609 | ||
| | SVIDCNVAV | 1b, 2a, 6a | | 21855 | 22019 | 7833 | 13308 | 3218 | 4376 | 31399 | 24317 | 21463 | 3412 | 24232 | ||
| | SVIDCNTCV | 1a | | 22281 | 23478 | 14812 | 17452 | 17390 | 13879 | 20769 | 25334 | 21666 | 7032 | 23918 | ||
| | SVIDCNTSV | 4a | | 22543 | 25166 | 10636 | 15522 | 3402 | 11164 | 17097 | 24124 | 20942 | 4646 | 24512 | ||
| | ITYSTYGKF | 1b, 5a, 2a, 2b, 1a, 4a | | 16829 | 22979 | 16133 | 9722 | 21954 | 6132 | 16141 | 20982 | |||||
| | LTYSTYGKF | 3a | | 14296 | 22834 | 13829 | 10036 | 22076 | 3345 | 11660 | 860# | 20046 | ||||
| | KVLVLNPSV | 1a, 1b, 2a, 2b, 4a, 5a, 6a | | 23587 | 6303 | 27046 | 18669 | 14670 | 21450 | 31145 | 20648 | 8842 | 3129# | 18558 | ||
| | RAKAPPPSW | 5a, 1b, 2a, 6a | | 25817 | 25080 | 27568 | 8387 | 25791 | 7172 | 18126 | 8580 | 596# | 22382 | |||
| | RAQAPPPSW | 1b, 3a, 1a | | 24980 | 24747 | 27454 | 22992 | 6443 | 24136 | 6017 | 14212 | 3253 | 1675# | 17482 | ||
| | KVWLAPPPSW | 4a | | 24000 | 4927 | 22172 | 26746 | 16220 | 18770 | 9620 | 39029 | 21580 | 20215 | 12296 | 20633 | |
| | LTSLGVNAV | 5a | | 5815 | 3795 | 33629 | 6533 | 20008 | 16663 | 13886 | 27357 | 24243 | 18277 | 3860# | 24767 | |
| LTSLGLNAV | 5a variant | 5305 | 3082 | 32917 | 6065 | 18186 | 16615 | 16431 | 29952 | 24579 | 19519 | 7004 | 23118 | |||
<50 IC50nm, bold, high affinity.
>50 IC50nm, <500 IC50nm, italic, intermediate affinity.
>500 IC50nm, #, poor affinity.
Binding affinity scores (as percentages) of Class II published epitopes and their variants were determined by the ProPred prediction program to common DRB1* alleles prevalent in the South African population
| | | | | | | | | |||
| | VYLLPRRGP | 1, 2, 4, 5, 6 | 0.0% | 0.0% | 18.0% | 0.0% | 0.0% | 16.0% | 18.0% | |
| | VGGVYLLPR | 1, 2, 4, 5, 6 | 0.0% | 0.0% | 17.0% | 0.0% | 9.0% | 9.0% | 10.0% | 20.0% |
| | | | | | | | | |||
| | LVLNPSVAA | 1, 2, 3, 4, 5, 6 | 17.0% | |||||||
| | YKVLVLNPS | 1, 2, 4, 5, 6 | 5.0% | 0.0% | 0.0% | 9.0% | 17.0% | |||
| | | | | | | | | |||
| | LPGNPAVAS | 2,3, 5, 6 | 0.0% | 2.0% | 0.0% | 4.0% | 0.0% | 0.0% | 9.0% | 0.0% |
| | LPGNPAIAS | 1, 4 | 0.0% | 0.7% | 15.0% | 4.0% | 0.0% | 2.4% | 0.0% | 7.0% |
| | IASLMAFTA | 1 | 7.0% | 0.0% | 0.0% | 4.0% | 0.0% | 14.0% | ||
| | | | | | | | | |||
| | LFNILGGWV | 1, 4, 5 | 0.0% | 0.0% | 16.0% | 0.0% | 0.0% | 16.0% | ||
| | FNILGGWVA | 1, 4, 5 | 0.0% | 2.0% | 16.0% | 16.0% | ||||
| | ILGGWVASQ | 4, 5 | 0.0% | 0.0% | 0.0% | 0.0% | 2.4% | 8.0% | 0.0% | |
| | LGGWVASQI | 4, 5 | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | 13.0% | ||
| | | | | | | | | |||
| | VFPDLGVRV | 1 | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | |
| | VYPDLGVRV | 3, 5 | 0.0% | 0.0% | 0.0% | 14.0% | 0.0% | 0.0% | 19.0% | |
| | IVYPDLGVR | 3, 5 | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | 7.0% | 0.0% | |
| | LIVYPDLGV | 3, 5 | 0.0% | 0.0% | 0.0% | 0.0% | 12.0% | 0.0% | 3.0% | |
| | | | | | | | | |||
| | LAPEARQAI | 1b | 0.0% | 0.0% | 8.0% | 0.0% | 11.0% | 0.0% | 4.5% | 11.0% |
| | LDPQARVAI | 5 | 0.0% | 0.0% | 8.0% | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% |
| LQPEARAAI | 5var | 0.0% | 0.0% | 0.0% | 12.0% | 1.0% |