Literature DB >> 32640609

In Silico Prediction of Human Leukocytes Antigen (HLA) Class II Binding Hepatitis B Virus (HBV) Peptides in Botswana.

Wonderful Tatenda Choga1,2, Motswedi Anderson1, Edward Zumbika3, Bonolo B Phinius1, Tshepiso Mbangiwa1,2, Lynnette N Bhebhe1, Kabo Baruti1,4, Peter Opiyo Kimathi5, Kaelo K Seatla1,6, Rosemary M Musonda1,7, Trevor Graham Bell8, Sikhulile Moyo1,7, Jason T Blackard9, Simani Gaseitsiwe1,7.   

Abstract

Hepatitis B virus (HBV) is the primary cause of liver-related malignancies worldwide, and there is no effective cure for chronic HBV infection (CHB) currently. Strong immunological responses induced by T cells are associated with HBV clearance during acute infection; however, the repertoire of epitopes (epi) presented by major histocompatibility complexes (MHCs) to elicit these responses in various African populations is not well understood. In silico approaches were used to map and investigate 15-mers HBV peptides restricted to 9 HLA class II alleles with high population coverage in Botswana. Sequences from 44 HBV genotype A and 48 genotype D surface genes (PreS/S) from Botswana were used. Of the 1819 epi bindings predicted, 20.2% were strong binders (SB), and none of the putative epi bind to all the 9 alleles suggesting that multi-epitope, genotype-based, population-based vaccines will be more effective against HBV infections as opposed to previously proposed broad potency epitope-vaccines which were assumed to work for all alleles. In total, there were 297 unique epi predicted from the 3 proteins and amongst, S regions had the highest number of epi (n = 186). Epitope-densities (Depi) between genotypes A and D were similar. A number of mutations that hindered HLA-peptide binding were observed. We also identified antigenic and genotype-specific peptides with characteristics that are well suited for the development of sensitive diagnostic kits. This study identified candidate peptides that can be used for developing multi-epitope vaccines and highly sensitive diagnostic kits against HBV infection in an African population. Our results suggest that viral variability may hinder HBV peptide-MHC binding, required to initiate a cascade of immunological responses against infection.

Entities:  

Keywords:  Africa; Botswana; HLA class II alleles; T-cell epitopes; candidate multi-epitope vaccines (MEV); escape mutation; hepatitis B virus (HBV); immunoinformatics; in silico

Mesh:

Substances:

Year:  2020        PMID: 32640609      PMCID: PMC7412261          DOI: 10.3390/v12070731

Source DB:  PubMed          Journal:  Viruses        ISSN: 1999-4915            Impact factor:   5.818


1. Background

Hepatitis B virus (HBV), a member of the Hepadnaviradae family, is the major etiology of end stage liver diseases (ESLD), liver cirrhosis (LC) and hepatocellular carcinoma (HCC), and causes up to 887,000 deaths per year [1]. Although more than 90% of healthy adults resolve acute HBV infection within 6 months, there remain over 287 million people who test seropositive for hepatitis B surface antigen (HBsAg) [2] and have chronic HBV infection (CHB). Viral clearance is mediated by cytokines, lymphocytes, and the ability to mount a multi-specific polyclonal and vigorous T cell-mediated response against HBV antigens for a protective immunity [3,4,5]. The quality of these responses is influenced by host genetics, as well as the ability of certain viral variants to escape immune recognition [6,7,8]. The major histocompatibility complexes (MHCs)—known as human leukocytes antigens (HLAs) in humans—are integral components of host genes located at chromosome 6p21. These highly polymorphic proteins serve as mediators of adaptive immune responses by presenting processed antigenic peptides to T cells. The two compatible types of MHCs—class I and class II—present exogenous and endogenous epitopes to CD8+ cytolytic T cells and CD4+ T helper (Th) cells, respectively [9]. The MHC class II alleles (HLA-DR, -DQ and -DP) present epitopes to CD4+ T cells [epi-HLA class II → CD4+ T-cells] that in turn elicit adaptive immune responses against viral infections by facilitating the induction of CD8+ cytotoxic T-lymphocytes (CTLs), production of cytokines crucial for survival, and maturation of B cells [10,11,12,13]. The link between HBV pathogenesis and host immunological profiles is still poorly understood [2]. HBV exhibits a high mutation rate, although only a small number of amino acid substitutions have been characterized functionally due to the costly and time-consuming nature of in vitro assays. Recent approaches have utilized in silico approaches such as machine learning techniques to prioritize candidate peptides for in vitro assays [14,15]. Thus far, the HBV mutations characterized have been associated with sensitivity of immunologic and molecular-based assays and viral escape leading to poor prognosis [16,17,18,19]. However, amino acid variations that influence HBV-MHC binding are poorly understood. In silico mapping of HLA class II binding peptides can be used to identify candidate peptides for in vitro assays to confirm CD4+ T cell epitopes which are crucial for the design of epitope-based vaccines and highly sensitive diagnostic tools that can detect low HBV DNA levels which are frequently missed by diagnostic kits [20,21,22,23]. Sub-Sahara Africa (SSA) and the Western Pacific regions are highly endemic for CHB [24], where the circulating genotypes include A, D, and E for SSA, and B and C for the Western Pacific. HBV genotypes A (subgenotype A1) and D (subgenotype D3) have been reported in Botswana, with genotype E occurring rarely [25,26,27,28,29]. Not only do genotypes show unique geographic distribution, they also differ in treatment response, pathogenesis potential, and prognosis [30,31]. Studies conducted in China have mapped different T cell epitopes that may be eligible for epitope-based vaccines and some were evaluated in vitro [20,32]. However, these findings may be less applicable to African populations whose host genetic pool, circulating genotypes, and immune profiles for HBV (e.g., hepatitis B e antigen [HBeAg] and HBsAg positivity) differ considerably with those of the Chinese population. A prerequisite to determine the epitope(s) for inclusion in epitope-based vaccines include (1) identification of conserved regions of the genome and (2) characterization of those regions that elicit protective immune responses [33]. Although there are hepatitis B vaccines in use currently, vaccine escape does occur; thus, more optimized vaccine candidates may be needed to avoid vaccine failure. In this study, we utilized HLA class II alleles that occur at the highest frequency in Botswana and locally derived HBV strains to identify HLA class II binding peptides which are good candidates for confirmatory in vitro tests of immunogenicity. The present study had three major aims: (1) to determine the repertoires of HLA class II epitopes within HBV envelope sequences of genotypes A and D isolated from different risk groups in Botswana (described in our earlier papers); (2) to compare if the predicted epitopes in genotypes A and D may vary across other HBV genotypes, suggesting that genotype-based multi-epitope vaccines would be more successful than the broad potency vaccines currently in use; (3) to investigate amino acid variations within these epitopes to determine if they may lead to immune escape (i.e., candidate escape mutations).

2. Materials and Methods

2.1. Mapping Peptides from HBV Surface Gene Restricted to HLA-class II Alleles

Three sequence datasets were included in the current analysis. The first database (N1) was used to map epitopes (epi) that bind predominantly to HLA class II alleles in Botswana and consisted of 92 non-recombinant full-length S gene (PreS/S) sequences [25,26,28] retrieved from GenBank (accession numbers MF979142—MF979176, KR139743—KR139748, and MH464807—MH464854). The aligned sequences were sorted by genotype and included A (n = 44) and D (n = 48). The three domains of HBV surface proteins—PreS1, PreS2, and S—were manually extracted from an overlapping Pol/S fragment and were divided into genotypes whose amino acid (aa) sequence alignments were sorted according to column similarities. Nucleotide alignments and sorting were performed using AliView 1.21 software [34]. Each region was then used to create a consensus sequence with the threshold set at 90% for all positions. Variants that did not meet this threshold were investigated independently in post-analyses. To assess if the aa composition of consensus sequences was representative of existing HBV strains, BLAST searches were conducted using the NCBI database, and strains exhibiting 100% similarity and coverage were evaluated further (Supplementary Table S1). The 15-mer HBV peptides overlapping by 14 aa were tested for binding to 9 HLA class II alleles—HLA-DRB1*0101, DRB1*0301, DRB1*0401, DRB1*0701, DRB1*0802, DRB1*1101, DRB1*1302, DRB1*1501, and DRB5*0101—that have high population coverage in Botswana [35]. The NetMHCIIPan version 3.2 online server (http://www.cbs.dtu.dk/services/NetMHCIIpan/) [36] was used to predict binding peptides, and their binding-affinity scores were categorized based on the Log-transformed binding affinity [1-Log50k (aff)]. The settings were adjusted starting with default [1-Log50k (aff)] of 0.426; 500nM and logAff = 0.638; 50nM for weak and strong binding respectively. Ten previously characterized HBV envelope proteins (PreS1, PreS2, S) epitopes—S: 18–37 epitope ID 51310; S: 70–84 (3966); S: 83–98 (46959); S: 200–211 (66307); S: 201–215 (59353); S: 211–244 (6574); S: 230–247 (47877); S: 363–378 (76458); S: 376–389 (17331); S: 378–386 (37664)—available at (www.iedb.org) [37] were included to calibrate the tool’s settings, and the thresholds showing highest specificity were utilized in the present study. Using percentile rank of eluted ligand prediction score (%Rank_EL), the strong binders (SB) were determined between 2–10% Rank_EL and 10–50% Rank_EL for weak binders (WB). Peptides with binding affinity less than that of WB were deemed pseudo binders (NB). Figure 1 outlines the various analytical steps included in this study, and the NetMHCIIPan results are provided in the Supplementary Table S2.
Figure 1

Schema illustrating the flow of data analysis used in this study. N = sample size; SB = strong binding peptides; WB = weak biding peptides; Tepi = total predicted epitopes; PreS/S = HBV surface gene; Depi = epitope densities. Sequences were derived from patients with different clinical outcomes: −(HBV/HIV; CHB; OBI)—HIV = human immunodeficiency virus; OBI = occult hepatitis B infection, CHB = chronic hepatitis B infection, HBV/HIV = coinfection. The blue colored segment shows the pipeline used to evaluate the diversity of epi. The grey segment is the pipeline used to determine epi and measure of promiscuity and conservativeness. The pink segment is the pipeline used to determine the best candidate vaccine.

The relationship between the length of the protein and the frequency of binding-peptides were compared using epitope density score (Depi) for all protein domains of S gene (PreS1, PreS2, S) and genotypes (A versus D). Depi score was defined as the proportion of binding peptides epi (where; epi = WB + SB) to total predicted epitopes (Tepi) relative to protein size. Tepi represent the total count of all predicted proteins.

2.2. Determining Prevalence of Putative Epitopes in HBV Genotypes (A–I) Except A and D

Putative or promiscuous epi were defined as peptides that exhibit similar binding affinity to 2 or more alleles. The prevalence of predicted putative epi were determined in a second dataset (N2) that included 10,308 PreS/S sequences (genotype B = 2905; C = 5575; E = 1118; F = 477; G = 86; H = 69; I = 78) retrieved from HBV database available at http://hvdr.bioinf.wits.ac.za/alignments/index.html [38]. N2 was also used to determine the overall prevalence of the predicted escape mutations. The sequences used in this analysis are included in the supplementary file provided; Supplementary-Table S3.

2.3. Variations Causing Escape to HLA Class II Binding

Dataset N3 consisted of 7743 HBV sequences (genotype A = 3115; genotype D = 4628) used to determine the frequency of aa variations which were termed HLA escape mutations in other HBV sequences. Sequences used were curated from http://hvdr.bioinf.wits.ac.za/alignments/index.html, partitioned by proteins (PreS1, PreS, S) [38]. Escape mutations were defined as those aa variations within the 15-mer core aa sequence that cause the binding affinity to change from either strong to pseudo binding (SB → NB) or from weak to pseudo binding (WB → NB). Several in-house customized Python pipelines were used to accurately investigate the frequency of escape mutations. The sequences used in this analysis are in the supplementary file provided; Supplementary Table S4.

2.4. Screening Putative Epi and Reconstruction of Tertiary Structure of the Modelled Vaccine

Since the predicted putative epi can be also homologous to human peptides that may (1) cause either autoimmune responses when used as a vaccine or (2) give false results when used as a diagnostic marker, a BLAST search was conducted with the NCBI protein database for all immunogenic epi. Afterwards, the predicted putative epi were catenated using a previously described method [39], and different combinations were used to construct candidate multi-epitope vaccines (MEVs). Physiological and biochemistry proteins such as thermal stability, desirable shelf-life, and pH among other properties are prerequisites during development of an ideal vaccine. The biochemical properties of generated candidate proteins were evaluated using online ProtoParam tool [40]. The proteins exhibiting properties similarly to those of vaccines currently in use were deemed the best candidate. The properties of 3 current HepB vaccines—including VO_0011094, VO_0011095, and VO_0011093—are curated in the DNA vaccine database [41]. Properties predicted include; immunogenicity, antigenicity, instability index, estimated half-life in humans’ molecular weight (mw), aliphatic index (AI), grand average of hydrophobicity (GRAVY), and theoretical (pH).

2.5. Determining the HLA-HBV Association Using Tepi

To test the hypothesis that Tepi could serve as a useful predictor of HLA-HBV associations, Tepi of S genes (A and D) were used to rank the 9 HLA class II alleles in the post epi prediction analyses. The available literature was used to corroborate the analyses.

3. Results

3.1. Predicting T-cell Epitopes Using Consensus from N1 Dataset

We first generated 6 different consensus sequences from PreS/S sequences of genotypes A and D. Consensus sequences were validated by comparison to the NCBI database and identified 16 pre-existing sequences which exhibit 100% identity to: PreS1 genotype A (PreS1A) consensus sequence; 39 to PreS1D, 6 to PreS2A, 12 to PreS2D, 19 to SA, and 20 to SD, respectively. The consensus sequences, identical sequences, and their country of origin are provided in supplementary file; Supplementary Table S1. Epitope is defined as a peptide that binds either weakly or strongly bind to HLA-DR alleles used, while HLA class II escape mutations were defined as aa variations within the 9-mer core aa sequence that changed the epitope-HLA-DR binding from SB/WB to pseudo binding. In total, there were 1819 total binding predicted (Tepi) from 297 unique epitopes restricted to 9 HLA class II including 20.2% SB. The number of signature aa differentiating HBV genotypes A and D were 31, 13, and 9 for the S, PreS1, and PreS1 regions, respectively (Table 1). S protein had the highest binding peptides constituting 79.9% of the sum of all Tepi (epi), PreS1 constituted 11%, while PreS2 had the least (9.1%) epi. The Depi of the SB and WB among genotypes (A and D) and proteins (PreS1, PreS2, S) are summarized in Figure 2.
Table 1

Distribution of T cell epitopes restricted to 9 HLA class II alleles with high population coverage in Botswana.

PreS1PreS2S
A (120 aa) D (108 aa)A (55 aa)D (55 aa)A (226 aa)D (226 aa)
Total bindings (n = 1819) SB (367); WB (1452) (%)6; 79 (4.7)7; 107 (6.3)4; 81 (4.7)3; 78 (4.5)122; 619 (40.7)136; 577 (39.1%)
Unique epi (n = 297)25 ¥38 ¥29 ¥24 ¥125 ¥126 ¥
Ratio = Tepi: Nepi3.43.03.42.95.95.7
Most active HLA: DRB * (SB; WB)*0802; *0101 *0401; (*0401, *0101)*0301; (*0401, *0101)*0301; 5*0101 *0702; *0401*0401; *1501
Genotype variation: AD
(A|D epi; p-value)A|D > 0.05A|D > 0.05A|D > 0.05

¥ indicates existence of epi that are common in both genotypes A and D. SB; strong binding peptides. WB; weak binding peptides. aa; amino acids. Web-logo diagrams represent signature aa between consensus sequences of genotype A and D set at a threshold of 100%. HBV; hepatitis B virus. * 0101 means HLA class II allele DRB1*0101 etc. 5*0101 means HLA class II allele DRB5*0101.

Figure 2

Epitope densities (Depi) of different PreS/S proteins stratified by genotype (A or D) and protein (PreS1, PreS2, S). where can be any protein (PreS/SA versus PreS/SD). PreS1A represent genotype A large Hepatitis B surface antigen (HBsAg); PreS1D represent genotype D large HBsAg; PreS2A represent genotype A middle HBsAg; PreS2D represent genotype D middle HBsAg; SA represent genotype A small HBsAg; SD represent genotype D small HBsAg. Tepi = Total binding peptides (WB + SB). Nepi unique = count of unique binding peptides per each protein.

3.2. Prevalence of Putative Epitopes Across all HBV Genotypes

To assess if the predicted epi may be suitable for vaccine inclusion, the most common epi were compared to sequences other than genotypes A and D. Table 2 shows the number of aa sequences containing the indicted epi. Eight out of 53 epi were found in semi-conserved regions ranked as “++++” and had prevalence > 85% among sequences in N2 dataset. These results suggest that multi-epitope genotype-based vaccines may be better to avoid vaccine escape.
Table 2

Showing the prevalence of S protein epi in other HBV genotypes (B–I).

Epitope Sites in S ProteinAA SequenceB (n = 2905)C (n = 5575)E (n = 1118)F (n = 477)G (n = 86)H (n = 69)I (n = 78)Prevalence (%) = 1 − [Count of Seq10308] *%Degree of Conservation ↓ (+: Variable) epi Previously Discussed
17–31AFGKFLWEWASARFSE = 998C = 32F = 1 10,0+[42]
180–194AGFFLLTRILTIPQSB = 62C = 4627E = 5F = 2G = 75 46,3++[43]
90–104CLIFLLVLLDYQGMLB = 2605C = 4661E = 1034F = 442G = 84H = 65I = 7086,9++++[44]
69–83CPGYRWMCLRRFIIFB = 2384C = 5064E = 1027F = 465G = 84H = 67I = 6588,8++++[42]
19–33FFLLTRILTIPQSLDB = 63C = 4717E = 5F = 2G = 76 47,2++[45,46,47]
158–172FGKFLWEWASARFSWE = 996C = 31F = 1 10,0+[48]
20–34FLLTRILTIPQSLDSB = 63C = 4709E = 5F = 5G = 78 47,1++[49]
93–107FLLVLLDYQGMLPVCB = 2616C = 4720E = 1031F = 446G = 84H = 66I = 7287,7++++[50]
161–175FLWEWASARFSWLSLB = 1E = 997C = 81F = 3I = 16 10,7+[42]
179–193FVQWFVGLSPTVWLSB = 2630C = 4698E = 76G = 85I = 75 73,4+++[51]
18–32GFFLLTRILTIPQSLB = 63C = 4618E = 5F = 2G = 76 46,2++-
159–173GKFLWEWASARFSWLE = 1002C = 31F = 3 10,1+-
202–216GPSLYSILSPFLPLLC = 19E = 2 0,2+[48]
71–85GYRWMCLRRFIIFLFB = 82C = 5065E = 1032F = 464G = 84H = 67I = 6566,5+++-
92–106IFLLVLLDYQGMLPVB = 2606C = 4659E = 1030F = 443G = 84H = 66I = 7086,9++++-
195–209IWMMWYWGPSLYSILB = 3C = 24E = 2 0,3+-
160–174KFLWEWASARFSWLSB = 1E = 1012C = 34F = 3I = 16 10,3+-
91–105LIFLLVLLDYQGMLPB = 2611C = 4656E = 1031F = 443G = 84H = 66I = 7086,9++++[52]
21–35LLTRILTIPQSLDSWB = 63C = 4754E = 5F = 5G = 77 47,6++[53]
94–108LLVLLDYQGMLPVCPB = 2653C = 4729E = 1032F = 445G = 84H = 66I = 7288,1++++[54]
15–29LQAGFFLLTRILTIPB = 62C = 4735E = 5F = 2G = 76 47,3++[54]
192–206LSVIWMMWYWGPSLYB = 772C = 4249E = 889G = 1I = 50 57,8++[44]
95–109LVLLDYQGMLPVCPLB = 2619C = 4708E = 1022F = 444G = 84H = 66I = 7387,5++++[55,56]
162–176LWEWASARFSWLSLLB = 7E = 1001C = 88F = 445G = 2H = 65I = 7516,3+[54]
205–219LYSILSPFLPLLPIFC = 21E = 2 0,2+-
178–192PFVQWFVGLSPTVWLB = 2649C = 4723E = 76G = 85I = 74 73,8+++-
70–84PGYRWMCLRRFIIFLB = 2375C = 5117E = 1033F = 466G = 85H = 67I = 6589,3++++[49]
66–80PPTCPGYRWMCLRRFB = 60C = 751E = 14F = 464G = 77H = 68 13,9+[44]
203–217PSLYSILSPFLPLLPC = 19E = 2 0,2+-
67–81PTCPGYRWMCLRRFIB = 60C = 750E = 14F = 464G = 77H = 68 13,9+[57]
16–30QAGFFLLTRILTIPQB = 62C = 4645E = 5F = 2G = 76 46,5++[58]
181–195QWFVGLSPTVWLSVIB = 2588C = 4310E = 73G = 6I = 71 68,4+++[58]
38–52SLNFLGGTTVCLGQNB = 5C = 2E = 2 0,1+[55,56,57]
204–218SLYSILSPFLPLLPIC = 19E = 2 0,2+[44]
193–207SVIWMMWYWGPSLYSB = 3C = 24E = 2 0,3+-
155–169SWAFGKFLWEWASARE = 998C = 31F = 1 10,0+-
68–82TCPGYRWMCLRRFIIB = 63C = 757E = 17F = 466G = 77H = 68 14,0+-
37–51TSLNFLGGTTVCLGQB = 5C = 2E = 2 0,1+[59,60]
194–208VIWMMWYWGPSLYSIB = 3C = 24E = 2 0,3+-
14–28VLQAGFFLLTRILTIB = 61C = 4676E = 5F = 2G = 76 46,8++[61]*
180–194VQWFVGLSPTVWLSVB = 2636C = 4584E = 77G = 6I = 75 71,6+++[62]
156–170WAFGKFLWEWASARFE = 1000C = 32F = 1 10,0+[44]
163–177WEWASARFSWLSLLVB = 7E = 1002C = 87F = 445G = 2H = 64I = 7516,3+-
182–196WFVGLSPTVWLSVIWB = 2506C = 4125E = 73G = 6I = 71 65,8+++[63]
201–215WGPSLYSILSPFLPLC = 19E = 2 0,2+[52]
196–210WMMWYWGPSLYSILSB = 3C = 25E = 2 0,3+-
36–50WTSLNFLGGTTVCLGB = 5C = 2E = 2 0,1+-
35–49WWTSLNFLGGTTVCLB = 5C = 2E = 2 0,1+-
199–213WYWGPSLYSILSPFLC = 18E = 2 0,2+-
206–220YSILSPFLPLLPIFFC = 19E = 2 0,2+-
200–214YWGPSLYSILSPFLPC = 18E = 2 0,2+-

+ The degree of conservation. The scale used: if score > = 100, then highly conserved and will be denoted by ‘+++++’. elif score > = 85: then semi conserved = ‘++++’. elif score > = 60: region of mutation and is denoted by ‘+++’. elif score > = 20, then highly variable region = ‘++’. else: high escape mutation = ‘+’. n represents the number of sequences used in the analysis. B represents full-length genotype B sequences, C represents full-length genotype C sequences, etc. 17–31 representing the beginning the position occupied by the 14-mer epitope predicts (e.g., 17 is the first amino acid of the epitope, while 31 is the last amino acid of the epitope).

3.3. Profiles of Strong Binding Epitopes

SB epi of the 3 proteins were sorted by the core aa sequence and analyzed based on genotype. Those found in both genotypes were considered putative when binding to alleles. There were no SB epitopes that were common between sequences of genotypes A and D for PreS1 and PreS2 regions. S protein had 89 out of 230 epi that satisfied the above criteria and were promiscuous for at least 5 alleles. There were 5 unique epi whose core aa were at S protein residues 41–49 (FLGGSPVCL), 14 epi with S protein residues 20–28 (FLLTRILTI), 10 epi with S residues 183–191 (FVGLSPTVW), 9 epi with S residues 22–30 (LTRILTIPQ), 6 epi with S residues 162–170 (LWEWASARF), 6 epi with S residues 184–192 (VGLSPTVWL), 12 epi with S residues 96–104 (VLLDYQGML), 2 epi with S residues 180–188 (VQWFVGLSP), 9 epi with S residues 163–171 (WEWASARFS), 4 epi with S residues 182–190 (WFVGLSPTV), and 12 epi with S residues 72–80 (YRWMCLRRF). Table 3 shows the full profiles of SB epi mapped for the S proteins for genotypes A and D.
Table 3

List of most promiscuous SB epi mapped from S protein.

S: ResiduesAA SequenceGenoCore aaHLA Class II Alleles S: Residues AA SequenceGenoCore aaHLA Class II Alleles
69–83CPGYRWMCLRRFIIFAYRWMCLRRF*0101*0301*0401*0701*0802*1302 68–82TCPGYRWMCLRRFIIDYRWMCLRRF*0101*0301*0401*0701*0802 *1302*1501
CPGYRWMCLRRFIIFD168–182VRFSWLSLLVPFVQWAWLSLLVPFV*0101 *0401*0701*0802*1101*1302*15015*0101
68–82 ICPGYRWMCLRRFIIA14–28VLQAGFFLLTRILTIAFLLTRILTI *0301*0401 *0802 *1302*15015*0101
70–84 PGYRWMCLRRFIIFLDVLQAGFFLLTRILTID
PGYRWMCLRRFIIFLA*0101*0301*1501165–179WASARFSWLSLLVPFDFSWLSLLVP*0101 *0401*0802*1101 5*0101
71–85 GYRWMCLRRFIIFLFA 174–188SLLVPFVQWFVGLSPAFVQWFVGLS*0101*0802*1101*1501
GYRWMCLRRFIIFLFDSLLVPFVQWFVGLSPD
67–85 PICPGYRWMCLRRFIA6–20SGFLGPLLVLQAGFFALGPLLVLQA*0101*0701*0802*1101*15015*0101
66–84 PPICPGYRWMCLRRFA*0101 *15015*0101SGFLGPLLVLQAGFFD*0701
67–81 PTCPGYRWMCLRRFID*1501 7–21GFLGPLLVLQAGFFLD
147–161 CTCIPIPSSWAFAKYACIPIPSSWA*0101*0401*0701*0802*1501 GFLGPLLVLQAGFFLA
CTCIPIPSSWAFGKFD5–19TSGFLGPLLVLQAGFD
145–159 GNCTCIPIPSSWAFAATSGFLGPLLVLQAGFA
146–160 NCTCIPIPSSWAFAKA93–107FLLVLLDYQGMLPVCDLVLLDYQGM*0101*0401*0802*1101*1501
NCTCIPIPSSWAFGKD5*0101FLLVLLDYQGMLPVCA*0802*1101
177–191 VPFVQWFVGLSPTVWDWFVGLSPTV*0101*0401*0701*0802*11015*010192–106IFLLVLLDYQGMLPVA*0802*1101
VPFVQWFVGLSPTVWAIFLLVLLDYQGMLPVD*0802*1101
168–182 ARFSWLSLLVPFVQWDWLSLLVPFV*0101*0401*0701*0802*1101*15015*0101182–196WFVGLSPTVWLSVIWDVGLSPTVWL *0301*0401*0802*1302*1501
169–183 RFSWLSLLVPFVQWFDWFVGLSPTVWLSVIWA*0301*0401*0802*1302
RFSWLSLLVPFVQWFA97–111LLDYQGMLPVCPLIPAYQGMLPVCP*0101 *0401*0701 *1101*15015*0101
167–181 SARFSWLSLLVPFVQDLLDYQGMLPVCPLIPD*0401*0701*1101
176–189 LVPFVQWFVGLSPTVAWFVGLSPTV*0101*0401*0701*0802 5*010196–110VLLDYQGMLPVCPLIA*0401*0701*1101
LVPFVQWFVGLSPTVDVLLDYQGMLPVCPLID*0401*0701*1101
166–180 ASARFSWLSLLVPFVD*0101*0401*0701*08025*010172–86YRWMCLRRFIIFLFIAWMCLRRFII *0701*0802*1101*15015*0101
170–184 FSWLSLLVPFVQWFVA*1101 YRWMCLRRFIIFLFIDWMCLRRFII
FSWLSLLVPFVQWFVD194–208VIWMMWYWGPSLYNIAWYWGPSLYN*0101*0401*0701*0802*1101 5*0101
167–181 SVRFSWLSLLVPFVQA 5*0101

PreS1A represent genotype A epi derived from sequences of large Hepatitis B surface antigen (HBsAg); PreS1D represent genotype D epi derived from sequences of large HBsAg; PreS2A represent genotype A epi derived from sequences of middle HBsAg; PreS2D represent genotype D epi derived from sequences of middle HBsAg; SA represent genotype A epi derived from sequences of small HBsAg; SD represent genotype D epi derived from sequences of small HBsAg. *0101 means HLA class II allele DRB1*0101 e.tc. 5*0101 means HLA class II allele DRB5*0101.

3.4. Profiles of Most Promiscuous Epitopes

Since the majority of predicted epi were WB, a strict threshold was applied to select the most the promiscuous epi. Thus, epi were selected if they bind to least 6 alleles or more as shown in Table 4.
Table 4

Highlighting most promiscuous T cell epitopes restricted to 9 HLA class II alleles.

Epitope SiteAA SequenceHBV ProteinCore AAHLA Class II AllelesPreviously Discussed epi
34–48PVPNIASHISSISSRPreS2AIASHISSIS1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1302, 1*1501, 5*0101-
39–53ASHISSISSRTGDPAPreS2AISSISSRTG1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
38–52IASHISSISSRTGDPPreS2AISSISSRTG1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
37–51TTASPLSSIFSRIGDPreS2DLSSIFSRIG1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
38–52TASPLSSIFSRIGDPPreS2DLSSIFSRIG1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
39–53ASPLSSIFSRIGDPAPreS2DLSSIFSRIG1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
208–222ILSPFIPLLPIFFCLSAFIPLLPIFF1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
209–223LSPFIPLLPIFFCLWSAFIPLLPIFF1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
207–221NILSPFIPLLPIFFCSAFIPLLPIFF1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1302, 5*0101-
149–163CIPIPSSWAFAKYLWSAIPSSWAFAK1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
6–20SGFLGPLLVLQAGFFSALGPLLVLQA1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
200–214YWGPSLYNILSPFIPSALYNILSPFI1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1302, 1*1501-
199–213WYWGPSLYNILSPFISALYNILSPFI1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
164–178EWASVRFSWLSLLVPSAVRFSWLSLL1*0101, 1*0401, 1*0802, 1*1101, 1*1302, 1*1501, 5*0101[64]
168–182VRFSWLSLLVPFVQWSAWLSLLVPFV1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1302, 1*1501, 5*0101[44]
169–183RFSWLSLLVPFVQWFSAWLSLLVPFV1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101[65]
156–170WAFAKYLWEWASVRFSAYLWEWASVR1*0101, 1*0301, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101[49]
208–222ILSPFLPLLPIFFCLSDFLPLLPIFF1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
209–223LSPFLPLLPIFFCLWSDFLPLLPIFF1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
155–169SWAFGKFLWEWASARSDFLWEWASAR1*0101, 1*0301, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501-
6–20SGFLGPLLVLQAGFFSDLGPLLVLQA1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101[44]
199–212WYWGPSLYSILSPFLSDLYSILSPFL1*0101, 1*0401, 1*0802, 1*1101, 1*1302, 1*1501, 5*0101-
167–181SARFSWLSLLVPFVQSDWLSLLVPFV1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
168–182ARFSWLSLLVPFVQWSDWLSLLVPFV1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101[44]
169–183RFSWLSLLVPFVQWFSDWLSLLVPFV1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101[42,49]
197–211MMWYWGPSLYSILSPSDWYWGPSLYS1*0101, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-
68–82TCPGYRWMCLRRFIISDYRWMCLRRF1*0101, 1*0301, 1*0401, 1*0701, 1*0802, 1*1101, 1*1501, 5*0101-

*0101 means HLA class II allele DRB1*0101 e.tc. 5*0101 means HLA class II allele DRB5*0101.

3.5. Designing and Predicting Structure of Candidate Multi-Epitope Vaccine

Table 4 shows 27 putative epi binding to at least 6 alleles and were selected to model the tertiary structure of candidate vaccines. The overlapping S region epi (SA = 11, SD = 10) were catenated to form proteins which were used to model a tertiary structure as shown in Figure 3. The overlapping SA epi at S protein residues 6–20 with aa sequence SGFLGPLLVLQAGFF, aa sequence CIPIPSSWAFAKYLWEWASVRFSWLSLLVPFVQWF at S protein residues 155–183, and aa sequence WYWGPSLYNILSPFIPLLPIFFCLW at S protein residues 199–223 yields a 75 amino acid protein of mw = 8878.62. Of the 6 proteins predicted, the most stable SA protein was vacci-SA with amino acid sequence: 5′-SGFLGPLLVLQAGFFWYWGPSLYNILSPFIPLLPIFFCLWCIPIPSSWAFAKYLWEWASVRFSWLSLLVPFVQWF-3′. The overlapping epi in SD occupied residues S: 6–20, S: 68–82, S: 197–223 and S: 155–183. The resulting protein was 85 aa long and had mw = 10,749.99. Of the 24-proteins predicted, the one selected for constructing tertiary structure was vacci-SD with aa sequence: 5′-MMWYWGPSLYSILSPFLPLLPIFFCLWSGFLGPLLVLQAGFFSWAFGKFLWEWASARFSWLSLLVPFVQWFTCPGYRWMCLRRFIIFLF-3′. When a BLAST search was conducted with the NCBI protein database for the 2 protein sequences, results show that both sequences were similar to 2 domains of major surface antigen (vMSA) from hepadnavirus superfamily; accession number pfam00695. A similar approach was used to select proteins from the epi in PreS2 region to determine proteins that can be used to model the 3D tertiary structures of epi in the PreS2 region. In total, there were 6 epi selected 3 for each genotype. The overlapping epi were 20 aa long with mw of 2006.20 and occupying residues PreS2: 34–53 in genotype A sequences. vacci-PreSD was made from overlapping epi occupying residues PreS2: 37–53 and had a mw of 1719.91. All proteins generated using the different epi ordering (permutations and combinations) have been provided under the supplementary file Supplementary Table S5.
Figure 3

Showing a tertiary structure of candidate vaccines: (a) Tertiary structures of candidate epi modelled using 3Dpro webtool. The SA protein in (a) has the aa composition: 5′-SGFLGPLLVLQAGFFWYWGPSLYNILSPFIPLLPIFFCLWCIPIPSSWAFAKYLWEWASVRFSWLSLLVPFVQWF-3′, and had following theoretical properties: antigenicity (0.04), instability index (II = 47.07), estimated half-life in vitro = 1.9 h, molecular weight (mw) = 8878.62, aliphatic index (AI = 114.40) and grand average of hydrophobicity (GRAVY = 0.995), and theoretical alkalinity (pI = 7,76). Using VaxiJen ver2.0 server set at threshold of 0.4, the overall prediction for the protective antigen was 0.53, displaying it as a plausible antigen [66]. (b) (5′-MMWYWGPSLYSILSPFLPLLPIFFCLWSGFLGPLLVLQAGFFSWAFGKFLWEWASARFSWLSLLVPFVQWFTCPGYRWMCLRRFIIFLF-3′) protein modelled using from SD epi. The protein had following theoretical properties: antigenicity (0.11), instability index (II = 53), estimated half-life in vitro 30 h, molecular weight (mw) = 10749.99, aliphatic index (AI = 101.91) and grand average of hydrophobicity (GRAVY = 0.965), and theoretical alkalinity (pI = 9.42),). The antigenicity score predicted in both candidate vaccine suggests that they are plausible antigens [66].

3.6. Mutations Associated with Escape from Class II HLA Binding

For amino acid variants present in more than 5% of sequences of N1 database, we evaluated the in silico impact on immune recognition. Mutations were labelled relative to the HBV surface gene proteins—PreS11–115, PreS21–42, and S1–226. Mutations that hindered HBV peptide-HLA binding were 86T, 90T, and 94P in PreS1A; 54P, 79E, 84S, and 85Q in PreS1D; 12I, 31I, and 54P in PreS2A; 5F, 22H, 22L, 22P, 32H, 36L, and 42S in PreS2D. The list of escape mutations and their prevalence in 7743 HBV genotype A and D sequences are shown in Table 5 and supplementary file Supplementary Table S5. There were coordinated variations among positions in aa sequence alignments that had an impact on HBV-HLA binding but showed no impact when analyzed individually. These were termed covariance mutations. The combinations of the covariance mutations and their impact on binding potential are shown in Table 5, Table 6 and Table 7 below.
Table 5

The prevalence of S gene escape mutations.

ProteinAA SequenceS Protein AA Residues (epi)Escape MutationsCount in A (n = 3115)Count in D (n = 4628)Count in Other Genotypes
SAMENITSGFLGPQLV1–14L12Q1313
1–14I4T132591
1–14I4Stop10--
SAGPLLVLQAGFFLLTR10–24L15Stop2-17
SALNFPGGSPVCLGQNS39–53L42P13653
SATRILTIPQ *LDSWWT23 -37S31Stop612-
SADLWWTSLNFLGDPPV33–47G44D4-16
DSWWTSLNFLGESPVG44E10587826
IPIPSSWGFAKYLWE150–164A157G1053
SAIPIPSSWAFVKYLWEA159V203234
SAPPICPGYRWMCQRRF66–80L77R7243
L77Q6-18
SAPPICPGYRWMCLR *FR79Stop91020
R79H135849
SALIFLLVLLDYQDMLP91–105G102D123
SACLIFLLVLLDYQGML90–104D99Stop1558
Y100C1961217
M103I313129
SASLLVPFVQWFVGLTP LLVPFVQWFEGLSPT174–188S187T1--
G185E61519
V184E122
L186P10-5
SASPTVWLLAIWMMWYW187–193S193L104257612
A194V494 ∆ *
SAGPSLYNISSPFIPLL202–216L209S4912
SDENITSGCLGPLLVLQAGF2–16F8C-1-
SAYLWEWASVRFSWPSL161–175L173P1311
SASPFIPLLLIFFCLWV210–224P214L114123
SDSSWAFGKFLWEWASA154–168 208–222K160N179
SDILSPYILLLPIFFCIF212Y + L213I + P214L-; -; 1140; 202; 413; ∆ *;23
NITSGFLGLLLVLQA4 -18P11L243
SDMENITSGFLGPLLVL1–15T5P; N3S + I4T + T5A2; -4; 24; -29
SDSWWTSLNFLGETTVC34–48G44E10587826
SDSWWTSLNFRGGTTVCL42R14814
SDLSVIWMMWYWGPNLY192–206S204N136194848
List of aa Variations within Genotype A and D epi M1E, M1L, M1K, M1I, M1V, M1T, M1R, E2H, E2Q, E2*, E2V, E2K, E2A, E2G, E2D, N3H, N3E, N3P, N3A, N3C, N3D, N3I, N3R, N3Y, N3K, N3T, N3G, N3S, I4L, I4R, I4Y, I4Q, I4F, I4M, I4H, I4K, I4A, I4P, I4S, I4N, I4V, I4T, T5Y, T5Q, T5L, T5R, T5E, T5K, T5V, T5P, T5I, T5S, T5A, S6G, S6F, S6A, S6P, S6T, S6L, G7C, G7D, G7*, G7A, G7V, G7K, G7E, G7R, F8C, F8A, F8G, F8V, F8Y, F8I, F8H, F8P, F8S, F8L, L9R, L9K, L9V, L9I, L9H, L9Q, L9P, G10D, G10*, G10V, G10Q, G10T, G10A, G10K, G10E, G10R, P11A, P11T, P11H, P11L, L12V, L12E, L12M, L12R, L12P, L12Q, L13Q, L13V, L13I, L13F, L13R, L13H, L13P, V14R, V14L, V14E, V14I, V14M, V14G, V14A, L15E, L15K, L15T, L15*, L15I, L15F, L15V, L15S, Q16K, Q16E, Q16H, Q16L, Q16R, Q16P, A17R, A17T, A17S, A17P, A17V, A17G, A17E, G18W, G18K, G18E, G18A, G18R, G18V, F19I, F19V, F19L, F19S, F19Y, F19C, F20I, F20Y, F20L, F20S, L21V, L21M, L21F, L21*, L21W, L21S, L22Q, L22M, L22V, L22F, L22S, L22*, L22W, T23F, T23R, T23Q, T23S, T23P, T23A, T23I, R24N, R24T, R24Q, R24E, R24I, R24G, R24S, R24K, I25R, I25S, I25F, I25N, I25A, I25T, I25V, L26Y, L26I, L26F, L26Q, L26P, L26H, L26R, T27R, T27P, T27S, T27A, T27K, T27I, I28K, I28L, I28V, I28T, I28M, P29E, P29A, P29F, P29S, P29Q, P29T, P29L, Q30M, Q30S, Q30P, Q30A, Q30L, Q30H, Q30R, Q30K, S31K, S31I, S31D, S31T, S31C, S31G, S31R, S31N, L32V, L32G, L32Q, L32R, L32I, L32P, D33V, D33H, D33Y, D33E, D33N, D33G, S34*, S34A, S34T, S34W, S34P, S34L, W35P, W35C, W35*, W35G, W35R, W35L, W36E, W36G, W36R, W36*, W36L, T37D, T37P, T37L, T37I, T37S, T37N, T37A, S38E, S38A, S38Y, S38F, S38P, L39I, L39V, L39R, L39H, L39F, L39P, N40Q, N40H, N40K, N40I, N40D, N40S, F41Q, F41I, F41Y, F41C, F41L, F41S, L42*, L42S, L42I, L42V, L42Q, L42R, L42P, G43V, G43W, G43K, G43R, G43E, G44Q, G44K, G44R, G44A, G44D, G44V, G44E, S45M, S45D, S45G, S45E, S45H, S45Q, S45R, S45I, S45K, S45N, S45V, S45L, S45P, S45A, S45T, P46N, P46R, P46S, P46I, P46A, P46H, P46L, P46T, V47Q, V47N, V47L, V47P, V47M, V47E, V47R, V47K, V47A, V47G, V47T, C48W, C48L, C48F, C48R, C48G, C48S, C48Y, L49A, L49T, L49C, L49S, L49I, L49V, L49F, L49H, L49R, L49P, G50C, G50W, G50P, G50R, G50D, G50V, G50S, G50A, Q51E, Q51K, Q51H, Q51P, Q51R, Q51L, N52T, N52G, N52H, N52K, N52I, N52D, N52S, S53G, S53M, S53*, S53T, S53P, S53W, S53L, Q54K, Q54*, Q54H, Q54L, Q54P, Q54R, S55T, S55Y, S55A, S55P, S55F, S55C, P56A, P56*, P56S, P56R, P56H, P56L, P56Q, T57S, T57A, T57I, S58Y, S58A, S58L, S58T, S58P, S58F, S58C, N59L, N59I, N59R, N59H, N59D, N59K, N59S, H60D, H60S, H60N, H60Y, H60Q, H60R, H60P, S61T, S61P, S61*, S61L, P62S, P62Q, P62L, T63V, T63F, T63N, T63S, T63A, T63I, S64W, S64P, S64F, S64Y, S64C, C65G, C65F, C65S, C65Y, C65R, P66S, P66L, P66T, P66A, P66Q, P66H, P67T, P67A, P67R, P67S, P67L, P67Q, I68D, I68P, I68V, I68F, I68S, I68N, I68A, I68T, C69F, C69G, C69S, C69R, C69Y, C69W, C69*, P70S, P70H, P70R, P70A, P70T, P70L, G71S, G71D, G71W, G71V, Y72S, Y72D, Y72N, Y72H, Y72F, Y72C, R73L, R73C, R73P, R73H, W74C, W74G, W74R, W74*, W74S, W74L, M75K, M75L, M75R, M75S, M75V, M75T, M75I, C76G, C76*, C76R, C76W, C76S, C76F, C76Y, L77P, L77G, L77M, L77V, L77Q, L77R, R78I, R78G, R78W, R78P, R78L, R78Q, R79P, R79G, R79L, R79C, R79S, R79H, F80G, F80N, F80Y, F80I, F80L, F80S, I81S, I81Y, I81N, I81F, I81M, I81V, I81T, I82F, I82N, I82V, I82M, I82T, I82L, F83Y, F83I, F83L, F83C, F83S, L84F, L84H, L84P, F85Q, F85R, F85P, F85L, F85Y, F85S, F85C, I86M, I86K, I86L, I86F, I86V, I86T, L87M, L87V, L87R, L87Q, L87P, L88R, L88V, L88M, L88Q, L88P, L89V, L89Y, L89T, L89R, L89I, L89Q, L89P, C90G, C90W, C90N, C90I, C90*, C90Y, C90F, C90S, L91G, L91F, L91P, L91R, L91H, I92H, I92N, I92V, I92M, I92L, I92S, I92T, F93R, F93W, F93A, F93Y, F93I, F93L, F93C, F93S, L94G, L94*, L94V, L94F, L94M, L94W, L94S, L95P, L95F, L95M, L95V, L95S, L95*, L95W, V96I, V96E, V96N, V96F, V96L, V96C, V96D, V96A, V96G, L97C, L97R, L97V, L97H, L97F, L97P, V98M, V98W, V98A, V98Q, V98P, V98R, V98L, D99H, D99K, D99V, D99A, D99E, D99N, D99G, Y100P, Y100*, Y100K, Y100D, Y100H, Y100N, Y100L, Y100W, Y100S, Y100F, Y100C, Q101P, Q101L, Q101*, Q101K, Q101H, Q101R, G102N, G102R, G102A, G102D, G102C, G102S, G102V, M103K, M103R, M103L, M103T, M103V, M103I, L104Q, L104M, L104G, L104V, L104W, L104S, L104F, P105T, P105S, P105H, P105A, P105L, P105R, V106L, V106R, V106Q, V106D, V106E, V106G, V106I, V106A, C107A, C107W, C107*, C107G, C107S, C107Y, C107R, P108A, P108T, P108S, P108L, P108H, L109R, L109V, L109I, L109Q, L109M, L109P, I110R, I110H, I110V, I110N, I110F, I110S, I110P, I110T, I110M, I110L, A111N, A111R, A111T, A111Q, A111S, A111L, A111P, G112V, G112K, G112N, G112E, G112A, G112R, S113C, S113R, S113F, S113L, S113Y, S113K, S113P, S113N, S113A, S113T, T114V, T114*, T114L, T114R, T114N, T114M, T114K, T114A, T114P, T114S, T115P, T115S, T115A, T115I, T115N, T116S, T116P, T116A, T116I, T116N, S117Y, S117A, S117V, S117R, S117I, S117G, S117N, S117T, T118W, T118N, T118G, T118I, T118E, T118P, T118S, T118K, T118R, T118M, T118A, T118V, G119S, G119W, G119T, G119*, G119C, G119V, G119E, G119R, P120H, P120N, P120R, P120L, P120A, P120Q, P120S, P120T, C121T, C121G, C121F, C121L, C121Y, C121S, C121W, C121R, K122G, K122V, K122E, K122N, K122T, K122I, K122S, K122Q, K122R, T123G, T123V, T123P, T123S, T123N, T123I, T123A, C124G, C124R, C124W, C124Y, C124S, C124F, T125P, T125I, T125A, T125M, T126P, T126Q, T126K, T126G, T126M, T126L, T126V, T126N, T126A, T126S, T126I, P127F, P127R, P127V, P127H, P127S, P127A, P127I, P127T, P127L, A128T, A128R, A128G, A128V, Q129E, Q129K, Q129L, Q129P, Q129N, Q129H, Q129R, G130V, G130*, G130D, G130K, G130A, G130E, G130S, G130R, G130N, N131Q, N131G, N131D, N131H, N131K, N131S, N131I, N131A, N131P, N131T, S132H, S132Y, S132P, S132C, S132F, M133R, M133G, M133K, M133V, M133Q, M133S, M133I, M133L, M133T, F134K, F134D, F134W, F134T, F134Q, F134C, F134R, F134V, F134H, F134S, F134L, F134N, F134I, F134Y, P135S, P135R, P135T, P135H, P135L, S136T, S136C, S136A, S136L, S136P, S136F, S136Y, C137S, C137R, C137*, C137Y, C137W, C138S, C138*, C138W, C138G, C138R, C138Y, C139S, C139G, C139*, C139Y, C139W, C139R, T140P, T140K, T140M, T140A, T140L, T140I, T140S, K141N, K141Q, K141T, K141I, K141R, K141E, P142T, P142A, P142H, P142R, P142I, P142S, P142L, T143W, T143F, T143A, T143P, T143M, T143L, T143S, D144Y, D144N, D144V, D144A, D144G, D144E, G145D, G145V, G145I, G145S, G145K, G145E, G145A, G145R, N146I, N146T, N146K, N146S, N146D, C147W, C147S, C147R, C147G, C147Y, T148S, T148I, T148A, C149S, C149G, C149W, C149Y, C149R, I150S, I150N, I150M, I150V, I150T, P151S, P151R, P151T, P151L, P151H, I152H, I152F, I152L, I152T, I152V, P153A, P153T, P153L, P153Q, P153S, S154Q, S154A, S154H, S154V, S154T, S154P, S154L, S155T, S155A, S155Y, S155F, S155P, W156S, W156C, W156G, W156R, W156*, W156L, A157S, A157N, A157D, A157P, A157V, A157T, A157G, F158I, F158Y, F158S, F158L, A159P, A159Q, A159L, A159S, A159T, A159R, A159E, A159V, A159G, K160Q, K160T, K160E, K160G, K160S, K160N, K160R, Y161K, Y161C, Y161V, Y161H, Y161I, Y161L, Y161S, Y161F, L162S, L162*, L162R, L162V, L162I, L162P, L162Q, W163G, W163C, W163R, E164*, E164K, E164A, E164V, E164D, E164G, W165Q, W165*, W165G, W165C, W165S, W165R, W165L, A166S, A166D, A166P, A166T, A166V, A166G, S167Y, S167E, S167T, S167G, S167P, S167L, V168L, V168I, V168F, V168G, V168D, V168S, V16887T, V168P, V168A, R169G, R169L, R169S, R169C, R169P, R169H, F170V, F170L, F170S, S171N, S171Y, S171C, S171L, S171P, S171F, W172M, W172F, W172G, W172R, W172S, W172L, W172C, W172*, P173I, P173Y, P173S, P173V, P173F, P173L, S174G, S174K, S174C, S174R, S174T, S174I, S174N, L175I, L175*, L175F, L175S, L176I, L176V, L176Q, L176P, V177P, V177I, V177S, V177G, V177E, V177M, V177L, V177A, P178G, P178R, P178H, P178A, P178L, P178S, P178Q, F179I, F179C, F179L, F179Y, F179S, V180P, V180L, V180E, V180F, V180I, V180A, Q181K, Q181E, Q181L, Q181H, Q181R, W182F, W182G, W182S, W182R, W182C, W182L, W182*, F183V, F183Y, F183I, F183L, F183S, F183C, V184L, V184P, V184D, V184F, V184I, V184E, V184G, V184A, G185K, G185L, G185W, G185R, G185A, G185E, L186M, L186A, L186R, L186C, L186S, L186I, L186V, L186F, L186P, L186H, S187A, S187T, S187C, S187P, S187F, S187L, P188A, P188S, P188R, P188T, P188H, P188L, T189L, T189N, T189A, T189S, T189P, T189I, V190S, V190E, V190P, V190D, V190G, V190I, V190F, V190A, W191K, W191S, W191*, W191G, W191L, W191R, W191C, L192R, L192H, L192V, L192F, L192P, S193*, S193T, S193P, S193F, S193L, A194G, A194D, A194S, A194T, A194F, A194L, A194I, A194V, I195K, I195V, I195L, I195T, I195M, W196G, W196V, W196K, W196R, W196*, W196S, W196L, M197L, M197K, M197R, M197V, M197I, M197T, M198L, M198V, M198R, M198K, M198T, M198I, W199P, W199C, W199R, W199G, W199S, W199*, W199L, Y200H, Y200*, Y200L, Y200N, Y200S, Y200C, Y200F, W201C, W201L, W201G, W201S, W201R, W201*, G202V, G202E, G202R, G202A, P203H, P203T, P203G, P203S, P203A, P203L, P203Q, P203R, S204Q, S204H, S204C, S204D, S204I, S204T, S204G, S204K, S204R, S204N, L205Q, L205R, L205V, L205P, L205M, Y206G, Y206P, Y206Q, Y206*, Y206I, Y206W, Y206V, Y206D, Y206R, Y206L, Y206N, Y206S, Y206F, Y206H, Y206C, N207G, N207K, N207C, N207D, N207P, N207H, N207I, N207T, N207R, N207S, I208L, I208C, I208F, I208S, I208V, I208N, I208T, L209K, L209A, L209F, L209*, L209G, L209M, L209S, L209W, L209V, S210M, S210G, S210I, S210T, S210K, S210R, S210N, P211A, P211T, P211S, P211R, P211L, P211H, F212C, F212S, F212L, F212Y, I213A, I213V, I213S, I213F, I213T, I213M, I213L, P214G, P214R, P214T, P214H, P214S, P214Q, P214L, L215D, L215I, L215V, L215T, L215M, L215Q, L215R, L215P, L216T, L216G, L216C, L216R, L216Y, L216V, L216I, L216S, L216F, L216*, P217A, P217T, P217Q, P217S, P217L, I218P, I218K, I218S, I218F, I218N, I218T, I218L, F219C, F219I, F219L, F219S, F220V, F220I, F220S, F220W, F220Y, F220L, F220C, C221V, C221R, C221W, C221L, C221S, C221G, C221F, C221Y, H222Q, H222V, H222S, H222R, H222F, H222I, H222P, H222L, W223F, W223G, W223C, W223R, W223*, W223L, V224L, V224I, V224E, V224G, V224A, Y225A, Y225*, Y225N, Y225D, Y225L, Y225I, Y225H, Y225C, Y225F, Y225S, I226R, I226H, I226L, I226F, I226N, I226V, I226T, I226M, I226S,

1R represents the position of the mutation ‘1’ and R is the change in amino acid “candidate escape mutations”. ∆ Genotype C sequences were excluded in the analysis. ∆* genotype B, C, D, E, F, G, H and I sequences were excluded in this analysis. * = Stop codon.

Table 6

Summary of PreS1 epitopes binding to respective alleles, and mutations that exists in the Botswana HBV sequences.

Epitope PositionCore AA SequenceHLA_ DRB*1/5_ Genotype AEpitopes PositionCore AA SequenceHLA_ DRB*1/5_ Genotype DMutations Relative to Protein: (WT, aa#, Mut)
AD
84–92ILATVPAVP ¥*080273–81ILQTLPANP I84V, I84T, A86V, A86T, V88M, V88L, A90P, A90V, P92LI73M, L75H, L77M, L77V, A79E
85–93LATVPAVPP ¥*˜080274–82LQTLPANPP I84V, I84T, A86V, A86T, V88M, V88L, A90P, A90V, P92LI73M, L75H, L77M, L77V, A79E
34–42FGANSNNPD 23–30FRANTANP ¥*0401-R24K
16–24LSVPNPLGF 5–13LSTSNPLGF ¥*0401, *1302--
24–32FFPDHQLDP 13–21FFPDHQLDP*0401,F25L-
63–71FGPGFTPPH 52–60FGLGFTPPH*0101, *0401, *0701, 5*0101F67LL54M, L54P, F56L
34–42FGANSNNPD 23–30FRANTANPD*0101, *0401, *0802, *1302-R24K
67–75FTPPHGGVL 56–64FTPPHGGLL*0101, *0701,F67L, G73R-
28–36HQLDPAFGA 17–25HQLDPAFRA*0301, *0401,-R24K
84–92ILATVPAVP*0101, *0301, *0401, *0701, *0802, *1101, *1302, *1501, 5*010173–81ILQTLPANP*0101, *0301, *0401, *0701, *0802,*1302,I84V, I84T, A86V, A86T, V88M, V88L, A90P, A90V, P92LI73M, L75H, L77M, L77V, A79E
85–93LATVPAVPP*0401, *0802, 5*010174–82LQTLPANPP*0802, 5*0101I84V, I84T, A86V, A86T, V88M, V88L, A90P, A90V, P92LI73M, L75H, L77M, L77V, A79E
74–82LLGWSPQAQ*150163–71 *0101, *0401, *0802, *1501W77stop, S78R, P79S, A81S, A81PS67N, P68S
16–24LSTSNPLGF 5–13 *0301, *0701, *1302--
12–21MGTNLSVPN*0401, *0802, *13021–10MGQNLSTSNP -G2E
15–22NLSVPNPLG*0401,4–12NLSTSNPLGF --
83–90QGILATVPA*0101, *080271–79QGILQTLPA*0101,G83D, I84V, I84K, A86T, A86V, V88L, V88M, A90T, A90VI73M, L75H, L77M, L77V, A79E
14–22TNLSVPNPL*0101, *0701, *13023–11QNLSTSNPL*0101, *0401,--
4–12WSAKPRKGM*1101, 5*0101- S5P, A6S, A6T, K10N, K10I-
77–85WSPQAQGIL*0101, *0701,66–75WSPQAQGIL*0101, *0701,W77stop, S78R, P79S, A81P, A81S, G83D, I84V, I84KP68S,

*0101 means HLA class II allele DRB1*0101 e.tc. 5*0101 means HLA class II allele DRB5*0101. ¥ indicates SB.

Table 7

Summary of PreS2 epitopes binding to respective alleles, and mutations that exist within the Botswana HBV sequences.

GenotypeAA Sequence Core AA Sequence of epiPres2 epi ResiduesCount of epi with Core AA SequenceHLA Class II Alleles Mutations in the Core Sequence
AALQDPRVRGAFHQALQDPRVRGLYFPA7–247*0301R16K, V17I
AASHISSISSVPNTASHISSISSRT35–49 *0401A39T, I45S
DASPLSSIFSVPTTASPLSSIFSRIG35–502*0101, *0401L42I, L42S
AFHQALQDPRNSTAFHQALQDPRVRG4–192*0301, 5*0101A11T, R16K
DFHQTLQDPRNSTTFHQTLQDPRVR4–181*0301-
DFSRIGDPALSPLSSIFSRIGDPALN40–551*0101, *0401, *0701, 5*0101R48T, P52H, A53V, L54P
AHISSISSRTVPNTASHISSISSRTG35–504*0101, *0401, *0701I45S, R48T
DIFSRIGDPASPLSSIFSRIGDPALN40–552*0802, *1302, *1501L42I, L42S
AISSISSRTGPNTASHISSISSRTGDPALN36–556*0101, *0401, *0701, *0802, *1101, *1501, 5*0101I45S, R48T
ALNPVPNTASSSSGTLNPVPNTASHISSI27–455*0401, *0802L32H, L32R, P34L, N37H, N37T, I38T, A38T
DLSSIFSRIGPTTASPLSSIFSRIGDPALN36–556*0101, *0401, *0701, *0802, *1101, *1501, 5*0101P52L, A53V
AMQWNSTAFHMQWNSTAFHQALQDP1–151*1302Q2I, A7T
MQWNSTTFHQTLQDP--S5F, S5Y, T6A, T7I, T7N
DPLSSIFSRIVPTTASPLSSIFSRI35–491*0701L42I, L42S
APVPNTASHISSGTLNPVPNTASHISSI27–456*1302P34L, N37H, N37T
DPVPTTASPLVNPVPTTASPLSSIF32–462*0701P34H, P36L, L42I, L42S
AQALQDPRVRTAFHQALQDPRVRGLYF6–2235*0101-
DQTLQDPRVRSTTFHQTLQDPRVRGLYF5–2245*0101R22K
DTLQDPRVRGSTTFHQTLQDPRVRGLYFPA6*0301R18K, G19D, G19A
AVRGLYFPAGDPRVRGLYFPAGGSSSG14–303*0802, *1101, *1501V17I, Y21N, Y21S, F22L, F22P, F22T, P23A
D3*0802, *1101, *1501R18K, G19D, G19A, F2222Q, F22H, F22P, F22L
AWNSTAFHQAMQWNSTAFHQALQDP1–151*0401, *0701A7T, A11T
DWNSTTFHQTMQWNSTTFHQTLQDP1*0701S5F, S5Y, T6A, T7I, T7N
AYFPAGGSSSPRVRGLYFPAGGSSSGTLNP15–346*0101, *0401, *1101, 5*0101Y21N, Y21S, F22L, F22P, F22T, P23A, S29L
DYFPAGGSSSPRVRGLYFPAGGSSSGTVNP6*0101, *0401, 5*0101, *1101F22Q, F22H, F22P, F22L

*0101 means HLA class II allele DRB1*0101 e.tc. 5*0101 means HLA class II allele DRB5*0101.

3.7. Assessing the Distribution of HBV-HLA (epi) as a Predictor of HLA Protective Effective

We used existing information on immunological studies for meta-analysis to estimate Tepi as a predictor of the protective effect of MHC class II alleles. Figure 4 shows a Pareto analysis and the 20% threshold indicates the 3 HLA class II alleles—DRB1*0301, 1302, *1101—that are highly likely to be less protective against HBV. We therefore speculate, with caution, that there is a relationship between Tepi and protectiveness and this should be further investigated to establish correlations (p-value < 0.05) with high statistical confidence.
Figure 4

Pareto Analysis applied to rank the Tepi of alleles against their percentage frequency.

4. Discussion

This detailed HBV immunoinformatics approach outlines the candidate peptides that can be used to develop biologicals against CHB. However, mutations within the T cell epitopes may impair HBV-HLA complex formation, a crucial component responsible for initiating cascade of responses for viral clearance [67]. In this study, we observed that there were no epi that bind to all alleles, and there was a large difference between the epi profiles of genotype A compared to genotype D. This phenomenon may explain previous failures during preclinical trials of candidate vaccines against CHB generated thus far, which were designed for broad potency. [68,69,70,71,72,73,74,75]. This may suggest that a genotype- and population-based, multi-epitope vaccine would be the best candidate to combat HBV. Using the cut-off set in this analysis, 21 epi from S regions and 6 from PreS2 regions were identified and used to construct tertiary structure of candidate vaccine. The candidate vaccines showed high binding in all alleles except for DRB1*0301. Populations with high coverage of DRB1*0301 have been closely associated with high susceptibility to CHB infection and nonresponse vaccination with envelope proteins [53,76,77]. Among the 3 types of proteins (PreS/S) analyzed in this study, S had the highest binding peptides constituting 79.9% of the sum of all Tepi (epi), and genotype D had the most epitopes. However, when Depi were compared between genotypes (A and D) and proteins (PreS/S), it was clear that the length of proteins where independent of the epi but dependent on the aa compositions and the position of the epi within the proteins. While S region was the longest protein (213 aa), it had similar Depi to the smallest protein PreS2D (PreS2 -54 aa). Previous studies that investigated all 7 proteins of HBV and showed that S protein had the most T cell antigenic epitopes [78,79]. However, further investigation should be conducted to determine any relationships between antigenicity and PreS1 protein with least epi. We also compared the frequency of epi between 2 genotypes, A and D, and observed that genotype D had generally more immunogenic epi than A with exception for PreS2A whose epi were 12% more than those of PreS2D, (Figure 1). Clinically, these trends correspond to data reported from studies that investigated the prognosis of patients infected with HBV genotype A1 strains compared to genotype D3. Others have reported a 10-fold increased progression to HCC among HBV patients infected with genotype A compared to genotype D and that patients infected with genotype A strains were likely to progress to CHB compared to genotype D [8,80]. Most countries in SSA, including Botswana, have low prevalence of HBV genotype D3 among different risk groups than genotype A1 [25,26,27,81]. Using existing information from studies that investigated the impact of alleles on different HBV outcomes to validate our statistical associations [53,82,83,84,85], we observed that out of 9 alleles, HLA-DRB*1301/2 and *0401 alleles—which had most epi—have been associated with spontaneous clearance of HBV infection [18,24,33,68,78,82,85,86,87], and HLA DRB1*0301 that had the least epi in our study has been previously associated with susceptibility to HBV infection, autoimmune hepatitis, chronicity, and non-responsiveness to HBV vaccination across different ethnic groups [9,88,89]. This strongly suggests that Tepi of PreS/S should be explored further as a predictor of the protective effect of HLA class II alleles. A host immune system can recognize foreign antigens (epi) and clear the infection in some cases [90,91,92]; however, most pathogens including HBV can mutate within epitopes, and this may result in an escape from host immune surveillance leading to persistence of infection [93]. This characteristic is regarded as one of the major hindrances in developing high potency therapeutic drugs. This mechanism of aa changes within epitopes (escape mutations) interferes with both peptide processing reducing the intracellular antigen load and downregulation of MHC expression hence increased risk of developing liver malignancies (HCC, LC) among CHB patients [4,5,94,95,96,97,98]. Studies investigating the role of escape mutations within the T cell epitopes are relatively rare. In this study, we observed coordinated aa variations, which reveal genetic dependencies (i.e., epi that escaped HLA binding when there were two or more mutations); however, some single aa mutations altered the binding potential. These mutations were termed covariance mutations. For instance, in the proportion of PreS1 binding peptides from both genotype A and D against alleles shown in Figure 3, 15-mers with core aa sequence ILATVPAVP84–92 in PreS1A—corresponding to ILQTLPANP73–81 in PreS1D—weakly binds to alleles: *0101; 0401; 0701; 0802; *1101; 1302; *1501; 5*0101 → PreS1A, and *0101; 0401; 0701; 0802; 1302→ PreS1D respectively. The aa Ala in genotype A is replaced by aa Gln for genotype D [A(A) → Q (D)] and [Val(A) → Lue(D)]91 causing PreS1D to pseudo bind to 3 alleles (*1101, 1302, and 5*0101). Additionally, the epitope strongly binds to allele *0802, but the changes in genotype D result in pseudo binding. Table 3, Table 4, Table 5, Table 6 and Table 7 summarize all the core aa of PreS/S epitopes that are restricted to 9 HLAs. We observed that the HBV epitope-HLA is greatly influenced by the position of core aa. For instance PreS1D epi AFGLGFTPPHGGLLG51–65 is a WB to alleles: *0101, *0401 and 5*0101 when using core-aa: FGLGFTPPH52–60 but it can only bind to *0701 when the core aa is FTPPHGGLL57–64. Furthermore, post epi analyses show that there were mutations outside the core aa sequences but had impact on the HBV-HLA blinding. For instance, 2 epi with aa residues S: 41–55—FLGGPPVCLGQNSQS and FLGGSPVCLGQNSQS—and all with core aa sequences FLGGPPVCL show different binding affinities thus NB and WB respectively. The escape mutations defined in the present study were those found within the core aa sequence. Overall, the present computational study facilitates the development of experimental epitope and escape mutation mapping studies.

5. Conclusions

Vaccines act by inducing strong immunity to counteract viral antigens presented by MHC-epitopes. However, their success is affected by virus evolution (e.g., escape mutations) within known protective epitopes; hence, multi-epitope, population-based vaccine constructs are preferred in order to generate a potent immunologic response against HBV. We demonstrate the quality of T cell epitopes among different HBV genotypes and reconstructed a candidate multi-epitope population-based vaccine. Our results suggest that among aa variations classified as polymorphisms do exit T-cell escape mutations and. In silico studies should be followed up with preclinical assays to validate the novelty of their findings. Viral hepatitis (VH) is a global burden, and the WHO has put forth an ambitious goal to eliminate VH as a public threat by 2030. HBV contributes a vast majority (77%) of VH cases and there are no therapeutic cures for chronic hepatitis B infections (CHB). We hypothesized that epitope vaccines are a potential CHB treatment because they can induce strong immune systems with ability to achieve hepatitis B virus surface antigen (HBsAg) loss. While several trials have failed to produce effective vaccines against CHB from T cell epitopes, we aimed to investigate the repertoire of T cell epitopes from different HBV genotypes (A and D), MHC class II alleles with high population coverage in Botswana. In silico analyses were used to map promiscuous epitopes (15-mers) using alleles -9 MHC class II alleles-, and PreS/S sequences -genotype A and D- with high population coverage in Botswana. Some epitopes mapped within PreS/S conserved regions, and none were promiscuous to all alleles suggesting that multi-epitope, population-based vaccines (MEPBV) may be more effective candidate vaccines against CHB compared to previously reported broad potency epitope-based candidate vaccines. Highly promiscuous peptides may also be considered as candidate peptides for designing highly sensitive diagnostic chips since current serological kits may fail to detect other HBV clinical phenotypes. The mapped T epitopes exhibited high mean diversity among genotypes and others had coordinated amino acid variations that were genetically dependent on each other in order to escape epitope-HLA binding.
  88 in total

1.  Combined GS-4774 and Tenofovir Therapy Can Improve HBV-Specific T-Cell Responses in Patients With Chronic Hepatitis.

Authors:  Carolina Boni; Harry L A Janssen; Marzia Rossi; Seung Kew Yoon; Andrea Vecchi; Valeria Barili; Eric M Yoshida; Huy Trinh; Timothy C Rodell; Diletta Laccabue; Arianna Alfieri; Federica Brillo; Paola Fisicaro; Greta Acerbi; Giuseppe Pedrazzi; Pietro Andreone; Carmela Cursaro; Marzia Margotti; Rosanna Santoro; Valeria Piazzolla; Maurizia R Brunetto; Barbara Coco; Daniela Cavallone; Yang Zhao; Adarsh Joshi; Jacky Woo; Audrey H Lau; Anuj Gaggar; G Mani Subramanian; Benedetta Massetto; Scott Fung; Sang Hoon Ahn; Xiaoli Ma; Alessandra Mangia; Carlo Ferrari
Journal:  Gastroenterology       Date:  2019-03-28       Impact factor: 22.682

Review 2.  Genetic variation of hepatitis B virus and its significance for pathogenesis.

Authors:  Zhen-Hua Zhang; Chun-Chen Wu; Xin-Wen Chen; Xu Li; Jun Li; Meng-Ji Lu
Journal:  World J Gastroenterol       Date:  2016-01-07       Impact factor: 5.742

3.  The Immune System: Basis of so much Health and Disease: 3. Adaptive Immunity.

Authors:  Eleni A Georgakopoulou; Yazan Hassona
Journal:  Dent Update       Date:  2017-04

4.  The effect of HLA on immunological response to hepatitis B vaccine in healthy people: a meta-analysis.

Authors:  Zheng-Kang Li; Jing-Jing Nie; Jie Li; Hui Zhuang
Journal:  Vaccine       Date:  2013-07-23       Impact factor: 3.641

Review 5.  Hepatitis B: 50 years after the discovery of Australia antigen.

Authors:  A Suk-Fong Lok
Journal:  J Viral Hepat       Date:  2015-08-17       Impact factor: 3.728

6.  Association between an MHC class II allele and clearance of hepatitis B virus in the Gambia.

Authors:  M R Thursz; D Kwiatkowski; C E Allsopp; B M Greenwood; H C Thomas; A V Hill
Journal:  N Engl J Med       Date:  1995-04-20       Impact factor: 91.245

Review 7.  Estimations of worldwide prevalence of chronic hepatitis B virus infection: a systematic review of data published between 1965 and 2013.

Authors:  Aparna Schweitzer; Johannes Horn; Rafael T Mikolajczyk; Gérard Krause; Jördis J Ott
Journal:  Lancet       Date:  2015-07-28       Impact factor: 79.321

8.  Results of a phase III clinical trial with an HBsAg-HBIG immunogenic complex therapeutic vaccine for chronic hepatitis B patients: experiences and findings.

Authors:  Dao-Zhen Xu; Xuan-Yi Wang; Xin-Liang Shen; Guo-Zhong Gong; Hong Ren; Li-Min Guo; Ai-Min Sun; Min Xu; Lan-Juan Li; Xin-Hui Guo; Zhen Zhen; Hui-Fen Wang; Huan-Yu Gong; Cheng Xu; Nan Jiang; Chen Pan; Zuo-Jiong Gong; Ji-Ming Zhang; Jia Shang; Jie Xu; Qing Xie; Tie-Feng Wu; Wen-Xiang Huang; Yong-Guo Li; Jing Xu; Zheng-Hong Yuan; Bin Wang; Kai Zhao; Yu-Mei Wen
Journal:  J Hepatol       Date:  2013-05-11       Impact factor: 25.083

Review 9.  T-cell epitope vaccine design by immunoinformatics.

Authors:  Atanas Patronov; Irini Doytchinova
Journal:  Open Biol       Date:  2013-01-08       Impact factor: 6.411

10.  DNAVaxDB: the first web-based DNA vaccine database and its data analysis.

Authors:  Rebecca Racz; Xinna Li; Mukti Patel; Zuoshuang Xiang; Yongqun He
Journal:  BMC Bioinformatics       Date:  2014-03-19       Impact factor: 3.169

View more
  3 in total

Review 1.  Peptides for Vaccine Development.

Authors:  Ian W Hamley
Journal:  ACS Appl Bio Mater       Date:  2022-02-23

Review 2.  Peptides to combat viral infectious diseases.

Authors:  Shams Al-Azzam; Yun Ding; Jinsha Liu; Priyanka Pandya; Joey Paolo Ting; Sepideh Afshar
Journal:  Peptides       Date:  2020-09-01       Impact factor: 3.750

Review 3.  Innate and adaptive immune escape mechanisms of hepatitis B virus.

Authors:  Hua-Jun Zhao; Yi-Fei Hu; Qiu-Ju Han; Jian Zhang
Journal:  World J Gastroenterol       Date:  2022-03-07       Impact factor: 5.742

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.