Literature DB >> 25598792

Characterization of HCV genotype 5a envelope proteins: implications for vaccine development and therapeutic entry target.

Maemu Petronella Gededzha1, Maphahlanganye Jeffrey Mphahlele1, Selokela Gloria Selabe1.   

Abstract

BACKGROUND: Hepatitis C virus (HCV) is one of the major causes of cirrhosis and hepatocellular carcinoma with an estimation of 185 million people with infection. The E2 is the main target for neutralizing antibody responses and the variation of this region is related to maintenance of persistent infection by emerging escape variants and subsequent development of chronic infection. While both E1 and E2 are hypervariable in nature, it is difficult to design vaccines or therapeutic drugs against them.
OBJECTIVES: The objective of this study was to characterize genotype 5a E1 and E2 sequences to determine possible glycosylation sites, conserved B-cell epitopes and peptides in HCV that could be useful targets in design of vaccine and entry inhibitors. PATIENTS AND METHODS: This study was conducted through PCR amplification of E1 and E2 regions, sequencing, prediction of B-cell epitopes, analysis of N-linked glycosylation and peptide design in 18 samples of HCV genotype 5a from South African.
RESULTS: Differences in the probability of glycosylation in E1 and E2 regions were observed in this study. Three conserved antigenic B-cell epitopes were predicted in the E2 regions and also 11 short peptides were designed from the highly conserved residues.
CONCLUSIONS: This study provided conserved B-cell epitopes and peptides that can be useful for designing entry inhibitors and vaccines able to cover a global population, especially where genotype 5a is common.

Entities:  

Keywords:  Epitopes; Genotype; Hepatitis C Virus; Peptides

Year:  2014        PMID: 25598792      PMCID: PMC4286708          DOI: 10.5812/hepatmon.23660

Source DB:  PubMed          Journal:  Hepat Mon        ISSN: 1735-143X            Impact factor:   0.660


1. Background

Globally, an estimated 185 million people have been infected with hepatitis C virus (HCV) as one of the major causes of cirrhosis and hepatocellular carcinoma (1). HCV genome consists of approximately 9.6 kilobases, positive-sense single-stranded RNA, which encodes three structural (C, E1 and E2) and 7 non-structural (p7, NS2, NS3, NS4A, NS4B, NS5A and NS5B) proteins flanked by 5’ and 3’ untranslated regions (UTR) (2). E1 and E2 proteins are type I transmembrane proteins with both N-terminal ectodomain and a C-terminal domain (3) and contain 6 and 11 glycosylation sites, respectively (4, 5). These proteins are involved in viral entry by interacting with CD81 and Scavenger receptor class B member 1 (SRB1) (6-8). HCV glycosylation sites play an essential role in envelope proteins to ensure correct conformation for virus entry (5, 9) and antigenic variation (10). HCV E2 glycosylation sites interact with cell surface receptors directly allowing the virus to enter the cell (11, 12). Glycosylation sites may mask important epitopes from host antibody responses (13, 14). B-cell epitopes are essential in increasing the preferred immune responses (15, 16) and number of epitopes and modulation of immune recognition of antigens can be influenced by deglycosylation of E1 proteins (17). The E1 derived peptide p35 (amino acid (aa) 315–323) (18), E2-conserved synthetic peptides p37 (aa 517–531) and p38 (aa 412–419) have been reported to neutralize HCV particles, as important components of a candidate peptide vaccine (19). The molecular targets for current HCV Direct-acting antiviral (DAA) in development are mainly focused on non-structural proteins such as the NS3 protease, NS5A and the NS5B RdRp (20). Recently, considerable progress has been made to understand HCV entry (21, 22) and development of entry inhibitors (20, 21, 23, 24). Many patients do not respond to the current available therapy, therefore, there is an urgent need to develop effective HCV vaccines and specific therapeutic drugs. While both E1 and E2 are hypervariable in nature, it is difficult to design vaccines or therapeutic drugs against them. Genotype 5a accounts for over 50% of HCV infections in South Africa (25).

2. Objectives

This study aimed to characterize genotype 5a E1 and E2 sequences to determine possible glycosylation sites, conserved B-cell epitopes and peptides in HCV that could be useful targets in the design of vaccine and entry inhibitors.

3. Patients and Methods

3.1. Study Population

This study included 18 genotype 5a samples collected from treatment-naive HCV infected patients at Dr. George Mukhari Academic Hospital (DGMAH), north-west of Pretoria, South Africa, from 2007 to 2011. Patients’ demographics and genotyping based on 5’UTR were previously described in detail (25). Six of 18 samples were sequenced as part of the genotype 5a near-full length analysis previously described (26). DGMAH is an academic hospital serving a population of around 4 million from both rural and urban areas. It is a referral hospital for patients from the North West, Mpumalanga, Limpopo and the northwest part of Pretoria, Gauteng. The Medunsa Research and Ethics Committee approved the study.

3.2. PCR and Sequencing

Viral RNA was extracted from 140 μL of serum using the QIAamp Viral RNA Mini Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. HCV RNA was converted into cDNA using the enzyme RevertAid TM RT-PCR (Fermentas, Vilnius, Lithuiana). The cDNA was amplified in three overlapping fragments (Table 1) covering complete E1 and E2 regions. Direct sequencing was performed with ABI 3500XL (Inqaba Biotechnological Industry, PTY, Ltd, Pretoria, South Africa) using second round PCR primers. Sequence fragment assembly was performed using Chromas Pro1.5 (www.technelysium.com.au/chromas.html). All sequences were aligned by Mafft (mafft.cbrc.jp/alignment/server/) and translated into amino acids using BioEdit (27).
Table 1.

Sequences of the HCV Primers Used in This Study

SequencesPrimersReference
A this study
F1A1088GAC CAT TTC ATC ATC ATG TCC CA
R1A1425TGT ATG CGG CGG CGA ACA AGA CC
F2A1113CTT CGG AGG GCC GTT GAC TAC TTA GCG
R2A1413CGA ACA AGA CCC CCC AGT GGG
B
M1051292ATG GCA TGG GAC ATG ATG ATG(27)
R1B2061TAG GCC CTA AGT TGC AGG GTG GAthis study
M1061298TGG GAC ATG ATG ATG AAT TGG(27)
R2B2022CAA ACC CTG TGG AAT TCA TCC AGthis study
C this study
F1C1743GGC TGG GGA ACT ATC AGC TAT
R1C2636AAA CCC ATG AGT CCC CGC AGC C
F2C1773TCG GGC CCC AGT GAT GAC AAG
R2C2612AGC CGC GTT TAG GAC AAT GAC GTT CT

3.3. Analysis of N-Linked Glycosylation Sites

The N glycosylation sites were predicted using the online prediction server NetNGlyc version 1.0 (http://www.cbs.dtu.dk/services/NetNGlyc/), which predicts N glycosylation sites in proteins by artificial neural networks that examine the sequence context of Asn-Xaa-Ser/Thr sequins. The networks can identify 86% of the glycosylated and 61% of the non-glycosylated sequins, with an overall accuracy of 76%.

3.4. Prediction of B-Cell Epitopes

For identification of B-cell epitopes, 16-mer B-cell epitopes was predicted using the program ABCpred (http://www.imtech.res.in/raghava/abcpred/) at a 0.51 default threshold using a consensus sequence from 18 genotype 5a sequences created using Bioedit. ABCpred server predicts B-cell epitopes using artificial neural network using fixed length patterns (28). Antigenicity of all predicted epitopes was analyzed using VaxiJen v2.0 online antigen prediction (www.ddg-pharmfac.net/vaxijen/). Proteins having antigenic score more than 0.4 were selected as antigenic. VaxiJen v2.0 allows antigen classification based on physicochemical properties of proteins without recourse to sequence alignment. All predicted epitopes were analyzed for conservation using the IEDB database (http://tools.immuneepitope.org/tools/conservancy/iedb_input) at a threshold of 100% conservation compared to 406, 221, 98, 33, 45, 45 randomly selected sequences from each of the HCV genotypes 1a, 1b, 2, 3, 4 and 6, respectively.

3.5. Peptide Design

Structure analysis of sequence was performed using the Protparam online tool (29). Protparam computed different parameters including the molecular weight, theoretical pI, AA composition, atomic composition, extinction coefficient, instability index, aliphatic index and grand average of hydropathicity (GRAVY). To check post-translational modifications, predicted peptides were predicted for N-linked glycosylation as described above and for N-linked phosphorylation using the NetPhos 2.0 (30) program. The NetPhos 2.0 produces neural network predictions for serine, threonine and tyrosine phosphorylation sites in sequences. Only those motifs with NetPhos score of 0.7 or greater were considered.

3.6. GenBank Accession Numbers

Sequences were submitted to GenBank under the accession numbers KC7678835 - KC767846.

4. Results

4.1. Sequence Alignment and Genetic Distances

Sequence alignment of 18 genotype 5a sequences with a reference sequence from the GenBank showed that most regions in the genotype 5a E1 and E2 proteins were conserved except hypervariable 1 (HVR1), which was highly variable as expected. Comparison of genetic distances between sequences in this study showed intragroup genetic distances ranging from 8% to 17%, with an average distance of 13% (Table 2).
Table 2.

Genetic Distances in E1 and E2 Sequences of Genotype 5a in This Study

Genetic Distances[a,b]
Sequence123456789101112131415161718
1 ZADGM7890
2 ZADGM65440.12
3 ZADGM42270.130.12
4 ZADGM19080.140.120.13
5 ZADGM17070.100.120.110.12
6 ZADGM6510.130.130.140.130.12
7 ZADGM3080.120.120.120.120.100.11
8 ZADGM64850.130.100.130.120.120.120.10
9 ZADGM41240.130.130.130.130.120.140.130.12
10 ZADGM24390.140.150.150.150.120.140.120.130.14
11 ZADGM23520.130.140.140.140.130.120.110.120.140.13
12 ZADGM525gp0.140.140.150.140.120.150.140.140.140.160.16
13 ZADGM8690.140.140.150.130.130.100.130.130.140.140.140.17
14 ZADGM30130.150.140.140.150.120.150.130.120.150.150.150.170.15
15 ZADGM05180.90.120.110.120.080.130.100.110.110.120.130.140.130.11
16 ZADGM25820.140.140.130.140.130.130.120.130.140.130.130.160.150.150.13
17 ZADGM20880.140.130.130.130.120.120.110.120.130.140.120.130.140.140.120.12
18 ZADGM11040.130.140.130.140.120.130.090.120.140.140.140.130.150.150.120.140.13

a The values range between 0 (0%) and 1 (100%) substitutions per nucleotide site.

b The numbers 1-18 corresponds to the sequence number on the vertical side.

a The values range between 0 (0%) and 1 (100%) substitutions per nucleotide site. b The numbers 1-18 corresponds to the sequence number on the vertical side.

4.2. Analysis of E1 and E2 N-Linked Glycosylation

E1 and E2 proteins of 18 sequences were analyzed for possible glycosylation sites. Differences in the probability of glycosylation in E1 and E2 were observed in most sequences. Whereas other studies reported five N-linked glycosylation sites in the E1 region, all strains in the current study showed three or four glycosylation sites, except for ZADGM2088, which showed 2 glycosylation sites, with N325 site not predicted as glycosylation sites from all sequences. In the E2 region, three sequences (ZADGM1104, ZADGM1707 and ZADGM3013) showed nine glycosylation sites, while the remaining had variations in the number of glycosylation sites. In ZADGM308, position N430 was replaced by H, while in ZADGM6544, N448 was replaced by D. Site N476 was found in only 6 of analyzed 18 sequences. The E2 sites N423 and N576 were not predicted as glycosylation sites in all genotype 5a sequences in this study (Table 3).
Table 3.

Probability of Glycosylation in E1 and E2 Sequences

Probability at Glycosylation Site [a,b]
SequenceE1No of SitesE2No of Sites
196 209 234 305 325 417 430 448 476 533 541 557 623 645
1 ZADGM7890++++++-4+++--++++++7
2 ZADGM6544++++--3+++---+++++6
3 ZADGM4227++++++-4++++--+-+++6
4 ZADGM1908++++--3+++-++++++-7
5 ZADGM1707+++++-4+++++++++++9
6 ZADGM651++++++-4+++--++++-6
7 ZADGM308+++++--3++-+-+++++-6
8 ZADGM6485+++++--3++++-++++++-7
9 ZADGM4124+++++-4+++--+++++7
10 ZADGM2439+++++-4+++-+++++-7
11 ZADGM2352+++++--3++++--+++++-6
12 ZADGM525gp-+++++-3+++---++++6
13 ZADGM869++++--3+++--+++++7
14 ZADGM3013+++++-4+++++++++++++9
15 ZADGM0518++++--3++++++--++++-6
16 ZADGM2582++++--3+++-+++++++8
17 ZADGM2088-++++--2+++--++++-6
18 ZADGM1104+++++--3+++++++++++9

a Numbering is based on the M62321 full-length sequence.

b Glycosylation probability is shown by +++ (probability > 70%), ++ (probability between 60 and 70%), + (probability between 50 and 60%), and - (not predicted).

a Numbering is based on the M62321 full-length sequence. b Glycosylation probability is shown by +++ (probability > 70%), ++ (probability between 60 and 70%), + (probability between 50 and 60%), and - (not predicted).

4.3. B-Cell Epitopes Prediction

Three conserved antigenic B-cell epitopes were predicted for genotype 5a sequences in the E2 region. Epitope E2504-609 (GPVYCFTPSPVVVGTT) had the highest antigenic score of 1.1613, while E2675-690 (LPCSFTPTPALSTGLI) and E2685-700 (LSTGLIHLHQNIVDTQ) had antigenic scores of 0.5340 and 0.6639, respectively. For conservancy analysis, epitope E2504-609 was highly conserved among other genotypes, while epitope E2675-690 and E2685-700 were variable (Table 4).
Table 4.

Predicted B-Cell Epitopes of HCV Genotype 5a and Their Antigenicity Score, Number of Allele and Conservancy (Percentage) in Different Genotypes

PositionPredicted EpitopesAntigen scoreGenotype 1aGenotype 1bGenotype 2Genotype 3Genotype 4Genotype 6
504 GPVYCFTPSPVVVGTT1.1613929788948973
675 LPCSFTPTPALSTGLI0.5340000000
685 LSTGLIHLHQNIVDTQ0.6639000002

4.4. Peptide Design

From the consensus sequences of genotype 5a E1 and E2, eleven short peptides of 8-28 amino acids were designed from the highly conserved residues. Five peptides of 9-16 amino acids in length were derived in the E1 region, while six peptides of 8-26 amino acids were derived in the E2. Three of the peptides had post-translation modification, which is the N-linked glycosylation, although at a low probability. None of the peptides has either serine, threonine and tyrosine phosphorylation sites predicted. Most peptides were found to be the best predicted peptides useful for designing entry inhibitors (Table 5).
Table 5.

Predicted Peptides for HCV E1 and E2 Conserved in Genotype 5a

Position [a]PeptidesLengthMolecular WeightTheoretical PIExtinction Coefficient (/cm M)Instability IndexAlphatic indexGRAVYComposition of Hydrophobic AA’s, %[b]N-linked Glycosylation C [c]N-linked Phosphorylation
201 YHTNDCPNSSI141611.75.08298017.3669.29-0.34321.4+-
262 VDYLAGGAA9835.93.801490-3.53108.890.86722.2--
304 CNCSIYSGH99836.7216155.6943.33-0.05611.1++-
314 TGHRMAWDMMMNWSPT161952.26.411100029.16-0.7066.2537.5--
352 HWGVLFAAAY101134.36.746990-4.25981.04040--
562 VKTCGAPPC98758.0312530.6843.330.31111.1%--
585 TDCFRKHP81003.17.9205.150-1.51212.5--
645 ACNWTRGERCDL121423.56.1562527.3140.83-0.90816.7+-
664 LSPLLHTTTQ101110.26.74-37.861170.02030%--
675 AILPCSFTPTPALSTGLIHLHQNIVDTQ282988.45.97028.191150.42132.1--
725 FLLLADAR8918.15.84--1.86171.251.22550--

a Numbering is based on the M62321 full-length sequence.

b list of hydrophobic amino acids (Leu, Val, Ile, Met, Phe and Trp).

c Glycosylation probability is shown by +++ (probability > 70%), ++ (probability between 60 and 70%), + (probability between 50 and 60%), and - (not present).

a Numbering is based on the M62321 full-length sequence. b list of hydrophobic amino acids (Leu, Val, Ile, Met, Phe and Trp). c Glycosylation probability is shown by +++ (probability > 70%), ++ (probability between 60 and 70%), + (probability between 50 and 60%), and - (not present).

5. Discussion

Genotype 5 is the most conserved HCV genotype classified into only one subtype (5a) (26). This study was designed to identify conserved sequences of these proteins to predict antigenic epitopes and peptides that could serve as best targets for vaccine design and potential entry inhibitors. Using different structural and sequence analyses tools helped with in-silico analysis for E1 and E2 regions. HCV genotype 5a sequences were found to be conserved in most regions of E1 and E2 proteins. The most variable region within the study sequences was the HVR1 and these HVR1 differed by up to 80% between HCV genotypes and subtypes (31). Although highly variable, the HVR1 is the only region that contains neutralization determinant, which is the target for immune response (32). As expected due to HVR variability, comparison of genetic distances between sequences in this study showed high genetic distances ranging from 8% to 17%, with an average distance of 13%. Variability within the HVR1 is one of the reasons describing why human antibodies raised against HCV E2 epitopes do not provide protection against multiple viral infections (19). In this study, analysis of N-linked glycosylation sites revealed that genotype 5a sequences were not conserved at glycosylation sites as compared to other genotypes. Site N476 with a level of 75% conservation among different genotypes was absent from the sequences of genotype 5a (5) and was found in six of the 18 analyzed sequences. As reported previously, E2 sites N423 and N576 were absent in all genotype 5a sequences including the 18 sequences from this study, which is notable because these two sites were reported to be 99-100% conserved across all genotypes (5). The glycosylation sites were reported to be highly conserved among different genotypes (9). These sequence variations in genotype 5a glycosylation sites could be useful to design efficient vaccine to help host to produce good antibody response. E2 is the main target for neutralizing antibody responses and variation of this region is thought to be related to maintenance of persistent infection by emerging escape variants and subsequent development of chronic infection (33, 34). Recently, a linear region of E2 encompassing amino acids 434 to 446 has been reported to elicit non-neutralizing antibodies that can inhibit neutralizing activity of antibodies targeting amino acids 412 to 423 (35). However, a study by Tarr et al. reported conflicting results showing that human antibodies that target the region encompassing amino acids 434 to 446, are not inhibitory but capable of neutralizing HCVpp and HCVcc entry (36). All B-cell epitopes included in this study were found to be antigenic ally effective, and it can be implied that these epitopes may be important for inducing the desired immune response. The E2504-609 epitope was found to be the most conserved among other genotypes. Recently a study by Ikram et al. reported conserved epitopes among genotype 3a that was also conserved among other genotypes (37). Highly conserved epitopes might influence the immunogenic potential since variability within the epitopes can increase the chance of immune escape (38). Short polypeptides derived from viral envelope sequences of other viruses have been used to investigate protein interactions involved in viral entry and some antiviral agents have been successfully developed (39). Envelope protein peptide inhibitors for other viruses in the same family with HCV like Dengue and West Nile were shown to inhibit viral entry (40, 41). In HCV, the post-binding entry step was prevented using peptides derived from the C terminal region of E2, which plays an important role in the HCV entry process (42). For this study, conserved peptides were derived that can be used as targets for therapeutic purposes. In this study, only three peptides had glycosylation sites at low probability and no phosphorylation sites were predicted. Post translational modifications such as glycosylation and phosphorylation affect the stability of therapeutic peptides (43). Using HCV glycoproteins in therapeutic strategies may offer protection against HCV infection (44). In conclusion, genotype 5a sequences are conserved and can be used to design epitopes and peptides. The results showed that antigenic conserved predicted B-cell epitopes and stable peptides with few post-translational modifications. These epitopes and peptides are potential candidates to design entry inhibitors and vaccines able to cover a global population, especially where genotype 5a is common. Further investigations would analyze these peptides to better understand their involvement in blocking HCV entry.
  40 in total

1.  Sequence and structure-based prediction of eukaryotic protein phosphorylation sites.

Authors:  N Blom; S Gammeltoft; S Brunak
Journal:  J Mol Biol       Date:  1999-12-17       Impact factor: 5.469

Review 2.  Resistance to enfuvirtide, the first HIV fusion inhibitor.

Authors:  Michael L Greenberg; Nick Cammack
Journal:  J Antimicrob Chemother       Date:  2004-07-01       Impact factor: 5.790

3.  A peptide derived from hepatitis C virus E2 envelope protein inhibits a post-binding step in HCV entry.

Authors:  R Liu; M Tewari; R Kong; R Zhang; P Ingravallo; R Ralston
Journal:  Antiviral Res       Date:  2010-02-13       Impact factor: 5.970

4.  Antibody to E1 peptide of hepatitis C virus genotype 4 inhibits virus binding and entry to HepG2 cells in vitro.

Authors:  Mostafa K El-Awady; Ashraf A Tabll; Khaled Atef; Samar S Yousef; Moataza H Omran; Yasmin El-Abd; Noha G Bader-Eldin; Ahmad M Salem; Samir F Zohny; Wael T El-Garf
Journal:  World J Gastroenterol       Date:  2006-04-28       Impact factor: 5.742

5.  Identification of amino acid residues in CD81 critical for interaction with hepatitis C virus envelope glycoprotein E2.

Authors:  A Higginbottom; E R Quinn; C C Kuo; M Flint; L H Wilson; E Bianchi; A Nicosia; P N Monk; J A McKeating; S Levy
Journal:  J Virol       Date:  2000-04       Impact factor: 5.103

6.  Antibody neutralization and escape by HIV-1.

Authors:  Xiping Wei; Julie M Decker; Shuyi Wang; Huxiong Hui; John C Kappes; Xiaoyun Wu; Jesus F Salazar-Gonzalez; Maria G Salazar; J Michael Kilby; Michael S Saag; Natalia L Komarova; Martin A Nowak; Beatrice H Hahn; Peter D Kwong; George M Shaw
Journal:  Nature       Date:  2003-03-20       Impact factor: 49.962

7.  Cell surface expression of functional hepatitis C virus E1 and E2 glycoproteins.

Authors:  Heidi E Drummer; Anne Maerz; Pantelis Poumbourios
Journal:  FEBS Lett       Date:  2003-07-10       Impact factor: 4.124

8.  A genetically humanized mouse model for hepatitis C virus infection.

Authors:  Marcus Dorner; Joshua A Horwitz; Justin B Robbins; Walter T Barry; Qian Feng; Kathy Mu; Christopher T Jones; John W Schoggins; Maria Teresa Catanese; Dennis R Burton; Mansun Law; Charles M Rice; Alexander Ploss
Journal:  Nature       Date:  2011-06-08       Impact factor: 49.962

9.  Depletion of interfering antibodies in chronic hepatitis C patients and vaccinated chimpanzees reveals broad cross-genotype neutralizing activity.

Authors:  Pei Zhang; Lilin Zhong; Evi Budo Struble; Hisayoshi Watanabe; Alla Kachko; Kathleen Mihalik; Maria Luisa Virata; Harvey J Alter; Stephen Feinstone; Marian Major
Journal:  Proc Natl Acad Sci U S A       Date:  2009-04-20       Impact factor: 11.205

10.  In Silico Identification and Conservation Analysis of B-cell and T-Cell Epitopes of Hepatitis C Virus 3a Genotype Enveloped Glycoprotein 2 From Pakistan: A Step Towards Heterologous Vaccine Design.

Authors:  Aqsa Ikram; Sadia Anjum; Muhammad Tahir
Journal:  Hepat Mon       Date:  2014-06-01       Impact factor: 0.660

View more
  1 in total

1.  TCR gene-modified T cells can efficiently treat established hepatitis C-associated hepatocellular carcinoma tumors.

Authors:  Timothy T Spear; Glenda G Callender; Jeffrey J Roszkowski; Kelly M Moxley; Patricia E Simms; Kendra C Foley; David C Murray; Gina M Scurti; Mingli Li; Justin T Thomas; Alexander Langerman; Elizabeth Garrett-Mayer; Yi Zhang; Michael I Nishimura
Journal:  Cancer Immunol Immunother       Date:  2016-02-03       Impact factor: 6.968

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.