Literature DB >> 26681161

A systematic analysis of a broadly neutralizing antibody AR3C epitopes on Hepatitis C virus E2 envelope glycoprotein and their cross-reactivity.

Jing Sun, Vladimir Brusic.   

Abstract

BACKGROUND: Hepatitis C virus (HCV) belongs to Flaviviridae family of viruses. HCV represents a major challenge to public health since its estimated global prevalence is 2.8% of the world's human population. The design and development of HCV vaccine has been hampered by rapid evolution of viral quasispecies resulting in antibody escape variants. HCV envelope glycoprotein E1 and E2 that mediate fusion and entry of the virus into host cells are primary targets of the host immune responses.
RESULTS: Structural characterization of E2 core protein and a broadly neutralizing antibody AR3C together with E1E2 sequence information enabled the analysis of B-cell epitope variability. The E2 binding site by AR3C and its surrounding area were identified from the crystal structure of E2c-AR3C complex. We clustered HCV strains using the concept of "discontinuous motif/peptide" and classified B-cell epitopes based on their similarity.
CONCLUSIONS: The assessment of antibody neutralizing coverage provides insights into potential cross-reactivity of the AR3C neutralizing antibody across a large number of HCV variants.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26681161      PMCID: PMC4682370          DOI: 10.1186/1755-8794-8-S4-S6

Source DB:  PubMed          Journal:  BMC Med Genomics        ISSN: 1755-8794            Impact factor:   3.063


Background

Hepatitis C virus (HCV) is a major cause of viral hepatitis, liver cirrhosis, and liver cancer. It was discovered in 1989 as a novel causative agent of hepatitis [1]. HCV is a growing health concern since it affects about 2.8% of the world population and its prevalence is rising [2,3]. Each year, there are more than 500,000 new HCV infections in Egypt, the country with the highest HCV prevalence [4]. In the United States, more people die from HCV than from human immunodeficiency virus 1 (HIV-1) related disease [5]. Six genotypes and multiple subtypes of HCV have been identified to date. Approximately 75% of Americans with HCV have genotype 1 of the virus (subtypes 1a or 1b), and 20-25% have genotypes 2 or 3, with small numbers of patients being infected with genotypes 4, 5, or 6 [6]. Effective vaccination would provide protection against this global disease. However, the development of HCV vaccine and identification of broadly neutralizing antibodies has been hampered because HCV sequences mutate rapidly generating escape variants [7], the non-neutralizing antibodies to HCV envelope proteins interfere with neutralizing antibodies [8], and there is lack of 3D structural information needed for vaccine development [9]. The first crystal structure of broadly neutralizing antibody against HCV has been published in 2013 [10]. The HCV envelope glycoproteins E1 and E2 form a heterodimer E1E2 that facilitates virus attachment and entry into host cells and are targets for neutralizing antibodies [11]. Recent progress in isolating and characterizing HCV-neutralizing antibodies are instrumental for vaccine discovery and design [12]. These HCV-neutralizing antibodies were isolated from immunized mice [13-15], or from patients chronically infected with HCV [16-20]. Giang et al. [4], using an exhaustive panning strategy, identified five distinct antigenic regions on the HCV E1E2, that were recognized by 73 human monoclonal antibodies (mAbs) from an HCV immune phage-display antibody library. Many of these antibodies showed broadly neutralizing ability. Structural characterization of HCV envelope glycoproteins is challenging because of the difficulty in obtaining homogenous protein preparations [10,21-23]. Recently, the crystal structure of E2 core bound to neutralizing antibody AR3C has been crystalized [10], The antibody AR3C belongs to a group of broadly neutralizing antibodies that recognize antigenic region 3 (AR3) of E2 protein and cross-neutralizes HCV genotypes by blocking CD81 receptor binding site [14]. In this study, we characterized the B-cell epitope from the E2c-AR3C structure. By mapping this B-cell epitope to HCV E2 protein sequences, all strains available in the HCV database have been catalogued and compared with the known neutralized HCV strains. We examined the B-cell epitope diversity among the HCV variants, assessed potential cross-neutralization of the broadly neutralizing antibody across all sequences, and provided suggestions for selection of representative strains for future analysis of diversity and cross-recognition of HCV neutralizing B-cell epitopes.

Materials and methods

Structures of neutralizing antibody-E2 core protein complex

HCV envelope glycoproteins E1 and E2 mediate fusion and entry into host cells and are the primary targets of the humoral immune responses. The structure of the E2 core bound to a broadly neutralizing antibody was first crystalized at 2.65 angstroms [10], and deposited in PDB [19] database (PDB ID: 4MWF).

Sequences of E2 protein from Hepatitis C virus

All E2 envelope protein sequences of HCV strains were retrieved from HCV database [24] (http://hcv.lanl.gov/content/index), a database that provides annotated data about HCV sequences. We retrieved 5589 E2 sequences from the HCV database. Of these, 5340 sequences with translated protein sequences were retained in E2 protein dataset, with 3723, 275, 995, 70, 22 and 87 sequences labeled as genotype 1-6, respectively. Among these, 168 sequences were genotype-unclassified isolates or representatives of recombinant strains. Five of the seven neutralizing motifs studied in [18] were represented in this E2 data set.

Neutralizing activity of monoclonal antibody AR3C

The comparison of mAbs binding to the antigenic regions 1(AR1), 2(AR2), and 3(AR3) showed that 3(AR3)-specific mAbs reacted not only with genotype 1, but also genotype 2a, suggesting the presence of highly conserved epitopes in AR3 [18]. Table 1 shows the neutralizing activity data from this study. The mAb AR3C neutralized multiple genotypes: 1a, 1b, 2a, 2b, 4 and 5. We retrieved the E2 sequences of these isolates from GenBank [25].
Table 1

Discontinuous motifs on B-cell epitope and its surrounding area.

IsolateGeno-typeIC50 (µg/ml)aGenbank accession no.Discontinuous motif
B-cell epitopeSurrounding area

4mwfCILNCNESLGLALFYKCWNTWGQSAY
H771a1AF009606.1ILNCNESLGLALFYKCWNTWGQSAY
H1M67463.1ILNCNESLGLALFYKCWNTWGQSAY
OH81b5AY545951.1ILNCNDSLGLALFYRCWQTFAANDY
UKN1 B12.161AY734974.1ILNCNDSLGLALFYNCWNTFAVTEY
JFH-12a1AB047639.1ILNCNDSLGLALFYRCWNTFATTEY
J610AF177036.1ILNCNDSLGIALFYSCWHTFSTTEY
UKN2 A1.210AY734977.1ILNCNDSLGIALFYSCWQTFSTTEY
UNK2 B1.12b10AY734982.1ILNCNDSLGLALFYNCWQTFSVSEY
UKN4 21.1641AY734987.2ILNCNDSLGLALFYSCWNTFGHNEY
UKN5 15.751EF427672.1ILNCQDSLGIALLYKCWQTFGFNSY

a Neutralization activity of E2-specific neutralizing antibody AR3C (IC50 data are from reference [18]).

Comparing the discontinuous motifs on B-cell epitope, residues differing from the consensus are underlined, and the highly variable position lacking a consensus residue is underlined and in italics.

Discontinuous motifs on B-cell epitope and its surrounding area. a Neutralization activity of E2-specific neutralizing antibody AR3C (IC50 data are from reference [18]). Comparing the discontinuous motifs on B-cell epitope, residues differing from the consensus are underlined, and the highly variable position lacking a consensus residue is underlined and in italics.

Consistency of strain sequence numbering

All sequences in E2 protein dataset were aligned using MAFFT multiple alignment server [26]. The multiple sequence alignment (MSA) results provided a consistent sequence numbering scheme for further analysis of all sequences. For each validated strain (Table 1), sequence similarity to all sequences in E2 protein dataset was assessed using BLAST [27] search. The sequence from E2 protein dataset with the highest identity score was used as the reference sequence. This step also provided a consistent sequence numbering scheme of positions within the MSA results for validated strains.

Identification of B-cell epitope and surrounding area

Usintg crystal structures of the antigen-antibody complex, we defined antigen-binding sites (B-cell epitopes) as described previously [28,29]. This was done using both the measurements of residue Accessible Surface Area (ASA) and the minimum atom distance to the antibody. a) For each residue on antigen protein, the ASA value was calculated using Naccess [28] software for free antigen and for antigen coupled with the corresponding antibody. Residues rwith ASA loss more than 20% were selected as designated epitope residues, b) The majority of contacts between two interacting atoms occur at <5Å separation. Euclidean distance was calculated between atom aand awith their coordinates aand ain the PDB structure data, Antigen residues rwhose minimum atom distance to the binding antibody is less than 4Å were also incorporated in epitope. The least atom distance was defined as The residues that satisfy either of these two conditions (ASA loss or the minimum distance thresholds) were considered to constitute a B-cell epitope. For the definition of surrounding area, we continued to use distance-based method: antigen residues with minimum atom distance to binding antibody less than 6Å, that are not B-cell epitope residues, were incorporated as components of the surrounding area.

Extraction of discontinuous motifs (functional motifs)

Based on the BLAST result, residue positions of a B-cell epitope and its surrounding area identified on the crystal structure were mapped onto its reference sequence, and further transferred to map onto all validated strain sequences (Figure 1). For structure sequence or each of the validated strain sequences, a residue string from these epitope positions was recognized as a discontinuous motif. Since we do not have negative data (escape variants), all discontinuous motifs extracted from these strains were classified as neutralized motifs, which were recognized as functional in neutralizing assays.
Figure 1

The workflow used in this study. The steps included: identification of a B-cell epitope and its surrounding area(key residues) from crystal structure, extraction of discontinuous motifs and peptides by key positions, cataloging and neutralization assessment of strains in E2 protein dataset by discontinuous peptides.

The workflow used in this study. The steps included: identification of a B-cell epitope and its surrounding area(key residues) from crystal structure, extraction of discontinuous motifs and peptides by key positions, cataloging and neutralization assessment of strains in E2 protein dataset by discontinuous peptides.

Extraction of discontinuous peptides

The concept of discontinuous peptide [31] describes a virtual linear residue string generated from sequences that combines residues that form B-cell epitope that are not continuous in the protein sequence. Discontinuous peptides were extracted from the E2 protein dataset. Based on the BLAST and MSA results, the residue positions of B-cell epitope and its surrounding area were mapped onto its reference strain sequence, and then mapped onto all sequences in E2 protein dataset (Figure 1). Patterns of discontinuous peptides were used to catalog all strains in the dataset, and they were compared to the functional neutralized motifs. Each discontinuous peptide that has unique sequence was termed a discontinuous motif.

Results

Neutralizing antibody against HCV E2c protein

The mAb AR3C was known to neutralize HCV genotype 1, 2, 4 and 5. We performed the analysis of the structure of mAb AR3C complexed with HCV E2c. The B-cell epitope and its surrounding area in structure 4MWF were identified (Figure 2) as described in the MATERIAL AND METHODS section.
Figure 2

The B-cell epitope and surrounding area recognized by neutralizing antibody AR3C. (A) The heavy and light chains of mAb AR3C are shown in red and yellow respectively and the E2 chain in light blue; (B) The B-cell epitope on E2c is highlighted in pink (4MWF, chain C: 422, 427, 428, 429, 430, 431, 432, 433, 436, 438, 439, 441, 442, 443, 446, 503 and 529), and the ring area surrounding B-cell epitope is green (4MWF, chain C: 434, 435, 437, 440, 444, 528, 531 and 613); (C) The variable residues which are different from mAb AR3C-neutralized are highlighted in yellow (4MWF, chain C: 422, 430, 431, 432, 433, 438 and 442).

The B-cell epitope and surrounding area recognized by neutralizing antibody AR3C. (A) The heavy and light chains of mAb AR3C are shown in red and yellow respectively and the E2 chain in light blue; (B) The B-cell epitope on E2c is highlighted in pink (4MWF, chain C: 422, 427, 428, 429, 430, 431, 432, 433, 436, 438, 439, 441, 442, 443, 446, 503 and 529), and the ring area surrounding B-cell epitope is green (4MWF, chain C: 434, 435, 437, 440, 444, 528, 531 and 613); (C) The variable residues which are different from mAb AR3C-neutralized are highlighted in yellow (4MWF, chain C: 422, 430, 431, 432, 433, 438 and 442).

Functional motifs on B-cell epitopes and its surrounding area

The positions of B-cell epitope residues were extracted and mapped to all validated strain sequences. Functional motifs were retrieved with corresponding neutralizing information. Seven distinct discontinuous motifs (identical motifs were present across different strains) were extracted from the sequences of E2 protein structure and 10 validated strains.

Discontinuous peptides derived from B-cell epitopes

The positions of epitope residues were mapped onto all sequences in the E2 protein dataset. Amino acid string representing discontinuous peptide was extracted from each E2 protein sequence. Among all 5340 sequences in E2 protein dataset, there were 402 different combinations of discontinuous peptides (patterns), which reflect the high sequential variability of HCV virus. Five discontinuous peptides identical to discontinuous motifs from neutralized strains covered 14.06% strains population (Figure 3A). The discontinuous peptides were further sorted according to their frequencies in the E2 protein dataset. Viewed by ranked frequencies, the top 10 most frequent discontinuous peptides covered more than 50% strains in the dataset, and top 25 discontinuous peptides covered nearly 80% of the total strain population (Figures 3B and 3C).
Figure 3

An overview of discontinuous peptides in the E2 protein dataset. (A) The number of discontinuous peptides and the number of discontinuous motifs generated from E2 protein dataset; (B) The distribution of all discontinuous peptide patterns frequencies. The yellow and grey bars represent discontinuous peptides identical to the neutralized motifs and the ones without validation data yet, while the red line is their accumulative frequency; (C) The zoom-up view of top ranked discontinuous peptides frequencies, from (B).

An overview of discontinuous peptides in the E2 protein dataset. (A) The number of discontinuous peptides and the number of discontinuous motifs generated from E2 protein dataset; (B) The distribution of all discontinuous peptide patterns frequencies. The yellow and grey bars represent discontinuous peptides identical to the neutralized motifs and the ones without validation data yet, while the red line is their accumulative frequency; (C) The zoom-up view of top ranked discontinuous peptides frequencies, from (B). Top ranked discontinuous peptides and those identical to the discontinuous motifs extracted from the E2 protein dataset are listed in Table 2 along with their frequencies. The most frequent discontinuous peptide has coverage of 754 strains, while the second most frequent peptide covers 320 strains. There is no validation data for the 3 most frequent discontinuous peptides, while discontinuous motifs ranked 4th, 6th, 11th, 12th, and 26th in the list were shown to be neutralizing. The neutralization potential of these un-tested discontinuous motifs could be estimated by comparing the composing amino acids to the validated motifs. The 1st ranked discontinuous peptide (ILNCNDSLGIALFYKCW) is different from the 4th ranked discontinuous peptide (ILNCNDSLGLALFYRCW, which is a neutralized motifs) in two positions: 10th residue L->I, and 15th residue R->K. Since both residues share similar chemical features, it is possible that the HCV strains with 1st discontinuous peptides could be neutralized by the mAb AR3C. Also, the two different residues have been shown in other validated neutralized motifs: the 26th ranked (ILNCNDSLGIALFYSCW) and 6th ranked (ILNCNESLGLALFYKCW) discontinuous peptides. From the reported neutralizing data, we derived the consensus sequence for B-cell epitope ILNCNDSLGIALFYKCW and experimentally verified E2 neutralizing motif I-L-N-C-[NQ]-[DE]-S-L-G-[IL]-A-L-F-Y-[KNRS]-C-W. Potentially neutralizing motif that should be validated experimentally is [IVL]-L-[NS]-C-[NQ]-[DEA]-[ST]-[LI]-G-[ILVM]-[ATV]-L-[FILM]-Y-X-[WF] (see Additional file 1). Targeted experimentation will identify B-cell epitope changes that would abolish AR3C binding as well as changes that do not have detrimental effects.
Table 2

Top ranked discontinuous peptides in the E2 protein dataset.

RankDiscontinuous peptidesFrequencyAccumulative percentageValidation status
1ILNCNDSLGIALFYKCW75414.12%Missing
2ILNCNDSLGLALFYKCW32020.11%Missing
3ILNCNASLGIALFYKCW25624.91%Missing
4ILNCNDSLGLALFYRCW24029.40%Neutralized
5ILNCNASLGVALFYKCW23733.84%Missing
6ILNCNESLGLALFYKCW22137.98%Neutralized
7ILNCNDSLGIALIYKCW21341.97%Missing
8ILNCNASLGLALFYRCW20945.88%Missing
9ILNCNESIGIALFYKCW17349.12%Missing
10ILNCNDSLGIALFYRCW15251.97%Missing
11ILNCNDSLGLALFYNCW14154.61%Neutralized
12ILNCNDSLGLALFYSCW12756.99%Neutralized
13ILNCNDTIGIALFYRCW12759.36%Missing
14ILNCNDSIGIALFYRCW12361.67%Missing
15ILNCNDSLGIALLYKCW10263.58%Missing
16VLNCNESIGLALFYKCW9265.30%Missing
17ILNCNDSLGVALFYKCW8566.89%Missing
18ILNCNASLGLALFYKCW8568.48%Missing
19ILNCNASLGVALLYKCW8069.98%Missing
20ILNCNDSLGIALFYNCW7271.33%Missing
21ILNCDESIGIALFYKCW7172.66%Missing
22ILNCNDSIGIALFYKCW7173.99%Missing
23ILNCNESIGLALFYKCW6675.22%Missing
24ILNCNDSLGVALLYKCW6376.40%Missing
25LLNCNDSLGLALFYKCW5577.43%Missing
26ILNCNDSLGIALFYSCW4878.33%Neutralized

The top 26 most frequent (including discontinuous peptides identical to discontinuous neutralized motifs) among 402 different patterns of discontinuous peptides are listed. This table lists discontinuous peptide, their frequencies, accumulative frequency and neutralization validation status to mAb AR3C. The neutralized discontinuous motifs are underlined. The residues in italics and underlined, indicate the amino acids that are not presented in the specific position from these known neutralized discontinuous motifs (see Figure 1(c)).

Top ranked discontinuous peptides in the E2 protein dataset. The top 26 most frequent (including discontinuous peptides identical to discontinuous neutralized motifs) among 402 different patterns of discontinuous peptides are listed. This table lists discontinuous peptide, their frequencies, accumulative frequency and neutralization validation status to mAb AR3C. The neutralized discontinuous motifs are underlined. The residues in italics and underlined, indicate the amino acids that are not presented in the specific position from these known neutralized discontinuous motifs (see Figure 1(c)). WebLogo [29] and BlockLogo [30] were generated for all the discontinuous peptides extracted from E2 protein dataset. Among the 17-residue B-cell epitope, most of the positions are quite conserved, as shown in WebLogo figure (Figure 4A). However, the BlockLogo figure shows a large number of different combinations and the high diversity of this binding site (Figure 4B).
Figure 4

Discontinuous peptides generated from the B-cell epitope of mAb AR3C. Discontinuous peptides were extracted from 5340 E2 protein sequences according to relevant positions from the PDB structure 4MWF, chain C: 422, 427, 428, 429, 430, 431, 432, 433, 436, 438, 439, 441, 442, 443, 446, 503 and 529. The corresponding positions in the reference strain sequence (Genbank accession number ACA53572.1) are identical to the PDB structure. (A) WebLogo figure [29,34] and (B) BlockLogo figure [30] of discontinuous peptides from the E2 protein dataset.

Discontinuous peptides generated from the B-cell epitope of mAb AR3C. Discontinuous peptides were extracted from 5340 E2 protein sequences according to relevant positions from the PDB structure 4MWF, chain C: 422, 427, 428, 429, 430, 431, 432, 433, 436, 438, 439, 441, 442, 443, 446, 503 and 529. The corresponding positions in the reference strain sequence (Genbank accession number ACA53572.1) are identical to the PDB structure. (A) WebLogo figure [29,34] and (B) BlockLogo figure [30] of discontinuous peptides from the E2 protein dataset. The neutralized motifs cover 14.06% of strain sequences in the E2 protein dataset, while the other discontinuous peptides that cover 85.94% of the strains lack validated data (Figure 5). Viewed by the genotype, the neutralizing coverage of genotypes 1, 2 and 4 are approximately 20% (18.48%, 22.18% and 17.14% respectively), higher than those of genotype 3, 5 and 6. The overall known neutralized coverage on the dataset is low. Of 402 discontinuous peptides, 379 had a complete B-cell epitope and 15 had ambiguities in sequence (residue X). Eight sequences had disrupted B-cell epitope (patterns 38, 65, 93, 180, 214, 285, 385, and 387, Additional file 2) most likely representing non-viable viruses.
Figure 5

MAb AR3C neutralization assessment sorted by HCV genotype. The virus population coverage for each genotype and unclassified isolates, the number in bracket after each genotype indicate the number of strain sequences for specific genotype (based on data from E2 protein dataset). The proportion of discontinuous peptides identical to neutralized motifs is colored in pink, while non-validated in grey. For each genotype from left to right, the numbers of different motif patterns among these sequences are 280, 51, 70, 25, 12, 45 and 30.

MAb AR3C neutralization assessment sorted by HCV genotype. The virus population coverage for each genotype and unclassified isolates, the number in bracket after each genotype indicate the number of strain sequences for specific genotype (based on data from E2 protein dataset). The proportion of discontinuous peptides identical to neutralized motifs is colored in pink, while non-validated in grey. For each genotype from left to right, the numbers of different motif patterns among these sequences are 280, 51, 70, 25, 12, 45 and 30.

Discontinuous peptides on B-cell epitope surrounding area

The antibody binding and neutralization ability can possibly be affected by the B-cell epitope surrounding area. Identical discontinuous peptides on B-cell epitope alone cannot fully guarantee the same neutralization result. The analysis of surrounding area aims to provide a more detailed assessment about the potential neutralizing properties of the AR3C. The frequency distribution of different discontinuous peptides on surrounding area showed similarity to the results of B-cell epitope comparisons (Figure 6). For the strains share identical discontinuous peptides on B-cell epitope, the discontinuous peptides on surrounding area have dominant patterns: the top 5 patterns cover as much as 50% of the strains. The result indicates that the residues that define AR3C epitope surrounding area do not affect B-cell epitope/antibody interaction independently of the actual B-cell epitope.
Figure 6

Distribution of discontinuous peptides frequency on surrounding area of strains with discontinuous peptides identical to neutralized motifs on B-cell epitope. (A) - (E), corresponds to five groups of strains share same discontinuous peptides on B-cell epitope identical to five neutralized motifs. The discontinuous peptides on B-cell epitope (neutralized motifs) of each group are listed on each plot.

Distribution of discontinuous peptides frequency on surrounding area of strains with discontinuous peptides identical to neutralized motifs on B-cell epitope. (A) - (E), corresponds to five groups of strains share same discontinuous peptides on B-cell epitope identical to five neutralized motifs. The discontinuous peptides on B-cell epitope (neutralized motifs) of each group are listed on each plot.

Conclusions and discussion

Hepatitis C virus, with its extreme variability of sequence repertoire, is a difficult target for vaccine design. Compared to envelope glycoproteins in other virus, such as hemagglutinin protein from influenza virus and E protein from dengue (DENV) virus, the B-cell epitopes on HCV E2 protein are much less conserved in composing residues. The MAb F10 [31] is a broadly neutralizing antibody against HA protein of influenza A virus. A total of 589 different patterns of discontinuous peptides on F10 B-cell epitope were extracted from 45,812 HA sequences. The mAb 2H12 [32] is a broadly neutralizing antibody shown to neutralize serotypes DENV1, 3 and 4, has 57 different patterns of discontinuous peptides on B-cell epitope that cover 4,659 dengue E protein sequences in dengue dataset [33]. In the current study, 5340 E protein sequences from HCV, which is a similar sequence set size as in dengue viruses, generated almost an order of magnitude larger diversity: 402 different discontinuous peptides at the mAb AR3C binding site have been identified. We assembled a HCV strains cataloguing method in this study. Strains with identical discontinuous peptides on B-cell epitope site were grouped and estimated to have similar neutralizing activity. For mAb AR3C, the discontinuous peptides on B-cell epitope from validated strains ranked 4th, 6th, 11th, 12th and 26th, covered 14.06% of all 5,340 strains in the E2 protein dataset. The discontinuous peptide and frequency list could be used as guidance for the selection of representative strains for future systematic neutralizing antibody tests. For example, the most dominant discontinuous peptides among population should be tested for neutralization assay in priority. For mAbs generated in the future, the neutralization coverage among the strains with top dominant discontinuous peptide could be used as a guidance of how broadly neutralized the mAb could reach. The neutralizing motif indicates that conservative replacements at positions 430 (N→Q), 431 (D→E) and 438 (L→I) will likely not affect binding affinities sufficiently to abolish neutralization. In addition, position 446 has multiple residues observed in neutralized variants (K,N,S,R) and it appears not to affect antibody binding. By observation of common discontinuous peptides we argue that conserved positions 427 (L), 428 (N), 429 (C), 436 (G), 439 (A), 441 (L), 443 (Y), 503 (C), and 529 (W) have structural or functional significance. The positions 422, 431, 432, 433, 438, and 442 are key for the study of the diversity of B-cell epitopes and targeting the design of broadly-protective vaccines. This results presented here are based on the existing data. More comprehensive conclusions will be generated as additional neutralizing antibody structures are crystallized and more neutralization assays are performed in the future. Advances in computation and biotechnology enable more comprehensive analysis where all combinations of antibodies and antigens can be assessed in silico. The new methodology of Big Data analysis [35] enables the analysis of diverse data types where protein, nucleotide, structure, and functional data can be analyzed in combination. The well-annotated data are combined with specialized analytical tools, including statistical analyses, sequence analysis, and mathematical models to gain insights into biological processes, generate knowledge, and inform decisions about validation experiments. This study has shown that the majority of common HCV variants have not been studied in antibody neutralization studies. The knowledge of cross-neutralization is, therefore, incomplete and there is an urgent need for designing libraries of viruses that will be representative of the majority of HCV strains. These libraries will enable systematic testing of strains against the panels of antibodies and enable the design of universal broadly protective HCV vaccines.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

VB and JS designed the study. JS collected HCV data from public database and performed the analysis. VB and JS draft the manuscript.

Additional file 1

Potentially neutralizing motifs derived from full list of discontinuous peptides in the E2 protein dataset. (*.pdf). Click here for file

Additional file 2

Full list of ranked discontinuous peptides in the E2 protein dataset. (*.xls). Click here for file
  32 in total

1.  Sequence logos: a new way to display consensus sequences.

Authors:  T D Schneider; R M Stephens
Journal:  Nucleic Acids Res       Date:  1990-10-25       Impact factor: 16.971

Review 2.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

Review 3.  Broadly neutralizing antiviral antibodies.

Authors:  Davide Corti; Antonio Lanzavecchia
Journal:  Annu Rev Immunol       Date:  2013-01-16       Impact factor: 28.527

4.  Characterization of truncated forms of hepatitis C virus glycoproteins.

Authors:  J P Michalak; C Wychowski; A Choukhi; J C Meunier; S Ung; C M Rice; J Dubuisson
Journal:  J Gen Virol       Date:  1997-09       Impact factor: 3.891

5.  Human broadly neutralizing antibodies to the envelope glycoprotein complex of hepatitis C virus.

Authors:  Erick Giang; Marcus Dorner; Jannick C Prentoe; Marlène Dreux; Matthew J Evans; Jens Bukh; Charles M Rice; Alexander Ploss; Dennis R Burton; Mansun Law
Journal:  Proc Natl Acad Sci U S A       Date:  2012-04-04       Impact factor: 11.205

6.  The increasing burden of mortality from viral hepatitis in the United States between 1999 and 2007.

Authors:  Kathleen N Ly; Jian Xing; R Monina Klevens; Ruth B Jiles; John W Ward; Scott D Holmberg
Journal:  Ann Intern Med       Date:  2012-02-21       Impact factor: 25.391

7.  Interferon alfa-2b alone or in combination with ribavirin as initial treatment for chronic hepatitis C. Hepatitis Interventional Therapy Group.

Authors:  J G McHutchison; S C Gordon; E R Schiff; M L Shiffman; W M Lee; V K Rustgi; Z D Goodman; M H Ling; S Cort; J K Albrecht
Journal:  N Engl J Med       Date:  1998-11-19       Impact factor: 91.245

Review 8.  Global epidemiology of hepatitis C virus infection: new estimates of age-specific antibody to HCV seroprevalence.

Authors:  Khayriyyah Mohd Hanafiah; Justina Groeger; Abraham D Flaxman; Steven T Wiersma
Journal:  Hepatology       Date:  2013-02-04       Impact factor: 17.425

9.  Structural analysis of a dengue cross-reactive antibody complexed with envelope domain III reveals the molecular basis of cross-reactivity.

Authors:  Claire M Midgley; Aleksandra Flanagan; Hai Bac Tran; Wanwisa Dejnirattisai; Kriangkrai Chawansuntati; Amonrat Jumnainsong; Wiyada Wongwiwat; Thaneeya Duangchinda; Juthathip Mongkolsapaya; Jonathan M Grimes; Gavin R Screaton
Journal:  J Immunol       Date:  2012-04-09       Impact factor: 5.422

10.  Human monoclonal antibodies to a novel cluster of conformational epitopes on HCV E2 with resistance to neutralization escape in a genotype 2a isolate.

Authors:  Zhen-yong Keck; Jinming Xia; Yong Wang; Wenyan Wang; Thomas Krey; Jannick Prentoe; Thomas Carlsen; Angela Ying-Jian Li; Arvind H Patel; Stanley M Lemon; Jens Bukh; Felix A Rey; Steven K H Foung
Journal:  PLoS Pathog       Date:  2012-04-12       Impact factor: 6.823

View more
  2 in total

1.  GIW and InCoB, two premier bioinformatics conferences in Asia with a combined 40 years of history.

Authors:  Christian Schönbach; Paul Horton; Siu-Ming Yiu; Tin Wee Tan; Shoba Ranganathan
Journal:  BMC Genomics       Date:  2015-12-09       Impact factor: 3.969

2.  Reverse epitope mapping of the E2 glycoprotein in antibody associated hepatitis C virus.

Authors:  Amruta S Naik; Ania Owsianka; Brendan A Palmer; Ciaran J O'Halloran; Nicole Walsh; Orla Crosbie; Elizabeth Kenny-Walsh; Arvind H Patel; Liam J Fanning
Journal:  PLoS One       Date:  2017-05-30       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.