| Literature DB >> 25598792 |
Maemu Petronella Gededzha1, Maphahlanganye Jeffrey Mphahlele1, Selokela Gloria Selabe1.
Abstract
BACKGROUND: Hepatitis C virus (HCV) is one of the major causes of cirrhosis and hepatocellular carcinoma with an estimation of 185 million people with infection. The E2 is the main target for neutralizing antibody responses and the variation of this region is related to maintenance of persistent infection by emerging escape variants and subsequent development of chronic infection. While both E1 and E2 are hypervariable in nature, it is difficult to design vaccines or therapeutic drugs against them.Entities:
Keywords: Epitopes; Genotype; Hepatitis C Virus; Peptides
Year: 2014 PMID: 25598792 PMCID: PMC4286708 DOI: 10.5812/hepatmon.23660
Source DB: PubMed Journal: Hepat Mon ISSN: 1735-143X Impact factor: 0.660
Sequences of the HCV Primers Used in This Study
| Sequences | Primers | Reference | |
|---|---|---|---|
|
| this study | ||
| F1A | 1088 | GAC CAT TTC ATC ATC ATG TCC CA | |
| R1A | 1425 | TGT ATG CGG CGG CGA ACA AGA CC | |
| F2A | 1113 | CTT CGG AGG GCC GTT GAC TAC TTA GCG | |
| R2A | 1413 | CGA ACA AGA CCC CCC AGT GGG | |
|
| |||
| M105 | 1292 | ATG GCA TGG GAC ATG ATG ATG | ( |
| R1B | 2061 | TAG GCC CTA AGT TGC AGG GTG GA | this study |
| M106 | 1298 | TGG GAC ATG ATG ATG AAT TGG | ( |
| R2B | 2022 | CAA ACC CTG TGG AAT TCA TCC AG | this study |
|
| this study | ||
| F1C | 1743 | GGC TGG GGA ACT ATC AGC TAT | |
| R1C | 2636 | AAA CCC ATG AGT CCC CGC AGC C | |
| F2C | 1773 | TCG GGC CCC AGT GAT GAC AAG | |
| R2C | 2612 | AGC CGC GTT TAG GAC AAT GAC GTT CT |
Genetic Distances in E1 and E2 Sequences of Genotype 5a in This Study
| Genetic Distances[ | |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sequence | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | |
|
| ZADGM7890 | ||||||||||||||||||
|
| ZADGM6544 | 0.12 | |||||||||||||||||
|
| ZADGM4227 | 0.13 | 0.12 | ||||||||||||||||
|
| ZADGM1908 | 0.14 | 0.12 | 0.13 | |||||||||||||||
|
| ZADGM1707 | 0.10 | 0.12 | 0.11 | 0.12 | ||||||||||||||
|
| ZADGM651 | 0.13 | 0.13 | 0.14 | 0.13 | 0.12 | |||||||||||||
|
| ZADGM308 | 0.12 | 0.12 | 0.12 | 0.12 | 0.10 | 0.11 | ||||||||||||
|
| ZADGM6485 | 0.13 | 0.10 | 0.13 | 0.12 | 0.12 | 0.12 | 0.10 | |||||||||||
|
| ZADGM4124 | 0.13 | 0.13 | 0.13 | 0.13 | 0.12 | 0.14 | 0.13 | 0.12 | ||||||||||
|
| ZADGM2439 | 0.14 | 0.15 | 0.15 | 0.15 | 0.12 | 0.14 | 0.12 | 0.13 | 0.14 | |||||||||
|
| ZADGM2352 | 0.13 | 0.14 | 0.14 | 0.14 | 0.13 | 0.12 | 0.11 | 0.12 | 0.14 | 0.13 | ||||||||
|
| ZADGM525gp | 0.14 | 0.14 | 0.15 | 0.14 | 0.12 | 0.15 | 0.14 | 0.14 | 0.14 | 0.16 | 0.16 | |||||||
|
| ZADGM869 | 0.14 | 0.14 | 0.15 | 0.13 | 0.13 | 0.10 | 0.13 | 0.13 | 0.14 | 0.14 | 0.14 | 0.17 | ||||||
|
| ZADGM3013 | 0.15 | 0.14 | 0.14 | 0.15 | 0.12 | 0.15 | 0.13 | 0.12 | 0.15 | 0.15 | 0.15 | 0.17 | 0.15 | |||||
|
| ZADGM0518 | 0.9 | 0.12 | 0.11 | 0.12 | 0.08 | 0.13 | 0.10 | 0.11 | 0.11 | 0.12 | 0.13 | 0.14 | 0.13 | 0.11 | ||||
|
| ZADGM2582 | 0.14 | 0.14 | 0.13 | 0.14 | 0.13 | 0.13 | 0.12 | 0.13 | 0.14 | 0.13 | 0.13 | 0.16 | 0.15 | 0.15 | 0.13 | |||
|
| ZADGM2088 | 0.14 | 0.13 | 0.13 | 0.13 | 0.12 | 0.12 | 0.11 | 0.12 | 0.13 | 0.14 | 0.12 | 0.13 | 0.14 | 0.14 | 0.12 | 0.12 | ||
|
| ZADGM1104 | 0.13 | 0.14 | 0.13 | 0.14 | 0.12 | 0.13 | 0.09 | 0.12 | 0.14 | 0.14 | 0.14 | 0.13 | 0.15 | 0.15 | 0.12 | 0.14 | 0.13 | |
a The values range between 0 (0%) and 1 (100%) substitutions per nucleotide site.
b The numbers 1-18 corresponds to the sequence number on the vertical side.
Probability of Glycosylation in E1 and E2 Sequences
| Probability at Glycosylation Site [ | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sequence | E1 | No of Sites | E2 | No of Sites | |||||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||||
|
| ZADGM7890 | + | ++ | ++ | + | - | 4 | + | ++ | - | - | + | ++ | + | + | + | 7 |
|
| ZADGM6544 | + | ++ | + | - | - | 3 | + | ++ | - | - | - | ++ | + | + | + | 6 |
|
| ZADGM4227 | + | ++ | ++ | + | - | 4 | ++ | ++ | - | - | + | - | + | + | + | 6 |
|
| ZADGM1908 | + | ++ | + | - | - | 3 | ++ | + | - | + | ++ | + | + | + | - | 7 |
|
| ZADGM1707 | + | ++ | + | + | - | 4 | ++ | + | + | + | + | ++ | + | + | + | 9 |
|
| ZADGM651 | + | ++ | ++ | + | - | 4 | ++ | + | - | - | + | + | + | + | - | 6 |
|
| ZADGM308 | + | ++ | ++ | - | - | 3 | ++ | - | + | - | + | ++ | + | + | - | 6 |
|
| ZADGM6485 | + | ++ | ++ | - | - | 3 | ++ | ++ | - | + | + | ++ | + | + | - | 7 |
|
| ZADGM4124 | + | ++ | + | + | - | 4 | ++ | + | - | - | + | + | + | + | + | 7 |
|
| ZADGM2439 | + | ++ | + | + | - | 4 | ++ | + | - | + | + | + | + | + | - | 7 |
|
| ZADGM2352 | + | ++ | ++ | - | - | 3 | ++ | ++ | - | - | + | ++ | + | + | - | 6 |
|
| ZADGM525gp | - | ++ | ++ | + | - | 3 | ++ | + | - | - | - | + | + | + | + | 6 |
|
| ZADGM869 | + | ++ | + | - | - | 3 | + | ++ | - | - | + | + | + | + | + | 7 |
|
| ZADGM3013 | + | ++ | + | + | - | 4 | ++ | ++ | ++ | + | + | ++ | + | + | + | 9 |
|
| ZADGM0518 | + | ++ | + | - | - | 3 | ++ | ++ | ++ | - | - | ++ | + | + | - | 6 |
|
| ZADGM2582 | + | ++ | + | - | - | 3 | ++ | + | - | + | + | ++ | + | + | + | 8 |
|
| ZADGM2088 | - | ++ | ++ | - | - | 2 | ++ | + | - | - | + | + | + | + | - | 6 |
|
| ZADGM1104 | + | ++ | ++ | - | - | 3 | ++ | ++ | + | + | + | + | + | + | + | 9 |
a Numbering is based on the M62321 full-length sequence.
b Glycosylation probability is shown by +++ (probability > 70%), ++ (probability between 60 and 70%), + (probability between 50 and 60%), and - (not predicted).
Predicted B-Cell Epitopes of HCV Genotype 5a and Their Antigenicity Score, Number of Allele and Conservancy (Percentage) in Different Genotypes
| Position | Predicted Epitopes | Antigen score | Genotype 1a | Genotype 1b | Genotype 2 | Genotype 3 | Genotype 4 | Genotype 6 |
|---|---|---|---|---|---|---|---|---|
|
| GPVYCFTPSPVVVGTT | 1.1613 | 92 | 97 | 88 | 94 | 89 | 73 |
|
| LPCSFTPTPALSTGLI | 0.5340 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| LSTGLIHLHQNIVDTQ | 0.6639 | 0 | 0 | 0 | 0 | 0 | 2 |
Predicted Peptides for HCV E1 and E2 Conserved in Genotype 5a
| Position [ | Peptides | Length | Molecular Weight | Theoretical PI | Extinction Coefficient (/cm M) | Instability Index | Alphatic index | GRAVY | Composition of Hydrophobic AA’s, %[ | N-linked Glycosylation C [ | N-linked Phosphorylation |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| YHTNDCPNSSI | 14 | 1611.7 | 5.08 | 2980 | 17.36 | 69.29 | -0.343 | 21.4 | + | - |
|
| VDYLAGGAA | 9 | 835.9 | 3.80 | 1490 | -3.53 | 108.89 | 0.867 | 22.2 | - | - |
|
| CNCSIYSGH | 9 | 983 | 6.72 | 1615 | 5.69 | 43.33 | -0.056 | 11.1 | ++ | - |
|
| TGHRMAWDMMMNWSPT | 16 | 1952.2 | 6.41 | 11000 | 29.16 | -0.706 | 6.25 | 37.5 | - | - |
|
| HWGVLFAAAY | 10 | 1134.3 | 6.74 | 6990 | -4.25 | 98 | 1.040 | 40 | - | - |
|
| VKTCGAPPC | 9 | 875 | 8.03 | 125 | 30.68 | 43.33 | 0.311 | 11.1% | - | - |
|
| TDCFRKHP | 8 | 1003.1 | 7.92 | 0 | 5.15 | 0 | -1.512 | 12.5 | - | - |
|
| ACNWTRGERCDL | 12 | 1423.5 | 6.1 | 5625 | 27.31 | 40.83 | -0.908 | 16.7 | + | - |
|
| LSPLLHTTTQ | 10 | 1110.2 | 6.74 | - | 37.86 | 117 | 0.020 | 30% | - | - |
|
| AILPCSFTPTPALSTGLIHLHQNIVDTQ | 28 | 2988.4 | 5.97 | 0 | 28.19 | 115 | 0.421 | 32.1 | - | - |
|
| FLLLADAR | 8 | 918.1 | 5.84 | - | -1.86 | 171.25 | 1.225 | 50 | - | - |
a Numbering is based on the M62321 full-length sequence.
b list of hydrophobic amino acids (Leu, Val, Ile, Met, Phe and Trp).
c Glycosylation probability is shown by +++ (probability > 70%), ++ (probability between 60 and 70%), + (probability between 50 and 60%), and - (not present).