| Literature DB >> 21303499 |
Abida Shehzadi1, Shahid Ur Rehman, Muhammad Idrees.
Abstract
BACKGROUND: HCV is a positive sense RNA virus affecting approximately 180 million people world wide and about 10 million Pakistani populations. HCV genotype 3a is the major cause of infection in Pakistani population. One of the major problems of HCV infection especially in the developing countries that limits the limits the antiviral therapy is the long term treatment, high dosage and side effects. Studies of antigenic epitopes of viral sequences of a specific origin can provide an effective way to overcome the mutation rate and to determine the promiscuous binders to be used for epitope based subunit vaccine design. An in silico approach was applied for the analysis of entire HCV proteome of Pakistani origin, aimed to identify the viral epitopes and their conservancy in HCV genotypes 1, 2 and 3 of diverse origin.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21303499 PMCID: PMC3042956 DOI: 10.1186/1743-422X-8-55
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
It comprises the data of HCV genome size, Proteins, Molecular weight and %age of highly repeated and least repeated amino acid residues in individual bases
| Bases | Proteins | aa Number | Highly repeated aa | % of repetition | Least repeated aa | % of repetition | ||
|---|---|---|---|---|---|---|---|---|
| Total bp | 9474 | Capsid | 114 | 12985.8 | R | 18.4 | C/F | 0.9 |
| A | 1974 | Core | 75 | 7638.88 | L | 16 | E/K/M/Y | 1.3 |
| C | 2700 | E1 | 190 | 20643.9 | V | 11.1 | E | 1.1 |
| G | 2622 | E2 | 350 | 38755.3 | L | 13.1 | K | 1.4 |
| T | 2178 | NS3 | 149 | 15423.6 | A/G | 11.4 | N | 0.7 |
| NS4a | 54 | 5751.69 | L/V | 14.8 | H/M/T/W | 1.9 | ||
| NS4b | 194 | 20167.5 | A | 13.4 | C | 0.4 | ||
| NS5a-1a | 62 | 6700.72 | G | 14.5 | E/D | 1.6 | ||
| NS5a-1b | 101 | 11224.6 | P | 11.9 | K/W | 1 | ||
Predicted HLA II epitopes HCV Proteins of Pakistani origin and their conservancy in Genotype 1, 2 and 3 worldwide
| Epitope start Position | Predicted T-cell epitopes | HLA alleles | HCV genotype 1 | HCV Genotype 2 | HCV Genotype 3 |
|---|---|---|---|---|---|
| 43 | 23 | LGVRATRK | LGVRATRK | LGVRATRK | |
| 36 | 15 | ||||
| 106 | 16 | WGP | WGP | WGPNDPRRR | |
| 34 | 24 | Y | Y | YVLPRRGPR | |
| 21 | 8 | ||||
| 35 | 9 | VLPRRGPRL | |||
| 45 | 25 | VRATRK | VRATRK | VRATRK | |
| 30 | 39 | VGGVY | VGGVY | VGGVYVLPR | |
| 15 | 6 | IRRPQDVKF | |||
| 95 | 28 | ||||
| 29 | 3 | IVGGVY | IVGGVY | IVGGVYVLP | |
| 82 | 10 | ||||
| 85 | 11 | ||||
| 33 | 1 | VY | VY | VYVLPRRGP | |
| 61 | 50 | FLLALLSCL | FLLALLSC | FLLALLSCL | |
| 64 | 45 | LALLSCL | LALLSCLIH * | ||
| 15 | 41 | ||||
| 24 | 44 | LVGAP | LVGAPVGGV * | ||
| 63 | 36 | LLALLSCL | LLALLSC | LLALLSCLI * | |
| 62 | 24 | FLLALLSCL | FLLALLSC | FLLALLSCL * | |
| 32 | 10 | VARALAHGV | VARALAHGV | ||
| 21 | 28 | YIPLVGAP | YIP | YIPLVGAPV | |
| 19 | 26 | MGYIPLVGA | MGYIP | MGYIPLVGA | |
| 58 | 41 | YVGATTASI * | |||
| 140 | 39 | MVVAHILRL* | |||
| 2 | 27 | WRNTSGLYV | |||
| 138 | 28 | ||||
| 56 | 21 | V | |||
| 9 | 31 | YVLTN | |||
| 161 | 15 | WGVLAG | WGV | WG | |
| 93 | 11 | FLVGQ | FLVGQAFTF | ||
| 181 | 91 | IIMVMFSGV | |||
| 130 | 35 | MMMNWSP | MMMNWSPA | ||
| 134 | 6 | WSPA | |||
| 132 | 14 | MNWSPA | |||
| 169 | 18 | YY | |||
| 47 | 21 | WTP | |||
| 172 | 25 | M | MQG | WTP | |
| 145 | 19 | ILRLPQTLF | |||
| 122 | 3 | ||||
| 151 | 48 | ||||
| 337 | 4 | WEF | |||
| 339 | 46 | F | |||
| 35 | 41 | ||||
| 342 | L | L | LVFLLLADA | ||
| 100 | 50 | ||||
| 198 | 47 | ||||
| 218 | 12 | ||||
| 62 | 45 | ||||
| 26 | 46 | ||||
| 57 | 12 | FYYHKFN | FYYHKFN | ||
| 83 | 26 | ||||
| 58 | 24 | ||||
| 286 | 17 | LLHSTTE | LLHSTTELA | ||
| 129 | 14 | VVVGTTD | VVVGTTDRLDD * | VVVGTTD | |
| 320 | 46 | VQYLYGVGS | VQYLYGVGS | ||
| 159 | 14 | ||||
| 293 | 7 | LAILPCSFT | |||
| 335 | 4 | LKWEF | |||
| 322 | 5 | YLYGVGS | YLYGVGSGM | ||
| 300 | 17 | FTPMPALST | |||
| 245 | 4 | ||||
| 18 | 26 | ||||
| 100 | 4 | ||||
| 257 | 19 | V | |||
| 206 | 8 | ||||
| 1 | 8 | ||||
| 267 | 10 | WTRGERC | |||
| 310 | 11 | IHLHQNIVD * | IHLHQNIVD | ||
| 101 | 51 | VRAHVLVRL | |||
| 62 | 50 | VILLTSLLY * | |||
| 73 | 24 | LVFDI | LVFDI | L | |
| 153 | 7 | LKDLAVATE * | |||
| 113 | 37 | ||||
| 130 | 11 | VGRWFNTYL * | |||
| 123 | 31 | FQM | |||
| 137 | 21 | YLYDHLAPM | |||
| 74 | 23 | VFDI | VFDI | ||
| 107 | 36 | LVRLCM | |||
| 108 | 51 | VRLCM | |||
| 89 | 33 | YFVRAHVLV | |||
| 11 | 15 | ||||
| 37 | 18 | I | Y | ||
| 33 | 8 | WWNQY | |||
| 185 | 10 | I | I | ILCGLPVSA | |
| 145 | 18 | MQHWAAAGL | |||
| 50 | 21 | VP | |||
| 88 | 35 | LYLIQ | |||
| 158 | 14 | VA | VA | VATEPVIFS | |
| 37 | 19 | YTICRCESAD * | |||
| 175 | 11 | WGADTAACG * | WGADTAACG * | WGADTAACG | |
| 4 | 46 | VQ | VQVLS | ||
| 43 | 42 | ||||
| 129 | 21 | VCTRGVAKA | VC | ||
| 24 | 13 | WTVYHGAG | WTVYHGAG | ||
| 84 | 18 | VIP | |||
| 138 | 45 | ||||
| 140 | 43 | FIPVE | |||
| 6 | 19 | ||||
| 53 | 29 | LVGWPAP | LVGWP | ||
| 27 | 22 | YHGAG | |||
| 14 | 10 | ||||
| 77 | 25 | LVTR | LVTR | ||
| 98 | 12 | LSPRPL | |||
| 124 | 44 | ||||
| 23 | 43 | VVIVG | VVIVG | VVIVGHIEL | |
| 3 | 43 | WVL | WVL | WVLLGGVLAA | |
| 4 | 40 | VL | VL | VLLGGVLAAL | |
| 38 | 11 | VPDKEVLYQ * | |||
| 24 | 8 | VIVGHIELG | |||
| 10 | 8 | LAALAAYCL | LAALAAYCLS | LAALAAYCLS | |
| 16 | 6 | YCLS | YCLSVGCVV | ||
| 26 | 9 | VGHIELGGK | |||
| 25 | 29 | IVGHIELGG | |||
| 20 | 15 | VGCVVIVGH | |||
| 9 | 9 | ||||
| 29 | 14 | IELGGKPAL | |||
| 81 | 41 | FFNILGGWV | |||
| 153 | 51 | ||||
| 152 | 51 | ||||
| 39 | 16 | WNF | WNF | WNFVSGIQY | |
| 165 | 35 | LVVGV | LVVGV | LVVGVICAA | |
| 82 | 32 | ||||
| 81 | 5 | FFNILGGWV | |||
| 63 | 9 | LMAF | LMAF | LMAF | |
| 27 | 35 | WQKLE | WQKLEAFWH * | ||
| 167 | 11 | VGV | VGV | VGVICAA | |
| 45 | 35 | ||||
| 64 | 23 | MAF | MAF | MAF | |
| 84 | 24 | ILGGWVA | ILGGWVA | ILGGWVATH | |
| 103 | 10 | V | VVSGLAGAA | ||
| 166 | 31 | VVGV | VVGV | VGVICAA | |
| 85 | 3 | LGGWVA | LGGWVA | LGGWVATHL | |
| 60 | 15 | VASLMAF | |||
| 41 | 8 | F | F | FVSGIQYLA | |
| 139 | 21 | FKIM | FKIMGGE | ||
| 9 | 14 | LQRATQQQA * | |||
| 122 | 6 | LDILAGYGA * | |||
| 104 | 3 | VSGLAGAAI | |||
| 39 | 51 | MR | FISCQKGY | MRLAGPRTC* | |
| 3 | 22 | F | FISCQKGYK * | ||
| 19 | 25 | V | |||
| 73 | 20 | LLRDE | LLRDE | LLRDEITFV * | |
| 16 | 33 | WRVAA | WRVAA | WRVAANSYV | |
| 55 | 4 | FTE | FTEVDGVRL ** | FTEVDGVRL | |
| 80 | 25 | FVVGLNSYA * | |||
| 32 | 16 | FHYITGATE | |||
| 61 | 27 | VRLHRYAP | VRLHRYAPP * | ||
| 87 | 20 | Y | VRLHRYAP | YAIGSQLPC * | |
| 23 | 14 | YVEV | YVEV | YVEVRRVGD | |
Bold amino acid residues in T-cell Epitope column indicates the anchor residues
Bold individual amino acid residues in HCV Genotype 1, 2 and 3 columns indicated the variation in peptide in comparison to the predicted epitope
*Indicates that one of the protein sequence selected for epitope conservancy either does not respond or have conservancy lower then 70%
** Indicates that only one of the protein sequence from selected sequences respond to epitope conservancy
D Indicates that amino acid residue in case of single/double variation diverted their group compared to primary epitope using pI value
DD Indicates that both amino acid residues in case of double variation diverted their group compared to primary epitope using pI value
Predicted HLA I epitopes HCV Proteins of Pakistani origin and their conservancy in Genotype 1, 2 and 3 worldwide
| Epitope start Position | Predicted T-cell epitopes | HLA alleles | HCV genotype 1 | HCV Genotype 2 | HCV Genotype 3 |
|---|---|---|---|---|---|
| 38 | 9 | ||||
| 35 | 25 | VLPRRGPRL | |||
| 55 | 8 | ||||
| 41 | 20 | R | RALEDGINF * | ||
| 7 | 15 | VIDTLTCGF | VIDT | VIDTLTCGF * | |
| 35 | 24 | ALAHGVR | ALAHGVRVL | ALAHGVRAL * | |
| 24 | 18 | LVGAP | LVGAPVGGV * | ||
| 26 | 9 | GAP | GAP | GAPVGGVAR * | |
| 135 | 14 | SPA | |||
| 86 | 19 | GD | GDVCGAV | GD | |
| 144 | 22 | HILRLPQTL * | |||
| 156 | 27 | IAGAHWGVL | IAGAHWG | ||
| 64 | 25 | ASIR | |||
| 285 | 11 | PLLHSTTE | PLLHSTTEL | ||
| 305 | 25 | AL | ALSTGLIHL | ||
| 227 | 20 | CSFT | CSFTPMPAL | ||
| 71 | 27 | ||||
| 157 | 28 | ||||
| 212 | 24 | ||||
| 6 | 25 | ||||
| 157 | 28 | ||||
| 172 | 6 | VITWGADTA * | |||
| 75 | 12 | FDI | FDI | FDI | |
| 70 | 15 | YPSL | |||
| 57 | 21 | G | G | GSRDGVILL | |
| 60 | 26 | DGVILLTSL * | |||
| 148 | 27 | WAA | WAAAGLKDL * | ||
| 50 | 11 | VPPLLARGS * | |||
| 46 | 28 | L | L | LQVWVPPLL * | |
| 117 | 16 | V | |||
| 65 | 23 | LTSLLYPSL | |||
| 6 | 48 | TLGAG | |||
| 73 | 31 | LVFDI | LVFDI | L | |
| 145 | 26 | MQHWAAAGL | |||
| 178 | 21 | DTAACGDI | DTAACGDI | DTAACGDIL | |
| 119 | 8 | GH | GHV | ||
| 27 | 14 | YHGAG | YHGAG | ||
| 128 | 8 | AVCTRGVAK | AVCTRGVAK * | ||
| 57 | 11 | PAP | P | ||
| 98 | 25 | LSPRPL | |||
| 95 | 24 | ||||
| 130 | 20 | CTRGVAKA | |||
| 7 | 24 | ||||
| 3 | 11 | WVL | WVL | WVLLGGVLA | |
| 23 | 21 | VVIVG | VVIVG | VVIVGHIEL | |
| 10 | 24 | LAALAAYCL * | LAALAAYCL | LAALAAYCL | |
| 5 | 27 | L | L | LLGGVLAAL | |
| 96 | 6 | PQSSSAFVV | |||
| 40 | 30 | NF | NF | NFVSGIQYL | |
| 81 | 13 | FFNILGGWV | |||
| 46 | 17 | ||||
| 102 | 9 | FV | FVVSGLAGA | ||
| 54 | 14 | LPGNPA | LPGNPA | LPGNPAVAS | |
| 161 | 14 | ||||
| 141 | 7 | IMGGE | |||
| 164 | 11 | ALVVGV | ALVVGV | ALVVGVICA | |
| 117 | 22 | LG | LG | LG | |
| 59 | 9 | A | A | AVASLMAF | |
| 152 | 15 | ||||
| 113 | 24 | GIGLG | |||
| 56 | 12 | GNPA | GNPA | GNPAVASLM | |
| 52 | 21 | STLPGNPA | STLPGNPAV | STLPGNPAV | |
| 85 | 26 | LGGWVA | LGGWVA | LGGWVATHL | |
| 145 | 11 | ||||
| 99 | 22 | SSAFVVSGL | |||
| 33 | 16 | HVKNGSMR | HVKNGSMR | HVKNGSMRL | |
| 49 | 6 | VPA | VPA | VPAAEFFTE | |
| 79 | 10 | TF | TF | TF | |
| 76 | 19 | DE | DE | DEITF | |
Bold amino acid residues in T-cell Epitope column indicates the anchor residues
Bold individual amino acid residues in HCV Genotype 1, 2 and 3 columns indicated the variation in peptide in comparison to the predicted epitope
*Indicates that one of the protein sequence selected for epitope conservancy either does not respond or have conservancy lower then 70%
** Indicates that only one of the protein sequence from selected sequences respond to epitope conservancy
D Indicates that amino acid residue in case of single/double variation diverted their group compared to primary epitope using pI value
DD Indicates that both amino acid residues in case of double variation diverted their group compared to primary epitope using pI value
Figure 1A comparative analysis of HCV 3a Predictive epitopes against MHC II alleles and their conservancy analysis in Genotype 1, 2 and 3 worldwide.
Figure 2A comparative analysis of HCV 3a Predictive epitopes predicted against MHC I and their conservancy analysis in Genotype 1, 2 and 3 worldwide.