| Literature DB >> 28264652 |
Shyamashree Banerjee1, Parth Sarthi Sen Gupta1, Amal Kumar Bandyopadhyay2.
Abstract
BACKGROUND: Japanese encephalitis virus (JEV) is a mosquito-borne flavivirus that causes Japanese Encephalitis (JE) and Acute Encephalitis Syndrome (AES) in humans. Genotype-I (as co-circulating cases with Genotype-III) was isolated in 2010 (JEV28, JEV21) and then in 2011 (JEV45) from Midnapur district, West Bengal (WB) for the first time from clinical patients who were previously been vaccinated with live attenuated SA14-14-2 strain. We apply bioinformatics and immunoinformatics on sequence and structure of E protein for analysis of crucial substitutions that might cause the genotypic transition, affecting protein-function and altering specificity of epitopes.Entities:
Keywords: B-cell & T-cell epitopes; Docking; Genotype I; Genotype III; Homology model; Japanese encephalitis virus; Midnapur; PEP-FOLD; SNP Energetics
Mesh:
Substances:
Year: 2017 PMID: 28264652 PMCID: PMC5339996 DOI: 10.1186/s12865-017-0197-9
Source DB: PubMed Journal: BMC Immunol ISSN: 1471-2172 Impact factor: 3.615
List of domain specific mutations in E protein of GI isolates in reference (Ref) to GIII strains
| Genotype I (GI) | Genotype III (GIII) | |||||||
|---|---|---|---|---|---|---|---|---|
| E Protein | Mutations | AB051292 | JN703381 | JN703382 | KC526872 | 3P54 | SA14 virulent | SA14-14-2 Non-virulent |
| Domain I | N--2H | . | + | + | + | N | N | N |
| N--2 T | + | . | . | . | N | N | N | |
| Y183F | + | . | . | . | Y | Y | Y | |
| R193K | + | . | . | . | R | R | K | |
| K138Ea | (+)a | (+)a | (+)a | (+)a | Kb | E | K | |
| V176Ia | (+)a | (+)a | (+)a | (+)a | Vb | I | V | |
| A177Ta | (+)a | (+)a | (+)a | (+)a | T | T | A | |
| Domain II | C-60Y | + | . | . | . | C | C | C |
| N103K | + | + | + | + | N | N | N | |
| F107La | (+)a | (+)a | (+)a | (+)a | Fb | L | F | |
| T129M | + | . | . | . | T | T | T | |
| A222S | + | + | + | + | A | A | A | |
| G244E | + | + | + | + | G | G | G | |
| G261S | . | + | + | + | G | G | G | |
| H264Qa | (+)a | (+)a | (+)a | (+)a | Q | Q | H | |
| M279Ka | (+)a | (+)a | (+)a | (+)a | Mb | K | M | |
| Domain III | V315A | + | + | + | + | V | V | V |
| S327T | + | + | + | + | S | S | S | |
| A366S | + | . | . | . | A | A | A | |
| V372L | + | . | + | + | V | V | V | |
| M374I | + | . | + | + | M | M | M | |
| G388K | + | . | + | + | G | G | G | |
| W396R | + | . | + | + | W | W | W | |
| Total mutations | 21 | 13 | 17 | 17 | 4 | 6 | Ref | |
| Net mutations | 15 | 7 | 11 | 11 | - | - | ||
Amino acids marked with bsign in 3P54 have been identified as mismatches between PDB and UNIPROT sequence (P27395)
All comparisons are made for first 406 residues of E protein (ecto domain). All three GI isolates of Midnapur/WB are taken with earlier Japan strain (Ishikawa). In GIII category, three sequences such as 3P54_A, virulent SA14 (ID M55506) and attenuated non-virulent SA14-14-2 (ID JN604986) are considered. + Sign indicates presence and dot indicates absence of mutations respectively. (+) indicates category of reported neurovirulent mutations [13]. Identical neurovirulent mutations of present isolates as SA14 are identified witha sign
Fig. 1Comparison of net substitutions for each domains and total of E protein among JEV28 (yellow), JEV45 (red) and Ishikawa isolates (cyan). JEV21 is not included in the plot as it is identical to JEV45
Fig. 2Residual plot of average physicochemical properties for GI isolates (Ishikawa, JEV28 and JEV45) in reference to SA14-14-2. Analyses were performed using PHYSICO [17] and PHYSICO2 [18] programs with an input of E proteins of SA14-14-2, Ishikawa, JEV28 and JEV45 in FASTA format. Percentile difference for each property was computed against the reference data and plotted using SYSTAT Sigma Plot v11.0
Fig. 3Backbone Cα-traces of homology model (a for Ishikawa, b for JEV45 and c for JEV28) of the ecto domain of PM0080325, PM0080324 and PM0080323 respectively. Domain I (red), domain II (yellow) and domain III (blue) are shown in different colors. Fusion loop (purple) and substitutions with respect to the template (SA14-14-2) are highlighted in each of the model based on their occurrence (see Table 1). Disulfide bonds (green) are also highlighted in each of the model based on their occurrence and only labeled in case of JEV45. Comparison of ANOLEA-profiles [61] of PM0080324 (green trace) with template (red trace) (D1), VERIFY3D (D2) analysis [31] and Ramachandran plot (D3) of main chain dihedral angles (core region is outlined in deep blue and allowed region in red) for residues (glycine as green circle and non-glycine as pink points) of the model along with PROCHECK [29] analysis (D4) are presented for model validation
Sequence and structure based evaluation of the effect of mutations (Table 1)
| Sequence based prediction | Structure based prediction | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mutation | SIFT | PHD-SNP | SNAP | META-SNP | SDM | |||||
| Score | Effect | Score | Effect | Score | Effect | Score | Effect | ∆∆G | ||
| Domain I | N002H | 0.500 | N | 0.056 | N | 0.180 | N | 0.115 | N | +0.01 |
| K138Ea | 1.000 | N | 0.121 | N | 0.185 | N | 0.111 | N | +0.52 | |
| V176Ia | 0.400 | N | 0.047 | N | 0.160 | N | 0.090 | N | +0.22 | |
| A177Ta | 1.000 | N | 0.042 | N | 0.105 | N | 0.063 | N | −0.17b | |
| Y183F | 0.200 | N | 0.488 | N | 0.525 | N | 0.367 | N | +1.79 | |
| R193K | 0.450 | N | 0.490 | N | 0.340 | N | 0.414 | N | −0.81 | |
| Domain II | C060Y | 0.000 | D | 0.935 | D | 0.800 | D | 0.882 | D | −2.33 |
| N103K | 0.000 | D | 0.833 | D | 0.780 | D | 0.771 | D | +1.61 | |
| F107La | 1.000 | N | 0.218 | N | 0.310 | N | 0.155 | N | −0.97 | |
| T129M | 0.070 | N | 0.106 | N | 0.410 | N | 0.143 | N | +0.86 | |
| A222S | 0.420 | N | 0.177 | N | 0.115 | N | 0.162 | N | −0.77 | |
| G244E | 1.000 | N | 0.070 | N | 0.215 | N | 0.065 | N | +1.85 | |
| G261S | 0.200 | N | 0.152 | N | 0.105 | N | 0.134 | N | +1.74 | |
| H264Qa | 0.074 | N | 0.074 | N | 0.225 | N | 0.078 | N | −0.75b | |
| M279Ka | 0.390 | N | 0.159 | N | 0.255 | N | 0.121 | N | −1.97 | |
| Domain III | V315A | 0.620 | N | 0.066 | N | 0.400 | N | 0.094 | N | −1.63 |
| S327T | 0.820 | N | 0.067 | N | 0.175 | N | 0.103 | N | +0.97 | |
| A366S | 0.740 | N | 0.190 | N | 0.235 | N | 0.185 | N | −1.18 | |
| V372L | 0.310 | N | 0.129 | N | 0.245 | N | 0.164 | N | −0.79 | |
| M374I | 0.540 | N | 0.225 | N | 0.365 | N | 0.169 | N | +0.08 | |
| G388K | 0.010 | D | 0.910 | D | 0.775 | D | 0.793 | D | +3.00 | |
| W396R | 0.000 | D | 0.841 | D | 0.865 | D | 0.819 | D | −2.48 | |
D disease, N normal; aindicates these six mutations are not GI specific but also present in SA14, GIII isolates with reference to vaccine strain SA14-14-2 (Table 1);
bindicates ∆∆G was calculated in reverse mutation form i.e. T177A and Q264H as WT E protein possesses T and Q at these positions respectively Server based four independent methods for sequence of ecto domain of E protein and Site Directed Mutator (SDM) [41] method for structure of all isolates was used for the purpose (see Materials and Methods for details)
B cell specific antigenic peptide epitopes short listed from a large set of initial population based on their antigenic score, model energy, average ASA (Accessible surface area) and average conservation
| Start Position | Linear Epitope | Score | Model Energy | Av. ASA | Av. H |
|---|---|---|---|---|---|
| 325 | EL | 0.986 | −23.71 | 22.8 | 0.01 |
| 372 | VEMEPPFGDSYIVVGR | 0.977 | −24.54 | 37.1 | 0.02 |
| 100 | GWG | 0.956 | −29.81 | 42.1 | 0.02 |
| 69 | STVARCPTTGEAHNEKRADS | 0.926 | −12.95 | 59.3 | 0.01 |
| 395 | H | 0.876 | −23.38 | 97.2 | 0.02 |
| 121 | CTSKAIGRMIQPENIKY | 0.854 | −18.21 | 38.2 | 0.06 |
| 349 | TPVGRLVTVNPFVATSS | 0.923 | −20.55 | 39.9 | 0.04 |
| 48 | IEASQLAEVRSY | 0.967 | −32.65 | 31.4 | 0.06 |
NACCESS program [44] was used to generate residue accessibility data. Then average accessibility was computed using side chain relative accessibility values. Variability was calculated using full length E protein sequences (total 50) and positional Shannon entropy (H) [17, 18] was computed for each homologous position. The Av. Variability is the average of all residues value. Average is made up to 404 residues as the model has 404 residues like template
Fig. 4B cell specific conformational antigenic determinants (AI) and linear epitopes (AII). (AI, AII) In each case domains of E protein are presented by different colors (i.e. domain I red, domain II silver and domain III green). (B) Representative Epitope pairs (BI for G388K and BII for W396R) for comparison of main chain topology of wild-type (blue) and mutant-type (red). The wild-type (yellow) and mutant (cyan) type residues are shown in stick formats. Each of this peptide structure was generated using PEP-FOLD [43] followed by normalization and fixation of charge and potential. RMSD of mutant structure was computed in reference to its wild-type. F-L: Fusion loop; RGD-L: RGD loop
Prediction of MHC class I specific peptide epitopes
| Start | Peptide | MHC-I binding scorea | Proteosomal Cleavage scoreb | Allele | Antigenic Scorec |
|---|---|---|---|---|---|
| 59 | Y | 1.3–17.0 | 0.82 | HLA-A*24:02,HLA-A*23:01,HLA-A*29:02,HLA-B*18:01,HLA-B*39:01, | 0.9572 |
| 95 | GFTDRGWG | 4.4–15.5 | 0.85 | HLA-A*03:01,HLA-A*11:01,HLA-A*29:02,HLA-A*30:01,HLA-A*33:03, | 1.4597 |
| 382 | YIVVGR | 6.4–6.6 | 0.92 | HLA-A*03:01; HLA-A*11:01;HLA-A*68:01 | 2.5185 |
| 394 | HH | 8.9–13.7 | 0.70 | HLA-B*07:02, HLA-B*14:02 | 0.5088 |
aPercentile score vary in allele specific manner. The lower the score better is the binding specificity [46]. The range of scores was obtained by setting cut off. bProteosomal cleavage score is the average of two kinds of scores: constitutively expressed type and INF-γ induced type. cAntigenic score was computed using Doytchinova and Flower's (2007) method [47]
Three score criteria were followed to screen the epitopes: MHC class I binding score, proteosomal cleavage score and antigenicity score. Association of HLA alleles for each epitope was determined by a cut off for percentile rank value ≤ 20.0. Epitopes were further checked for their efficacy in interactions at the peptide binding groove (see below)
MHC class II specific top scored antigenic peptides along with associated alleles
| Start | Peptide | MHC-II binding scorea | Allele | Antigenic Scoreb |
|---|---|---|---|---|
| 55 | EVRSY | 0.25–1.76 | DRB1*03:05,DRB1*04:01,DRB1*04:08,DRB1*04:09,DRB1*04:07,DRB1*04:21, | 0.9127 |
| 95 | GFTDRGWG | 5.37–17.95 | DRB1*01:02,DRB1*03:05,DRB1*07:03,DRB1*08:01,DRB1*11:01,DRB1*11:28, | 0.9232 |
| 380 | DSYIVVGR | 0.1–1.97 | DRB1*03:06,DRB1*03:05,DRB1*03:01,DRB1*03:08,DRB1*03:07,DRB1*08:01, | 1.5861 |
| 383 | IVVGR | 0.4–4.04 | DRB1*03:06,DRB1*03:05,DRB1*03:01,DRB1*03:08,DRB1*03:07,DRB1*04:02, | 1.1724 |
aPercentile score vary in allele specific manner. The less the score better is the binding. The range of scores was obtained by setting cut off. bAntigenic score was computed using Doytchinova and Flower's (2007) method [47]
Epitopes were further checked for their efficacy in interactions at the peptide binding groove (see below)
Energetics of docking complex formed between epitopes and PBG of MHCs along with their conservation, accessibility and conformational stability values
| IEDB Server Analyses | Shannon Variability | NACCESS | Peptide | Docking Analyses of MHCs-Peptides | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SL. NO. | Amino acid Position | Peptide | Average | Average side-chain Accessibility | CGE | GE | VDWa | ACE | HB | |
| MHC-I | 1 | 59 | Y | 0.02 | 23.5 | −4.70 | −42.89 | −16.11 | −5.89 | −4.74 |
| 2 | 95 | GFTDRGWG | 0.03 | 50.4 | −6.61 | −28.87 | −21.69 | −4.29 | −2.42 | |
| 3 | 382 | YIVVGR | 0.01 | 32.0 | −5.52 | −29.36 | −11.13 | −4.13 | −1.95 | |
| 4 | 394 | HH | 0.03 | 64.3 | −5.84 | −39.03 | −14.41 | −2.02 | −7.97 | |
| MHC-II | 1 | 55 | EVRSY | 0.04 | 24.7 | −16.96 | −48.65 | −15.18 | −9.98 | −2.92 |
| 2 | 95 | GFTDRGWG | 0.02 | 57.9 | −17.23 | −46.57 | −14.25 | −5.75 | −4.75 | |
| 3 | 380 | DSYIVVGR | 0.04 | 34.9 | −16.07 | −53.97 | −22.96 | −1.11 | −5.02 | |
| 4 | 383 | IVVGR | 0.00 | 36.8 | −15.34 | −52.90 | −13.06 | −3.63 | −2.61 | |
aVDW is the net stabilization due to Van der Waals interactions i.e. (attractive VDW – repulsive VDW); GE global stabilization energy, ACE atomic contact energy, HB hydrogen bond, CGE coarse-grained free-energy of peptide-structure used for docking [49]
Fig. 5Docking and structure based short listing of antigenic epitopes using human class I MHC (HLA-A i.e. 2X4O.pdb in a) and class II MHC (HLA-DRB1 i.e. 1DLH.pdb in b) crystal structures. Typical sequence and structure of docked peptide (blue color A & B) with white one as mutated residue. PBG indicates peptide binding groove which was mapped using NACCESS procedure [44] (see Materials and Methods for details). Two views per MHC class are shown for better visualization of binding pocket and binding groove. Chains of both classes of MHCs are shown by conventional notations