| Literature DB >> 29169357 |
Lorrany Dos Santos Franco1, Paloma Oliveira Vidal1, Jaime Henrique Amorim2.
Abstract
BACKGROUND: The arboviruses Zika virus (ZIKV) and Dengue virus (DENV) have important epidemiological impact in Brazil and other tropical regions of the world. Recently, it was shown that previous humoral immunity to DENV enhances ZIKV replication in vitro, which may lead to more severe forms of the disease. Thus, traditional approaches of vaccine development aiming to control viral infection through neutralizing antibodies may induce cross-reactive enhancing antibodies. In contrast, cellular immune response was shown to be capable of controlling DENV infection independently of antibodies. The aim of the present study was to design a flavivirus NS5 protein capable of inducing a cellular immune response against DENV and ZIKV.Entities:
Keywords: Bioinformatics; Dengue virus; Epitopes; Vaccine; Zika virus
Mesh:
Substances:
Year: 2017 PMID: 29169357 PMCID: PMC5701345 DOI: 10.1186/s12929-017-0395-z
Source DB: PubMed Journal: J Biomed Sci ISSN: 1021-7770 Impact factor: 8.410
Fig. 1Evolutionary history of selected amino acid sequences of ZIKV and DENV NS5 proteins. The tree shows that NS5 sequences from Brazilian ZIKV isolates grouped with those from Asian ZIKV lineage. The evolutionary history was inferred by using the Maximum Likelihood method based on the JTT matrix-based model [1]. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model, and then selecting the topology with superior log likelihood value. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 195 amino acid sequences. All positions containing gaps and missing data were eliminated. There were a total of 896 positions in the final dataset. Evolutionary analyses were conducted in MEGA7
Fig. 2Consensus sequence of the ZIKV NS5 protein. Amino acid sequences from ZIKV NS5 database were used to construct a consensus sequence. The methyltransferase MTase domain is shown in gray, the linker region between MTase and RdRp domains is shown in black and underlined and the RdRp domain is shown black and bold
Frequency of class I and class II HLA alelles (≥ 3%) in the Brazilian population
| Class |
| Alelle | Frequence |
|---|---|---|---|
| Class I | HLA-A | A*01:01 | 0.091 |
| A*02:01 | 0.192 | ||
| A*02:02 | 0.040 | ||
| A*02:24 | 0.040 | ||
| A*03:01 | 0.051 | ||
| A*11:01 | 0.061 | ||
| A*24:02 | 0.081 | ||
| A*29:02 | 0.030 | ||
| A*31:01 | 0.040 | ||
| A*33:03 | 0.030 | ||
| A*68:02 | 0.035 | ||
| HLA-B | B*07:02 | 0.101 | |
| B*08:01 | 0.043 | ||
| B*08:05 | 0.036 | ||
| B*18:01 | 0.051 | ||
| B*35:01 | 0.051 | ||
| B*44:03 | 0.035 | ||
| B*51:01 | 0.051 | ||
| HLA-C | C*02:02 | 0.033 | |
| C*03:03 | 0.048 | ||
| C*03:04 | 0.048 | ||
| C*04:01 | 0.138 | ||
| C*05:01 | 0.062 | ||
| C*07:01 | 0.157 | ||
| C*07:02 | 0.081 | ||
| C*08:02 | 0.076 | ||
| C*12:03 | 0.038 | ||
| C*15:02 | 0.033 | ||
| C*17:01 | 0.033 | ||
| Class II | HLA-DRB1 | DRB1*01:01 | 0.055 |
| DRB1*01:02 | 0.035 | ||
| DRB1*03:01 | 0.121 | ||
| DRB1*04:01 | 0.035 | ||
| DRB1*07:01 | 0.075 | ||
| DRB1*08:04 | 0.030 | ||
| DRB1*11:01 | 0.060 | ||
| DRB1*11:04 | 0.035 | ||
| DRB1*13:01 | 0.070 | ||
| DRB1*13:02 | 0.050 | ||
| DRB1*14:01 | 0.035 | ||
| DRB1*15:01 | 0.050 | ||
| DRB1*15:03 | 0.045 |
ZIKV NS5 predicted epitopes 100% conserved among ZIKV lineages and DENV serotypes and their respective population coverage rates (%) considering the Brazilian population
| Epitope | Locationa | Alleles | Population coverage |
|---|---|---|---|
| LSRNSTHEMY | 211–220 | HLA-A*01:01 | 13.87% |
| CVYNMMGKR | 451–459 | HLA-A*03:01, HLA-A*31:01, HLA-A*33:03 | 12.06% |
| CVYNMMGKREK | 451–461 | HLA-A*03:01, HLA-A*11:01 | 12.06% |
| AIWYMWLGAR | 474–483 | HLA-A*31:01 | 11.06% |
| WYMWLGAR | 476–483 | HLA-A*31:01, HLA-A*33:03, HLA-C*04:01 | 33.73% |
| RAIWYMWLGAR | 473–483 | HLA-A*33:03 | 3.9% |
| IWYMWLGAR | 475–483 | HLA-A*31:01, HLA-A*33:03 | 11.06% |
| LEFEALGF | 485–492 | HLA-B*18:01, HLA-C*04:01 | 8.0% |
| YADDTAGW | 532–539 | HLA-C*03:03, HLA-C*05:01, HLA-C*08:02, HLA-C*17:01 | 9.64% |
| RAIWYMWL | 473–480 | HLA-C*03:03, HLA-C*07:02, HLA-C*15:02 | 9.64% |
| YADDTAGWDT | 532–541 | HLA-C*05:01, HLA-C*08:02 | 9.7% |
| YADDTAGWDTR | 532–542 | HLA-C*05:01, HLA-C*08:02 | 9.7% |
| YADDTAGWD | 532–540 | HLA-C*05:01, HLA-C*12:03 | 9.7% |
| ADDTAGWD | 533–540 | HLA-C*05:01 | 9.7% |
| GRGGWSYY | 83–90 | HLA-C*07:01, HLA-C*07:02 | 23.83% |
| SRNSTHEM | 212–219 | HLA-C*07:01, HLA-C*07:02 | 23.83% |
| SRNSTHEMY | 212–220 | HLA-C*07:01, HLA-C*07:02 | 23.83% |
| SRNSTHEMYW | 212–221 | HLA-C*07:01 | 23.83% |
| SRAIWYMWL | 472–480 | HLA-C*07:02 | 19.15% |
aLocation of the predicted epitope into the ZIKV NS5 consensus amino acid sequence
Total population coverage of selected epitopes for different regions of the world
| Epitope | North America | Central America | South America | Europe | East Africa | West Africa | Central Africa | North Africa | South Africa | Northeast Asia | South Asia | East Asia | Southeast Asia | Southwest Asia | Oceania |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LSRNSTHEMY | 12.72% | 0.00% | 6.03% | 25.67% | 11.24% | 7.31% | 2.84% | 14.18% | 0.22% | 3.44% | 13.80% | 2.55% | 2.10% | 14.66% | 11.38% |
| CVYNMMGKR | 13.11% | 0.00% | 6.37% | 25.05% | 7.49% | 9.48% | 12.29% | 8.96% | 0.14% | 3.05% | 12.33% | 1.47% | 1.18% | 9.15% | 4.04% |
| CVYNMMGKREK | 13.11% | 0.00% | 6.37% | 25.05% | 7.49% | 9.48% | 12.29% | 8.96% | 0.14% | 3.05% | 12.33% | 1.47% | 1.18% | 9.15% | 4.04% |
| AIWYMWLGAR | 7.09% | 0.00% | 16.56% | 4.72% | 2.11% | 1.71% | 1.14% | 3.40% | 0.07% | 4.08% | 8.42% | 13.92% | 2.65% | 5.82% | 3.28% |
| WYMWLGAR | 7.09% | 0.00% | 16.56% | 4.72% | 2.11% | 1.71% | 1.14% | 3.40% | 0.07% | 4.08% | 8.42% | 13.92% | 2.65% | 5.82% | 3.28% |
| RAIWYMWLGAR | 6.62% | 0.00% | 2.55% | 0.91% | 2.34% | 12.54% | 7.21% | 6.08% | 0.07% | 11.17% | 15.47% | 16.92% | 16.79% | 5.21% | 0.64% |
| IWYMWLGAR | 7.09% | 0.00% | 16.56% | 4.72% | 2.11% | 1.71% | 1.14% | 3.40% | 0.07% | 4.08% | 8.42% | 13.92% | 2.65% | 5.82% | 3.28% |
| LEFEALGF | 6.11% | 0.00% | 4.86% | 11.8% | 8.12% | 4.85% | 3.75% | 5.85% | 0.06% | 0.68% | 2.51% | 0.0% | 4.64% | 5.91% | 1.22% |
| YADDTAGW | 6.48% | 0.00% | 4.03% | 8.6% | 1.72% | 0.32% | 0.9% | 0.99% | 0.13% | 9.52% | 2.98% | 20.72% | 7.96% | 1.75% | 12.2% |
| RAIWYMWL | 6.48% | 0.00% | 4.03% | 8.6% | 1.72% | 0.32% | 0.9% | 0.99% | 0.13% | 9.52% | 2.98% | 20.72% | 7.96% | 1.75% | 12.2% |
| YADDTAGWDT | 7.51% | 0.00% | 6.38% | 10.3% | 2.6% | 2.6% | 2.62% | 5.92% | 0.00% | 0.93% | 4.09% | 1.06% | 0.15% | 3.5% | 5.44% |
| YADDTAGWDTR | 7.51% | 0.00% | 6.38% | 10.3% | 2.6% | 2.6% | 2.62% | 5.92% | 0.00% | 0.93% | 4.09% | 1.06% | 0.15% | 3.5% | 5.44% |
| YADDTAGWD | 7.51% | 0.00% | 6.38% | 10.3% | 2.6% | 2.6% | 2.62% | 5.92% | 0.00% | 0.93% | 4.09% | 1.06% | 0.15% | 3.5% | 5.44% |
| ADDTAGWD | 7.51% | 0.00% | 6.38% | 10.30% | 2.60% | 2.60% | 2.62% | 5.92% | 0.00% | 0.93% | 4.09% | 1.06% | 0.15% | 3.50% | 5.44% |
| GRGGWSYY | 16.98% | 0.00% | 13.05% | 25.11% | 21.52% | 21.54% | 23.17% | 24.53% | 0.39% | 1.53% | 11.44% | 11.64% | 7.22% | 11.85% | 4.22% |
| SRNSTHEM | 16.98% | 0.00% | 13.05% | 25.11% | 21.52% | 21.54% | 23.17% | 24.53% | 0.39% | 1.53% | 11.44% | 11.64% | 7.22% | 11.85% | 4.22% |
| SRNSTHEMY | 16.98% | 0.00% | 13.05% | 25.11% | 21.52% | 21.54% | 23.17% | 24.53% | 0.39% | 1.53% | 11.44% | 11.64% | 7.22% | 11.85% | 4.22% |
| SRNSTHEMYW | 16.98% | 0.00% | 13.05% | 25.11% | 21.52% | 21.54% | 23.17% | 24.53% | 25.18% | 1.53% | 11.44% | 11.64% | 7.22% | 11.85% | 4.22% |
| SRAIWYMWL | 21.28% | 0.00% | 28.04% | 22.28% | 6.29% | 4.16% | 12.72% | 7.26% | 0.28% | 28.5% | 16.05% | 20.09% | 33.1% | 12.49% | 16.87% |
| Total | 68.5% | 0.00% | 64.63% | 81.76% | 50.55% | 51.39% | 53.14% | 58.57% | 1.0% | 51.69% | 63.85% | 65.32% | 59.37% | 54.4% | 48.67% |
Molecular modeling results of the amino acid consensus sequence of ZIKV NS5
| Ranka | PDB Hitb | TM-scorec | RM SDd | IDENe | Covf |
|---|---|---|---|---|---|
| 1 |
| 0.977 | 0.30 | 0.960 | 0.978 |
| 2 |
| 0.972 | 0.93 | 0.694 | 0.980 |
| 3 |
| 0.714 | 4.18 | 0.518 | 0.784 |
| 4 |
| 0.616 | 2.98 | 0.689 | 0.660 |
| 5 |
| 0.615 | 1.87 | 0.677 | 0.631 |
| 6 |
| 0.517 | 3.62 | 0.139 | 0.568 |
| 7 |
| 0.516 | 3.57 | 0.151 | 0.570 |
| 8 |
| 0.512 | 2.41 | 0.694 | 0.534 |
| 9 |
| 0.501 | 4.16 | 0.148 | 0.566 |
| 10 |
| 0.497 | 4.24 | 0.132 | 0.564 |
aRanking of proteins is based on TM-score of the structural alignment between the query structure and known structures in the PDB library
bDigital object identifier of the top 10 proteins deposited in the PDB library which have the closest structural similarity to the ZIKV conosensus NS5;
cAlignment score with the most similar protein structure found in PDB library;
dRoot-mean-square deviation of atomic positions (root-mean-square deviation, RMSD) between residues that are structurally aligned by TM-align
ePercentage of sequence identity in the structurally aligned region;
fCov represents the coverage of the alignment by TM-align and is equal to the number of structurally aligned residues divided by length of the query protein
Fig. 3Distribution of class I HLA predicted epitopes along the ZIKV NS5 consensus 3D model. Epitopes found to be 100% conserved among ZIKV lineages and DENV serotypes were shown to be homogeneously distributed along the protein structure. Three regions containing epitopes specific for HLA-A presentation were found (a). One of them is located at the MTase domain and contains only one epitope (Region 1, epitope LSRNSTHEMY) (b). Two other regions (2 and 3) which contain together six epitopes are located at the RdRp domain (c). Two epitopes are located in region 2 (Region 2, epitopes CVYNMMGKR and CVYNMMGKREK) and four in region 3 (Region 3, epitopes AIWYMWLGAR, WYMWLGAR, RAIWYMWLGAR and IWYMWLGAR). One region containing only one epitope specific for HLA-B presentation is located at the RdRp domain (Region 1, epitope LEFEALGF) (d). Five regions containing epitopes specific for HLA-C presentation were found (e). Two of them are located at the MTase domain. Region 1 contains three epitopes (Region 1, epitopes SRNSTHEM, SRNSTHEMY and SRNSTHEMYW) and region 2 contains one epitope (Region 2, epitope GRGGWSYY) (f). Three other regions are located at the RdRp domain: region 3 contains three epitopes (Region 3, epitopes RAIWYMWL, WYMWLGAR and SRAIWYMWL), region 4 contains one epitope (Region 4, epitope LEFEALGF) and region 5 contains four epitopes (Region 5, epitopes YADDTAGWDT, YADDTAGWDTR, YADDTAGWD and ADDTAGWD) (g)