| Literature DB >> 32104691 |
Chulmin Park1, Won-Bok Kim1, Sung-Yeon Cho1,2, Eun-Jee Oh3, Hyeyoung Lee4, Kyungjoon Kang5, Yoonsuk Lee5, Dong-Gun Lee1,2.
Abstract
Even in countries that are currently not facing a flavivirus epidemic, the spread of mosquito-borne flaviviruses presents an increasing public threat, owing to climate change, international travel, and other factors. Many of these countries lack the resources (viral strains, clinical specimens, etc.) needed for the research that could help cope with the threat imposed by flaviviruses, and therefore, an alternative approach is needed. Using an in silico approach to global databases, we aimed to design and develop flavivirus NS1 recombinant proteins with due consideration towards antigenic variation. NS1 genes analyzed in this study included a total of 6,823 sequences, from Dengue virus (DENV), Japanese encephalitis virus (JEV), West Nile virus (WNV), Zika virus (ZIKV), and Yellow fever virus (YKV). We extracted and analyzed 316 DENV NS1 sequence types (STs), 59 JEV STs, 75 WNV STs, 30 YFV STs, and 43 ZIKV STs using a simple algorithm based on phylogenetic analysis. STs were reclassified according to the variation of the major epitope by MHC II binding. 78 DENV epitope type (EpT), 29 JEV EpTs, 29 WNV EpTs, 12 YFV EpTs, and 5 ZIKV EpTs were extracted according to their major epitopes. Also, frequency results showed that there were dominant EpTs in all flavivirus. Fifteen STs were selected and purified for the expression of recombinant antigen in Escherichia coli by sodium dodecyl sulfate extraction. Our study details a novel in silico approach for the development of flavivirus diagnostics, including a simple way to screen the important peptide regions.Entities:
Year: 2020 PMID: 32104691 PMCID: PMC7040382 DOI: 10.1155/2020/3865707
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Overview of the in silico approach and the screening using the Escherichia coli expression system in this study.
Figure 2Frequency analysis of (a) DENV, (b) WNV (>200 of maximum frequency), and (c) JEV, YFV, and ZIKV (<200 of maximum frequency) NS1 genes. The x-axis indicates the sequence type (ST) of each flavivirus, while the y-axis represents the frequencies of each ST within the flavivirus NS1 genes registered in GenBank. These data were analyzed using blastp.
Figure 3Phylogenetic analysis of the consensus sequence of each flavivirus NS1 using the Maximum Likelihood method, based on the JTT matrix-based model.
Prediction of MHC class II binding peptide in DENV NS1 using the T-cell epitope prediction tool at IEDB.
| Amino acid positiona | Human allele | Mean percentile rankb | Mean IC50c | Core peptided | Homology (%)e | Epitope IDf | |
|---|---|---|---|---|---|---|---|
| DENV-1 | 28–48 | HLA-DRB | 0.207 | 49 |
| 99.56 | 869135 |
|
| 99.56 | ||||||
|
| 99.56 | ||||||
|
| 99.44 | ||||||
| SPKRLSAAI | 98.89 | ||||||
| 95–113 | HLA-DRB | 0.228 | 60.6 | ILAQGKKMI | 96.89 | 738503 | |
| GKKMIRPQP | 98.11 | ||||||
| KKMIRPQPM | 98.38 | ||||||
| 157–175 | HLA-DPA/DPB | 0.574 | 79.22 | GFGIFTTNI | 98.33 | 869721 | |
| IFTTNIWLK | 98.22 | ||||||
|
| 99.44 | ||||||
| 204–224 | HLA-DRB | 0.217 | 45.64 | SEKNETWKL | 99.0 | 195579 | |
| NETWKLARA | 98.67 | ||||||
| ETWKLARAS | 98.78 | ||||||
| TWKLARASF | 99.22 | ||||||
| WKLARASFI | 99.44 | ||||||
| KLARASFIE | 99.44 | ||||||
|
| |||||||
| DENV-2 | 14–33 | HLA-DRB | 0.4 | 64.5 | KCGSGIFIT | 96.67 | |
| IFITDNVHT | 96.33 | ||||||
| FITDNVHTW | 96.22 | ||||||
| 28–43 | HLA-DRB | 0.65 | 146 | TEQYKFQPE | 98.56 | 539828 | |
| YKFQPESPS | 98.33 | ||||||
| FQPESPSKL | 98.67 | ||||||
| 157–176 | HLA-DPA/DPB | 0.614 | 118.25 | GVFTTNIWL | 98.11 | 866460 | |
| VFTTNIWLK | 96.11 | 869722 | |||||
|
| 97.33 | 866831 | |||||
| 192–211 | HLA-DQA/DQB | 0.215 | 456.83 | DMGYWIESA | 98.44 | 867040 | |
| MGYWIESAL | 98.11 | ||||||
| 204–223 | HLA-DRB | 0.062 | 595.67 | SALNDTWKM | 94.33 | 539079 | |
| TWKMEKASF | 96.0 | ||||||
| WKMEKASFI | 96.11 | ||||||
| 229–244 | HLA-DQA/DQB | 0.88 | 542.5 | WSNGVLESE | 99.22 | ||
| GVLESEMII | 98.11 | ||||||
| 317–335 | HLA-DRB | 0.46 | 415.8 | TLPPLRYRG | 99.11 | 180396 | |
| YRGEDGCWY | 99.22 | ||||||
| RGEDGCWYG | 99.5 | ||||||
| GEDGCWYGM | 99.67 | ||||||
| EDGCWYGME | 99.78 | ||||||
| DGCWYGMEI | 100 | ||||||
|
| |||||||
| DENV-3 | 28–48 | HLA-DRB | 0.435 | 48.15 |
| 99.56 | 190996 |
|
| 99.56 | ||||||
|
| 99.56 | ||||||
|
| 99.56 | ||||||
| SPKRLATAI | 99.78 | ||||||
| 157–176 | HLA-DPA/DPB | 0.703 | 128.15 | GVFTTNIWL | 99.44 | 866460 | |
| VFTTNIWLK | 99.11 | ||||||
|
| 99.11 | ||||||
| 192–210 | HLA-DQA/DQB | 0.702 | 819.6 | DMGYWIESQ | 99.78 | 190715 | |
| 206–223 | HLA-DRB | 0.806 | 78.11 | SWKLEKASL | 97.56 | 539717 | |
| WKLEKASLI | 97.56 | ||||||
|
| |||||||
| DENV-4 | 14–34 | HLA-DRB | 0.193 | 52.86 | KCGSGIFVV | 95.11 | 46388 |
| IFVVDNVHT | 95.11 | ||||||
| FVVDNVHTW | 95.33 | ||||||
| HTWTEQYKF | 99.78 | ||||||
|
| 99.78 | ||||||
| 28–46 | HLA-DRB | 0.282 | 95.6 | YKFQPESPA | 99.56 | 539829 | |
| FQPESPARL | 99.56 | ||||||
| QPESPARLA | 99.56 | ||||||
These data were extracted by filtering at below 1.0 percentile rank. Bold letters indicate the same core peptide among DENV serotypes. aNS1 amino acid sequence number. bMean of percentile ranks of the predicted peptides in this region. cMean of IC50 by SMM-align method for the predicted peptides in this region. dData predicted by the combinatorial peptide library, SMM-align, and NN-align methods. eValues calculated as mean of the homology for each amino acid sequence of each ST. fThese represent epitopes associated with the positive result of T-cell response (T-cell assay such as ELISPOT, ICS assay, and MHC ligand assay), previously reported at some reference. Epitope ID is assigned at IEDB. These epitopes include all or part of the core peptides in the table.
MHC class I binding peptide common in DENV types predicted by the T-cell epitope prediction tool at IEDB.
| Start no. of amino acid position | Major allele | Peptide of DENV-1 | Peptide of DENV-2 | Peptide of DENV-3 | Peptide of DENV-4 |
|---|---|---|---|---|---|
| 20 | HLA-B |
| FITDNVHTW |
| FVVDNVHTW |
| 26 | HLA-A | H | H | H | H |
| 38 | HLA-B |
| SPSKLASAI |
| SPARLASAI |
| 107 | HLA-B |
| QPTELKYSW |
| PASDLKYSW |
| 160 | HLA-B | FGIFTTNIW |
|
| FGMFTTNIW |
| 243 | HLA-B | IPKIYGGPI | IPKNFAGPV | IPKSLAGPI | IPKSYAGPF |
| 252 | HLA-B | SQHNYRPGY | SQHNYRPGY | SQHNHRPGY | SQHNYRQGY |
| 303 | HLA-B | VTGKIIHEW | ASGKLITEW | VSGKLIHEW | ASGKLVTQW |
These data were extracted by filtering at below 0.6 percentile rank. Bold letters indicate that the peptide is shared with the MHC class II binding core peptide.
Prediction of MHC class II binding peptides in JEV NS1 by the T-cell epitope prediction tool at IEDB.
| Genotype | Amino acid positiona (epitope ID)d | Human allele | Mean percentile rankb | Core peptidec |
|---|---|---|---|---|
| Genotype 1 & 3 | 14–32 | HLA-DRB | 0.568 |
|
| 91–109 | HLA-DRB | 0.474 | RYRSAPKRL | |
| 143–159 | HLA-DQA/DQB | 0.687 | CPDEHRAWN | |
| 148–168 | HLA-DRB | 0.317 |
| |
| 192–211 (539105, 540128) | HLA-DQA/DQB | 0.502 | DLSYWIESR | |
| 267–287 | HLA-DRB, HLA-DQA/DQB | 0.116 | WDENGIVLD | |
| 317–335 | HLA-DRB | 0.698 | SLPPLRFRT | |
| 332–350 | HLA-DRB | 0.316 | GMEIRPVRH | |
|
| ||||
| Genotype 5 | 14–32 | HLA-DRB | 0.568 |
|
| 144–158 | HLA-DQA/DQB | 0.95 |
| |
| 148–168 | HLA-DRB | 0.284 |
| |
| 192–211 (539105, 540128) | HLA-DQA/DQB | 0.195 | DLSYWIESH | |
| 267–287 | HLA-DRB | 0.171 | DENEITLDF | |
| 332–349 | HLA-DRB | 0.458 | GMEIRPMKH | |
These data were extracted by filtering at below 1.0 percentile rank. Bold letters indicate the same core peptide among JEV genotypes. aNS1 amino acid sequence number. bMean of percentile ranks of the predicted peptides in this region. cValues predicted by the combinatorial peptide library, SMM-align, and NN-align methods. dThese represent epitopes associated with the positive result of T-cell response (T-cell assay such as ELISPOT, ICS assay, and MHC ligand assay), previously reported at some reference. Epitope ID is assigned at IEDB. These epitopes include all or part of the core peptides in the table.
Prediction of MHC class II binding peptide in NS1 from WNV, YFV, and ZIKV by the T-cell epitope prediction tool at IEDB.
| Amino acid positiona (epitope ID)d | Human allele | Meanb | Core peptidec |
|---|---|---|---|
|
| |||
| 14–32 | HLA-DRB | 0.642 |
|
| 91–109 | HLA-DRB | 0.102 | MYK |
| 144–158 | HLA-DQA/DQB | 0.96 | QNRAWNSLE |
| 154–174 (21175) | HLA-DRB, HLA-DPA/DPB | 0.611 | EVEDFGFGL |
| 188–211 (159026) | HLA-DRB, HLA-DQA/DQB | 0.244 | AIHSDLSYW |
| 317–336 (57994) | HLA-DRB | 0.293 | YQTDSGCWY |
|
| |||
|
| |||
| 15–31 (10088) | HLA-DQA/DQB | 0.91 | IFIFRDSDD |
| 64–78 (113828) | HLA-DRB, HLA-DQA/DQB | 0.162 | EHEMWRSRA |
| 71–95 (113319, 137424) | HLA-DQA/DQB | 0.633 | DEINAIFEE |
| 144–158 (181115] | HLA-DPA/DPB, HLA-DQA/DQB | 0.804 | NRVWNSFQI |
| 169–183 (137404, 232424] | HLA-DRB | 0.01 | MDAVFEYTI |
| 207–221 | HLA-DQA/DQB | 0.7 | WMIHTLEAL |
| 222–237 (231830152485) | HLA-DRB | 0.675 | KECEWPLTH |
|
| |||
|
| |||
| 14–31 (569047, 863166, 871319) | HLA-DRB | 0.89 | RCGTGVFVY |
| 28–46 (862875, 862438) | HLA-DRB | 0.69 | WRDRYKYHP |
| 67–86 (862872, 862852) | HLA-DQA/DQB | 0.57 | SVEGELNAI |
| 142–160 (862457, 862672, 863124] | HLA-DPA/DPB | 0.321 | HRAWNSFLV |
| 157–160 (862889, 862596] | HLA-DPA/DPB | 0.819 | GVFHTSVWL |
| 193–201 (862673] | HLA-DQA/DQB | 0.668 | DLGYWIESE |
These data were extracted by filtering at below 1.0 percentile rank. Bold letters indicate the same core peptide as that of JEV. aNS1 amino acid sequence number. bMean of percentile ranks of the predicted peptides in this region. cValues predicted by the combinatorial peptide library, SMM-align, and NN-align methods. dThese represent epitopes associated with the positive result of T-cell response (T-cell assay such as ELISPOT, ICS assay, and MHC ligand assay), previously reported at some reference. Epitope ID is assigned at IEDB. These epitopes include all or part of the core peptides in the table.
The reclassification of each STs by the locus of major epitopes.
| DENV1 | DENV2 | DENV3 | DENV4 | JEV | WNV | YFV | ZIKV | |
|---|---|---|---|---|---|---|---|---|
| Major epitope locusa | E1 (28–46) | E1 (14–43) | E1 (28–46) | E1 (14–43) | E1 (14–29) | E1 (14–29) | E1 (19–29) | E1 (14–42) |
| E2 (96–109) | E2 (161–171) | E2 (161–171) | E2 (96–105) | E2 (96–105) | E2 (64–93) | E2 (70–84) | ||
| E3 (159–171) | E3 (197–218) | E3 (197–205) | E3 (143–168) | E3 (146–171) | E3 (147–183) | E3 (147–172) | ||
| E4 (204–219) | E4 (232–243) | E4 (209–218) | E4 (197–206) | E4 (193–205) | E4 (211–236) | E4 (197–205) | ||
| E5 (317–335) | E5 (268–281) | E5 (317–331) | ||||||
| E6 (317–346) | ||||||||
|
| ||||||||
| STb | 113 | 94 | 64 | 45 | 59 | 75 | 30 | 43 |
|
| ||||||||
| EpTc | 31 | 33 | 9 | 5 | 29 | 29 | 12 | 5 |
|
| ||||||||
| Frequency of major EpT (allelic profile) (%)d | EpT 1 (1-1-1-1) (73.9%) | EpT 1 (1-1-1-1-1) (31.5%) | EpT 1 (1-1-1-1) (84.3%) | EpT 1(1) (67.7%) | EpT 1 (1-1-1-1-1-1) (37.9%) | EpT 1 (1-1-1-1-1) (78.6%) | EpT 1 (1-1-1-1) (23.3%) | EpT 1 (1-1-1-1) (88.4%) |
| EpT 2 (1-6-5-1) (10.1%) | EpT 2 (1-1-3-1-1) (38.2%) | EpT 2 (1-1-1-1) (13.3%) | EpT 1(2) (24.4%) | EpT 1 (1-1-2-3-1-1) (30.7%) | EpT 2 (1-4-1-1) (20.6%) | |||
| EpT 3 (1-1-1-2) (5.9%) | ||||||||
aMajor epitope locus is the region in which the core peptides (Tables 1–4) are distributed. The number in parentheses is the amino acid sequence number. bSequence type (ST) was classified by the phylogenetic analysis. The numbers in this row represent the total sum of each ST. cAll the variations of epitopes were assigned allele numbers and combined into an allelic profile and assigned an epitope type (EpT). The numbers in this row represent the total sum of each EpT. dThe frequency registered on GenBank.
Figure 4SDS polyacrylamide gel electrophoresis (PAGE) analysis showing the example of Ni-NTA purification of flavivirus NS1 recombinant protein by the Escherichia coli expression system (a), western blot analysis of DENV NS1 (b), and western blot analysis of ZIKV NS1 recombinant proteins (c). (a) Each fraction was loaded as follows: 1, E. coli lysate expressing DENV 1 NS1 recombinant protein after IPTG induction; 2, fraction (unbound) through-out in Ni-NTA column; 3, washing fraction at 50 mM imidazole; 4, elute 1 fraction at 150 mM imidazole; 5, elute 2 fraction at 150 mM imidazole; 6, E. coli lysate expressing DENV type 2 NS1 recombinant protein; 7, fraction (unbound) through-out in Ni-NTA column; 8, washing fraction; 9, elute 1 fraction; 10, elute 2 fraction, M: PageRuler protein ladder (Abcam plc., Cambridge, UK). (b) Anti-DENV NS1 primary antibody (titer 1 : 2,000) and anti-mouse IgG secondary antibody (1 : 20,000) were used. 1–4; DENV type 1 NS1 recombinant protein, 5-6; DENV type 2 NS1 recombinant protein, 7-8; DENV type 3 and 4 NS1 recombinant protein, 9–12; DENV type 1–4 NS1 recombinant protein (Native Antigen Company), 13; JEV-NS1 (Native Antigen Company), 14; TBEV NS1 (Native Antigen Company), 15; WNV-NS1 (Native Antigen Company), 16; YFV-NS1 (Native Antigen Company), 17; ZIKV Uganda strain NS1 (Native Antigen Company), 18; ZIKV Suriname strain NS1 (Native Antigen Company). (c) Anti-ZIKV NS1 primary antibody (titer 1 : 2,000) and anti-mouse IgG secondary antibody (1 : 20,000) were used. 1; WNV NS1 recombinant protein, 2; YFV NS1 recombinant protein, 3; ZIKV NS1 recombinant protein, 4; JEV NS1 recombinant protein, 5; DENV NS1 recombinant protein, 6; ZIKV Uganda strain NS1 (Native Antigen Company), 7; DENV type 1 NS1 (Native Antigen Company), 8; DENV type 2 (Native Antigen Company), 9; DENV type 3 (Native Antigen Company), 10; DENV type 4 (Native Antigen Company), M; PageRuler protein ladder (Abcam).