Literature DB >> 28670016

Systematic analysis of protein identity between Zika virus and other arthropod-borne viruses.

Hsiao-Han Chang1, Roland G Huber2, Peter J Bond2, Yonatan H Grad3, David Camerini4, Sebastian Maurer-Stroh2, Marc Lipsitch1.   

Abstract

OBJECTIVE: To analyse the proportions of protein identity between Zika virus and dengue, Japanese encephalitis, yellow fever, West Nile and chikungunya viruses as well as polymorphism between different Zika virus strains.
METHODS: We used published protein sequences for the Zika virus and obtained protein sequences for the other viruses from the National Center for Biotechnology Information (NCBI) protein database or the NCBI virus variation resource. We used BLASTP to find regions of identity between viruses. We quantified the identity between the Zika virus and each of the other viruses, as well as within-Zika virus polymorphism for all amino acid k-mers across the proteome, with k ranging from 6 to 100. We assessed accessibility of protein fragments by calculating the solvent accessible surface area for the envelope and nonstructural-1 (NS1) proteins.
FINDINGS: In total, we identified 294 Zika virus protein fragments with both low proportion of identity with other viruses and low levels of polymorphisms among Zika virus strains. The list includes protein fragments from all Zika virus proteins, except NS3. NS4A has the highest number (190 k-mers) of protein fragments on the list.
CONCLUSION: We provide a candidate list of protein fragments that could be used when developing a sensitive and specific serological test to detect previous Zika virus infections.

Entities:  

Mesh:

Year:  2016        PMID: 28670016      PMCID: PMC5487971          DOI: 10.2471/BLT.16.182105

Source DB:  PubMed          Journal:  Bull World Health Organ        ISSN: 0042-9686            Impact factor:   9.408


Introduction

Monitoring the geographic and the demographic distribution of people infected with Zika virus is important for informing decision-makers and researchers during the ongoing epidemic. Health officials also need further knowledge about the associations between Zika virus infection and its sequelae, such as microcephaly and Guillain–Barré syndrome. However, the absence of a sensitive and specific serological test for detecting prior Zika virus infection impedes research. According to the World Health Organization’s Target product profiles for better diagnostic tests for Zika virus infection, such a test must be able to differentiate between chikungunya, dengue and Zika viruses, since these mosquito-borne arboviruses can be co-circulating and can cause similar symptoms. Dengue and Zika viruses belong to the virus family Flaviviridae, while chikungunya virus belongs to the Togaviridae family. Although they belong to different virus families, Zika and chikungunya viruses share some similarities in envelope protein folding and membrane fusion mechanisms. Active Zika virus infections can be detected by nucleic acid-based diagnostic tools., However, developing serological diagnostic tests to detect previous Zika virus infections has been challenging, because of cross-reactivity between antibodies against different arboviruses.– Hence, current serological assays, such as enzyme-linked immunosorbent assay (ELISA) and plaque reduction neutralization tests, may not be able to distinguish if a person has been infected with Zika virus or another flavivirus or if a person has received a previous yellow fever or Japanese encephalitis vaccination., A study has shown that neutralizing monoclonal antibodies generated against recombinant fragments of the envelope protein of dengue virus serotype 2 tend to be cross-reactive among flaviviruses, while nonneutralizing antibodies seem to be virus specific. We hypothesize that immunogenic protein regions with sequence dissimilarity may exist across arthropod-borne viruses (arboviruses) and that antibodies targeting these regions may be less likely to be cross-reactive. Identifying such regions could aid the development of specific microarray-based serological tests, such as a peptide microarray, to detect Zika virus and/or other related viruses. A peptide microarray is a high-throughput method for detecting interactions between peptides and antibodies and is composed of multiple spots of peptides on a solid surface. We also hypothesize that protein regions that are more conserved among different strains of the Zika virus are more likely to contribute to the sensitivity of the peptide microarray. Thus, to identify Zika virus conserved protein fragments that are variable among other virus species, we analysed proportions of protein sequence identity across virus species and protein polymorphism among different strains of Zika virus. We analysed the flaviviruses Zika, dengue, West Nile, Japanese encephalitis and yellow fever, and the alphavirus chikungunya.

Methods

We used publicly available proteomic sequencing data (Table 1). For the Zika virus, we used data set A from Faria et al. We downloaded the protein sequences of Japanese encephalitis virus, yellow fever virus and chikungunya virus from the National Center for Biotechnology Information (NCBI) protein database and the sequences for dengue virus serotypes 1–4 and West Nile virus from NCBI virus variation resource.
Table 1

Proteomic sequencing data used to compare identity between viruses and within viruses

SpeciesCollection dateWHO RegionNo. of samples
ZIKV1947–2015African, Americas, Western Pacific34
DENV101/01/2010– 06/01/2016African, Americas, European, South-East Asia, Western Pacific171
DENV201/01/2010– 06/01/2016Americas, Eastern Mediterranean, South-East Asia, Western Pacific158
DENV301/01/2010– 06/01/2016Americas, Eastern Mediterranean, South-East Asia, Western Pacific62
DENV401/01/2010– 06/01/2016Americas, South-East Asia, Western Pacific58
WNV01/01/2008–06/01/2016Americas, European, South-East Asia, 44
JEV1951–2012South-East Asia, Western Pacific19
YFV1981–2016African, Americas, Western Pacific31
CHIKV1953-2015African, Americas, European, South-East Asia, Western Pacific212

CHIKV: chikungunya virus; DENV1−4: dengue virus serotype 1; JEV: Japanese encephalitis virus; WHO: World Health Organization; WNV: West Nile virus; YFV; yellow fever virus; ZIKV: Zika virus.

Note: For ZIKV, we used data set A from Faria et al. We downloaded the protein sequences of JEV, YFV and CHIKV from the National Center of Biotechnology Information (NCBI) protein database and sequences for DENV serotypes 1–4 and WNV from NCBI virus variation resource.

CHIKV: chikungunya virus; DENV1−4: dengue virus serotype 1; JEV: Japanese encephalitis virus; WHO: World Health Organization; WNV: West Nile virus; YFV; yellow fever virus; ZIKV: Zika virus. Note: For ZIKV, we used data set A from Faria et al. We downloaded the protein sequences of JEV, YFV and CHIKV from the National Center of Biotechnology Information (NCBI) protein database and sequences for DENV serotypes 1–4 and WNV from NCBI virus variation resource. We used BLASTP to find regions of identity between arboviruses, applying a default Expect (E)-value threshold of 10, that is the expected number of hits of the observed similarity, by chance, is fewer than 10. The results are robust and we obtained the same results when E-value thresholds were 5 or 50. When comparing the chikungunya and the Zika viruses, we used an E-value threshold of 1000, because chikungunya does not belong to the Flaviviridae family and we could not identify any regions of similarity when using an E-value threshold of 10. For all protein fragments across the proteome, we calculated the proportion of shared amino acids between virus species and polymorphism among different Zika virus strains. We analysed protein fragments of different lengths, so called k-mers (where k is the amino acid length of the protein fragment), with k equal to 6 or ranging from 10 to 100. We used a sliding window approach, where we moved the window one amino acid at a time along the proteome to include every possible k-mer. To be conservative, we identified protein fragment identity between species by the maximum identity among all the pairs of strains for each window considered. For analysing the identity with dengue virus, we used the highest identity between the Zika virus and all four serotypes of the dengue virus for each window considered. To assess if protein identity between the Zika virus and each of the dengue serotype was significantly associated with polymorphism within each dengue virus serotype, we calculated P-values by using Pearson's correlation test. To identify polymorphisms within viruses, we used both the average pairwise difference and the proportion of polymorphic sites. Average pairwise difference is calculated by averaging the proportions of differences in peptide sequences from all pairs of the virus strains. We chose to plot the proportion of polymorphic sites in the figures because it is less sensitive to population structure and/or sampling bias. To identify potential protein fragments that could be used for diagnostic tests, we selected k-mers with low proportion of identity between the Zika virus and other arboviruses as well as low polymorphism between different strains of Zika virus as lead candidate protein fragments. The rationale for this approach was that fragments with low between-species identity and low within-species polymorphism are most likely to have both the required specificity and sensitivity for such tests. We chose k-mers in the bottom quintile of values of identity and polymorphism for each k-mer length. Insights into protein structures are critical for assessing the possible antigenicity of peptides, because buried peptides are less likely to be antigenic. To determine if any of the fragments are exposed or buried in the two Zika virus proteins with available protein structures, the envelope protein and the non-structural (NS) protein 1, we calculated the solvent accessible surface area for each amino acid. We used the published structures of dimeric NS1 (protein data bank identification, PDB ID: 5GS6) and the envelope protein in the biological assembly of the mature virus (PDB ID: 5IRE). To calculate the solvent accessible surface area, we used the linear combinations of pairwise overlaps method and used 10 Å2 as the upper limit for buried residues, as this value corresponds to half the surface area of a single water molecule. The regions at the C-terminal end of the dengue virus envelope protein interact with the viral lipid membrane and are unlikely to be exposed. Due to the high structural similarity of the envelope proteins between dengue and Zika viruses, we assume that the region from residue 404 to the C-terminus in Zika virus envelope protein is also buried. For the lead candidate list, we excluded the k-mers without any continuous exposed peptides longer than five amino acids in the two proteins, because exposed peptides are more likely to be antigenic. The threshold of five amino acids was chosen because 99.7% of experimentally determined antigenic B-cell epitopes for flaviviruses found in Virus Pathogen Database and Analysis Resource database are longer than five amino acids. We obtained the list of theses epitopes through the database’s web site at http://www.viprbrc.org/.

Results

On average, Zika virus shares 55.6% amino acid sequence identity with dengue virus, 46.0% with yellow fever virus, 56.1% with Japanese encephalitis virus, 57.0% with West Nile virus and 1.3% with chikungunya virus. The identity between Zika virus and other viruses and Zika virus polymorphism for all k-mers are available from the corresponding author. As an example, Fig. 1 and Fig. 2 show the identity between Zika virus and other viruses investigated and polymorphisms within the Zika virus for all 50-mer peptides.
Fig. 1

Zika virus polymorphism versus identity between Zika virus and other arboviruses, 50-mers across the Zika virus proteome

Fig. 2

Sliding-window identity between Zika virus and other flaviviruses and within-Zika virus polymorphism

Zika virus polymorphism versus identity between Zika virus and other arboviruses, 50-mers across the Zika virus proteome E: envelope; NS; non-structural; prM; precursor membrane. Notes: 50-mers across the Zika virus proteome were analysed, using a sliding window approach. Fifty-mers containing known epitopes in non-Zika virus flaviviruses are shown in green. Polymorphic sites are the sites that vary among different strains of Zika virus. Sliding-window identity between Zika virus and other flaviviruses and within-Zika virus polymorphism E: envelope; NS; non-structural; prM; precursor membrane. Notes: The window size is 50 amino acids and step size is 5 amino acids. The light green shaded area (position 430–500) shows low between-species identity and low within-Zika virus polymorphism. Fig. 3 shows protein fragments mapped to the corresponding envelope or NS1 proteins. The exposed areas of the proteins show regions with both low identity with other flaviviruses and low Zika virus polymorphism.
Fig. 3

Mapping per-site identity and polymorphism onto the structures of Zika virus envelope protein and nonstructural protein 1 dimer

Mapping per-site identity and polymorphism onto the structures of Zika virus envelope protein and nonstructural protein 1 dimer NS1: nonstructural protein 1. Notes: Sites polymorphic within Zika virus or with high identity between Zika virus and other viruses could compromise respectively sensitivity and specificity of a serological test. Thus the most useful sites are expected to be those that lack identity with other viruses and lack polymorphism within Zika virus, shown in red. Identity between a comparator virus and Zika virus is shown in yellow; polymorphism within Zika virus is shown in blue; and sites with both properties are shown in green. The side view of the envelope protein shows the two transmembrane helices on the bottom. These helices likely remain buried within the lipid bilayer envelope and hence are unavailable for interactions with antibodies. In the structures including all viruses, sites are shown as homologous if they are homologous between Zika virus and any of the flaviviruses. Homologous regions are larger for dengue virus because sites are shown as homologous if they are homologous between Zika virus and any of the four dengue virus serotypes. The lead candidate list for developing a specific and sensitive microarray-based serological test contains 294 protein fragments. These fragments have low similarity between viruses, low polymorphism within the Zika virus and continuous exposed peptides longer than five amino acids (Table 2; available at: http://www.who.int/bulletin/volumes/95/7/16-182105). The list excluded 10.9% (36/330) of k-mers containing previously identified B-cell epitopes for other flaviviruses than Zika, because they are likely to be cross-reactive. Protein fragments from all Zika virus proteins, except NS3, are present in the list. NS4A has the highest number (190 k-mers) of candidate protein fragments (Table 3).
Table 2

The lead candidate list of Zika virus protein fragments with low proportion of identity with other flaviviruses and low polymorphism

Position in proteome, aa
ProteinAverage pairwise differencePolymorphic sites, %k-meraHomology with other flaviviruses, %
Peptide sequence
StartEndDENVJEVYFVWNV
2695capsid C0.00780.0429700.48570.48570.22860.5000PFGGLKRLPAGLLLGHGPIRMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRII
2897capsid C0.00190.0286700.50000.47140.22860.5000GGLKRLPAGLLLGHGPIRMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINA
32101capsid C0.00770.0429700.50000.47140.25710.4857RLPAGLLLGHGPIRMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINARKEK
33102capsid C0.00770.0429700.48570.47140.25710.4857LPAGLLLGHGPIRMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINARKEKK
34103capsid C0.00770.0429700.48570.48570.25710.5000PAGLLLGHGPIRMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINARKEKKR
35104capsid C0.00770.0429700.48570.48570.25710.5000AGLLLGHGPIRMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINARKEKKRR
4594capsid C0.00270.0400500.48000.46000.26000.4800RMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRI
5493capsid C0.00330.0500400.47500.40000.25000.4250LRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLR
5594capsid C0.00330.0500400.47500.40000.25000.4250RFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRI
5695capsid C0.00330.0500400.47500.40000.25000.4250FTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRII
5796capsid C0.00330.0500400.47500.40000.22500.4250TAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIIN
5897capsid C0.00330.0500400.47500.37500.20000.4000AIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINA
5998capsid C0.00330.0500400.47500.37500.22500.4000IKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINAR
6099capsid C0.00330.0500400.47500.40000.25000.3750KPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINARK
61100capsid C0.00330.0500400.47500.40000.22500.3750PSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINARKE
8792capsid C0.00000.000060.33330.16670.16670.1667DLAAML
131136pr0.00000.000060.16670.16670.00000.1667AYYMYL
132137pr0.00000.000060.16670.16670.00000.1667YYMYLD
133138pr0.00000.000060.16670.16670.00000.1667YMYLDR
231240membrane0.00000.0000100.30000.30000.20000.3000SQTWLESREY
411450envelope0.00440.0750400.45000.42500.22500.3750CSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIGHETDENR
412451envelope0.00440.0750400.42500.42500.20000.3500SKKMTGKSIQPENLEYRIMLSVHGSQHSGMIGHETDENRA
413452envelope0.00440.0750400.42500.42500.22500.3750KKMTGKSIQPENLEYRIMLSVHGSQHSGMIGHETDENRAK
414453envelope0.00440.0750400.42500.42500.20000.3750KMTGKSIQPENLEYRIMLSVHGSQHSGMIGHETDENRAKV
415454envelope0.00440.0750400.45000.40000.20000.3500MTGKSIQPENLEYRIMLSVHGSQHSGMIGHETDENRAKVE
419448envelope0.00200.0333300.46670.40000.20000.3667SIQPENLEYRIMLSVHGSQHSGMIGHETDE
420449envelope0.00200.0333300.46670.40000.20000.3667IQPENLEYRIMLSVHGSQHSGMIGHETDEN
421450envelope0.00200.0333300.46670.36670.20000.3333QPENLEYRIMLSVHGSQHSGMIGHETDENR
422451envelope0.00200.0333300.46670.36670.20000.3000PENLEYRIMLSVHGSQHSGMIGHETDENRA
436445envelope0.00000.0000100.40000.30000.20000.3000SQHSGMIGHE
438447envelope0.00000.0000100.40000.30000.20000.3000HSGMIGHETD
439448envelope0.00000.0000100.40000.30000.20000.3000SGMIGHETDE
440449envelope0.00000.0000100.40000.20000.20000.2000GMIGHETDEN
441450envelope0.00000.0000100.40000.20000.20000.2000MIGHETDENR
442451envelope0.00000.0000100.40000.30000.20000.2000IGHETDENRA
626665envelope0.00290.0500400.40000.47500.30000.4750KVPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELD
627666envelope0.00290.0500400.40000.47500.30000.4750VPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDP
629658envelope0.00200.0333300.43330.46670.30000.4333AQMAVDMQTLTPVGRLITANPVITESTENS
913918NS10.00000.000060.33330.00000.00000.0000FVRAAK
913922NS10.00000.0000100.40000.10000.20000.2000FVRAAKTNNS
914919NS10.00000.000060.33330.00000.00000.0000VRAAKT
915920NS10.00000.000060.33330.16670.16670.1667RAAKTN
12941343NS2A0.00430.0600500.36000.36000.30000.3200LAILAALTPLARGTLLVAWRAGLATCGGFMLLSLKGKGSVKKNLPFVMAL
12951344NS2A0.00430.0600500.38000.36000.28000.3200AILAALTPLARGTLLVAWRAGLATCGGFMLLSLKGKGSVKKNLPFVMALG
12991318NS2A0.00000.0000200.35000.35000.25000.3500ALTPLARGTLLVAWRAGLAT
12991338NS2A0.00180.0250400.35000.32500.25000.3250ALTPLARGTLLVAWRAGLATCGGFMLLSLKGKGSVKKNLP
12991348NS2A0.00430.0600500.36000.34000.30000.3000ALTPLARGTLLVAWRAGLATCGGFMLLSLKGKGSVKKNLPFVMALGLTAV
13001319NS2A0.00000.0000200.35000.35000.30000.3500LTPLARGTLLVAWRAGLATC
13001339NS2A0.00180.0250400.35000.32500.25000.3250LTPLARGTLLVAWRAGLATCGGFMLLSLKGKGSVKKNLPF
13001349NS2A0.00430.0600500.36000.34000.30000.3000LTPLARGTLLVAWRAGLATCGGFMLLSLKGKGSVKKNLPFVMALGLTAVR
13011320NS2A0.00000.0000200.35000.35000.25000.3500TPLARGTLLVAWRAGLATCG
13011340NS2A0.00180.0250400.37500.30000.25000.3000TPLARGTLLVAWRAGLATCGGFMLLSLKGKGSVKKNLPFV
13021311NS2A0.00000.0000100.40000.30000.20000.3000PLARGTLLVA
13021321NS2A0.00000.0000200.35000.30000.20000.3000PLARGTLLVAWRAGLATCGG
13031322NS2A0.00000.0000200.35000.25000.15000.2500LARGTLLVAWRAGLATCGGF
13041323NS2A0.00000.0000200.35000.25000.15000.2500ARGTLLVAWRAGLATCGGFM
13051324NS2A0.00000.0000200.40000.30000.15000.3000RGTLLVAWRAGLATCGGFML
13091318NS2A0.00000.0000100.40000.30000.20000.3000LVAWRAGLAT
13101319NS2A0.00000.0000100.40000.20000.20000.2000VAWRAGLATC
13111320NS2A0.00000.0000100.40000.30000.20000.3000AWRAGLATCG
13121321NS2A0.00000.0000100.30000.30000.20000.3000WRAGLATCGG
13131322NS2A0.00000.0000100.20000.30000.20000.3000RAGLATCGGF
13141323NS2A0.00000.0000100.10000.20000.20000.2000AGLATCGGFM
13151324NS2A0.00000.0000100.20000.30000.20000.3000GLATCGGFML
13171322NS2A0.00000.000060.16670.16670.16670.1667ATCGGF
13181323NS2A0.00000.000060.16670.16670.16670.1667TCGGFM
13261331NS2A0.00000.000060.16670.16670.16670.1667SLKGKG
13281333NS2A0.00000.000060.16670.16670.16670.1667KGKGSV
13281337NS2A0.00000.0000100.30000.30000.20000.3000KGKGSVKKNL
13541363NS2A0.00000.0000100.30000.30000.20000.3000INVVGLLLLT
14591468NS2B0.00000.0000100.30000.30000.20000.3000GPPMREIILK
14601469NS2B0.00000.0000100.30000.20000.20000.2000PPMREIILKV
14611470NS2B0.00000.0000100.30000.20000.20000.3000PMREIILKVV
14621467NS2B0.00000.000060.33330.16670.16670.1667MREIIL
14621471NS2B0.00000.0000100.40000.10000.10000.2000MREIILKVVL
14631472NS2B0.00000.0000100.40000.10000.10000.2000REIILKVVLM
14741493NS2B0.00000.0000200.40000.25000.20000.3500ICGMNPIAIPFAAGAWYVYV
14751494NS2B0.00000.0000200.40000.30000.25000.3000CGMNPIAIPFAAGAWYVYVK
14761495NS2B0.00000.0000200.40000.35000.25000.3000GMNPIAIPFAAGAWYVYVKT
14771496NS2B0.00000.0000200.35000.35000.25000.3000MNPIAIPFAAGAWYVYVKTG
14781497NS2B0.00000.0000200.35000.40000.25000.3500NPIAIPFAAGAWYVYVKTGK
14831492NS2B0.00000.0000100.40000.20000.20000.3000PFAAGAWYVY
14841493NS2B0.00000.0000100.30000.20000.20000.2000FAAGAWYVYV
21162215NS4A0.00910.06001000.39000.35000.35000.4400GAAFGVMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEI
21172216NS4A0.00910.06001000.39000.35000.34000.4400AAFGVMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIE
21182217NS4A0.00840.05001000.40000.35000.34000.4400AFGVMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEP
21192218NS4A0.00840.05001000.40000.34000.33000.4400FGVMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPA
21202179NS4A0.00570.0500600.43330.33330.35000.3833GVMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTV
21202189NS4A0.00490.0429700.38570.32860.32860.4286GVMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMR
21202199NS4A0.00520.0500800.41250.38750.33750.4625GVMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFG
21202209NS4A0.00460.0444900.38890.35560.33330.4667GVMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWL
21202219NS4A0.00410.04001000.39000.34000.33000.4400GVMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPAR
21212180NS4A0.00570.0500600.43330.33330.35000.3833VMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVS
21212190NS4A0.00490.0429700.38570.32860.32860.4143VMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRN
21212200NS4A0.00520.0500800.41250.38750.35000.4500VMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGM
21212210NS4A0.00460.0444900.38890.35560.34440.4556VMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLM
21212220NS4A0.00410.04001000.40000.35000.34000.4400VMEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARI
21222181NS4A0.00570.0500600.43330.33330.35000.4000MEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSL
21222191NS4A0.00490.0429700.40000.34290.34290.4286MEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNK
21222201NS4A0.00520.0500800.41250.38750.35000.4625MEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMV
21222211NS4A0.00460.0444900.40000.36670.34440.4667MEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMW
21222221NS4A0.00410.04001000.41000.36000.34000.4500MEALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIA
21232182NS4A0.00570.0500600.45000.35000.36670.4167EALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLG
21232192NS4A0.00490.0429700.41430.35710.35710.4429EALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKG
21232202NS4A0.00520.0500800.42500.38750.36250.4750EALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVT
21232212NS4A0.00460.0444900.40000.36670.35560.4667EALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWL
21232222NS4A0.00410.04001000.41000.36000.34000.4500EALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIAC
21242183NS4A0.00240.0333600.45000.33330.36670.4000ALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGI
21242193NS4A0.00200.0286700.42860.35710.37140.4429ALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGI
21242203NS4A0.00270.0375800.42500.38750.36250.4750ALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTL
21242213NS4A0.00240.0333900.40000.35560.35560.4556ALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLS
21242223NS4A0.00210.03001000.41000.35000.34000.4400ALGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACV
21252184NS4A0.00240.0333600.45000.35000.36670.4167LGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIF
21252194NS4A0.00200.0286700.44290.37140.37140.4571LGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIG
21252204NS4A0.00270.0375800.42500.38750.36250.4875LGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLG
21252214NS4A0.00240.0333900.41110.36670.35560.4667LGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSE
21252224NS4A0.00210.03001000.41000.36000.34000.4500LGTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVL
21262185NS4A0.00240.0333600.45000.33330.35000.4167GTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFF
21262195NS4A0.00200.0286700.45710.37140.35710.4571GTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGK
21262205NS4A0.00270.0375800.42500.37500.35000.4875GTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGA
21262215NS4A0.00240.0333900.42220.35560.34440.4556GTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEI
21262225NS4A0.00210.03001000.42000.35000.33000.4400GTLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLI
21272186NS4A0.00240.0333600.43330.31670.35000.4000TLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFV
21272196NS4A0.00200.0286700.45710.37140.37140.4429TLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKM
21272206NS4A0.00270.0375800.42500.36250.35000.4750TLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGAS
21272216NS4A0.00240.0333900.42220.34440.34440.4444TLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIE
21272226NS4A0.00210.03001000.42000.35000.33000.4300TLPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIV
21282187NS4A0.00240.0333600.43330.31670.35000.4167LPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVL
21282197NS4A0.00200.0286700.45710.38570.37140.4571LPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMG
21282207NS4A0.00270.0375800.42500.36250.35000.4750LPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASA
21282217NS4A0.00240.0333900.43330.34440.35560.4444LPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEP
21282227NS4A0.00210.03001000.42000.35000.33000.4300LPGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVV
21292188NS4A0.00240.0333600.43330.33330.35000.4333PGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLM
21292198NS4A0.00200.0286700.44290.38570.35710.4571PGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGF
21292208NS4A0.00270.0375800.41250.36250.33750.4750PGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAW
21292218NS4A0.00240.0333900.42220.34440.34440.4444PGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPA
21292228NS4A0.00210.03001000.42000.35000.33000.4300PGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVF
21302179NS4A0.00290.0400500.46000.32000.36000.3600GHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTV
21302189NS4A0.00240.0333600.41670.31670.33330.4167GHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMR
21302199NS4A0.00310.0429700.44290.38570.34290.4571GHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFG
21302209NS4A0.00270.0375800.41250.35000.33750.4625GHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWL
21302219NS4A0.00240.0333900.41110.33330.33330.4333GHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPAR
21302229NS4A0.00210.03001000.41000.35000.32000.4300GHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFL
21312180NS4A0.00290.0400500.46000.32000.36000.3800HMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVS
21312190NS4A0.00240.0333600.41670.31670.33330.4167HMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRN
21312200NS4A0.00310.0429700.44290.38570.35710.4571HMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGM
21312210NS4A0.00270.0375800.41250.35000.35000.4625HMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLM
21312220NS4A0.00240.0333900.42220.34440.34440.4444HMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARI
21312230NS4A0.00210.03001000.42000.36000.33000.4400HMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLL
21322181NS4A0.00290.0400500.44000.30000.36000.3800MTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSL
21322191NS4A0.00240.0333600.43330.31670.35000.4167MTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNK
21322201NS4A0.00310.0429700.44290.37140.35710.4571MTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMV
21322211NS4A0.00270.0375800.42500.35000.35000.4625MTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMW
21322221NS4A0.00240.0333900.43330.34440.34440.4444MTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIA
21322231NS4A0.00210.03001000.42000.35000.33000.4300MTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLL
21332192NS4A0.00240.0333600.45000.33330.36670.4333TERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKG
21332202NS4A0.00310.0429700.45710.37140.37140.4714TERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVT
21332212NS4A0.00270.0375800.42500.35000.36250.4625TERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWL
21332222NS4A0.00240.0333900.43330.34440.34440.4444TERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIAC
21332232NS4A0.00210.03001000.43000.36000.34000.4300TERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLV
21342203NS4A0.00310.0429700.45710.38570.37140.4857ERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTL
21342213NS4A0.00270.0375800.42500.35000.36250.4625ERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLS
21342223NS4A0.00240.0333900.43330.34440.34440.4444ERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACV
21342233NS4A0.00210.03001000.43000.37000.35000.4400ERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVV
21352204NS4A0.00310.0429700.45710.37140.37140.5000RFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLG
21352214NS4A0.00270.0375800.43750.35000.36250.4750RFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSE
21352224NS4A0.00240.0333900.43330.34440.34440.4556RFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVL
21352234NS4A0.00210.03001000.44000.37000.35000.4500RFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVL
21362215NS4A0.00270.0375800.43750.35000.36250.4750FQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEI
21362225NS4A0.00240.0333900.43330.34440.34440.4556FQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLI
21362235NS4A0.00210.03001000.44000.38000.35000.4600FQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLI
21372216NS4A0.00270.0375800.43750.35000.36250.4750QEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIE
21372226NS4A0.00240.0333900.43330.35560.34440.4556QEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIV
21372236NS4A0.00210.03001000.45000.39000.36000.4700QEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIP
21382217NS4A0.00270.0375800.45000.35000.37500.4750EAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEP
21382227NS4A0.00240.0333900.43330.35560.34440.4556EAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVV
21382237NS4A0.00210.03001000.46000.40000.37000.4800EAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPE
21392208NS4A0.00310.0429700.44290.35710.35710.5000AIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAW
21392218NS4A0.00270.0375800.45000.33750.36250.4625AIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPA
21392228NS4A0.00240.0333900.44440.34440.34440.4444AIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVF
21392238NS4A0.00210.03001000.47000.40000.37000.4800AIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEP
21402189NS4A0.00290.0400500.46000.30000.36000.4400IDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMR
21402199NS4A0.00360.0500600.48330.38330.36670.4833IDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFG
21402209NS4A0.00310.0429700.44290.34290.35710.4857IDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWL
21402219NS4A0.00270.0375800.43750.32500.35000.4500IDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPAR
21402229NS4A0.00240.0333900.43330.34440.33330.4444IDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFL
21402239NS4A0.00210.03001000.47000.40000.36000.4800IDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEPE
21412190NS4A0.00290.0400500.46000.30000.36000.4400DNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRN
21412210NS4A0.00310.0429700.44290.34290.37140.4857DNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLM
21412220NS4A0.00270.0375800.45000.33750.36250.4625DNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARI
21412230NS4A0.00240.0333900.44440.35560.34440.4556DNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLL
21412240NS4A0.00210.03001000.48000.41000.36000.4900DNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEPEK
21422191NS4A0.00290.0400500.46000.30000.36000.4400NLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNK
21422201NS4A0.00360.0500600.46670.36670.36670.4833NLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMV
21422211NS4A0.00310.0429700.44290.34290.35710.4857NLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMW
21422221NS4A0.00270.0375800.45000.33750.35000.4625NLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIA
21422231NS4A0.00240.0333900.43330.34440.33330.4444NLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLL
21422241NS4A0.00210.03001000.48000.41000.36000.4900NLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEPEKQ
21432212NS4A0.00310.0429700.42860.34290.37140.4857LAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWL
21432222NS4A0.00270.0375800.43750.33750.35000.4625LAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIAC
21432232NS4A0.00240.0333900.43330.35560.34440.4444LAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLV
21432242NS4A0.00210.03001000.48000.42000.37000.5000LAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEPEKQR
21442213NS4A0.00310.0429700.42860.34290.37140.4857AVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLS
21442223NS4A0.00270.0375800.43750.33750.35000.4625AVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACV
21442233NS4A0.00240.0333900.43330.36670.35560.4556AVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVV
21452214NS4A0.00310.0429700.44290.35710.37140.5000VLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSE
21452224NS4A0.00270.0375800.43750.35000.35000.4750VLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVL
21452234NS4A0.00240.0333900.44440.37780.35560.4667VLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVL
21462215NS4A0.00310.0429700.45710.35710.35710.4857LMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEI
21462225NS4A0.00270.0375800.45000.35000.33750.4625LMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLI
21462235NS4A0.00240.0333900.45560.38890.34440.4667LMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLI
21472216NS4A0.00310.0429700.44290.35710.34290.4857MRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIE
21472226NS4A0.00270.0375800.43750.36250.32500.4625MRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIV
21472236NS4A0.00240.0333900.45560.40000.35560.4778MRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIP
21482217NS4A0.00310.0429700.45710.35710.35710.4857RAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEP
21482227NS4A0.00270.0375800.43750.36250.32500.4625RAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVV
21482237NS4A0.00240.0333900.46670.41110.36670.4889RAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPE
21492218NS4A0.00310.0429700.45710.35710.35710.4857AETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPA
21492228NS4A0.00270.0375800.45000.36250.33750.4625AETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVF
21492238NS4A0.00240.0333900.47780.42220.37780.5000AETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEP
21502219NS4A0.00310.0429700.45710.34290.35710.4714ETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPAR
21502229NS4A0.00270.0375800.45000.36250.33750.4625ETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFL
21502239NS4A0.00240.0333900.48890.42220.37780.5000ETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEPE
21512220NS4A0.00310.0429700.45710.34290.35710.4714TGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARI
21512230NS4A0.00270.0375800.45000.36250.33750.4625TGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLL
21512240NS4A0.00240.0333900.48890.42220.36670.5000TGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEPEK
21522221NS4A0.00310.0429700.47140.35710.35710.4857GSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIA
21522231NS4A0.00270.0375800.45000.36250.33750.4625GSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLL
21532222NS4A0.00310.0429700.45710.34290.34290.4714SRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIAC
21532232NS4A0.00270.0375800.45000.36250.33750.4500SRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLV
21542223NS4A0.00310.0429700.45710.34290.32860.4714RPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACV
21542233NS4A0.00270.0375800.45000.37500.33750.4625RPYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVV
21552224NS4A0.00310.0429700.44290.35710.31430.4714PYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVL
21552234NS4A0.00270.0375800.45000.38750.32500.4625PYKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVL
21562225NS4A0.00310.0429700.45710.35710.31430.4714YKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLI
21562235NS4A0.00270.0375800.46250.40000.33750.4750YKAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLI
21572226NS4A0.00310.0429700.44290.37140.30000.4714KAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIV
21572236NS4A0.00270.0375800.46250.41250.33750.4875KAAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIP
21582227NS4A0.00200.0286700.44290.37140.30000.4714AAAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVV
21592228NS4A0.00200.0286700.45710.37140.31430.4714AAAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVF
21602219NS4A0.00240.0333600.45000.35000.31670.4833AAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPAR
21602229NS4A0.00200.0286700.44290.37140.30000.4714AAQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFL
21612230NS4A0.00200.0286700.45710.38570.31430.4857AQLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLL
21622231NS4A0.00200.0286700.45710.38570.31430.4857QLPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLL
21632232NS4A0.00100.0143700.47140.40000.32860.4857LPETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLV
21642233NS4A0.00100.0143700.45710.40000.34290.4857PETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVV
21652234NS4A0.00100.0143700.45710.40000.32860.4857ETLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVL
21662235NS4A0.00100.0143700.45710.41430.32860.5000TLETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLI
21682227NS4A0.00120.0167600.43330.36670.30000.4833ETIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVV
21692228NS4A0.00120.0167600.43330.35000.31670.4833TIMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVF
21702229NS4A0.00120.0167600.41670.35000.31670.4833IMLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFL
21712230NS4A0.00120.0167600.43330.35000.33330.4833MLLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLL
21722231NS4A0.00120.0167600.41670.35000.31670.4833LLGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLL
21732232NS4A0.00120.0167600.41670.35000.31670.4667LGLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLV
21742233NS4A0.00120.0167600.41670.36670.33330.4833GLLGTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVV
21772226NS4A0.00140.0200500.40000.36000.30000.4800GTVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIV
21782227NS4A0.00140.0200500.38000.36000.28000.4800TVSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVV
21792228NS4A0.00140.0200500.40000.36000.30000.4800VSLGIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVF
21822231NS4A0.00140.0200500.42000.40000.32000.4800GIFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLL
21832232NS4A0.00140.0200500.42000.40000.32000.4600IFFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLV
21842233NS4A0.00140.0200500.40000.42000.34000.4800FFVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVV
21852234NS4A0.00140.0200500.40000.42000.34000.4800FVLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVL
21862235NS4A0.00140.0200500.42000.44000.36000.4800VLMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLI
21882227NS4A0.00180.0250400.40000.40000.32500.4500MRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVV
21892228NS4A0.00180.0250400.40000.37500.32500.4250RNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVF
21902229NS4A0.00180.0250400.40000.40000.32500.4500NKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFL
21922231NS4A0.00180.0250400.40000.40000.32500.4500GIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLL
21932232NS4A0.00180.0250400.40000.40000.32500.4250IGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLV
22032222NS4A0.00000.0000200.40000.25000.30000.3500LGASAWLMWLSEIEPARIAC
22072226NS4A0.00000.0000200.40000.30000.30000.2500AWLMWLSEIEPARIACVLIV
22082227NS4A0.00000.0000200.40000.30000.30000.2500WLMWLSEIEPARIACVLIVV
22102229NS4A0.00000.0000200.40000.35000.30000.3000MWLSEIEPARIACVLIVVFL
22122231NS4A0.00000.0000200.40000.35000.30000.3000LSEIEPARIACVLIVVFLLL
23162335NS4B0.00000.0000200.40000.35000.30000.4000TPAVQHAVTTSYNNYSLMAM
23172326NS4B0.00000.0000100.30000.30000.20000.3000PAVQHAVTTS
23182323NS4B0.00000.000060.16670.16670.16670.1667AVQHAV
23182327NS4B0.00000.0000100.20000.30000.20000.3000AVQHAVTTSY
23182337NS4B0.00000.0000200.35000.25000.30000.3000AVQHAVTTSYNNYSLMAMAT
23192328NS4B0.00000.0000100.20000.30000.20000.3000VQHAVTTSYN
23192338NS4B0.00000.0000200.40000.30000.30000.3500VQHAVTTSYNNYSLMAMATQ
24182427NS4B0.00000.0000100.30000.30000.20000.3000VVTDIDTMTI
24192428NS4B0.00000.0000100.30000.20000.20000.2000VTDIDTMTID
24222427NS4B0.00000.000060.16670.00000.16670.0000IDTMTI
24232428NS4B0.00000.000060.33330.00000.00000.0000DTMTID
24532458NS4B0.00000.000060.33330.00000.16670.0000TAWGWG
24532462NS4B0.00000.0000100.40000.30000.20000.3000TAWGWGEAGA
24542459NS4B0.00000.000060.33330.16670.16670.1667AWGWGE
27032708NS50.00000.000060.33330.16670.16670.1667YTSTMM
27042709NS50.00000.000060.33330.16670.16670.1667TSTMME
27052710NS50.00000.000060.33330.16670.16670.1667STMMET
34033412NS50.00000.0000100.30000.00000.20000.0000STQVRYLGEE
34043413NS50.00000.0000100.30000.00000.20000.0000TQVRYLGEEG
34053414NS50.00000.0000100.40000.00000.20000.0000QVRYLGEEGS
34083413NS50.00000.000060.33330.00000.16670.0000YLGEEG

aa: amino acid; DENV: dengue virus; JEV: Japanese encephalitis virus; NS: non-structural; pr: precursor; WNV: West Nile virus; YFV: yellow fever virus.

a K-mer is the protein fragment’s length in amino acids.

Note: None of the peptides had any homology with chikungunya virus.

Table 3

The number of Zika virus protein fragments selected as lead candidates for developing a serological test

ProteinNo. of protein fragments
6-mer10-mer20-mer30-mer40-mer50-mer60-mer70-mer80-mer90-mer100-merTotal
Capsid C1000810600016
prM310000000004
E0605700000018
NS1310000000004
NS2A41070340000028
NS2B1750000000013
NS3000000000000
NS4A00505142444383228190
NS4B5630000000014
NS5430000000007
Total213420523192450383228294

E: envelope; NS; non-structural; prM; precursor membrane.

Note: mer refers to the amino acids length of the protein fragment. Candidate proteins were selected based on identity between Zika virus and other viruses, within-Zika virus polymorphism, and protein structure.

aa: amino acid; DENV: dengue virus; JEV: Japanese encephalitis virus; NS: non-structural; pr: precursor; WNV: West Nile virus; YFV: yellow fever virus. a K-mer is the protein fragment’s length in amino acids. Note: None of the peptides had any homology with chikungunya virus. E: envelope; NS; non-structural; prM; precursor membrane. Note: mer refers to the amino acids length of the protein fragment. Candidate proteins were selected based on identity between Zika virus and other viruses, within-Zika virus polymorphism, and protein structure. As Zika virus infection is associated with birth defects that are not seen in other flavivirus infections, we compared identity and polymorphism of proteins between flaviviruses. Overall, the level of identity between Zika virus and other flaviviruses is similar to the level of identity seen when comparing other flaviviruses with each other (available from the corresponding author). In contrast, one region (amino acid positions 430–500 in the proteome) in the envelope protein shows both low identity between Zika virus and other flaviviruses and low polymorphism within Zika virus (Fig. 2) and the relative polymorphism of NS2A and NS2B is on average 53.6% and 69.5% lower in Zika virus than in other flaviviruses, respectively (Fig. 4).
Fig. 4

Normalized within-species polymorphism for each gene of each virus

Normalized within-species polymorphism for each gene of each virus E: envelope; NS; non-structural; prM; precursor membrane. Notes: The proportion of polymorphic sites of each gene is normalized by the highest proportion of polymorphic sites for each virus. Protein identity between dengue and Zika viruses is negatively associated with polymorphism within the dengue virus proteins (P-values < 0.01 for all dengue serotypes; Fig. 5). This result can be explained by so-called negative selection, i.e. protein regions under stronger selective constraints tend to be more conserved and have higher identity between species and lower polymorphism within species. We did not observe a similar association for within-Zika virus polymorphism, which might be due to fewer strains analysed and/or smaller effective size of the global Zika virus population from which sequences were sampled, resulting in lower selection efficiency.
Fig. 5

Dengue virus polymorphism versus identity with Zika virus

Dengue virus polymorphism versus identity with Zika virus E: envelope; NS; non-structural; prM; precursor membrane. Notes: 50-mers across the Dengue virus proteome were analysed, using a sliding window approach. Dengue virus polymorphism is negatively associated with the identity between dengue virus and Zika virus.

Discussion

Here we identified regions within the Zika virus proteome that have low identity with other viruses and low within-species polymorphism. These regions may be used to develop new serological diagnostic tests to detect Zika virus infection. However, for some of the identified regions, their antigenic properties are unknown and, therefore, these regions would first need to be evaluated for such properties. The regions identified as antigenic could then be used for developing a peptide microarray, where a collection of identified peptides are displayed on a surface. Antibodies generated during a previous Zika virus infection will then be able to bind to these displayed peptides. The read-out of the microarray is the fluorescent signal generated by fluorescence-coupled secondary antibodies that have bound to the serum antibody–peptide complexes. An advantage of assessing multiple peptides simultaneously in one test is that individual peptides do not need to generate a strong signal, since the intensities of signals of all different antibody–peptide complexes can be incorporated into a composite signal. Through statistical modelling the signal generated can be used to distinguish Zika virus infection and other infections. Microarrays also have a greater potential to identify prior virus infections than neutralization-based assays, because microarrays can detect a broader range of antibodies than only antibodies that neutralize the virus and protect against infections. Peptide microarrays have been used to differentiate between serological responses to closely related bacterial pathogens and to detect previous viral infections. The computational selection strategy used here represents a targeted approach, which reduces the number of potential candidate peptides. These peptides could be used for creating a peptide–antibody signature for a given viral infection. Once the signature is identified, a diagnostic test employing only the most important peptides contributing to that signature can be designed and produced. While our computational analysis of k-mers focused on linear epitopes, specific and sensitive linear epitopes together may be sufficient to distinguish different arboviruses. Moreover, depending on how a serological diagnostic test is produced, some of the longer k-mers might fold with sufficient similarity to their native folding to present conformational epitopes. Our analysis showed that NS1 protein polymorphism is low. Therefore, using peptides from the NS1 protein for diagnostic test might result in a high-sensitivity test for detecting antibodies against Zika virus from different geographical locations. On the contrary, the identity of NS1 protein across flaviviruses is not particularly low compared to other proteins (third highest among 10 proteins), suggesting that NS1 is not the top candidate protein for low cross-reactivity. Recently, Euroimmun AG (Lübeck, Germany) developed a Zika virus ELISA for immunoglobulins (Ig)M and IgG, based on the NS1 protein. Preliminary results show that the test is Zika virus specific., However, the small sample size, the fact that the samples were not from regions with endemic dengue and the lack of samples from patients with different stages of infection weaken the conclusion., Moreover, because each diagnostic test has its advantages and disadvantages, having multiple approaches available is helpful for providing an accurate diagnosis. A sensitive and specific diagnostic test detecting several arbovirus infections simultaneously would be valuable, so that only one assay is required to diagnose active and previous flavivirus infection(s). While we designed the sequence analysis for specificity and sensitivity of detection of Zika virus infection, the same type of analysis could be used for identifying specific and sensitive markers for each arbovirus. By including specific and sensitive markers from all arboviruses in the same peptide microarray, the microarray has the potential to detect several arbovirus infections simultaneously. To further dissect the molecular mechanism leading to the Zika virus sequelae not seen with other flaviviruses, the protein fragments presented in the candidate list may be useful. The low polymorphisms in NS2A and NS2B proteins might be good candidates to start investigating the possible molecular link between Zika virus and microcephaly and Guillain–Barré syndrome. Peptide-sequence identity is unlikely to fully predict cross-reactivity due to other factors, such as glycosylation. Nonetheless, this analysis based on publicly available sequences provides a step towards the development of a serological test that can distinguish previous Zika virus and co-circulating arbovirus infections.
  25 in total

1.  Reactivity of serum samples from patients with a flavivirus infection measured by immunofluorescence assay and ELISA.

Authors:  Penelopie Koraka; Herve Zeller; Matthias Niedrig; Albert D M E Osterhaus; Jan Groen
Journal:  Microbes Infect       Date:  2002-10       Impact factor: 2.700

2.  Use of recombinant fusion proteins and monoclonal antibodies to define linear and discontinuous antigenic sites on the dengue virus envelope glycoprotein.

Authors:  F Megret; J P Hugnot; A Falconar; M K Gentry; D M Morens; J M Murray; J J Schlesinger; P J Wright; P Young; M H Van Regenmortel
Journal:  Virology       Date:  1992-04       Impact factor: 3.616

3.  Predicting antigenic determinants in proteins: looking for unidimensional solutions to a three-dimensional problem?

Authors:  M H Van Regenmortel; J L Pellequer
Journal:  Pept Res       Date:  1994 Jul-Aug

4.  Pushing the Envelope: Dengue Viral Membrane Coaxed into Shape by Molecular Simulations.

Authors:  Jan K Marzinek; Daniel A Holdbrook; Roland G Huber; Chandra Verma; Peter J Bond
Journal:  Structure       Date:  2016-07-07       Impact factor: 5.006

5.  Zika virus infections imported to Italy: clinical, immunological and virological findings, and public health implications.

Authors:  Lorenzo Zammarchi; Giulia Stella; Antonia Mantella; Dario Bartolozzi; Dennis Tappe; Stephan Günther; Lisa Oestereich; Daniel Cadar; César Muñoz-Fontela; Alessandro Bartoloni; Jonas Schmidt-Chanasit
Journal:  J Clin Virol       Date:  2014-12-12       Impact factor: 3.168

6.  High specificity of a novel Zika virus ELISA in European patients after exposure to different flaviviruses.

Authors:  Daniela Huzly; Ingeborg Hanselmann; Jonas Schmidt-Chanasit; Marcus Panning
Journal:  Euro Surveill       Date:  2016-04-21

7.  Genetic and serologic properties of Zika virus associated with an epidemic, Yap State, Micronesia, 2007.

Authors:  Robert S Lanciotti; Olga L Kosoy; Janeen J Laven; Jason O Velez; Amy J Lambert; Alison J Johnson; Stephanie M Stanfield; Mark R Duffy
Journal:  Emerg Infect Dis       Date:  2008-08       Impact factor: 6.883

8.  Natural selection of protein structural and functional properties: a single nucleotide polymorphism perspective.

Authors:  Jinfeng Liu; Yan Zhang; Xingye Lei; Zemin Zhang
Journal:  Genome Biol       Date:  2008-04-08       Impact factor: 13.583

9.  Virus Variation Resource--recent updates and future directions.

Authors:  J Rodney Brister; Yiming Bao; Sergey A Zhdanov; Yuri Ostapchuck; Vyacheslav Chetvernin; Boris Kiryutin; Leonid Zaslavsky; Michael Kimelman; Tatiana A Tatusova
Journal:  Nucleic Acids Res       Date:  2013-12-04       Impact factor: 16.971

Review 10.  Background review for diagnostic test development for Zika virus infection.

Authors:  Rémi N Charrel; Isabelle Leparc-Goffart; Suzan Pas; Xavier de Lamballerie; Marion Koopmans; Chantal Reusken
Journal:  Bull World Health Organ       Date:  2016-08-01       Impact factor: 9.408

View more
  21 in total

1.  Structural basis for Zika envelope domain III recognition by a germline version of a recurrent neutralizing antibody.

Authors:  Shannon R Esswein; Harry B Gristick; Andrea Jurado; Avery Peace; Jennifer R Keeffe; Yu E Lee; Alisa V Voll; Mohsan Saeed; Michel C Nussenzweig; Charles M Rice; Davide F Robbiani; Margaret R MacDonald; Pamela J Bjorkman
Journal:  Proc Natl Acad Sci U S A       Date:  2020-04-22       Impact factor: 11.205

2.  Maternally Acquired Zika Antibodies Enhance Dengue Disease Severity in Mice.

Authors:  Angela M Fowler; William W Tang; Matthew P Young; Anila Mamidi; Karla M Viramontes; Melanie D McCauley; Aaron F Carlin; Robert T Schooley; Jesica Swanstrom; Ralph S Baric; Jennifer Govero; Michael S Diamond; Sujan Shresta
Journal:  Cell Host Microbe       Date:  2018-11-14       Impact factor: 21.023

3.  A plant-produced vaccine protects mice against lethal West Nile virus infection without enhancing Zika or dengue virus infectivity.

Authors:  Huafang Lai; Amber M Paul; Haiyan Sun; Junyun He; Ming Yang; Fengwei Bai; Qiang Chen
Journal:  Vaccine       Date:  2018-02-26       Impact factor: 3.641

4.  Development of Zika Virus Serological Testing Strategies in New York State.

Authors:  William T Lee; Susan J Wong; Karen E Kulas; Alan P Dupuis; Anne F Payne; Laura D Kramer; Amy B Dean; Kirsten St George; Jennifer L White; Jamie N Sommer; Michel Ledizet; Ronald J Limberger
Journal:  J Clin Microbiol       Date:  2018-02-22       Impact factor: 5.948

5.  Structural landscape of the complete genomes of dengue virus serotypes and other viral hemorrhagic fevers.

Authors:  Riccardo Delli Ponti; Marek Mutwil
Journal:  BMC Genomics       Date:  2021-05-17       Impact factor: 3.969

6.  Diagnosis of Zika Virus Infection by Peptide Array and Enzyme-Linked Immunosorbent Assay.

Authors:  Nischay Mishra; Adrian Caciula; Adam Price; Riddhi Thakkar; James Ng; Lokendra V Chauhan; Komal Jain; Xiaoyu Che; Diego A Espinosa; Magelda Montoya Cruz; Angel Balmaseda; Eric H Sullivan; Jigar J Patel; Richard G Jarman; Jennifer L Rakeman; Christina T Egan; Chantal B E M Reusken; Marion P G Koopmans; Eva Harris; Rafal Tokarz; Thomas Briese; W Ian Lipkin
Journal:  mBio       Date:  2018-03-06       Impact factor: 7.867

Review 7.  Advanced "lab-on-a-chip" to detect viruses - Current challenges and future perspectives.

Authors:  Jianjian Zhuang; Juxin Yin; Shaowu Lv; Ben Wang; Ying Mu
Journal:  Biosens Bioelectron       Date:  2020-05-12       Impact factor: 10.618

8.  T cell immunity rather than antibody mediates cross-protection against Zika virus infection conferred by a live attenuated Japanese encephalitis SA14-14-2 vaccine.

Authors:  Ran Wang; Zida Zhen; Lance Turtle; Baohua Hou; Yueqi Li; Na Wu; Na Gao; Dongying Fan; Hui Chen; Jing An
Journal:  Appl Microbiol Biotechnol       Date:  2020-06-15       Impact factor: 4.813

9.  A Label and Probe-Free Zika Virus Immunosensor Prussian Blue@carbon Nanotube-Based for Amperometric Detection of the NS2B Protein.

Authors:  Bárbara V M Silva; Marli T Cordeiro; Marco A B Rodrigues; Ernesto T A Marques; Rosa F Dutra
Journal:  Biosensors (Basel)       Date:  2021-05-16

10.  Antibody responses to Zika virus proteins in pregnant and non-pregnant macaques.

Authors:  Anna S Heffron; Emma L Mohr; David Baker; Amelia K Haj; Connor R Buechler; Adam Bailey; Dawn M Dudley; Christina M Newman; Mariel S Mohns; Michelle Koenig; Meghan E Breitbach; Mustafa Rasheed; Laurel M Stewart; Jens Eickhoff; Richard S Pinapati; Erica Beckman; Hanying Li; Jigar Patel; John C Tan; David H O'Connor
Journal:  PLoS Negl Trop Dis       Date:  2018-11-27
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.