| Literature DB >> 25056894 |
Matthew Cotten1, Velislava Petrova1, My V T Phan2, Maia A Rabaa3, Simon J Watson1, Swee Hoe Ong1, Paul Kellam4, Stephen Baker5.
Abstract
UNLABELLED: Norovirus is a highly transmissible infectious agent that causes epidemic gastroenteritis in susceptible children and adults. Norovirus infections can be severe and can be initiated from an exceptionally small number of viral particles. Detailed genome sequence data are useful for tracking norovirus transmission and evolution. To address this need, we have developed a whole-genome deep-sequencing method that generates entire genome sequences from small amounts of clinical specimens. This novel approach employs an algorithm for reverse transcription and PCR amplification primer design using all of the publically available norovirus sequence data. Deep sequencing and de novo assembly were used to generate norovirus genomes from a large set of diarrheal patients attending three hospitals in Ho Chi Minh City, Vietnam, over a 2.5-year period. Positive-selection analysis and direct examination of protein changes in the virus over time identified codons in the regions encoding proteins VP1, p48 (NS1-2), and p22 (NS4) under positive selection and expands the known targets of norovirus evolutionary pressure. IMPORTANCE: The high transmissibility and rapid evolutionary rate of norovirus, combined with a short-lived host immune responses, are thought to be the reasons why the virus causes the majority of pediatric viral diarrhea cases. The evolutionary patterns of this RNA virus have been described in detail for only a portion of the virus genome and never for a virus from a detailed urban tropical setting. We provide a detailed sequence description of the noroviruses circulating in three Ho Chi Minh City hospitals over a 2.5-year period. This study identified patterns of virus change in known sites of host immune response and identified three additional regions of the virus genome under selection that were not previously recognized. In addition, the method described here provides a robust full-genome sequencing platform for community-based virus surveillance.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25056894 PMCID: PMC4178781 DOI: 10.1128/JVI.01333-14
Source DB: PubMed Journal: J Virol ISSN: 0022-538X Impact factor: 5.103
FIG 1Primer design and function for full-genome deep-sequencing amplification. (Left panel) Virtual PCR showing the mapping of the designed primers to a norovirus GII.4 genome (GenBank accession no. JQ613552). Colored circles indicate the position of each primer and the number of mismatches; gray bars indicate the predicted sizes of the PCR products. A schematic of the ORF organization of the virus is shown at the top. (Right panel) The PCR products from 14 samples for individual primer pairs for amplicons 1, 2, and 3. Amplification of sample 7 failed. Lanes: c, water control; m, size marker. The sizes of relevant marker bands (in kilobase pairs) are indicated to the right.
Primers used in this study
| Primer | Sequence | Strand | Position | GC_fraction | Norovirus GII genomes (517) | Norovirus genomes (753) | |||
|---|---|---|---|---|---|---|---|---|---|
| 0 MM | 0–3 MM | 0 MM | 0–3 MM | ||||||
| UNP_47 | GTGAATGAAGATGGCGTCTAAC | Plus | 1 | 55.52 | 0.45 | 98 | 100 | 83 | 84 |
| UNP_45 | TCTAACGACGCTTCCGCTG | Plus | 17 | 58.30 | 0.58 | 75 | 96 | 62 | 80 |
| UNP_201R | GCAATGGCCACCTCCTCAT | Minus | 2808 | 57.95 | 0.58 | 97 | 100 | 80 | 85 |
| UNP_226R | TTGGCCTCCTCCTCTTCACA | Minus | 2850 | 58.21 | 0.55 | 92 | 99 | 76 | 82 |
| UNP_339 | GGCAAGAAGCACACAGCC | Plus | 2660 | 57.48 | 0.61 | 88 | 92 | 73 | 78 |
| UNP_1316 | TGGTCCAAGCCACAAGTGG | Plus | 2519 | 58.05 | 0.58 | 11 | 100 | 13 | 89 |
| UNP_82 | GACCTCTGGGACGAGGTTG | Minus | 5150 | 57.41 | 0.63 | 87 | 96 | 70 | 80 |
| UNP_135 | CTCCACCAGGGGCTTGTAC | Minus | 5271 | 57.63 | 0.63 | 89 | 94 | 73 | 78 |
| UNP_2 | GGGAGGGCGATCGCAAT | Plus | 5049 | 57.57 | 0.65 | 88 | 96 | 72 | 79 |
| UNP_23 | TTGTGAATGAAGATGGCGTCGA | Plus | 5079 | 58.53 | 0.45 | 56 | 100 | 42 | 84 |
| UNP_100 | GCCAGTCCAGGAGTCCAA | Minus | 7447 | 56.43 | 0.61 | 74 | 97 | 61 | 83 |
| UNP_44 | GCACGGTTGAGACTGTGC | Minus | 7418 | 57.27 | 0.61 | 84 | 98 | 69 | 82 |
| UNP_20 | CGAGGGGAGTCACGGGT | Minus | 7493 | 58.34 | 0.71 | 86 | 97 | 70 | 83 |
Primer mapping position in the norovirus GII.4 genome (GenBank accession no. JQ613552).
The T was calculated with a Python script that approximates the Breslauer method (59).
All GenBank database entries (July 2014) for norovirus GII (taxonomic identification no. 142786; length, 7,000 to 8,000 nt; 517 entries).
Percentage of norovirus GII genomes (n = 517) showing perfect homology to the primer.
Percentage of norovirus GII genomes (n = 517) showing the target sequence for the primer with up to three mismatches.
All GenBank database entries (July 2014) for norovirus (taxonomic identification no. 122929; length, 7,000 to 8,000 nt; 753 entries).
Percentage of norovirus genomes (n = 753) showing perfect homology to the primer.
Percentage of norovirus genomes (n = 753) showing the target sequence for the primer with up to three mismatches.
GenBank and ENA accession numbers, genetic cluster, and sample collection date
| Virus | GenBank accession no. | ENA accession no. | Cluster | Sample collection date |
|---|---|---|---|---|
| Hu_GII_10116_2009_VNM | ERR212491 | 1 | 9/7/2009 | |
| Hu_GII_10054_2009_VNM | ERR225641 | 1 | 21/5/2009 | |
| Hu_GII_10114_2009_VNM | ERR212490 | 1 | 9/7/2009 | |
| Hu_GII_10313_2010_VNM | ERR217285 | 4 | 22/2/2010 | |
| Hu_GII_30212_2009_VNM | ERR223539 | 5 | 6/10/2009 | |
| Hu_GII_10148_2009_VNM | ERR212498 | 2 | 11/8/2009 | |
| Hu_GII_C2H-18_2011_VNM | ERR225628 | 3 | 30/8/2011 | |
| Hu_GII_10110_2009_VNM | ERR212489 | 1 | 6/7/2009 | |
| Hu_GII_10325_2010_VNM | ERR217290 | 4 | 26/2/2010 | |
| Hu_GII_10101_2009_VNM | ERR212487 | 1 | 25/6/2009 | |
| Hu_GII_10002_2009_VNM | ERR225635 | 1 | 4/5/2009 | |
| Hu_GII_30351_2009_VNM | ERR138007 | 4 | 17/12/2009 | |
| Hu_GII_30448_2010_VNM | ERR223547 | 6 | 29/1/2010 | |
| Hu_GII_30468_2010_VNM | ERR223549 | 5 | 24/2/2010 | |
| Hu_GII_10247_2009_VNM | ERR217278 | 1 | 10/12/2009 | |
| Hu_GII_10193_2009_VNM | ERR138002 | 1 | 05/10/2009 | |
| Hu_GII_20419_2010_VNM | ERR223554 | 5 | 1/2/2010 | |
| Hu_GII_10236_2009_VNM | ERR217280 | 1 | 19/11/2009 | |
| Hu_GII_20088_2009_VNM | ERR223553 | 8 | 28/7/2009 | |
| Hu_GII_20118_2009_VNM | ERR212481 | 1 | 28/8/2009 | |
| Hu_GII_C2H-20_2011_VNM | ERR225629 | 5 | 5/9/2011 | |
| Hu_GII_10173_2009_VNM | ERR212503 | 1 | 11/9/2009 | |
| Hu_GII_10136_2009_VNM | ERR212495 | 1 | 3/8/2009 | |
| Hu_GII_C2033_2010_VNM | ERR212484 | 6 | 28/6/2010 | |
| Hu_GII_20151_2009_VNM | ERR212467 | 1 | 16/9/2009 | |
| Hu_GII_20460_2010_VNM | ERR223530 | 5 | 4/3/2010 | |
| Hu_GII_10199_2009_VNM | ERR217283 | 1 | 20/10/2009 | |
| Hu_GII_C2007_2010_VNM | ERR138011 | 4 | 2/4/2010 | |
| Hu_GII_20066_2009_VNM | ERR212470 | 1 | 14/7/2009 | |
| Hu_GII_20479_2010_VNM | ERR223531 | 5 | 16/3/2010 | |
| Hu_GII_10012_2009_VNM | ERR225637 | 1 | 7/5/2009 | |
| Hu_GII_C2H-24_2011_VNM | ERR225631 | 5 | 16/9/2011 | |
| Hu_GII_10062_2009_VNM | ERR225642 | 1 | 29/5/2009 | |
| Hu_GII_20494_2010_VNM | ERR223534 | 1 | 19/3/2010 | |
| Hu_GII_10079_2009_VNM | ERR212486 | 1 | 11/6/2009 | |
| Hu_GII_C2H-31_2011_VNM | ERR217269 | 3 | 28/9/2011 | |
| Hu_GII_10285_2010_VNM | ERR217288 | 4 | 18/1/2010 | |
| Hu_GII_30399_2010_VNM | ERR138008 | 1 | 11/1/2010 | |
| Hu_GII_10182_2009_VNM | ERR212507 | 1 | 17/9/2009 | |
| Hu_GII_30082_2009_VNM | ERR223537 | 8 | 23/6/2009 | |
| Hu_GII_10158_2009_VNM | ERR212499 | 1 | 21/8/2009 | |
| Hu_GII_10176_2009_VNM | ERR212504 | 1 | 14/9/2009 | |
| Hu_GII_20150_2009_VNM | ERR212466 | 1 | 15/9/2009 | |
| Hu_GII_10204_2009_VNM | ERR217287 | 1 | 29/10/2009 | |
| Hu_GII_10034_2009_VNM | ERR225638 | 1 | 15/5/2009 | |
| Hu_GII_10163_2009_VNM | ERR212501 | 1 | 28/8/2009 | |
| Hu_GII_10075_2009_VNM | ERR225644 | 1 | 9/6/2009 | |
| Hu_GII_10074_2009_VNM | ERR225643 | 1 | 9/6/2009 | |
| Hu_GII_C2H-25_2011_VNM | ERR225632 | 5 | 20/9/2011 | |
| Hu_GII_C2H-27_2011_VNM | ERR225633 | 5 | 21/9/2011 | |
| Hu_GII_20486_2010_VNM | ERR223532 | 7 | 18/3/2010 | |
| Hu_GII_30116_2009_VNM | ERR223538 | 7 | 9/7/2009 | |
| Hu_GII_10108_2009_VNM | ERR212488 | 1 | 2/7/2009 | |
| Hu_GII_30241_2009_VNM | ERR223540 | 1 | 26/10/2009 | |
| Hu_GII_30443_2010_VNM | ERR223546 | 8 | 28/1/2010 | |
| Hu_GII_20092_2009_VNM | ERR212474 | 1 | 31/7/2009 | |
| Hu_GII_20079_2009_VNM | ERR212473 | 1 | 24/7/2009 | |
| Hu_GII_10137_2009_VNM | ERR212496 | 1 | 4/8/2009 | |
| Hu_GII_10051_2009_VNM | ERR225640 | 1 | 21/5/2009 | |
| Hu_GII_C2H-36_2011_VNM | ERR217266 | 3 | 25/10/2011 | |
| Hu_GII_20145_2009_VNM | ERR212464 | 1 | 11/9/2009 | |
| Hu_GII_20188_2009_VNM | ERR138004 | 1 | 7/10/2009 | |
| Hu_GII_20094_2009_VNM | ERR212476 | 2 | 3/8/2009 | |
| Hu_GII_20357_2009_VNM | ERR138005 | 4 | 30/12/2009 | |
| Hu_GII_C2418_2010_VNM | ERR138012 | 4 | 1/11/2010 | |
| Hu_GII_10195_2009_VNM | ERR217284 | 1 | 13/10/2009 | |
| Hu_GII_20067_2009_VNM | ERR212471 | 1 | 15/7/2009 | |
| Hu_GII_C2H-47_2011_VNM | ERR217275 | 5 | 3/11/2011 | |
| Hu_GII_20107_2009_VNM | ERR212477 | 1 | 20/8/2009 | |
| Hu_GII_30473_2010_VNM | ERR223550 | 7 | 1/3/2010 | |
| Hu_GII_10078_2009_VNM | ERR225645 | 1 | 10/6/2009 | |
| Hu_GII_20108_2009_VNM | ERR212478 | 1 | 20/8/2009 | |
| Hu_GII_20154_2009_VNM | ERR212469 | 1 | 16/9/2009 | |
| Hu_GII_30381_2010_VNM | ERR223544 | 5 | 4/1/2010 | |
| Hu_GII_C2H-48_2011_VNM | ERR217271 | 5 | 4/11/2011 | |
| Hu_GII_C2035_2010_VNM | ERR138006 | 4 | 28/6/2010 | |
| Hu_GII_10127_2009_VNM | ERR212492 | 1 | 17/7/2009 | |
| Hu_GII_10194_2009_VNM | ERR217286 | 1 | 13/10/2009 | |
| Hu_GII_30257_2009_VNM | ERR223541 | 1 | 30/10/2009 | |
| Hu_GII_10129_2009_VNM | ERR212493 | 1 | 20/7/2009 | |
| Hu_GII_C2H-50_2011_VNM | ERR217268 | 3 | 22/11/2011 | |
| Hu_GII_30303_2009_VNM | ERR223542 | 5 | 23/11/2009 | |
| Hu_GII_C2H-55_2011_VNM | ERR212509 | 3 | 25/11/2011 | |
| Hu_GII_C2365_2010_VNM | ERR212485 | 5 | 15/9/2010 | |
| Hu_GII_C2H-44_2011_VNM | ERR217276 | 3 | 31/10/2011 | |
| Hu_GII_10169_2009_VNM | ERR212502 | 1 | 4/9/2009 | |
| Hu_GII_20093_2009_VNM | ERR212475 | 1 | 3/8/2009 | |
| Hu_GII_10255_2009_VNM | ERR217273 | 1 | 15/12/2009 | |
| Hu_GII_C2H-62_2011_VNM | ERR217267 | 3 | 14/12/2011 | |
| Hu_GII_10235_2009_VNM | ERR217274 | 1 | 19/11/2009 | |
| Hu_GII_20146_2009_VNM | ERR212465 | 1 | 11/9/2009 | |
| Hu_GII_20123_2009_VNM | ERR212479 | 1 | 1/9/2009 | |
| Hu_GII_20370_2010_VNM | ERR212461 | 5 | 6/1/2010 | |
| Hu_GII_C2H-45_2011_VNM | ERR217277 | 5 | 2/11/2011 | |
| Hu_GII_10183_2009_VNM | ERR217289 | 1 | 22/9/2009 | |
| Hu_GII_20069_2009_VNM | ERR212472 | 1 | 16/7/2009 | |
| Hu_GII_C2H-43_2011_VNM | ERR217281 | 3 | 14/10/2011 | |
| Hu_GII_10145_2009_VNM | ERR212497 | 1 | 7/8/2009 | |
| Hu_GII_C2H-52_2011_VNM | ERR212508 | 3 | 24/11/2011 | |
| Hu_GII_10160_2009_VNM | ERR212500 | 1 | 26/8/2009 | |
| Hu_GII_10223_2009_VNM | ERR217272 | 1 | 6/11/2009 | |
| Hu_GII_10003_2009_VNM | ERR225636 | 1 | 5/5/2009 | |
| Hu_GII_30192_2010_VNM | ERR138003 | 1 | 21/9/2009 | |
| Hu_GII_20493_2010_VNM | ERR223533 | 5 | 19/3/2010 | |
| Hu_GII_10131_2009_VNM | ERR212494 | 1 | 22/7/2009 | |
| Hu_GII_10238_2009_VNM | ERR217282 | 1 | 19/11/2009 | |
| Hu_GII_30400_2010_VNM | ERR223545 | 5 | 11/1/2010 | |
| Hu_GII_20153_2009_VNM | ERR212468 | 1 | 16/9/2009 | |
| Hu_GII_10037_2009_VNM | ERR225639 | 1 | 15/5/2009 | |
| Hu_GII_20144_2009_VNM | ERR212463 | 1 | 10/9/2009 | |
| Hu_GII_C2H-39_2011_VNM | ERR217270 | 5 | 24/10/2011 | |
| Hu_GII_20122_2009_VNM | ERR212480 | 1 | 31/8/2009 |
GenBank accession number, accessible at http://www.ncbi.nlm.nih.gov/nuccore/.
European Nucleotide Archive (ENA) accession number, accessible at http://www.ebi.ac.uk/ena/.
Genetic cluster as defined in Fig. 2.
Day/month/year of sample collection.
FIG 2ML phylogenetic tree of the 112 HCMC GII genomes in this study and 89 GII.4 genomes from the same HCMC cohort that were sequenced separately and made publically available in the GenBank database plus selected global reference genomes. The eight phylogenetic clusters of norovirus identified in this study are marked with colored bars. Bootstrap support of ≥0.85 at key nodes is indicated with asterisks. The tree is midpoint rooted for clarity, and all horizontal branch lengths are drawn to a scale of nucleotide substitutions per site.
Phylogenetic clusters identified in this study
| Phylogenetic cluster | Closest genome | Genotype by RIVM algorithm | No. of genomes | Frequency (%) in 201 genomes |
|---|---|---|---|---|
| 1 | NV_GII_VNM_2009_KC175360 | GII.P4.DH06b_GII.4.DH06b | 140 | 69.65 (67.6) |
| NV_GII_VNM_2009_KC175395 | ||||
| 2 | NV_GII_VNM_2009_KC175402 | GII.P4.DH06b_GII.4.DH06b | 2 | |
| 3 | NV_GII4_TW_2007_JN400615 | GII.P4.DH06b_GII.4.DH06b | 12 | |
| NV_GII4_Ehime_2007_AB541241 | ||||
| 4 | NV_GII_VNM_2010_KC175383 | GII.P4.NO09_GII.4.NO09 | 20 | 9.95 (9.5) |
| 5 | NV_Pune_2007_EU921389 | GII.P21_GII.3 | 19 | 9.45 (10.2) |
| 6 | NV_Pune_2007_EU921389 | GII.Pg_GII.12 | 2 | 0.99 (0.6) |
| NV_GII2_12_Wahroonga_2009_JQ613568 | ||||
| 7 | NV_GII_Gifu_1999_AB084071 (<50%) | GII.P7_GII.6 | 3 | 1.49 (2.5) |
| 8 | NV_GII_Gifu_1999_AB084071 (<50%) | GII.P7_GII.6 | 3 | 1.49 (2.5) |
Phylogenetic classification (see Fig. 2).
Based on the number of reads mapped.
Based on the RIVM algorithm (47).
The values in parentheses are genotype frequency percentages determined by My et al. (31).
PCR and genome sequencing success by norovirus genotype
| Genotype | No. of samples | Amplicon 1 | Amplicon 2 | Amplicon 3 | No. of genomes | (%) Successful |
|---|---|---|---|---|---|---|
| GII.4 | 60 | 55 | 55 | 57 | 55 | 92 |
| GII (non-GII.4) | 58 | 48 | 45 | 53 | 45 | 74 |
| GI | 10 | 7 | 4 | 4 | 2 | 20 |
| GII.2 | 5 | 4 | 1 | 5 | 2 | 40 |
| GII.3 | 34 | 26 | 27 | 27 | 26 | 77 |
| GII.6 | 8 | 8 | 7 | 8 | 7 | 88 |
| GII.7 | 2 | 2 | 0 | 2 | 0 | 0 |
| GII.9 | 1 | 1 | 0 | 1 | 1 | 100 |
| GII.12 | 2 | 1 | 2 | 2 | 1 | 50 |
| GII.13 | 6 | 5 | 6 | 6 | 5 | 83 |
Sample genotype previously determined (30, 31).
Successful RT-PCR amplification of sufficient DNA (ca. 0.4 μg) for Illumina library preparation. The values shown are the number of successful amplicons generated.
Yield of >95% of the full genome.
FIG 3Temporal appearance of the HCMC norovirus GII genotypes during the study period. Genomes were stratified by genotype (from Fig. 2), color coded, and plotted by date of sample isolation.
Evolutionary rates
| Sequence set and genomic region | Mean rate (95% HPD) | Substitution model |
|---|---|---|
| Cluster 1, GII.P4 Den Haag 2006b_GII.4 Den Haag 2006b | ||
| Whole genome | 6.15 (5.39–6.86) | SRD06 |
| ORF1 | 5.94 (5.04–6.94) | SRD06 |
| ORF2 | 5.69 (4.54–6.90) | SRD06 |
| ORF3 | 8.99 (6.59–11.6) | SRD06 |
| p48 (NS1-2) | 6.60 (4.83–8.47) | GTR+G |
| NTPase (NS3) | 5.41 (4.04–6.89) | GTR+G |
| p22 (NS4) | 8.21 (5.48–11.11) | GTR+G |
| VPg (NS5) | 5.95 (3.27–8.94) | HKY+G |
| 3CLpro (NS6) | 6.03 (3.57–8.61) | GTR+G |
| RdRp (NS7) | 4.74 (3.50–6.10) | GTR+G |
| Cluster 4, GII.P4 New Orleans 2009_GII.4 New Orleans 2009 | ||
| Whole genome | 5.73 (3.74–7.81) | GTR+G |
| ORF1 | 4.03 (1.77–6.33) | HKY+G |
| ORF2 | 5.60 (0.68–9.82) | HKY+G |
| ORF3 | ||
| Cluster 5, GII.P21_GII.3 | ||
| Whole genome | 5.34 (4.06–6.82) | SRD06 |
| ORF1 | 4.81 (3.45–6.17) | SRD06 |
| ORF2 | 5.99 (3.75–8.39) | SRD06 |
| ORF3 | 7.38 (2.06–13.9) | SRD06 |
Evolutionary rates were measured as 10−3 substitution per site per year. The mean evolutionary rate (10−3 substitution per site per year) and the 95% upper and lower HPD intervals were determined as described in Materials and Methods.
There was insufficient signal for the algorithms to return a reliable evolutionary rate for ORF3 region of sequences from GII.4 cluster 4.
FIG 4Summary of evolutionary rates inferred for the genomic regions of GII.4 cluster 1. Evolutionary rates were estimated as described in Materials and Methods, and mean values are indicated by colored circles, and error bars show 95% confidence intervals. The region of the norovirus genome used for calculation is labeled, and the two regions with rates higher than that of the full genome are in red.
FIG 5Changes in the GII.4 cluster 1 VP1 protein. The protein sequences were aligned, and amino acid differences from the reconstructed ancestral sequence of cluster 1 were determined and marked with vertical colored bars, with the new amino acid residue color coded as shown at the bottom; the gray bar indicates a gap in the query sequence. The sequences were ordered by sample date, with the earliest samples at the bottom of the graph. Functional domains of the VP1 protein are indicated at the top of the graph and include the shell domain and the protruding 1 (P1) and protruding 2 (P2) domains. The locations of blockade epitopes A and E are also indicated (B_A and B_E, respectively). The histogram in the second panel from the top indicates the total number of changes at each position. The protein changes occurring in more than four samples are annotated with the parental amino acid, the position, and the new amino acid. Codons found to be under positive selection by MEME or FUBAR are indicated with red asterisks.
Positive-selection analysis
| Codon position | FUBAR | MEME |
|---|---|---|
| ORF1 (p48 [NS1-2]) 79 | 0.991 | 0.037 |
| ORF2 (VP1) 106 | 0.993 | 0.014 |
| ORF2 (VP1) 298 | 0.984 | >0.05 |
| ORF3 (VP2) 144 | 0.983 | 0.043 |
Codons under positive selection in 140 cluster 1 norovirus genomes sequenced in this study as detected by FUBAR (44) and the MEME (43).
Posterior probability values obtained by FUBAR are shown.
P values obtained by MEME are shown.
The analysis of ORF3 covered the first 247 of the protein's 268 codons.
FIG 6Changes in cluster 1 minor capsid protein VP2 over time. Protein changes were analyzed and are depicted as described in the legend to Fig. 5. The functional domains of the VP2 protein, including the VP1-interacting region (VP1_ID), are marked at the top. The histogram in the second panel from the top indicates the total number of changes at each position. A codon found to be under positive selection by the MEME or FUBAR is indicated with a red asterisk.
FIG 7Changes in cluster 1 p22 (NS4) proteins over time. Protein changes were analyzed and are depicted as described in the legend to Fig. 5. The functional domains of the p22 (NS4) protein, including the MERES domain, are marked at the top. The histogram in the second panel from the top shows the total number of changes at each position.
FIG 8Changes in cluster 1 p48 (NS1-2) proteins over time. Protein changes were analyzed and are depicted as described in the legend to Fig. 5. The functional domains of the p48 (NS1-2) protein, including the transmembrane (TM) domain, are marked at the top. The histogram in the second panel from the top shows the total number of changes at each position. A codon found to be under positive selection by the MEME or FUBAR is indicated with a red asterisk.