Literature DB >> 21529385

Evolution of new genotype of West Nile virus in North America.

Allison R McMullen1, Fiona J May, Li Li, Hilda Guzman, Rudy Bueno, James A Dennett, Robert B Tesh, Alan D T Barrett.   

Abstract

Previous studies of North American isolates of West Nile virus (WNV) during 1999-2005 suggested that the virus had reached genetic homeostasis in North America. However, genomic sequencing of WNV isolates from Harris County, Texas, during 2002-2009 suggests that this is not the case. Three new genetic groups have been identified in Texas since 2005. Spread of the southwestern US genotype (SW/WN03) from the Arizona/Colorado/northern Mexico region to California, Illinois, New Mexico, New York, North Dakota, and the Texas Gulf Coast demonstrates continued evolution of WNV. Thus, WNV continues to evolve in North America, as demonstrated by selection of this new genotype. Continued surveillance of the virus is essential as it continues to evolve in the New World.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21529385      PMCID: PMC3321787          DOI: 10.3201/eid1705.101707

Source DB:  PubMed          Journal:  Emerg Infect Dis        ISSN: 1080-6040            Impact factor:   6.883


West Nile virus (WNV) is a mosquito-borne flavivirus belonging to the Japanese encephalitis serogroup and maintained in an enzootic cycle between mosquitoes (primarily Culex spp.) and birds. Mammals such as horses and humans act as dead-end hosts. Most human infections are asymptomatic; West Nile fever develops in ≈20% of infected patients and neuroinvasive disease develops in <1% (). WNV was first isolated in Uganda in 1937 and was generally associated with sporadic outbreaks of mild, febrile illness until the 1990s, when several epidemics of neuroinvasive disease were reported in northern Africa, eastern Europe, and Russia (–). In 1999, WNV was first isolated in North America from human and bird samples during an outbreak of encephalitic disease in New York. After this outbreak, WNV rapidly spread across the United States north to Canada and south to the Caribbean region, Mexico, and Central and South America. By 2002, the original WNV genotype isolated in New York, known as NY99, was displaced by a new genotype, designated the North American (NA) or WN02 genotype (hereafter termed NA/WN02 genotype) (,). This genotype is characterized by 13 conserved nt changes, 1 of which results in an amino acid substitution, V159A, in the envelope (E) protein. The NA/WN02 genotype is believed to have become dominant in North America because of its ability to more efficiently disseminate in mosquitoes than the original NY99 virus genotype (–). Beasley et al. () first identified the NA/WN02 genotype in Texas in 2002, and further studies showed that this genotype had spread throughout the Upper Texas Gulf Coast and to other regions in the United States (). Additional studies examined phenotypic changes in WNV isolates from the Upper Texas Gulf Coast region during 2003 and identified co-circulation of small-plaque, temperature-sensitive, mouse-attenuated and large-plaque, non–temperature-sensitive, mouse-virulent strains (–). Subsequent studies of the E gene of viruses isolated through 2006 suggested that since the emergence of the NA/WN02 genotype, WNV in North America is either genetically homeostatic () or its growth rate is decreasing (). We examined genetic variation in selected WNV strains since 2005 from the Upper Texas Gulf Coast region, in particular, Harris County, Texas, USA (Houston metropolitan area). We report the isolation of genetic variants that demonstrate the continuing evolution of WNV in North America. We also show that the southwestern US genotype first identified in Arizona, Colorado, and northern Mexico in 2003 (termed SW/WN03 genotype) has now spread to the Upper Texas Gulf Coast region.

Materials and Methods

Virus Isolates

Virus isolates were obtained from the World Reference Center for Emerging Viruses and Arboviruses at the University of Texas Medical Branch (UTMB) in Galveston, Texas. All new isolates used in this study were originally made from mosquito pools or the brains of naturally infected birds cultured in Vero cells at UTMB. Each isolate was given a second passage in Vero cells to generate a working stock and stored at –80°C.

Reverse Transcription–PCR

Viral RNA was extracted from 140 μL of infected Vero cell supernatant by using the QIAamp Viral RNA Mini Kit (QIAGEN, Valencia, CA, USA) per the manufacturer’s directions. Full-genome sequencing was performed by consensus overlapping sequencing of PCR products with primers based on the published sequence of WNV NY-99 flamingo 382–99 (GenBank accession no. AF196835). Reverse transcription–PCR was performed by using the Titan One Tube RT-PCR Kit (Roche Applied Science, Indianapolis, IN, USA) (primers and PCR conditions are available by request). PCR products were subjected to electrophoresis on 1% agarose gels and purified by using the QIAquick Gel Extraction Kit (QIAGEN).

Sequencing and Analysis

Purified PCR products were sequenced in both directions by using the Protein Chemistry or Molecular Genomics Core Laboratories at UTMB. Sequences were edited and assembled by using ContigExpress in the VectorNTI program suite (Invitrogen, Carlsbad, CA, USA). Full-length coding sequences were aligned with all published full-length North American WNV isolate sequences available in GenBank (as of November 2010) and isolate WNV IS-98 STD by using MUSCLE in Seaview version 4 (). The final open reading frame (ORF) alignment contained 244 sequences of 10,299 nt (3,433 aa residues). A second alignment was made by using MUSCLE; this aligment contained 33 sequences from the Upper Texas Gulf Coast region. This alignment contained 11,030 nt and contained the entire ORF and portions of the 3′ and 5′ untranslated region (UTR). Phylogenetic trees were inferred using the neighbor-joining (NJ) method in the Phylip package () and the maximum-likelihood (ML) method by using PhyML (). MODELTEST, in conjunction with PAUP, was used to identify generalized time reversible + I + Γ4 as the best-fit nucleotide substitution model to be used in phylogenetic analyses (,). To assess robustness of the phylogenetic methods used, we used the NJ method and 1,000 bootstrap replicates. The ML method used 100 bootstrap replicates for the entire North American alignment and 1,000 bootstrap replicates for the Upper Texas Gulf Coast region alignment. IS-98 STD was used as outgroup for the entire North American WNV alignment, and NY99 was used as outgroup for the Upper Texas Gulf Coast region alignment.

Recombination Detection and Selection Analysis

Screening for recombination was performed on the first 9,999 nt of the North American WNV ORF alignment by using single-break point analysis on the Datamonkey server (–). This screening verified absence of recombination in sequences before running the selection analyses. The first 9,999 nt were selected because of constraints on sequence length by the programs used. Using the ratio of nonsynonymous (dN) to synonymous (dS) nucleotide substitutions, we examined the genome for sites of positive selection. Positive selection was defined as dN>dS and a p value <1.0. Using the Datamonkey web server (,), we used 3 methods to detect site specific nonneutral selection: single-likelihood counting (SLAC), fixed effects likelihood (FEL), and internal FEL (IFEL) (,). BioEdit was used to create datasets for the first 9,999 nt of the ORF and for each gene (capsid [C], premembrane [prM], E, nonstructural protein 1 [NS1], NS2A, NS2B, NS3, NS4A, NS4B, and NS5) for these analyses ().

Results

Viral Isolates

Viruses used in this study were isolated during 2005–2009 from mosquitoes or dead birds collected in Harris County. There were 111 isolates: 14 from 2005, 11 from 2006, 36 from 2007, one from 2008, and 49 from 2009. The genomic sequences of 17 geotemporally representative isolates were determined and compared with other WNV strains from the Upper Texas Gulf Coast region (Harris, Jefferson, and Montgomery Counties) isolated during 2002–2005 and sequenced in our laboratory (Table 1) (,,).
Table 1

West Nile viruses from the Upper Texas Gulf Coast, USA, used to study genotype evolution, 2002–2009

StrainSourceCountyCollection yearGenBank accession no.
TX2002–1HumanUnknown2002DQ164198
TX2002–2HumanUnknown2002DQ164205
TVP8533HumanJefferson2002AY218294
Bird 114Blue jayHarris2002GU827998
Bird1153Mourning doveHarris2003AY712945
Bird1171Great-tailed grackleHarris2003AY712946
v4095Culex quinquefasciatus mosquitoHarris2003GU828002
v4380Cx. quinquefasciatus mosquitoHarris2003GU828001
Bird1881Mourning doveJefferson2003GU828003
Bird1519Blue jayMontgomery2003GU828004
Bird1576Blue jayMontgomery2003GU827999
Bird1175Blue jayHarris2003GU828000
Bird1461Blue jayHarris2003AY712947
v4369Cx. quinquefasciatus mosquitoHarris2003AY712948
Bird3588Blue jayHarris2004DQ164206
TX5058Blue jayHarris2005JF415929
M12214Cx. quinquefasciatus mosquitoHarris2005JF415915
TX5810Common grackleHarris2006JF415916
M6019Cx. quinquefasciatus mosquitoHarris2006JF415930
TX6276Northern mockingbirdHarris2006JF415916
TX6647Blue jayHarris2007JF415917
TX6747Blue jayHarris2007JF415918
M19433Aedes albopictus mosquitoHarris2007JF415919
TX7191Blue jayHarris2007JF415920
TX7558Blue jayHarris2008JF415921
M37012Cx. quinquefasciatus mosquitoHarris2009JF415922
M37906Cx. quinquefasciatus mosquitoHarris2009JF415923
TX7827Blue jayHarris2009JF415924
M38488Ae. albopictus mosquitoHarris2009JF415925
M20140Cx. quinquefasciatus mosquitoHarris2009JF415926
M20141Ae. albopictus mosquitoHarris2009JF415927
M20122Cx. quinquefasciatus mosquitoHarris2009JF415928

Harris County Isolates, 2005–2009

Nucleotide Changes

The genome of WNV is a single-stranded, positive-sense RNA molecule; the NY99 strain contains 11,029 nt. The genome encodes a 5′ UTR (genomic nt 1–96) and a 3′ UTR (genomic nt 10,396–11,029). The UTRs flank a single ORF that encodes 10 proteins; 3 structural proteins (C, prM/M, and E) and 7 nonstructural proteins (NS1, NS2A, NS2B, NS3, NS4A, NS4B, and NS5). When compared with NY99, the prototype WNV strain for North America, the 17 WNV isolates we analyzed in this study had 38–60 nt (0.35%–0.54%) differences; most changes were synonymous. Nine of the 13 conserved nt changes characteristic of the NA/WN02 genotype were found in all newly sequenced isolates (Table 2). One 2006 isolate (TX6276), two 2007 isolates (TX6747 and TX7191), and six of seven 2009 isolates (M37012, M37906, M39488, M20140, M20141, and M20122) encoded a C at nt 660 and 6238, which was identical to that in the NY99 strain. Two isolates (M12214 from 2005 and M19433 from 2007) encoded a C at nt 6426, and 2 isolates (TX5810 and M6019, both from 2006) encoded a U at nt 9352, again identical to the NY99 strain. One 2005 isolate (TX5058) contained a 6-nt (nt 10471–10476) deletion in the 3′ UTR, and one 2007 isolate (TX7191) and two 2009 isolates (M37906 and TX7827) contained a 1-nt deletion at nt position 49/50 in the 5′ UTR.
Table 2

Nucleotide sequence changes in West Nile virus NA/WN02 genotype, Upper Texas Gulf Coast, USA*

StrainYearprM
E
NS2A
NS3
NS4B
NS5
3′ UTR
6601442246637744146480361386238642669967938935210851
NY991999CUCUACCCCCUCA
Bird 1142002UC†UUGUUUUUCUG
M122142005UCUUGUUU · UCUG
TX5058UCUUGUUUUUCUG
TX58102006UCUUGUUUUUC · G
M6019UCUUGUUUUUC · G
TX 6276 · CUUGUU · UUCUG
TX 66472007UCUUGUUUUUCUG
TX6747 · CUUGUU · UUCUG
M19433UCUUGUUU · UCUG
TX7191 · CUUGUU · UUCUG
TX 75582008UCUUGUUUUUCUG
M 370122009 · CUUGUU · UUCUG
M 37906 · CUUGUU · UUCUG
TX 7827UCUUGUUUUUCUG
M 39488 · CUUGUU · UUCUG
M 20140 · CUUGUU · UUCUG
M 20141 · CUUGUU · UUCUG
M20122 · CUUGUU · UUCUG

*PrM, premembrane; E, envelope; NS, nonstructural; UTR, untranslated region. Values indicate nucleotide position within each gene. Dots indicate no change from NY99 isolate.
†Encodes for amino acid substitution E-V159A.

*PrM, premembrane; E, envelope; NS, nonstructural; UTR, untranslated region. Values indicate nucleotide position within each gene. Dots indicate no change from NY99 isolate.
†Encodes for amino acid substitution E-V159A.

Amino Acid Substitutions

Deduced amino acid sequences were compared and substitutions were identified for 41 residues (2 in C, 4 in prM/M, 6 in E, 0 in NS1, 6 in NS2A, 2 in NS2B, 8 in NS3, 2 in NS4A, 4 in NS4B, and 7 in NS5); each isolate contained 3–7 substitutions (Table A1). Eight (C-T109I, E- T70I, E-V159A, E-I460M/L, NS2A-R98G, NS2A-A137V, NS4A-A85T, and NS4B-I240M) of the 41 substitutions were found in >1 isolate. All isolates contained the E-V159A substitution present in the NA/WN02 genotype.
Table A1

Deduced amino acid substitutions for West Nile virus isolates, Upper Texas Gulf Coast, USA*

GenePosition1999
2005
2006
2007
2008
2009
NY99M12214TX5058TX5810M6019TX6276TX6647TX6747M19433TX7191TX7558M37012M37906TX7827M38488M20140M20141M20122
C
39*D · · · N · · · · · · · · · · · · ·
109
T

·
·

·
·
·

·
·
·
·

·

·
·
·
·
I
I
·
PrM/M
88R · · · K · · · · · · · · · · · · ·
122V · · · · · I · · · · · · · · · · ·
140V · · · · · · I · · · · · · · · · ·
156
V

·
·

·
·
·

·
·
·
·

·

·
·
I
·
·
·
·
E
49E · · · · K · · · · · · · · · · · ·
70T · · · · · · · · · · · I · I · · ·
159VAAAAAAAAAAAAAAAAA
431V · · · · · · · · · · · · · · · · F
460I · · · · M · · · · · LL · LLLL
475
N

·
·

·
S
·

·
·
·
·

·

·
·
·
·
·
·
·
NS2A
90M · · · · · V · · · · · · V · · · ·
98R · · GG · · · · · · · · · · · · ·
118Y · · · · · · · · · · · · · · · · H
124IV · · · · · · · · · · · · · · · ·
137A · · · · · · · · · · · · · · VV ·
224
A

V
·

·
·
·

·
·
·
·

·

·
·
·
·
·
·
·
NS2B
41A · · · · · · · · · V · · · · · · ·
99
M

·
·

·
·
·

·
·
·
T

·

·
·
·
·
·
·
·
NS3
106V · · · · · · · · · · · · · A · · ·
160S · · · · · · · A · · · · · · · · ·
162I · · · · · · · · M · · · · · · · ·
213I · · · · · · · · · · · · · M · · ·
355Y · · · · · · · · · · · · H · · · ·
365SG · · · · · · · · · · · · · · ·
486F · L · · · · · · · · · · · · · · ·
562
R

·
·

·
·
·

K
·
·
·

·

·
·
·
·
·
·
·
NS4A
65M · · · · · · T · · · · · · · · · ·
85
A

T
·

·
·
·

T
·
T
·

T

·
·
·
·
·
·
·
NS4B
33L · · · F · · · · · · · · · · · · ·
116T · · · · · · A · · · · · · · · · ·
176I · · · · V · · · · · · · · · · · ·
240
I

·
·

M
M
M

·
·
·
·

·

M
M
·
M
M
M
M
NS521K · · · · · R · · · · · · · · · · ·
91M · · · · · · · V · · · · · · · · ·
314K · · · · · · · R · · · · · · · · ·
395M · · · · · · · · · · · I · · · · ·
422R · K · · · · · · · · · · · · · · ·
640L · · P · · · · · · · · · · · · · ·
661S · · T · · · · · · · · · · · · · ·
860A · · · · · · · · T · · · · · · · ·

*C, capsid; PrM/M, premembrane/membrane; E, envelope, NS nonstructural. Dot indicates no change from NY99 isolate.

Upper Texas Gulf Coast Region Isolates, 2002–2009

WNV was first detected in the Upper Texas Gulf Coast region in 2002 (). During 2002–2004, isolates from the Upper Texas Gulf Coast region were divided genetically into 3 groups (groups 1–3) () and showed 0.30%–0.40% divergence compared with NY99. Isolates from 2005–2009 (groups 4–6; see below for their definitions) have significantly greater divergence (0.40%–0.70%; p<9.4 × 10–9) from NY99 (Table 3). When compared with isolates from 2002–2004 (groups 1–3), we found that recent Upper Texas Gulf Coast isolates from 2005–2009 (groups 4–6) have nucleotide divergence rates ranging from 0.50% to 0.80%. Because of the high number of synonymous nucleotide mutations, the deduced amino acid sequences of all isolates exhibited a higher level of conservation; divergence rates ranged from 0.10% to 0.30% compared with NY99.
Table 3

Percentage nucleotide and amino acid sequence divergence of West Nile virus isolates, Upper Texas Gulf Coast, USA, 2002–2009*

Group†NY99Group 1Group 2Group 3Group 4Group 5Group 6
NY990.1–0.20.1–0.20.10.1–0.20.20.2–0.3
Group 10.3–0.40.1–0.30.1–0.30.1–0.30.2–0.30.2–0.4
Group 20.30.3–0.40.1–0.20.2–0.30.2–0.30.2–0.3
Group 30.30.1–0.40.2–0.30.1–0.30.2–0.30.2–0.3
Group 40.5–0.60.5–0.70.5–0.60.5–0.60.2–0.30.2–0.3
Group 50.5–0.70.5–0.80.4–0.70.4–0.70.7–1.00.3
Group 60.4–0.50.4–0.60.4–0.50.4–0.50.7–0.80.6–0.8

*Amino acid sequence divergence is shown above the diagonal, and nucleotide sequence divergence is shown in below the diagonal.
†Group 1: bird 114 (2002), bird 1171 (2003), bird 1153 (2003); group 2: bird 1519 (2003), v4369 (2003), v4095 (2003), bird 1881 (2003), v4380 (2003); group 3: bird 1576 (2003), bird 1175 (2003), TX2003; group 4: TX6376 (2006), M20141 (2009), M20140 (2009), M37906 (2009), M39488 (2009), M20122 (2009), M37102 (2009); group 5: M12214 (2005), M19433 (2007), TX6647 (2007), TX7558 (2008); group 6: M6019 (2006), TX5810 (2006).

*Amino acid sequence divergence is shown above the diagonal, and nucleotide sequence divergence is shown in below the diagonal.
†Group 1: bird 114 (2002), bird 1171 (2003), bird 1153 (2003); group 2: bird 1519 (2003), v4369 (2003), v4095 (2003), bird 1881 (2003), v4380 (2003); group 3: bird 1576 (2003), bird 1175 (2003), TX2003; group 4: TX6376 (2006), M20141 (2009), M20140 (2009), M37906 (2009), M39488 (2009), M20122 (2009), M37102 (2009); group 5: M12214 (2005), M19433 (2007), TX6647 (2007), TX7558 (2008); group 6: M6019 (2006), TX5810 (2006). With the exception of conserved nucleotide mutations in the NA/WN02 genotype, there were a few nucleotide changes or deduced amino acid substitutions that were shared by >1 isolate from 2002–2004 and 1 of the newly sequenced isolates from 2005–2009. Nucleotide changes at 11 positions were shared between >1 isolate from 2002–2004 and the newly sequenced isolates from 2005–2009, with only 1 aa substitution, NS4B-I240M, found in >1 isolate from both groups.

Phylogenetic Analysis

Phylogenetic trees were generated by NJ and ML analyses by using only the polyprotein sequence of 34 isolates: the 17 newly sequenced isolates, 16 published sequences of isolates from the Upper Texas Gulf Coast region, and NY99 (Figure 1). Both methods produced trees with similar topology. In addition to groups 1, 2, and 3 identified in Upper Texas Gulf Coast region isolates obtained in 2002–2003 (), we identified 3 other phylogenetic groups in this study.
Figure 1

Maximum-likelihood phylogenetic tree of Upper Texas Gulf Coast, USA, West Nile virus isolates, 2002–2009. The tree was inferred from open reading frame sequences of 33 Upper Texas Gulf Coast isolates and NY99 by using PhyML () and rooted with IS-98 STD. The outgroup has been removed. Bootstrap values are for 1,000 replicates and only values >500 are shown. Groups 1–3 were previously identified by May et al. (). Red, isolates sequenced in this study. Scale bar indicates nucleotide substitutions per site.

Maximum-likelihood phylogenetic tree of Upper Texas Gulf Coast, USA, West Nile virus isolates, 2002–2009. The tree was inferred from open reading frame sequences of 33 Upper Texas Gulf Coast isolates and NY99 by using PhyML () and rooted with IS-98 STD. The outgroup has been removed. Bootstrap values are for 1,000 replicates and only values >500 are shown. Groups 1–3 were previously identified by May et al. (). Red, isolates sequenced in this study. Scale bar indicates nucleotide substitutions per site. Group 4 is composed of 6 of 7 isolates from 2009 and 1 isolate from 2006 (TX6276). Group 5 is composed of 4 isolates: 1 from 2005 (M12214), 2 from 2007 (M19433 and TX6647), and 1 from 2008 (TX7558). Group 6 is composed of 2 isolates from 2006 (M6019 and TX5810) and 2 sequenced isolates from 2002. Groups 4 and 5 are supported by high bootstrap values; group 6 has a lower bootstrap value. Within group 4, all 2009 isolates contained an E-I460E substitution. All group 5 isolates contain the amino acid substitution NS4B-A85T, and all isolates in groups 4 and 6 contain the NS4B-I240M substitution. Four isolates did not fall into these 6 groups: TX5058 (2005), TX6747 and TX7191 (2007), and TX7827 (2009). The TX7828 2009 isolate, which did not fall into group 4 with the other 2009 isolates, is the only 2009 isolate sequenced in this study that was isolated from a bird (blue jay). However, we had only 1 isolate from 2008, and WNV activity was low in Harris County in 2008 (R. Bueno and R. Tesh, unpub. data). This finding may have been caused by Hurricane Ike, which hit the Upper Texas Gulf Coast in September 2008. A second phylogenetic analysis was undertaken that used isolates from this study and all published full-length WNV sequences from North America available on GenBank (Figure 2). NJ and ML methods produced trees with similar topology. Within the larger tree, there were analogous groupings of previously and newly sequenced Harris County isolates, as shown in Figure 1. Three of the Harris County groups form distinct clusters of isolates within the NA/WN02 genotype and may represent formation of new genotypes or clusters. Group 1 Harris County isolates (2002–2003) cluster with a grouping of isolates from California from 2003–2008, and group 4 isolates (2006–2009) cluster with several isolates from New York (2008) and 1 isolate from Illinois (2006). Group 5 isolates from Harris County cluster with isolates from the southwestern United States and northern Mexico (called the SW/WN03 genotype because the first isolates were identified in Arizona and Colorado in 2003).
Figure 2

Maximum-likelihood phylogenetic tree showing all published, full open reading frame North American West Nile virus isolates, 2002–2009 (A), and enlargement showing SW/WN03 genotype (B). Red, isolates sequenced in this study. Scale bar in panel A indicates nucleotide substitutions per site. Numbers on the right in panel B indicate groups.

Maximum-likelihood phylogenetic tree showing all published, full open reading frame North American West Nile virus isolates, 2002–2009 (A), and enlargement showing SW/WN03 genotype (B). Red, isolates sequenced in this study. Scale bar in panel A indicates nucleotide substitutions per site. Numbers on the right in panel B indicate groups.

SW/WN03 Genotype

This genotype is composed of 5 groups on the basis of nucleotide and amino acid sequences and phylogenetic analysis. SW Group 1 is composed of 2 isolates: WNV-1/US/BID-V4093/2007 from New York and CO2003–2 from Colorado. SW Group 2 is composed of 4 isolates, 3 from Texas that were sequenced in this study: M12214 (2005), TX6647 (2007), TX7558 (2008), and BSL2–05 from South Dakota in 2005. SW Group 3 is composed of 6 isolates: 2 from Illinois (WNV-1/US/BID-V4369/2004 and WNV-1/US/BID-V4378/2005), 2 from New York (WNV-1/US/BID-V4806/2005 and WNV-1/US/BID-V4624/2008), and 2 from New Mexico (04–237NM and 04–238NM). SW Group 4 is composed of 8 isolates from Mexico in 2003 (TVP9115, TVP9118–TVP9222). SW Group 5 is the largest and is composed of 22 isolates: 10 from Arizona (2003–2006), 2 from New Mexico (2005), 2 from Colorado (2004), 3 from California (2005, 2007–2008), and 5 from Texas (M19433, which was sequenced in this study, and 4 isolates from west Texas). Further examination of sequences within the SW/WN03 genotype showed that they share some or all of a signature of 13 nt changes (different from those of the NA/WN02 genotype), including 2 aa substitutions, NS4B-A85T and NS5-K314R (Table A2). Isolates in SW group 1 contain 4 of the 13 changes (nt positions 6238, 6721, 7269, and 9264). SW group 2 isolates contain 5 changes (nt positions 6238, 6721, 8550, 9264, and 9660). SW group 3 isolates have 7 changes (nt positions 1320, 6238, 6721, 8550, 8621, 9264, and 9660). SW group 4 isolates have 6 changes (nt positions 1320, 6238, 6721, 8550, 8621, and 9660). SW group 5 isolates have all 13 nt changes (nt positions 1320, 1974, 3399, 6238, 6721, 6765, 6936, 7269, 8550, 8621, 9264, 9660, and 10062). Isolates in SW groups 1 and 2 contain only 1 (NS4B-A85T) of the 2 aa substitutions, and isolates in SW groups 3, 4, and 5 contain both amino acid substitutions.
Table A2

Genotype nucleotide changes in SW/WN03 isolate of West Nile virus, Upper Texas Gulf Coast, USA*

Strain
State
Year
Group
E
NS1
NS3
NS4A
NS4B
NS5
3′ UTR
1320
1974
3399
6238
6721†
6765
6936
7269
8550
8621‡
9264
9660
10062
NY99NY1999ACUCG U UUCAUCU
CO 2003–2CO20031···A··C··C··
HM488201NY2007···UA··C··C··
M12214 TX 2005 2 · · · U A · · · U · C U ·
BSL2–05SD2005···UA···U·CU·
TX 6647 TX 2007 · · · U A · · · U · C U ·
TX 7558 TX 2008 · · · U A · · · U · C U ·
HM488191IL20043G··UA···UGCU·
04–237NMNM2004G··UA···UGCU·
04–236NMNM2004G··UA···UGCU·
HM488198IL2005G··UA···UGCU·
HM756675NY2005G··UA···UGCU·
HM488239NY2008G··UA···UGCU·
TVP9117MEX20034G···A···UG·U·
TVP9223MEX2003G··UA···UG·U·
TVP9115MEX2003G··UA···UG·U·
TVP9222MEX2003G··UA···UG·U·
TVP9218MEX2003G··UA···UG·U·
TVP9220MEX2003G··UA···UG·U·
TVP9219MEX2003G··UA···UG·U·
TVP9221MEX2003G··UA···UG·U·
AZ-03–1623AZ20035GUCUACCCUGCUC
AZ-03 03–1799AZ2003GUCUACCCUGCUC
AZ-03–1681AZ2003GUCUACCCUGCUC
BSL5–04AZ2004GUCUACCCUGCUC
04–252AZAZ2004GUCUACCCUGCUC
04–251AZAZ2004GUCUACCCUGCUC
AZ2004AZ2004GUCUACCCUGCUC
04–216COCO2004GUCUACCCUGCUC
04–219COCO2004GUCUACCCUGCUC
BSL13–05AZ2005GUCUACCCUGCUC
142WG-AZ05PIAZ2005GUCUACCCUGCUC
132WG-CA05LACA2005GUCUACCCUGC·C
HM756677NM2005GUCUACCCUGCUC
009WG-NM05LCNM2005GUCUACCCUGCUC
GCTX1–2005TX2005GUCUACCCUG··C
007WG-TX05EPTX2005GUCUACCCUGCUC
144WG-AZ06PIAZ2006GUCUAC·CUGCUC
011WG-TX06EPTX2006GUCUACCCUGCUC
194WG-CA07LACA2007GUCUACCCUGCUC
013WG-TX07EPTX2007GUCUACCCUGCUC
M19433 TX 2007 G U C U A C C C U G C U C
JPW080814–01CA2008GUCUACCCUGCUC

*E, envelope; NS, nonstructural; UTR, untranslated region; NY, New York; CO, Colorado; TX, Texas; SD, South Dakota; IL, Illinois, NM, New Mexico; MEX, Mexico, AZ, Arizona; CA, California. Dots indicate no change from NY99 isolate. Isolates in boldface were sequenced in this study.
†NS4A-A85T.
‡NS5-K314R.
§Nucleotide changes compared with NY99 isolate.

Selection Pressures

Recombination analysis using single-break point analysis was performed on the first 3,333 codons of the ORF of the North American WNV alignment to rule out recombination before performing selection pressure analysis. As expected, no evidence of recombination was detected. Selection pressures on the WNV genome were examined by using 3 methods: SLAC, FEL, and IFEL (Table 4). These methods estimated the ratio of nonsynonymous (dN) to synonymous (dS) amino acid substitutions in 10 datasets representing the first 9,999 nt (3,333 aa residues) of the ORF and each protein (C, prM/E, NS1, NS2A, NS2B, NS3, NS4A, NS4B, and NS5) for the complete North American WNV alignment of 244 genomes. On examination of the ORF, 3 residues were identified for positive selection in >2 of the 3 methods. E-V431I (codon position 721 in ORF) was identified by FEL (p = 0.057) and IFEL (p = 0.072), NS2A-A224V/T (codon position 1367 in ORF) was identified by SLAC (p = 0.087) and FEL (p = 0.096), and NS4A-A85T (codon position 2209 in ORF) was identified by SLAC (p = 0.087), FEL (p = 0.011), and IFEL (p = 0.067). When selection analysis was performed on each gene, only E-V431I was identified for positive selection (FEL, p = 0.059 and IFEL, p = 0.065). An additional residue, NS5-K314R, was also identified for positive selection by FEL (p = 0.042) and IFEL (p = 0.042).
Table 4

Positive and negative selection results for West Nile virus isolates, Upper Texas Gulf Coast, USA, 2002–2009*

ProteinAmino acid residues relative to ORFLength of protein, aaOverall dN/dSSingle-likelihood ancestor counting†

Fixed effects
likelihood†

Internal fixed effects likelihood†
Positive selectionNegative selectionPositive selectionNegative selectionPositive selectionNegative selection
ORF1–3,333‡3,3330.110224686191625
C1–1231230.270020902
prM124–2901660.1340702513
E291–7915000.11901517213
NS1792–11433510.13401714844
NS2A1144–13742300.13001514813
NS2B1375–15051300.1180601901
NS31506–21246180.083042010106
NS4A2125–22731480.1350712203
NS4B2274–25222480.11201304908
NS52529–34339040.0980612148410

*ORF, open reading frame; dN, nonsynonymous nucleotide substitutions; dS, synonymous nucleotide substitutions; C, capsid; prM, premembrane; E, envelope; NS, nonstructural.
†No. sites where p<0.1.
‡Only the first 3,333 aa residues of the ORF were used for these analyses because of program constraints on alignment size.

*ORF, open reading frame; dN, nonsynonymous nucleotide substitutions; dS, synonymous nucleotide substitutions; C, capsid; prM, premembrane; E, envelope; NS, nonstructural.
†No. sites where p<0.1.
‡Only the first 3,333 aa residues of the ORF were used for these analyses because of program constraints on alignment size. NS4A-A85T and NS5-K314R are the 2 aa residues that identify the SW/WN03 genotype described. Residue E-V431I is found in a California cluster (California isolates 2003–2008). Substitutions at NS2A-224 (codon position 1367 in ORF) are found in 5 NY99 genotype isolates (A224T) and 4 SW/NA03 genotype isolates (A224V).

Discussion

To date, most genetic and phylogenetic studies of WNV have focused on partial genome sequencing, primarily of the E protein gene. Although studies of the E protein gene are helpful in understanding the evolution of WNV in North America, they provide few phylogenetically informative sites; analysis of genomic sequences is more informative (,–). Similarly, although many studies have examined the evolutionary dynamics of WNV soon after its introduction into North America, only 1 published study has examined isolates since 2006 (). To our knowledge, none have been published that examined isolates from 2007 or more recently. For these reasons, we examined evolution of WNV by using genomic sequences from 1999–2009. We focused on the Upper Texas Gulf Coast region because of availability of multiple isolates from the same localities each year since the first detection of the virus in Texas in 2002. These isolates were obtained as part of an ongoing surveillance program of WNV activity in Harris County. The isolates sequenced in this study demonstrate that the NA/WN02 genotype has been maintained during 2002–2009 in Harris County. All 17 isolates sequenced contained 9 of 13 nt changes associated with the NA/WN02 genotype reported by Davis et al. (), including the amino acid substitution E-V159A. However, since 2005, reversion to the NY99 genotype was seen at 4 nt positions. Nine isolates contained a C at nt positions 660 and 6238, three isolates had a U at nt position 6426, and two isolates had a C at nt position 9352. Although isolates sequenced in our study display a high degree of similarity, they have major differences. It appears that >3 genetic groups of isolates were co-circulating in Harris County over the study period. Thus, there is continued genetic diversity of WNV over time, at least in the Upper Texas Gulf Coast region, rather than the genetic homeostasis in North America, which was proposed on the basis of using E gene sequences of viruses isolated through 2005 (). One group, group 4, contains isolates from 2006 and 2009. A second group, group 5, contains isolates from 2005, 2007, and 2008. A third group, group 6, contains isolates from 2006 plus 2 sequenced isolates from 2002. All other 2002–2005 isolates sequenced previously fall into other groups (groups 1–3) (). Four isolates, TX5058 from 2005, TX6747 and TX7191 from 2007, and TX7827 from 2009, did not fall into any of the 6 groups and may represent single isolates that did not have any advantage and thus became extinct. When compared with all North American WNV isolates, we found 3 distinctive clusters of isolates within the NA/WN02 genotype. Each cluster contained several isolates from the Upper Texas Gulf Coast region, in addition to other isolates. The first cluster contains group 4 isolates, in addition to four 2008 isolates from New York and one 2006 isolate from Illinois. The second cluster is composed primarily of isolates from California, in addition to group 1 Upper Texas Gulf Coast isolates and 3 additional isolates from Colorado, Connecticut, and Illinois. Group 5 isolates from Harris County cluster with isolates from the southwestern United States (Arizona and Colorado) and from northern Mexico, which were first identified in 2003. This SW/WN03 genotype shares some or all of 13 nt changes, which encode for 2 aa substitutions. Our data indicate that this genotype is spreading into new areas. It has been identified in California, Illinois, New Mexico, New York, North Dakota, and Texas since 2003. Two of these clusters, the California cluster and the cluster we have called the SW/WN03 genotype, are further supported by using selection analysis. This analysis has shown that there is potential for positive selection at E-V431I in the California cluster and at both of the amino acid substitutions (NS4A-A85T, NS5-K314R) in the SW/WN03 genotype. This finding further provides evidence of the potential role of this emerging genotype. Potential roles of single amino acid substitutions within the WNV genome should also be noted. The single amino acid change, E-V159A, which occurred in the NA/WN02 genotype, was shown to decrease the extrinsic incubation period of the virus in mosquitoes, which enabled that genotype to displace the NY99 genotype (). Brault et al. () reported that the NS3-T249P substitution increased virulence in American crows. The NS3-T249P substitution has undergone positive selection but the E-V159A change has not, yet both cause phenotypic changes. We speculate that positive selection of NS4A-A85T and NS5-K314R induces a phenotypic change in WNV. Previous studies in our laboratory that focused on the E protein gene concluded that WNV is experiencing a genetic stasis or decrease in its growth rate after establishment of the NA/WN02 genotype (). However, none of these studies have phylogenetically examined the entire genome of WNV. This study of genomic sequences demonstrates evolution of WNV, at least in the Upper Texas Gulf Coast region, and potential emergence of a new genotype in the southwestern United States (SW/WN03 genotype). Further experiments are needed to investigate potential phenotypic changes that occur in conjunction with the noted genotype changes and to determine if the SW/WN03 genotype will replace the current dominant NA/WN02 genotype.
  31 in total

1.  Limited evolution of West Nile virus has occurred during its southwesterly spread in the United States.

Authors:  David W C Beasley; C Todd Davis; Hilda Guzman; Dana L Vanlandingham; Amelia P A Travassos da Rosa; Ray E Parsons; Stephen Higgs; Robert B Tesh; Alan D T Barrett
Journal:  Virology       Date:  2003-05-10       Impact factor: 3.616

2.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.

Authors:  Stéphane Guindon; Olivier Gascuel
Journal:  Syst Biol       Date:  2003-10       Impact factor: 15.683

3.  Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology.

Authors:  Wayne Delport; Art F Y Poon; Simon D W Frost; Sergei L Kosakovsky Pond
Journal:  Bioinformatics       Date:  2010-07-29       Impact factor: 6.937

4.  Not so different after all: a comparison of methods for detecting amino acid sites under selection.

Authors:  Sergei L Kosakovsky Pond; Simon D W Frost
Journal:  Mol Biol Evol       Date:  2005-02-09       Impact factor: 16.240

5.  MODELTEST: testing the model of DNA substitution.

Authors:  D Posada; K A Crandall
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

6.  West Nile: a deadly virus?

Authors:  B Le Guenno; A Bougermouh; T Azzam; R Bouakaz
Journal:  Lancet       Date:  1996-11-09       Impact factor: 79.321

7.  West Nile encephalitis epidemic in southeastern Romania.

Authors:  T F Tsai; F Popovici; C Cernescu; G L Campbell; N I Nedelcu
Journal:  Lancet       Date:  1998-09-05       Impact factor: 79.321

8.  Emergence of attenuated West Nile virus variants in Texas, 2003.

Authors:  C Todd Davis; David W C Beasley; Hilda Guzman; Marina Siirin; Ray E Parsons; Robert B Tesh; Alan D T Barrett
Journal:  Virology       Date:  2004-12-05       Impact factor: 3.616

9.  Genetic and phenotypic variation of West Nile virus in New York, 2000-2003.

Authors:  Gregory D Ebel; Justin Carricaburu; David Young; Kristen A Bernard; Laura D Kramer
Journal:  Am J Trop Med Hyg       Date:  2004-10       Impact factor: 2.345

10.  Isolation of two strains of West Nile virus during an outbreak in southern Russia, 1999.

Authors:  D K Lvov; A M Butenko; V L Gromashevsky; V P Larichev; S Y Gaidamovich; O I Vyshemirsky; A N Zhukov; V V Lazorenko; V N Salko; A I Kovtunov; K M Galimzyanov; A E Platonov; T N Morozova; N V Khutoretskaya; E O Shishkina; T M Skvortsova
Journal:  Emerg Infect Dis       Date:  2000 Jul-Aug       Impact factor: 6.883

View more
  38 in total

Review 1.  West Nile virus: A re-emerging pathogen revisited.

Authors:  Miguel A Martín-Acebes; Juan-Carlos Saiz
Journal:  World J Virol       Date:  2012-04-12

2.  Molecular evolution of West Nile virus in a northern temperate region: Connecticut, USA 1999-2008.

Authors:  Philip M Armstrong; Charles R Vossbrinck; Theodore G Andreadis; John F Anderson; Kendra N Pesko; Ruchi M Newman; Niall J Lennon; Bruce W Birren; Gregory D Ebel; Mathew R Henn
Journal:  Virology       Date:  2011-07-01       Impact factor: 3.616

3.  Fluid Spatial Dynamics of West Nile Virus in the United States: Rapid Spread in a Permissive Host Environment.

Authors:  Francesca Di Giallonardo; Jemma L Geoghegan; Douglas E Docherty; Robert G McLean; Michael C Zody; James Qu; Xiao Yang; Bruce W Birren; Christine M Malboeuf; Ruchi M Newman; Hon S Ip; Edward C Holmes
Journal:  J Virol       Date:  2015-10-28       Impact factor: 5.103

4.  Many human RNA viruses show extraordinarily stringent selective constraints on protein evolution.

Authors:  Jinn-Jy Lin; Maloyjo Joyraj Bhattacharjee; Chun-Ping Yu; Yan Yuan Tseng; Wen-Hsiung Li
Journal:  Proc Natl Acad Sci U S A       Date:  2019-09-04       Impact factor: 11.205

5.  Dynamic transmission of West Nile virus across the United States-Mexican border.

Authors:  Brian R Mann; Allison R McMullen; Hilda Guzman; Robert B Tesh; Alan D T Barrett
Journal:  Virology       Date:  2012-11-07       Impact factor: 3.616

6.  A Live Attenuated Chimeric West Nile Virus Vaccine, rWN/DEN4Δ30, Is Well Tolerated and Immunogenic in Flavivirus-Naive Older Adult Volunteers.

Authors:  Kristen K Pierce; Stephen S Whitehead; Beth D Kirkpatrick; Palmtama L Grier; Adrienne Jarvis; Heather Kenney; Marya P Carmolli; Cynthia Reynolds; Cecilia M Tibery; Janece Lovchik; Anna Janiak; Catherine J Luke; Anna P Durbin; Alexander G Pletnev
Journal:  J Infect Dis       Date:  2016-10-25       Impact factor: 5.226

7.  West Nile Virus fidelity modulates the capacity for host cycling and adaptation.

Authors:  Haley S Caldwell; Kiet Ngo; Janice D Pata; Laura D Kramer; Alexander T Ciota
Journal:  J Gen Virol       Date:  2020-02-18       Impact factor: 3.891

Review 8.  Factors shaping the adaptive landscape for arboviruses: implications for the emergence of disease.

Authors:  Lark L Coffey; Naomi Forrester; Konstantin Tsetsarkin; Nikos Vasilakis; Scott C Weaver
Journal:  Future Microbiol       Date:  2013-02       Impact factor: 3.165

9.  Genotype-specific variation in West Nile virus dispersal in California.

Authors:  Nisha K Duggal; William K Reisen; Ying Fang; Ruchi M Newman; Xiao Yang; Gregory D Ebel; Aaron C Brault
Journal:  Virology       Date:  2015-07-25       Impact factor: 3.616

10.  Extrinsic Incubation Rate is Not Accelerated in Recent California Strains of West Nile Virus in Culex tarsalis (Diptera: Culicidae).

Authors:  Mary E Danforth; William K Reisen; Christopher M Barker
Journal:  J Med Entomol       Date:  2015-07-02       Impact factor: 2.278

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.