| Literature DB >> 28771475 |
Naomi L Forrester1, Joel O Wertheim2, Vivian G Dugan3, Albert J Auguste1, David Lin4, A Paige Adams1, Rubing Chen1, Rodion Gorchakov1, Grace Leal1, Jose G Estrada-Franco1, Jyotsna Pandya1, Rebecca A Halpin3, Kumar Hari4, Ravi Jain4, Timothy B Stockwell5, Suman R Das5, David E Wentworth3, Martin D Smith6, Sergei L Kosakovsky Pond2, Scott C Weaver1.
Abstract
Venezuelan equine encephalitis (VEE) complex alphaviruses are important re-emerging arboviruses that cause life-threatening disease in equids during epizootics as well as spillover human infections. We conducted a comprehensive analysis of VEE complex alphaviruses by sequencing the genomes of 94 strains and performing phylogenetic analyses of 130 isolates using complete open reading frames for the nonstructural and structural polyproteins. Our analyses confirmed purifying selection as a major mechanism influencing the evolution of these viruses as well as a confounding factor in molecular clock dating of ancestors. Times to most recent common ancestors (tMRCAs) could be robustly estimated only for the more recently diverged subtypes; the tMRCA of the ID/IAB/IC/II and IE clades of VEE virus (VEEV) were estimated at ca. 149-973 years ago. Evolution of the IE subtype has been characterized by a significant evolutionary shift from the rest of the VEEV complex, with an increase in structural protein substitutions that are unique to this group, possibly reflecting adaptation to its unique enzootic mosquito vector Culex (Melanoconion) taeniopus. Our inferred tree topologies suggest that VEEV is maintained primarily in situ, with only occasional spread to neighboring countries, probably reflecting the limited mobility of rodent hosts and mosquito vectors.Entities:
Mesh:
Year: 2017 PMID: 28771475 PMCID: PMC5557581 DOI: 10.1371/journal.pntd.0005693
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
The subtypes of VEE complex alphaviruses and their transmission cycles.
| Virus | Subtype | Abbreviation | First Isolation | Geographic Range | Vertebrate Host Range | Mosquito vector | Human Disease | Endemic/Epidemic |
|---|---|---|---|---|---|---|---|---|
| Venezuelan equine encephalitis | IAB | VEEV | Venezuela 1938 | Trinidad, Peru, Colombia, Guatemala-Mexico-Texas | Horses/humans | Yes | Epidemic | |
| IC | Colombia, 1962 | Colombia, Venezuela, Peru | Horses/humans | Epidemic | ||||
| ID | Colombia, 1961 | South and Central America | Rodents | Endemic | ||||
| IE | Panama, 1961 | Central America | Rodents, horses, humans | Epidemic/Endemic | ||||
| Everglades | II | EVEV | Florida, USA, 1963 | Florida | Birds | No | Endemic | |
| Mucambo | IIIA | MUCV | Trinidad, 1954 | South America | Unknown | Culex portesi | yes | Endemic |
| Tonate | IIIB | TONV | French Guiana, 1973 | South and Central America | Unknown | Unknown | No | Unknown |
| 71D1252 | IIIC | South America | Unknown | Unknown | No | Unknown | ||
| Pixuna | IV | PIXV | Brazil, 1964 | South America | Unknown | Unknown | No | Unknown |
| Cabassou | V | CABV | French Guiana, 1968 | French Guiana | Unknown | No | Unknown | |
| Rio Negro (AG80_663) | VI | RNV | Argentina, 1980 | Argentina | Unknown | Unknown | No | Unknown |
| Mosso das Pedras (78V3531) | IF | MDPV | Brazil, 1978 | Brazil | Unknown | No | Unknown |
Fig 1Maximum likelihood phylogeny for VEEV complex.
Relevant internal nodes are identified.
Fig 2Evidence of branch length underestimation in the VEE complex shown via comparison of tree lengths inferred under Branch-Site REL (BSREL) and GTR+Γ4 substitution models.
Letters correspond to nodes in maximum likelihood tree. The aBSREL analysis performed with an optimized number of rate classes is shown in black. BSREL analysis performed with a fixed number of three rate classes is shown in gray. Instances in which only a black letter is shown indicate that aBSREL and BSREL produced identical results. The dashed line depicts x = y, an unbiased analysis.
Fig 3The genetic relationships of the II/ID/IAB/IC subtypes of the VEEV complex.
MCC tree as determined by coalescent analysis for the ID/II strains with the IAB and IC strains included. All branches had posterior probabilities of >0.97 except for the branches marked with a *, which did not have significant support. The time to most recent common ancestors (tMRCA) dates for individual branches as identified from the MCC tree are indicated on significant branches with the highest posterior density (HPD) in brackets, dates of clades supported by BSREL were emboldened. Subtype IAB strains are highlighted by the grey box.
Fig 4The genetic relationships of the IE subtype of the VEEV complex.
MCC tree as determined by coalescent analysis for the IE strains. All branches had posterior probabilities of >0.97 except for the branches marked with a *, which did not have significant support. Time to most recent common ancestor (tMRCA) as determined by the MCC tree were added to major branches with the HPD in brackets. If the clades were supported by BSREL the tMRCA and HPD were emboldened.
Codon positions predicted to be under episodic positive selection, showing the amino acid changes and the proteins altered.
Amino acid position is defined using the Genome sequence 3908 (Accession number U55350 in the NCBI database) and numbered against the non-structural and structural polyproteins.
| Protein | Amino acid position (ORF) | p-value | Amino acid change | Protein function |
|---|---|---|---|---|
| nsP1 | 3 | 0.021 | K—> T | NA |
| nsP1 | 130 | 0.016 | T—> Q/R | Viral methyltransferase |
| nsP1 | 444 | 0.002 | N—> P/S/D | NA |
| nsP1 | 487 | 0.049 | I—> V/L | NA |
| nsP2 | 884 | 0.007 | P—> S | Viral Helicase |
| nsP3 | 1344 | 0.045 | Y—> M/I/E/S | Appr -1 -p processing enzyme |
| nsP4 | 1921 | 0.031 | S—> V/A | RNA polymerase |
| nsP4 | 1991 | 0.043 | L—> E | RNA polymerase |
| nsP4 | 2162 | 0.014 | M—> L/A | RNA polymerase |
| nsP4 | 2397 | <0.001 | C—>S/R | RNA polymerase |
| Capsid | 10 | 0.014 | M—> T | Capsid |
| Capsid | 66 | 0.007 | P—> K/Q/R/S | Capsid |
| Capsid | 92 | 0.025 | K—> G/P/Q/R/S | Capsid |
| 0.028 | ||||
| E3 | 303 | 0.006 | A—> S/V | E3 glycoprotein |
| E2 | 451 | 0.046 | D—> E/G/N/K | E2 glycoprotein |
| E2 | 527 | 0.044 | G—> R | E2 glycoprotein |
| E2 | 533 | 0.016 | E—> D/K | E2 glycoprotein |
| E2 | 547 | 0.001 | T—> K/S/R/Q | E2 glycoprotein |
| E2 | 647 | 0.003 | E—> H/N/K | E2 glycoprotein |
| E1 | 819 | 0.038 | T—> S | E1 glycoprotein |
| E1 | 976 | 0.020 | A—> V | E1 glycoprotein |
| E1 | 1153 | 0.025 | K—> R | E1 glycoprotein |
1Protein change associated with epidemic strains of IE in 1996 and 2003 [58].