| Literature DB >> 35737310 |
Bernal León1,2, Gabriel González3, Alessandro Nicoli4, Alicia Rojas5, Antonella Di Pizio4, Lisbeth Ramirez-Carvajal6, Carlos Jimenez7.
Abstract
Venezuelan Equine Encephalitis virus (VEEV) is an arboviral pathogen in tropical America that causes lethal encephalitis in horses and humans. VEEV is classified into six subtypes (I to VI). Subtype I viruses are divided into epizootic (IAB and IC) and endemic strains (ID and IE) that can produce outbreaks or sporadic diseases, respectively. The objective of this study was to reconstruct the phylogeny and the molecular clock of sequences of VEEV subtype I complex and identify mutations within sequences belonging to epizootic or enzootic subtypes focusing on a sequence isolated from a mare in Costa Rica. Bayesian phylogeny of the VEEV subtype I complex tree with 110 VEEV complete genomes was analyzed. Evidence of positive selection was evaluated with Datamonkey server algorithms. The putative effects of mutations on the 3D protein structure in the Costa Rica sequence were evaluated. The phylogenetic analysis showed that Subtype IE-VEEV diverged earlier than other subtypes, Costa Rican VEEV-IE ancestors came from Nicaragua in 1963 and Guatemala in 1907. Among the observed non-synonymous mutations, only 17 amino acids changed lateral chain groups. Fourteen mutations located in the NSP3, E1, and E2 genes are unique in this sequence, highlighting the importance of E1-E2 genes in VEEV evolution.Entities:
Keywords: Bayesian; Costa Rica; E1/E2 dimer; VEEV; evolution; homology modeling; horse encephalitis; mutations; phylogenetic
Year: 2022 PMID: 35737310 PMCID: PMC9229380 DOI: 10.3390/vetsci9060258
Source DB: PubMed Journal: Vet Sci ISSN: 2306-7381
Figure 1Bayesian inference phylogenetic tree depicting VEEV subtype I topology. Branch labels show the posterior probability, and the node indicates the probability of the subtype origin of the MRCA.
Positive selection by gene, branch, and site was obtained with three different methods.
| BUSTED | FEL | ABSREL | ||
|---|---|---|---|---|
| PP | PN | |||
| NSP1 535 | NF | 0 | 216 | NF |
| MK796243 | NF | 0 | 0 | NF |
| NSP2 794 | F | 0 | 527 | NF |
| MK796243 | NF | 2 * | 0 | NF |
| NSP3 558 | NF | 2 | 269 | F, 1/163 |
| NSP4 607 | F | 0 | 418 | NF |
| MK796243 | NF | 2 * | 0 | NF |
| CAPSIDE 285 | NF | 0 | 157 | NF |
| MK796243 | NF | 0 | 0 | NF |
| E1 442 | NF | 1 | 234 | NF |
| MK796243 | NF | 2 * | 0 | NF |
| E2 423 | NF | 0 | 259 | NF |
| MK796243 | NF | 2 * | 0 | NF |
| E3 59 | NF | 0 | 34 | NF |
| MK796243 | NF | 0 | 0 | NF |
NF: positive selection was not found, F: positive selection was found, PP: pervasive positive/diversifying selection, PN: pervasive negative/purifying selection. * the pervasive positive/diversifying selection sites are shown in Table 3.
Amino acid distribution in sites with positive selection.
| SUBTYPE # Seq | |||||||
|---|---|---|---|---|---|---|---|
| Site | aa | IAB 8 | IC 38 | ID 17 | IE-1 2 | IE-2 25 | IE-3 20 |
|
| |||||||
| 343 | P | 2 | |||||
| L | 1 | 12 | |||||
| S | 24 | 7 | |||||
| - | 1 | ||||||
| N | 8 | 38 | 16 | ||||
| D | 1 | ||||||
| - | 1 | ||||||
| 389 | C | 3 | |||||
| R | 2 | 22 | 1 | ||||
| H | 19 | ||||||
| E | 1 | ||||||
| G | 8 | 38 | 16 | ||||
|
| |||||||
| 347 | A | 10 | 2 | 25 | 20 | ||
| T | 8 | 37 | 7 | ||||
| S | 1 | ||||||
Amino acid composition among the sequences.
| Pacific IE-2 | Pacific IE-2 | Pacific IE-2 | Pacific IE-2 | Caribbean IE-3 | Caribbean IE-3 | Panama IE-1 | ||
|---|---|---|---|---|---|---|---|---|
| MK796243 | KC344441 | KC344435 | AF448535 | KC344446 | KC344457 | KC344432 | ||
| NSP1 (535aa) | ||||||||
| 120 | Sπ | T | A | A | A | A | A | |
| 508 | Eπ | A | A | A | A | A | A | |
| NSP2 | ||||||||
| 75 | H ** | Q | Q | Q | Q | Q | Q | |
| 454 * | E | D | D | D | D | D | D | |
| 495 * | Aπ | T | T | S | T | T | T | |
| 535 | T | T | A | A | A | A | A | |
| 617 | Y | H | H | Y | H | H | H | |
| NSP3 | ||||||||
| 65 | I ** | L | L | L | L | L | I | |
| 344 * | P | P | T | L | T | T | T | |
| 350 | Sπ | P | P | P | P | P | P | |
| 356 | Sπ | S | P | P | P | P | P | |
| 366 | Dπ | E | E | E | E | E | D | |
| 389 | Cπ *** | S | S | S | S | S | S | |
| 439 | Nπ | T | T | T | T | T | T | |
| 452 | K | K | Q | Q | Q | Q | R | |
| 465 * | Gπ | E | E | E | E | E | E | |
| 470 | Sπ | T | T | T | T | T | T | |
| 528 | Cπ | S | S | S | S | S | S | |
| NSP4 | ||||||||
| 105 * | V | L | L | L | L | L | L | |
| 235 * | D | E | E | E | E | E | E | |
| E1 | ||||||||
| 208 | V ** | I | V | V | I | I | I | |
| 211 * | Tπ | S | S | S | S | S | S | |
| 389 * | Aπ | T | T | T | T | T | T | |
| E2 | ||||||||
| 81 * | Rπ | Q | Q | Q | Q | Q | Q | |
| 182 * | Rπ | S | S | S | S | S | S | |
| 282 | Lπ | S | S | S | S | S | S |
The colors highlighting amino acids represent changes in chemical properties: yellow, red, purple, and light blue indicate positively charged, negatively charged, hydrophobic, and polar amino acids, respectively, while green is either C, G, or P. The acidic E is changed to the hydrophobic A in position 508 of NSP1, the G is changed to the acidic E in position 479 of NSP3, in position 389 of E1 the hydrophobic A is changed with the polar T and in the 282 in E2 the hydrophobic is changed to the polar S. * the pervasive positive/diversifying selection sites according to FEL. ** Histidine (H) at position 75 of NSP2 is shared among all the sequences of the IAB, ID, and IC subtypes, Isoleucine (I) at position 65 of the NSP3 protein is shared with the 2 sequences IE-1 and all the sequences of the IAB, ID, and IC subtypes, Valine (V) at position 208 of the SPE1 protein is shared with 17 sequences of IE-2 group and all sequences of the IAB, 16/17 ID, and IC subtypes. *** In position 389 of the NSP3, the CR sequence has a C, while the subtypes IAB and IC have a G.
Figure 2E1/E2 heterodimer model (A) and zoomed view of the interface between E1 and E2 at the position T389 (B). E1 and E2 are colored in lime green and violet, respectively. Cartoon and transparent surface styles are used. (A) Positions affected by mutations in E1 and E2 are shown as dark green and purple spheres, respectively. Calculated ΔΔG values for each mutant are reported. (B) Hydrogen bonds are represented as purple dashed lines. Only residues involving side chains in the interactions are shown and labeled: Y338 in E2 and T389 and H390 in E1 are displayed as sticks with atom-type coloring. The images were prepared with ChimeraX [35].