| Literature DB >> 27054586 |
Shea N Gardner1, Kevin McLoughlin1, Nicholas A Be2, Jonathan Allen1, Scott C Weaver3, Naomi Forrester3, Mathilde Guerbois3, Crystal Jaing2.
Abstract
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broad panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Finally, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.Entities:
Mesh:
Year: 2016 PMID: 27054586 PMCID: PMC4824352 DOI: 10.1371/journal.pone.0152604
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Gene regions from which SNPs were extracted.
| Gene | Coordinates on TC-83 genome |
|---|---|
| E1 | 10000–11327 |
| E2 | 8563–9843 |
| E3 | 8386–8574 |
| Capsid | 7562–8396 |
| All SNPs | 1–11446 |
Numbers of SNPs identified in VEEV genomes, by gene region.
| Gene | Number of SNPs |
|---|---|
| E1 | 1268 |
| E2 | 1384 |
| E3 | 262 |
| Capsid | 937 |
Fig 1SNP phylogeny of VEEV isolates by parsimony.
Strains are labeled by serotype-country-year collected-strain-host. Country codes are GA = Guatemala, PE = Peru, NI = Nicaragua, VE = Venezuela, CO = Colombia, TR = Trinidad, PA = Panama, US = USA, EC = Ecuador, ME = Mexico, BE = Belize, HO = Honduras, BR = Brazil, AR = Argentina, FG = French Guiana. Host codes are hor = horse, don = donkey, hum = human, mos = mosquito, ham = hamster, mus = mouse. u = unknown. Strains are colored by serotype (blue = IE, green = ID, red = IC, and purple = IAB). Hosts from which the strains were collected are indicated with symbols at the branch tips (red circles = human, orange circles = horses, blue circles = mosquitos, and green squares = hamsters). Counts of the number of alleles shared uniquely by the sequences down each branch are shown at the nodes in blue.
Fig 2Decision tree for prediction of serotype from SNP alleles.
Notations above internal nodes indicate SNP position in the TC-83 genome and alleles corresponding to left and right branches. Numbers below terminal nodes are numbers of isolates in node with serotypes IAB/IC/ID/IE respectively.
Fig 3Decision tree for prediction of host type from SNP alleles.
Notations above internal nodes are as in Fig 2. Numbers below terminal nodes are numbers of isolates in node collected from large/small host types, respectively.
Accuracy, positive (PPV) and negative predictive value (NPV), true positive (TPR) and negative (TNR) rates for serotype and host type predictions.
| Phenotype | Accuracy | PPV | NPV | TPR | TNR |
|---|---|---|---|---|---|
| 94.8% | - | - | - | - | |
| IAB | 98.5% | 90.0% | 99.2% | 90.0% | 99.2% |
| IC | 97.0% | 71.4% | 100.0% | 100.0% | 96.8% |
| ID | 95.6% | 97.9% | 94.3% | 90.2% | 98.8% |
| IE | 98.5% | 98.4% | 98.6% | 98.4% | 98.6% |
| 87.4% | - | - | - | - | |
| large | 87.4% | 90.7% | 85.9% | 75.0% | 95.2% |
| small | 87.4% | 85.9% | 90.7% | 95.2% | 75.0% |
(PPV) = TP/(TP + FP), (NPV) = TN/(TN+FN), TPR = TP/(TP+FN), and TNR = TN/(TN+FP).
Comparison of trees from multiple sequence alignment versus all SNPs, and trees from SNPs located in a single gene versus all SNPs.
| Tree comparison | Splits Found in 2nd tree | Total Splits in SNP tree | Fraction splits in SNP tree found in 2nd tree |
|---|---|---|---|
| All SNPs vs MSA | 112 | 146 | 0.77 |
| All SNPs vs E1 | 84 | 146 | 0.58 |
| All SNPs vs E2 | 72 | 146 | 0.49 |
| All SNPs vs E3 | 68 | 146 | 0.47 |
| All SNPs vs capsid | 54 | 146 | 0.37 |