| Literature DB >> 26678286 |
Chieh-Hua Lin, Yu-Bin Wang, Shu-Hwa Chen, Chao Agnes Hsiung, Chung-Yen Lin.
Abstract
Enteroviruses (EV) with different genotypes cause diverse infectious diseases in humans and mammals. A correct EV typing result is crucial for effective medical treatment and disease control; however, the emergence of novel viral strains has impaired the performance of available diagnostic tools. Here, we present a web-based tool, named EVIDENCE (EnteroVirus In DEep conception, http://symbiont.iis.sinica.edu.tw/evidence), for EV genotyping and recombination detection. We introduce the idea of using mixed-ranking scores to evaluate the fitness of prototypes based on relatedness and on the genome regions of interest. Using phylogenetic methods, the most possible genotype is determined based on the closest neighbor among the selected references. To detect possible recombination events, EVIDENCE calculates the sequence distance and phylogenetic relationship among sequences of all sliding windows scanning over the whole genome. Detected recombination events are plotted in an interactive figure for viewing of fine details. In addition, all EV sequences available in GenBank were collected and revised using the latest classification and nomenclature of EV in EVIDENCE. These sequences are built into the database and are retrieved in an indexed catalog, or can be searched for by keywords or by sequence similarity. EVIDENCE is the first web-based tool containing pipelines for genotyping and recombination detection, with updated, built-in, and complete reference sequences to improve sensitivity and specificity. The use of EVIDENCE can accelerate genotype identification, aiding clinical diagnosis and enhancing our understanding of EV evolution.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26678286 PMCID: PMC4682392 DOI: 10.1186/1471-2164-16-S12-S8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Genotype re-classification procedures.
Figure 2EVIDENCE interface. (A) Sequence Search. In DB search, keywords or multiple conditions can be used to search for related sequences. In BLAST search, single or multiple sequences are used to BLAST against all EV sequences to search for similar sequences. (B) Genotyping. First, the query sequence is compared to the GTRefSet (the scan region step). The result table contains indices of mixed-ranking scores presenting segment similarity. References for phylogenetic analysis are selected from this table. (C) Recombination Detection. The potential recombination events can be detected through observing swaps of the most similar reference (plot similarity) or changes of the relatively closest neighbor (bootscanning) in the genome segments.
Figure 3Statistics of GTRefSet. (A) The species composition of the database. Enterovirus B and Rhinovirus A collectively contribute to about half of the total known genotypes. (B) Genome region view.
| Species | GenBank Nomenclature | Assigned by EVIDENCE | Sequences with Congruent Assignment (% *) |
|---|---|---|---|
| Enterovirus A | 15875 | 16057 | 15794 (99.49%) |
| Enterovirus B | 16432 | 16502 | 16238 (98.82%) |
| Enterovirus C | 8224 | 8202 | 8146 (99.05%) |
| Enterovirus D | 1031 | 1132 | 1031 (100%) |
| Enterovirus E | 73 | 43 | 41 (56.16%) |
| Enterovirus F | 5 | 43 | 5 (100%) |
| Enterovirus G | 196 | 216 | 190 (96.94%) |
| Enterovirus H | 10 | 10 | 10 (100%) |
| Enterovirus J | 9 | 11 | 5 (55.56%) |
| Rhinovirus A | 2733 | 5088 | 2697 (98.68%) |
| Rhinovirus B | 608 | 1101 | 592 (97.37%) |
| Rhinovirus C | 2309 | 3690 | 2259 (97.83%) |
| unclassified Enterovirus unclassified Rhinovirus | 4957 | 367 | 119 (7.4%) |
Remark: * The percentage is calculated as follows:
| Original Species | Original Genotype | #Sequences | Region2 (sequence number) |
|---|---|---|---|
| Enterovirus A | Coxsackievirus A2 | 11 | 5'UTR (1), 2C (2), 3D (8) |
| Enterovirus A | Coxsackievirus A3 | 2 | 5'UTR (1), 3D (1) |
| Enterovirus A | Coxsackievirus A4 | 13 | 2C (2), 3D (11) |
| Enterovirus A | Coxsackievirus A5 | 2 | 3D (2) |
| Enterovirus A | Coxsackievirus A6 | 74 | 5'UTR (69), 2C (2), 3D (3) |
| Enterovirus A | Coxsackievirus A7 | 7 | 2C (2), 3D (5) |
| Enterovirus A | Coxsackievirus A8 | 6 | 2C (1), 3D (5) |
| Enterovirus A | Coxsackievirus A10 | 94 | 5'UTR (85), VP4-VP2 (2), 2C (3), 3D (4) |
| Enterovirus A | Coxsackievirus A14 | 3 | VP1 (1), 2C (1), 3D (1) |
| Enterovirus A | Coxsackievirus A16 | 1 | 3D (1) |
| Enterovirus A | Enterovirus A76 | 1 | 3D (1) |
| Enterovirus A | NA1 | 54 | 5'UTR (28), 2BC (2), 3D (24) |
| Enterovirus B | Echovirus 4 | 3 | 5'UTR (3) |
| Enterovirus B | Echovirus 9 | 5 | 5'UTR (5) |
| Enterovirus B | NA1 | 10 | 5'UTR (10) |
| unclassified Enterovirus | UE1 | 22 | 5'UTR (10), VP4 (3), VP1 (3), 3D (6) |
Remarks:
1. Abbreviations in use: NA, Not assigned; UE, including all classes under unclassified Enterovirus.
2. The sequenced region is a partial or complete segment.
| Species | #Sequence | Region | #Genotype | Number (%) of Genotype Discrepancies | References | |
|---|---|---|---|---|---|---|
| EVIDENCE | RIVM Genotyping Tool | |||||
| 49 | 5'UTR | 2 | 0 | 49 (100%) | [ | |
| 59 | VP1 | 8 | 0 | 0 | [ | |
| 3 | CG | 3 | 0 | 0 | [ | |
| 101 | VP1/VP2 | 22 | 6 (5.9%) | 6 (5.9%) | [ | |
| 13 | VP1 | 6 | 1 (7.7%) | 2 (15.4%) | [ | |
| 1 | VP1 | 1 | 0 | 1 (100%) | [ | |
| 5 | VP1 | NA | 5 (100%) | 5 (100%) | [ | |
Figure 4Detection of recombination events in a highly virulent enterovirus strain CA16/GD09/119. The ML tree of the GD09/119 complete genome sequence to the ten top ranked prototype references in GTRefSet is shown in panel A. The five closest references (CV-A4, CV-A16, CV-A7, EV-A71, and EV-A120) were selected for recombination detection, which was plotted using Bootscanning (panel B) and similarity (panel C). The results of the potential recombination analyses of GD09/119 to CVA4, CVA16, and one of CV-A7 (panel D), EV-A71 (panel E), or EV-A120 (panel F), are shown as bootscanning plots.