| Literature DB >> 26717191 |
Abstract
Phylogenetic analysis has yet to uncover the early origins of flaviviruses. In this study, I mined a database of expressed sequence tags in order to discover novel flavivirus sequences. Flavivirus sequences were identified in a pool of mRNA extracted from the sea spider Endeis spinosa (Pycnogonida, Pantopoda). Reconstruction of the translated sequences and BLAST analysis matched the sequence to the flavivirus NS5 gene. Additional sequences corresponding to envelope and the NS5 MTase domain were also identified. Phylogenetic analysis of homologous NS5 sequences revealed that Endeis spinosa NS5 (ESNS5) is likely related to classical insect-specific flaviviruses. It is unclear if ESNS5 represents genetic material from an active viral infection or an integrated viral genome. These data raise the possibility that classical insect-specific flaviviruses and perhaps medically relevant flaviviruses, evolved from progenitors that infected marine arthropods.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26717191 PMCID: PMC4699914 DOI: 10.1371/journal.pone.0146037
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Top 10 tblastn hits of Sepik virus polyprotein against the expressed sequence tags (est) database.
| Best hit | Accession number | E-value |
|---|---|---|
|
| GI:283511872 | 6e-71 |
|
| GI:283498993 | 1e-70 |
|
| GI:283545672 | 2e-68 |
|
| GI:283545171 | 2e-67 |
|
| GI:283541209 | 1e-66 |
|
| GI:283513154 | 2e-66 |
|
| GI:283534952 | 6e-66 |
|
| GI:283535103 | 7e-66 |
|
| GI:283545130 | 9e-66 |
|
| GI:283545705 | 9e-66 |
Top 10 nucleotide BLAST hits of Sepik virus genome against the expressed sequence tags (est) database.
| Best hit | Accession number | E-value |
|---|---|---|
|
| GI:283541362 | 8e-16 |
|
| GI:283538836 | 8e-16 |
|
| GI:283511744 | 8e-16 |
|
| GI:283498993 | 8e-16 |
|
| GI:283545705 | 4e-14 |
|
| GI:283545672 | 4e-14 |
|
| GI:283545171 | 4e-14 |
|
| GI:283545118 | 4e-14 |
|
| GI:283541204 | 4e-14 |
|
| GI:283535170 | 4e-14 |
Fig 1Alignment of ESNS5 and Rio Bravo virus NS5 RdRp.
Amino acid sequence alignment was performed using reconstructed ESNS5 and Rio Bravo virus NS5 RdRp (Rio) using Clustal Omega. Stars represent perfect homology, colons represent partial homology, and periods represent weak homology. Dashes represent deleted sequence.
Top 10 protein BLAST hits of reconstructed ESNS5 against the non-redundant protein sequence (nr) database.
| Best hit | Accession number | E-value |
|---|---|---|
| Rio Bravo virus NS5 | GI:27735371 | 3e-180 |
| Montana myotis leukoencephalitis virus NS5 | GI:27669998 | 2e-178 |
| Deer tick virus NS5 | GI:164504431 | 2e-178 |
| Tick-borne encephalitis virus NS5 | GI:27596786 | 3e-178 |
| Alkhumra hemorrhagic fever virus NS5 | GI:27545515 | 1e-177 |
| Modoc virus NS5 | GI:25121731 | 7e-177 |
| Powassan virus NS5 | GI:16945860 | 9e-176 |
| Langat virus NS5 | GI:25121534 | 1e-175 |
| Apoi virus NS5 | GI:27697405 | 3e-175 |
| Dengue virus type 1 NS5 | GI:25014069 | 5e-174 |
Fig 2Phylogenetic position of ESNS5.
The evolutionary history of ESNS5 and almost all known flaviviruses was inferred by generating homologous NS5 amino acid sequences using MUSCLE multiple sequence alignment and Gblocks 0.91b software, followed by the Maximum Likelihood methods based on the JTT matrix-based models. The tree with the highest log likelihood (-27149.5525) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. Initial trees for the heuristic search were obtained by applying the Neighbor-Joining method to a matrix of pairwise distances estimated using a JTT model. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 74 sequences. All positions containing gaps and missing data were eliminated. There were a total of 496 positions in the final dataset. Evolutionary analysis was conducted in MEGA6.