| Literature DB >> 31976084 |
Darren J Obbard1, Mang Shi2, Katherine E Roberts3, Ben Longdon3, Alice B Dennis4.
Abstract
Metagenomic sequencing has revolutionised our knowledge of virus diversity, with new virus sequences being reported faster than ever before. However, virus discovery from metagenomic sequencing usually depends on detectable homology: without a sufficiently close relative, so-called 'dark' virus sequences remain unrecognisable. An alternative approach is to use virus-identification methods that do not depend on detecting homology, such as virus recognition by host antiviral immunity. For example, virus-derived small RNAs have previously been used to propose 'dark' virus sequences associated with the Drosophilidae (Diptera). Here, we combine published Drosophila data with a comprehensive search of transcriptomic sequences and selected meta-transcriptomic datasets to identify a completely new lineage of segmented positive-sense single-stranded RNA viruses that we provisionally refer to as the Quenyaviruses. Each of the five segments contains a single open reading frame, with most encoding proteins showing no detectable similarity to characterised viruses, and one sharing a small number of residues with the RNA-dependent RNA polymerases of single- and double-stranded RNA viruses. Using these sequences, we identify close relatives in approximately 20 arthropods, including insects, crustaceans, spiders, and a myriapod. Using a more conserved sequence from the putative polymerase, we further identify relatives in meta-transcriptomic datasets from gut, gill, and lung tissues of vertebrates, reflecting infections of vertebrates or of their associated parasites. Our data illustrate the utility of small RNAs to detect viruses with limited sequence conservation, and provide robust evidence for a new deeply divergent and phylogenetically distinct RNA virus lineage.Entities:
Keywords: RNA interference; RNA virus; arthropod; dark virus; metagenome
Year: 2020 PMID: 31976084 PMCID: PMC6966834 DOI: 10.1093/ve/vez061
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Figure 1.Virus segments and sequencing coverage. Panels show the structure and fold-coverage for each of the five segments (columns), for each of the four founding Quenyaviruses (rows). Graphs represent fold-coverage on a log10 scale, with the structure of the segment annotated below to scale (dark: coding, pale: non-coding). Assembled contigs that terminated with a poly-A tract are denoted ‘AA’, and potentially incomplete open reading frames indicated with a jagged edge.
Figure 2.Phylogenetic trees for each of the viral segments. Panels A–E show maximum-likelihood phylogenetic trees for segments 1–5, inferred from amino-acid sequences. Panel E shows the tree for the most conserved segment, which encodes a putative RdRp. Trees are mid-point rooted, and the scale bar represents 0.5 substitutions per site. The four viruses marked in bold are the founding members of the clade, those underlined come from nominally vertebrate metagenomic datasets, and species names in italic denote sequences from public transcriptomes. One, J. sambac (marked superscript 1), came from a plant transcriptome contaminated with the whitefly D. kirkaldyi. Note that some aspects of tree topology appear to be consistent among segments, suggesting that reassortment may be limited. Sequence alignments and tree files are provided via Figshare doi:10.6084/m9.figshare.11341982.
Figure 3.Relationship of the Quenyaviruses to other RNA viruses. Unrooted phylogenetic trees showing the possible relationships between the RdRp (segment 5) of Quenyaviruses and RdRps of representatives from other groups of RNA viruses. Trees were inferred by maximum-likelihood using IQtree from alignments using T-coffee modes ‘expresso’ (A) and ‘accurate’ (B), or using a Bayesian approach (C) that co-infers the tree and alignment. None of the deep relationships had any support in the Bayesian analysis, although all of the major clades were recovered and many of the relationships between them are the same as those in (B). Sequence alignments are provided via Figshare doi:10.6084/m9.figshare.11341982.