Literature DB >> 22848734

Discovery of novel dsRNA viral sequences by in silico cloning and implications for viral diversity, host range and evolution.

Huiquan Liu1, Yanping Fu, Jiatao Xie, Jiasen Cheng, Said A Ghabrial, Guoqing Li, Xianhong Yi, Daohong Jiang.   

Abstract

Genome sequence of viruses can contribute greatly to the study of viral evolution, diversity and the interaction between viruses and hosts. Traditional molecular cloning methods for obtaining RNA viral genomes are time-consuming and often difficult because many viruses occur in extremely low titers. DsRNA viruses in the families, Partitiviridae, Totiviridae, Endornaviridae, Chrysoviridae, and other related unclassified dsRNA viruses are generally associated with symptomless or persistent infections of their hosts. These characteristics indicate that samples or materials derived from eukaryotic organisms used to construct cDNA libraries and EST sequencing might carry these viruses, which were not easily detected by the researchers. Therefore, the EST databases may include numerous unknown viral sequences. In this study, we performed in silico cloning, a procedure for obtaining full or partial cDNA sequence of a gene by bioinformatics analysis, using known dsRNA viral sequences as queries to search against NCBI Expressed Sequence Tag (EST) database. From this analysis, we obtained 119 novel virus-like sequences related to members of the families, Endornaviridae, Chrysoviridae, Partitiviridae, and Totiviridae. Many of them were identified in cDNA libraries of eukaryotic lineages, which were not known to be hosts for these viruses. Furthermore, comprehensive phylogenetic analysis of these newly discovered virus-like sequences with known dsRNA viruses revealed that these dsRNA viruses may have co-evolved with respective host supergroups over a long evolutionary time while potential horizontal transmissions of viruses between different host supergroups also is possible. We also found that some of the plant partitiviruses may have originated from fungal viruses by horizontal transmissions. These findings extend our knowledge of the diversity and possible host range of dsRNA viruses and offer insight into the origin and evolution of relevant viruses with their hosts.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22848734      PMCID: PMC3407116          DOI: 10.1371/journal.pone.0042147

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The genome sequence of viruses can contribute greatly to the study of viral evolution, diversity and interactions between viruses and their hosts. Traditional methods for obtaining RNA viral sequences include the use of techniques such as dsRNA isolation, cDNA library construction and molecular cloning [1], [2]. Though powerful in discovering unknown viruses, these methods are time-consuming and often hindered by difficulties in cultivation and extremely low titers for many viruses. Amplification by PCR based on known viral sequences is most efficient for detecting viruses, but only leads to discovery of known or similar viruses. Recent viral metagenomic studies overcame these limitations and provided a promising method for investigation of unrefined viral diversity, in which viral particles are first partially purified and then viral sequences are randomly amplified before sub-cloning and sequencing [3], [4]. By using this approach, numerous previously unknown viruses have been discovered in environmental and clinical samples [5], [6], [7], [8]. However, a disadvantage of this method is difficulties to identify the host range of detected viruses. With the advance of next generation sequencing (NGS) technologies, another culture-independent approach for virus discovery is developed by deep sequencing and assembly of virus-derived small silencing RNAs [9], [10], [11]. This approach can identify both plant and invertebrate viruses occurring at extremely low titers without purification of viral particles and amplification of viral sequences. Double-stranded (ds) viruses infecting eukaryotes are grouped into seven families: Birnaviridae, Picobirnaviridae, Reoviridae, Endornaviridae, Chrysoviridae, Partitiviridae, and Totiviridae [12]. Though many members in the first three families cause serious diseases, viruses in the latter four families are generally associated with latent infections and have little or no overt effects on their hosts. Members of family Totiviridae possess a monopartite genome encoding, in most cases, only a capsid protein (CP) and an RNA-dependent RNA polymerase (RdRp) [12], [13]. These viruses mainly infect fungi and protozoa. Viruses in the family Partitiviridae infect fungi, plants or apicomplexan protozoa and possess a bipartite genome separately encoding the CP and RdRp [12], [13], [14], [15], [16]. The family Chrysoviridae encompasses viruses with quadripartite genomes that code for RdRp, CP, and two unknown proteins P3 and P4 [12], [13], [17]. Currently, the known host range of chrysoviruses is limited to fungi. Members of family Endornaviridae comprise large dsRNAs encoding a single long polypeptide with typical viral RNA helicases (Hels), UDP-glucosyltransferases (UGTs), and RdRps [18]. Endornaviruses have been reported in plants, fungi, and oomycetes. Recently, several monopartite dsRNA viruses distantly related to totiviruses and partitiviruses were found in plants [19], [20], [21]. In addition, a novel bipartite dsRNA virus and a novel quadripartite dsRNA virus phylogenetically related to chrysoviruses and totiviruses were reported in fungi [22], [23]. These viruses may belong to novel dsRNA viral families [19], [22], [23]. Hence, our understanding of the diversity and host range of dsRNA viruses must rely on the discovery of additional new viruses. The partitiviruses, totiviruses, chrysoviruses, and endornaviruses as well as other related unclassified dsRNA viruses do not have extracellular routes for infection and are transmitted vertically via cell division or horizontally via cell fusions. Therefore, it has been suggested that these viruses may have co-evolved with their hosts [24], [25]. However, Koonin and co-authors [26] suggested that horizontal transmission of viruses between plants and fungi (interkingdom host jumping) might have been particularly important in the evolution of the family Partitiviridae. Indeed, recent phylogenetic analyses based on amino acid sequences of RdRps of partitiviruses suggest that partitiviral horizontal transmission between fungi and plants may have occurred [15]. However, due to limited availability of genomic sequences of representative viruses, evolutionary relationships of these viruses with their hosts remain to be elucidated. Considering that many dsRNA viruses are associated with symptomless or persistent infections of their hosts, samples or materials derived from eukaryotic organisms used to construct cDNA libraries for EST sequencing may carry viruses. Viral RNAs may have been cloned and sequenced together with host RNAs. Therefore, the EST databases may include numerous viral ESTs that are treated as contaminating sequences. These viral ESTs, however, are important to understand the host range and evolution of dsRNA viruses. With this in mind, we performed in silico cloning, a procedure of obtaining full or partial cDNA sequence of a gene by bioinformatics analysis, using the known dsRNA viral sequences as queries to search against NCBI Expressed Sequence Tag (EST) database. In this study, we obtained numerous virus-like sequences related to members of families Endornaviridae, Chrysoviridae, Partitiviridae, and Totiviridae. Many of them were discovered from eukaryotic lineages that were not known to be hosts to these viruses. Furthermore, we conducted comprehensive phylogenetic analysis with these newly identified virus-like sequences and known dsRNA viruses. Results from this study extended our knowledge of the diversity and possible host range of dsRNA viruses and offered insight into the origin and evolution of relevant viruses with their hosts.

Results and Discussion

Identification of novel partitivirus-like sequences

By using the in silico cloning method, we obtained 91 virus-like sequences (contigs or singletons) that were most closely related to members of the family Partitiviridae (Table 1 and Data S1). Among these partitivirus-like sequences, 47 were RdRp-like and 44 were CP-like sequences. Despite the fact that most of them represented only partitiviral genomic fragments, some assembled contigs contains complete or near full-length sequence of RdRp or CP (Fig. 1). Most of the partitivirus-like sequences (18) were discovered from plant cDNA libraries, including those of 10 monocots, 19 eudicots, and 3 conifers, but only two were found in fungal cDNA libraries, although these viruses are common in different fungal species. It is possible that plant materials carrying partitiviruses used to construct cDNA libraries render viral detection more difficult than fungal materials. In fact, the concentration of partitiviruses in plants was often low while it was relatively high in fungi. On the other hand, the currently available fungal EST data are limited.
Table 1

Partitivirus-like sequences discovered from cDNA libraries of eukaryotic organisms.

Organism groupOrganisma) No. of RdRpNo. of CPTotalc)
Contigb) SingletonContigb) Singleton
Plants
monocots Asparagus officinalis (garden asparagus)11
Festuca arundinacea (tall fescue)11
Festuca pratensis (meadow ryegrass)6 (1)43 (2)13
Lolium perenne (perennial ryegrass)11
Lolium multiflorum (Italian ryegrass)123
Avena barbata (slender oat)1 (1)1 (1)2
Secale cereale (rye)112
Leymus cinereus x Leymus triticoides (basin, creeping wild rye)21 (1)3
Pseudoroegneria spicata (beardless wheatgrass)23
Agrostis capillaris (waipu)11
eudicots Triphysaria versicolor (Yellowbeak owl's-clover)11
Triphysaria pusilla (dwarf owl's clover)22
Lactuca perennis (blue lettuce)11
Medicago sativa (alfalfa)1 (1)22
Trifolium pratense (red clover)1 (1)2 (1)3
Quercus mongolica var. crispula (Mongolian oak)213
Eschscholzia californica (California poppy)112
Artemisia annua (sweet wormwood)123
Allium cepa (onion)1 (1)1
Brassica rapa subsp. pekinensis (Chinese cabbage)123
Raphanus raphanistrum subsp. raphanistrum (wild radish)1 (1)214
Raphanus raphanistrum subsp. maritimus (Sea Radish)1 (1)45
Helianthus annuus (common sunflower)1 (1)34
Helianthus paradoxus (Puzzle Sunflower)11
Gossypium hirsutum (upland cotton)11
Chenopodium glaucum (oakleaf goosefoot)11
Cannabis sativa (hemp)21 (1)3
Cynara scolymus (artichoke)11
Salvia fruticosa (Greek sage)33
conifers Picea glauca & Picea sitchensis (white spruce & Sitka spruce)* 1 (1)2 (1)3
Pseudotsuga menziesii var. menziesii (Douglas-fir)11
Fungi Conidiobolus coronatus 11
Pisolithus microcarpus 11
Animals Schistosoma mansoni (flatworms)11
Botrylloides leachi (tunicates)11
Ascaris suum (pig roundworm)112
Phlebotomus papatasi (Sandfly)11
Onychiurus arcticus (Tullberg)11
Mengenilla chobauti (twisted-wing parasites)11
Gadus morhua (Atlantic cod)11
Plants/Fungi Oryza sativa (rice) infected by Magnaporthe grisea 11
Capsicum annuum (Chili Pepper) infected by Pytophthora capsici 112
Total432621271791

Partitivirus-like ESTs were detected from the cDNA libraries of these organisms. Note the possibility that certain viral sequences are possible not from the annotated host organisms.

ESTs from the two species are high identical and are assembled into one contig.

The numbers in parentheses indicate the numbers of contigs which are corresponding to complete or near full-length sequence of RdRp or CP segments. Note the possibility that virus ESTs used to generate a contig are possible not from the same virus.

Several sequences in certain species were possibly corresponding to one viral segment but were not assembled into one contig due to sequencing gaps.

Figure 1

Representative full-length sequences of partitiviral RdRp or CP segments assembled from EST libraries of different plants.

Multiple ESTs were used to construct contigs where overlapping regions of EST sequences show over 97% sequence identity. Bracket indicates that the RdRp and CP-like sequences were possibly from one virus. Rectangular boxes indicate ORFs.

Representative full-length sequences of partitiviral RdRp or CP segments assembled from EST libraries of different plants.

Multiple ESTs were used to construct contigs where overlapping regions of EST sequences show over 97% sequence identity. Bracket indicates that the RdRp and CP-like sequences were possibly from one virus. Rectangular boxes indicate ORFs. Partitivirus-like ESTs were detected from the cDNA libraries of these organisms. Note the possibility that certain viral sequences are possible not from the annotated host organisms. ESTs from the two species are high identical and are assembled into one contig. The numbers in parentheses indicate the numbers of contigs which are corresponding to complete or near full-length sequence of RdRp or CP segments. Note the possibility that virus ESTs used to generate a contig are possible not from the same virus. Several sequences in certain species were possibly corresponding to one viral segment but were not assembled into one contig due to sequencing gaps. Interestingly, though partitiviruses have not been isolated from animals thus far, we discovered 8 partitivirus-like sequences from the cDNA libraries of 7 animal species. Most of these virus-like sequences shared low amino acid (aa) identities (<60%) with those of known partitiviruses, suggesting that they may represent new viral species in the Partitiviridae family. However, some of them, such as a few sequences from wild radish (Raphanus raphanistrum subsp. raphanistrum) and sea radish (R. raphanistrum subsp. maritimus) have high sequence identity (>90% aa) to RdRp or CP of R. sativus cryptic virus 2 (RsCV-2) and RsCV-3, two partitiviruses reported in cultivated radish R. sativus-root cv. Yidianhong, respectively, suggesting that the same or similar viruses infected different host species. Three partitivirus-like sequences were identified in the cDNA libraries of plant samples infected by fungi. These sequences were either from fungal viruses or plant viruses. In addition, since mycoviruses are commonly found in different species of endophytic fungi of grasses [27], we cannot rule out the possibility that some virus-like sequences found in cDNA libraries of plants could actually be derived from viruses of endophytic fungi. In fact, it is yet to be determined if some of the characterized dsRNA viruses from plants are true plant or fungal viruses [13], [28].

Identification of novel toti-, chrys- and endornavirus-like sequences

Eight virus-like sequences that were distinctly related to members of the family Totiviridae were discovered from cDNA libraries of plant, rust fungi, arthropods and diatoms (Table 2 and Data S1). The CP-like sequence in Tamarix androssowii was most closely related to that of black raspberry virus F, a toti-like virus whose sequence is publicly available only in the database. All of the totivirus-like sequences shared only low sequence identity (<50%) with known totiviruses, suggesting that they may represent the genomes of novel totivirus-like species.
Table 2

Totivirus, chrysovirus, endornavirus and Southern tomato virus (STV)-like sequences discovered from cDNA libraries of eukaryotic organisms.

Organism groupOrganisma) No. of totiviral RdRpNo. of totiviral CP
Contigb) SingletonContigb) Singleton
Diatoms Phaeodactylum tricornutum (diatom)1
Fungi Puccinia striiformis f. sp. Tritici (wheat stripe rust)1
Uromyces appendiculatus (Bean rust)2
Animals Lepeophtheirus salmonis (salmon louse)1
Nilaparvata lugens (brown planthopper)1
Epiphyas postvittana (light brown apple moth)1
Plants Tamarix androssowii 1

Virus-like ESTs were detected from the cDNA libraries of these organisms. Note the possibility that certain viral sequences are possible not from the annotated host organisms.

Note the possibility that virus ESTs used to generate a contig are possible not from the same virus.

Virus-like ESTs were detected from the cDNA libraries of these organisms. Note the possibility that certain viral sequences are possible not from the annotated host organisms. Note the possibility that virus ESTs used to generate a contig are possible not from the same virus. Fourteen virus-like sequences from plant cDNA libraries were closely related to Southern tomato virus (STV) and three other related unclassified viruses isolated from plants [19], [20], [21], with genome organizations similar to totiviruses. In addition, an RdRp-like sequence from microsporidian (Antonospora locustae) also was related to these four plant viruses (Table 2 and Data S1). One chrysovirus RdRp-like and two p3 protein-like sequences were found in cDNA libraries of sweet wormwood (Artemisia annua) and garden zinnia (Zinnia violacea) (Table 2 and Data S1). These three sequences were distantly related to known chrysoviruses and each of them was most closely related to different viruses, suggesting that they may be derived from three distinct, novel viruses. We also identified three virus-like sequences that were distantly related to the polyprotein of members of the family Endornaviridae from cDNA libraries of animals, protozoans, and plants (Table 2 and Data S1).

Phylogenetic analysis of partitivirus-like sequences

To evaluate the phylogenetic relationships of the partitivirus-like sequences identified by in silico cloning with known viruses, we constructed maximum likelihood phylogenetic trees with amino acid sequences of RdRp or CP protein sequences. We also added recently reported endogenous partitivirus-like sequences in the phylogenetic analysis [29], [30]. As shown in Fig. 2, the RdRp tree mainly was divided into four large clades: I–IV. The CP tree has similar clades except that the CPs of viruses in clade IV of the RdRp tree formed two distinct clusters: IV-1 and IV-2 (Fig. 3). The virus-like sequences discovered here were distributed within the sub-clades of RdRp and CP trees, strongly suggesting that they were derived from members of the Partitiviridae family. These new viral sequences nearly doubled the amount of partitiviral sequences currently available in the public database, remarkably expanding the known diversity of partitiviruses.
Figure 2

Maximum likelihood tree of partitiviral RdRp-like sequences.

The phylogenetic tree were built using PhyML-mixtures based on a multiple sequence alignment generated using COBALT with the Word Size parameter setting to 3. The base tree was drawn using Interactive Tree Of Life Version 2.1.1 (http://itol.embl.de/). The tree is midpoint rooted. The p-values of approximate likelihood ratios (SH-aLRT) plotted as circle marks on the branches (only p-values>0.5 are indicated) and circle size is proportional to the the p-values. The p-values for the branches relevant to our conclusions are shown. Scale bars correspond to 0.2 amino acid substitutions per site.

Figure 3

Maximum likelihood tree of partitiviral CP-like sequences.

The phylogenetic tree were built using PhyML-mixtures based on a multiple sequence alignments generated using COBALT with the Word Size parameter setting to 3 and Constraint E-value parameter setting to 0.1. The base tree was drawn using Interactive Tree Of Life Version 2.1.1 (http://itol.embl.de/). The tree is midpoint rooted. The p-values of approximate likelihood ratios (SH-aLRT) plotted as circle marks on the branches (only p-values>0.5 are indicated) and circle size is proportional to the p-values. The p-values for the branches relevant to our conclusions are shown. Scale bars correspond to 0.2 amino acid substitutions per site.

Maximum likelihood tree of partitiviral RdRp-like sequences.

The phylogenetic tree were built using PhyML-mixtures based on a multiple sequence alignment generated using COBALT with the Word Size parameter setting to 3. The base tree was drawn using Interactive Tree Of Life Version 2.1.1 (http://itol.embl.de/). The tree is midpoint rooted. The p-values of approximate likelihood ratios (SH-aLRT) plotted as circle marks on the branches (only p-values>0.5 are indicated) and circle size is proportional to the the p-values. The p-values for the branches relevant to our conclusions are shown. Scale bars correspond to 0.2 amino acid substitutions per site.

Maximum likelihood tree of partitiviral CP-like sequences.

The phylogenetic tree were built using PhyML-mixtures based on a multiple sequence alignments generated using COBALT with the Word Size parameter setting to 3 and Constraint E-value parameter setting to 0.1. The base tree was drawn using Interactive Tree Of Life Version 2.1.1 (http://itol.embl.de/). The tree is midpoint rooted. The p-values of approximate likelihood ratios (SH-aLRT) plotted as circle marks on the branches (only p-values>0.5 are indicated) and circle size is proportional to the p-values. The p-values for the branches relevant to our conclusions are shown. Scale bars correspond to 0.2 amino acid substitutions per site. The partitivirus-like sequences derived from cDNA libraries of fungi were most closely related to known fungal viruses (Fig. 2). The sequences discovered from cDNA libraries of animals Sandfly (Phlebotomus papatasi) and pig roundworm (Ascaris suum) were most closely related to integrated viral sequences in animal genomes (Fig. 2). Likewise, most of the new partitivirus-like sequences from cDNA libraries of plant samples clustered with reported plant viruses and/or integrated viruses in plant genomes. The plant virus-like sequences were generally most similar to each other and formed distinct clusters (Fig. 2 and 3). A more reasonable explanation for these results is that these partitivirus-like sequences were indeed derived from the current annotated organisms, although it is also possible that a trace fungal entophytes in the tissues of annotated organism might be the true source for certain sequences. Clade I and II mainly consisted of mixtures of partitivirus-like sequences from plants and fungi. The mosaic distribution of plant and fungal viruses indicates the possibility of viral transmission between plants and fungi. Moreover, many of the plant viral clusters were composed of viruses from different plant families or classes (monocots or eudicots) but their phylogenies seem not to be topologically congruent with that of their hosts, suggesting that these plant partitiviruses may not be ancient origin. Considering that the branches of fungal viruses were generally locating at the base of plant viral clades (Fig. 2 and 3), it is likely that these plant partitiviruses evolved from viruses of fungi by interkingdom host jumping. Clade IV was mainly composed of plant partitivirus-like sequences while clade IV was mainly composed of fungal partitivirus-like sequences (Fig. 2 and 3). Some of animal viral RdRp-like sequences were distantly related to those from plants and fungi and their branches were clustering deeply within each clade (Fig. 2). Furthermore, four partitivirus-like sequences from animals and protozoans clustered together and formed an extra small clade branching deeply in the RdRp tree (Fig. 2). The point of divergence of plant, fungi and animal virus-like sequences correspond to deep branching in the phylogenetic tree, implying that virus-host co-evolution seems to be possible although it is impossible to link them to any specific time scale. However, RdRp-like sequences from cDNA libraries of Tullberg (Onychiurus arcticus) and slender oat (Avena barbata) clustered together in clade III (Fig. 2). Likewise, the CP-like sequences from cDNA libraries of flatworm (Schistosoma mansoni) and twisted-wing parasite (Mengenilla chobauti) were most closely related to those of certain fungal viruses in clade I and III (Fig. 3). If these sequences were indeed derived from viruses infecting the annotated hosts, these results clearly suggest that horizontal transmission of these viruses occurred between animals and fungi or between animals and plants.

Phylogenetic analysis of toti- and chrysovirus-like sequences

We constructed phylogenetic tree for the toti- and chrysovirus-like sequences discovered here with all available RdRps from members of the families Chrysoviridae and Totiviridae as well as other totivirus-related unclassified viruses. Recently identified endogenous totivirus-like sequences [29] were also included in this analysis. We found that toti- and chrysovirus-like sequences actually comprise diverse viral lineages (Fig. 4). Most of the newly identified virus-like sequences were placed within the sub-clades of phylogenetic tree strongly suggesting that they were derived from members of these viral families. All STV-like sequences from plants and fungi clustered together and constituted a distinct clade (clade I). The two fungal STV-like sequences located at the base of this clade, which was more distantly related to plant sequences, suggesting that these viruses may have co-evolved with their host (fungi and plants). Similarly, the deep branching of virus-like sequence from diatoms, protozoans and fungi in clade II and these from animals in clade III is also likely to be the result of coevolution between viruses and hosts. However, it is difficult to determine how deep the co-evolution is. The three plant viruses in clade IV, however, were possibly originated from fungal viruses via interkingdom host jumping, because their branches located within cluster of fungal viruses. Likewise, the unclassified Cucurbit yellow-associated virus may have evolved from insect viruses of clade V (Fig. 4), if this virus is indeed a plant virus.
Figure 4

Maximum likelihood tree of totiviral, chrysoviral, and STV RdRp-like sequences.

The phylogenetic tree was built using PhyML-mixtures based on a multiple sequence alignment generated using COBALT with the Word Size parameter setting to 3. The base tree was drawn using Interactive Tree Of Life Version 2.1.1 (http://itol.embl.de/). The totivirus-like group represented by Southern tomato virus (STV) was chosen as outgroup for presentation purposes. The p-values of approximate likelihood ratios (SH-aLRT) plotted as circle marks on the branches (only p-values>0.5 are indicated) and circle size is proportional to the p-values. The p-values for the branches relevant to our conclusions are shown. Scale bars correspond to 0.2 amino acid substitutions per site. Asterisk (*) indicates the chrysoviral clade.

Maximum likelihood tree of totiviral, chrysoviral, and STV RdRp-like sequences.

The phylogenetic tree was built using PhyML-mixtures based on a multiple sequence alignment generated using COBALT with the Word Size parameter setting to 3. The base tree was drawn using Interactive Tree Of Life Version 2.1.1 (http://itol.embl.de/). The totivirus-like group represented by Southern tomato virus (STV) was chosen as outgroup for presentation purposes. The p-values of approximate likelihood ratios (SH-aLRT) plotted as circle marks on the branches (only p-values>0.5 are indicated) and circle size is proportional to the p-values. The p-values for the branches relevant to our conclusions are shown. Scale bars correspond to 0.2 amino acid substitutions per site. Asterisk (*) indicates the chrysoviral clade. The chrysovirus-like sequences clustered together and formed a distinct clade in the phylogenetic tree (Fig. 4). The only two sequences from plants branching at the base of one sub-clade, possible represent the co-evolved viral lineage in plants.

Phylogenetic analysis of endornavirus-like sequences

Phylogenetic analysis of endornavirus-like sequences shown that the plant viral sequences were generally clustered together and fungal viral sequences were generally branched at the base of plant viral clusters (Fig. 5). This is likely to be the results of coevolution of viruses with their hosts. However, horizontal transmission of viruses may have occurred between plants and fungi or chromalveolates as the fungal virus Helicobasidium mompa dsRNA virus N10 and two chromalveolate viruses: Phytophthora endornavirus 1 and Ichthyophthirius multifiliis contig were most closely related to certain plant viruses, respectively (Fig. 5). The virus-like sequence from Sea lice (Caligus rogercresseyi) clustered with those of two fungal endornaviruses of Helicobasidium mompa and thereby possibly evolved from fungal viruses. The virus-like sequence from white spruce (Picea glauca) was distantly related to other endornavirus-like sequences, possibly representing a new dsRNA viral lineage related to endornaviruses.
Figure 5

Maximum likelihood tree of polyproteins of endornavirus-like sequences.

The phylogenetic tree was built using PhyML-mixtures based on a multiple sequence alignment generated using COBALT with the default parameter setting. The tree was rooted with closteroviruses Pineapple mealybug wilt-associated virus 1 and Plum bark necrosis and stem pitting-associated virus, which belong to the “alpha-like” supergroup of ss(+)RNA viruses. Only p values for the approximate likelihood ratios (SH test) of >0.5 (50%) are indicated. Scale bars correspond to 0.5 amino acid substitution per site.

Maximum likelihood tree of polyproteins of endornavirus-like sequences.

The phylogenetic tree was built using PhyML-mixtures based on a multiple sequence alignment generated using COBALT with the default parameter setting. The tree was rooted with closteroviruses Pineapple mealybug wilt-associated virus 1 and Plum bark necrosis and stem pitting-associated virus, which belong to the “alpha-like” supergroup of ss(+)RNA viruses. Only p values for the approximate likelihood ratios (SH test) of >0.5 (50%) are indicated. Scale bars correspond to 0.5 amino acid substitution per site.

The potential host range of dsRNA viruses

The dsRNA virus-like sequences discovered here are either from integrated viral sequences or infecting viruses. Although it is not certain whether all of the virus-like sequences are indeed derived from the annotated host organisms, they may indicate the potential hosts for these viruses and extend the possible host range of viruses. Members of the family Partitiviridae commonly occur in plants and fungi. To date, only one member of this family, the Cryptosporidium parvum virus 1, was found to infect apicomplexan protozoa of genus Cryptosporidium. The discovery of partitivirus-like sequences from animal cDNA libraries in this study together with our previous finding of endogenous partitiviral sequences in arthropod genomes [29] clearly suggest that these viruses can also infect animals (Fig. 6).
Figure 6

A tree of eukaryotes showing known host range of partitvirus-, totivirus-, chrysovirus- and endornavirus-like viruses.

This tree was drawn base on The Tree of Life Web Project (http://tolweb.org/).

A tree of eukaryotes showing known host range of partitvirus-, totivirus-, chrysovirus- and endornavirus-like viruses.

This tree was drawn base on The Tree of Life Web Project (http://tolweb.org/). Totiviruses are known to infect fungi and protozoa and recently found in arthropods [11], [31], [32], [33] and fish [34]. In addition, three totiviruses and four STV-like viruses from plants have been published [19], [20], [21], [35] or are publicly available in the database. Our study further extended the possible host range of these viruses to other eukaryotic lineages (Fig. 6). Though the only known hosts of viruses in the family Chrysoviridae are fungi, Anthurium mosaic-associated virus, a chryso-like virus infecting monocots is available in the database. Furthermore, we discovered three chryso-like sequences from two dicot plant species, suggesting that plants also are possible hosts of chrysoviruses. Endornaviruses were originally discovered in plants and later found in fungi and Stramenopiles [18]. Our finding further extends the potential host range of these viruses to animals (Fig. 6). Altogether, totiviruses and STV-like viruses have the broadest host range, including four of the five supergroups of eukaryotes: Unikonta, Plantae, Chromalveolata and Excavata [36]. However, chrysoviruses were only distributed in the former two supergroups. The potential host range of partitiviruses and endornaviruses includes three eukaryotic supergroups: Unikonta, Plantae, Chromalveolata. With the discovery of more and more viruses, the host range of these viruses will be further extended.

Interaction and evolution of dsRNA viruses with their hosts

The partiti-, toti-, chryso-, and endornaviruses have similar features that they are transmitted vertically via spores and do not have extracellular routes for infection and have little or no effects on their hosts [12], [13]. These characteristics indicate that these viruses may have co-evolved with their hosts [24], [25], but there is no clear phylogenetic evidence supporting this. These viruses were initially found in fungi or plants but shown in recent reports and this study to be widely distributed in eukaryotic supergroups. The origins of Partitiviridae and Totiviridae have been revealed to be ancient and antedate the radiation of eukaryotic supergroups [26]. Our phylogenetic analysis showed that some lineages of these dsRNA families from different host supergroups are branching deeply in the phylogenetic tree. The phylogenetic pattern indicates that these viral lineages may have co-evolved with their host supergroups. However, it is difficult to determine how deep the co-evolution is. The extent of co-evolution could either be complete co-evolution that virus split corresponding to the host split or partial co-evolution that ancestral viruses transmitted between different host supergroups followed by co-diverged with respective hosts. Our data also revealed that recent horizontal transmission of these viruses may have occurred between different host supergroups (such as fungi and plants). Although interkingdom host jumping is not occurred easily because it would require entry into the germline, there is sufficient opportunity to occur during the long evolutionary history with their hosts.

A simple, effective approach for discovery of novel viruses

In this study, by using the in silico cloning approach, we discovered numerous novel virus-like sequences from NCBI EST database, representing members of families Partitiviridae, Totiviridae, Endornaviridae, and Chrysoviridae. The new sequences identified in this study are either from novel viruses or from known but yet unsequenced viruses. These viral sequences provide extended the potential host range of related viruses and help to shed light on their origin and evolution. We failed to find EST sequences similar to three other dsRNA families, Birnaviridae, Picobirnaviridae and Reoviridae. It is possible that samples for cDNA library construction containing viruses are easily detected due to the fact that these viruses generally cause serious disease on their hosts. This method we used seemed to be more effective for identification of plant viruses. Considering the extensive diversity of plant viruses and the rapid increase of plant EST sequencing, the in silico cloning approach may has a broad application prospect in plant virology. In fact, similar approaches were used to identify several families of plant positive-sense ssRNA viruses. Numerous EST sequences similar to these viruses were also found. Many of them may belong to new species or genera. We also found many plant virus-like sequences from cDNA libraries of insects and nematodes, suggesting new viral vectors or new host range for these plant viral lineages (Liu Huiquan et al., unpublished data).

Conclusions

In this study, we demonstrated the application of the in silico cloning approach for discovery of novel dsRNA viral sequences. By using this method, we obtained 91 partitivirus-like, 22 toti- or STV-like, 3 chrysovirus-like and 3 endornavirus-like novel sequences. Some of these virus-like sequences were discovered from eukaryotic lineages which are not known to be hosts for these viruses. Furthermore, phylogenetic analysis of these new virus-like sequences with those of known dsRNA viruses revealed that these viruses may have co-evolved with respective host supergroups over a long evolutionary time frame while potential viral horizontal transmission was also likely to be occurred between different host supergroups. The phylogenetic analysis also revealed that some of plant partitiviruses may have originated from mycoviruses by interkingdom host jumping. Our findings extend the diversity and possible host range of dsRNA viruses and offers insight into the origin and evolution of relevant viruses with their hosts.

Materials and Methods

DsRNA viruses cloning in silico

We firstly selected and downloaded the protein sequences derived from representative viruses in each genus of dsRNA families, Birnaviridae, Picobirnaviridae, Reoviridae Endornaviridae, Chrysoviridae, Partitiviridae, and Totiviridae from viral genome databases at the NCBI website (http://www.ncbi.nlm.nih.gov/genomes/GenomesHome.cgi?taxid=10239). These viral sequences were then used as seed queries to search against the NCBI EST database by Netblast (blastcl3) program with tBLASTn strategy. All non-redundant matches from these searches with E-values≤1e-5 were extracted and were divided into different groups according to different sources of species. The ESTs in each group were used to construct contigs with CAP3 sequence assembly program where overlapping regions of EST sequences show at least 97% sequence identity. The resulting contigs and singletons were used as BLASTx queries against the non-redundant (NR) protein database to confirm the assembly quality and the relationships between these and the known viruses. If the contigs and singletons from one species have high sequence identity (95% DNA) with the reported viruses in the same species, these sequences were discarded and not analyses further. The analyses were completed by March 2011.

Sequence alignment and phylogenetic analysis

The software package DNAMAN 7 (Lynnon Biosoft, USA) was used for sequence annotations, including nucleotide statistics and ORF searching. The putative peptides of viral contigs and singletons were obtained according to BLASTx hits and ORF predictions were checked manually. Multiple alignments of protein sequences were constructed using COBALT [37] and manually edited. To give the best alignment, the alignment parameter Constraint E-value and Word Size were adjusted for different datasets. Maximum likelihood (ML) phylogenies were estimated using amino acid sequence alignments with PhyML-mixtures [38], [39], assuming the EX2 mixture model [39] and SPR tree topologies search strategy. Gaps in alignment are systematically treated as unknown characters. The reliability of internal branches was evaluated based on SH-like approximate likelihood ratio test (SH-aLRT) statistics. Summary of virus-like EST sequences obtained by cloning . (XLS) Click here for additional data file.
  31 in total

1.  Isolation and full-length sequence analysis of Armigeres subalbatus totivirus, the first totivirus isolate from mosquitoes representing a proposed novel genus (Artivirus) of the family Totiviridae.

Authors:  Yougang Zhai; Houssam Attoui; Fauziah Mohd Jaafar; Huan-Qin Wang; Yu-Xi Cao; Suo-Ping Fan; Yang-Xin Sun; Li-Dong Liu; Peter P C Mertens; Wei-San Meng; David Wang; Guodong Liang
Journal:  J Gen Virol       Date:  2010-08-11       Impact factor: 3.891

2.  A novel monopartite dsRNA virus from rhododendron.

Authors:  S Sabanadzovic; N Abou Ghanem-Sabanadzovic; R A Valverde
Journal:  Arch Virol       Date:  2010-08-19       Impact factor: 2.574

3.  COBALT: constraint-based alignment tool for multiple protein sequences.

Authors:  Jason S Papadopoulos; Richa Agarwala
Journal:  Bioinformatics       Date:  2007-03-01       Impact factor: 6.937

4.  Plant-feeding insects harbor double-stranded RNA viruses encoding a novel proline-alanine rich protein and a polymerase distantly related to that of fungal viruses.

Authors:  Allyn Spear; Mark S Sisterson; Raymond Yokomi; Drake C Stenger
Journal:  Virology       Date:  2010-06-11       Impact factor: 3.616

5.  Blueberry latent virus: an amalgam of the Partitiviridae and Totiviridae.

Authors:  Robert R Martin; Jing Zhou; Ioannis E Tzanetakis
Journal:  Virus Res       Date:  2010-10-01       Impact factor: 3.303

6.  A novel bipartite double-stranded RNA Mycovirus from the white root rot Fungus Rosellinia necatrix: molecular and biological characterization, taxonomic considerations, and potential for biological control.

Authors:  Sotaro Chiba; Lakha Salaipeth; Yu-Hsin Lin; Atsuko Sasaki; Satoko Kanematsu; Nobuhiro Suzuki
Journal:  J Virol       Date:  2009-10-14       Impact factor: 5.103

7.  Southern tomato virus: The link between the families Totiviridae and Partitiviridae.

Authors:  Sead Sabanadzovic; Rodrigo A Valverde; Judith K Brown; Robert R Martin; Ioannis E Tzanetakis
Journal:  Virus Res       Date:  2009-01-22       Impact factor: 3.303

8.  Widespread endogenization of genome sequences of non-retroviral RNA viruses into plant genomes.

Authors:  Sotaro Chiba; Hideki Kondo; Akio Tani; Daisuke Saisho; Wataru Sakamoto; Satoko Kanematsu; Nobuhiro Suzuki
Journal:  PLoS Pathog       Date:  2011-07-14       Impact factor: 6.823

9.  Origin, adaptation and evolutionary pathways of fungal viruses.

Authors:  S A Ghabrial
Journal:  Virus Genes       Date:  1998       Impact factor: 2.332

10.  Rapid identification of known and new RNA viruses from animal tissues.

Authors:  Joseph G Victoria; Amit Kapoor; Kent Dupuis; David P Schnurr; Eric L Delwart
Journal:  PLoS Pathog       Date:  2008-09-26       Impact factor: 6.823

View more
  17 in total

1.  Amalga-like virus infecting Antonospora locustae, a microsporidian pathogen of grasshoppers, plus related viruses associated with other arthropods.

Authors:  Jesse D Pyle; Patrick J Keeling; Max L Nibert
Journal:  Virus Res       Date:  2017-03-04       Impact factor: 3.303

2.  Nucleotide sequence of Zygosaccharomyces bailii virus Z: Evidence for +1 programmed ribosomal frameshifting and for assignment to family Amalgaviridae.

Authors:  Delphine Depierreux; Minh Vong; Max L Nibert
Journal:  Virus Res       Date:  2016-03-04       Impact factor: 3.303

3.  In silico cloning and B/T cell epitope prediction of triosephosphate isomerase from Echinococcus granulosus.

Authors:  Fen Wang; Bin Ye
Journal:  Parasitol Res       Date:  2016-06-09       Impact factor: 2.289

4.  The dsRNA Virus Papaya Meleira Virus and an ssRNA Virus Are Associated with Papaya Sticky Disease.

Authors:  Tathiana Ferreira Sá Antunes; Raquel J Vionette Amaral; José Aires Ventura; Marcio Tadeu Godinho; Josiane G Amaral; Flávia O Souza; Poliane Alfenas Zerbini; Francisco Murilo Zerbini; Patricia Machado Bueno Fernandes
Journal:  PLoS One       Date:  2016-05-11       Impact factor: 3.240

5.  The alkalophilic fungus Sodiomyces alkalinus hosts beta- and gammapartitiviruses together with a new fusarivirus.

Authors:  Lenka Hrabáková; Alexey A Grum-Grzhimaylo; Igor Koloniuk; Alfons J M Debets; Tatiana Sarkisova; Karel Petrzik
Journal:  PLoS One       Date:  2017-11-29       Impact factor: 3.240

6.  Identification of Two Novel Amalgaviruses in the Common Eelgrass (Zostera marina) and in Silico Analysis of the Amalgavirus +1 Programmed Ribosomal Frameshifting Sites.

Authors:  Dongbin Park; Chul Jun Goh; Hyein Kim; Yoonsoo Hahn
Journal:  Plant Pathol J       Date:  2018-04-01       Impact factor: 1.795

7.  A +1 ribosomal frameshifting motif prevalent among plant amalgaviruses.

Authors:  Max L Nibert; Jesse D Pyle; Andrew E Firth
Journal:  Virology       Date:  2016-09-04       Impact factor: 3.616

Review 8.  Origins and evolution of viruses of eukaryotes: The ultimate modularity.

Authors:  Eugene V Koonin; Valerian V Dolja; Mart Krupovic
Journal:  Virology       Date:  2015-03-12       Impact factor: 3.616

9.  Virus-host co-evolution under a modified nuclear genetic code.

Authors:  Derek J Taylor; Matthew J Ballinger; Shaun M Bowman; Jeremy A Bruenn
Journal:  PeerJ       Date:  2013-03-05       Impact factor: 2.984

10.  dsRNA-seq Reveals Novel RNA Virus and Virus-Like Putative Complete Genome Sequences from Hymeniacidon sp. Sponge.

Authors:  Syun-Ichi Urayama; Yoshihiro Takaki; Daisuke Hagiwara; Takuro Nunoura
Journal:  Microbes Environ       Date:  2020       Impact factor: 2.912

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.