Literature DB >> 19812076

Sigma viruses from three species of Drosophila form a major new clade in the rhabdovirus phylogeny.

Ben Longdon1, Darren J Obbard, Francis M Jiggins.   

Abstract

The sigma virus (DMelSV), which is a natural pathogen of Drosophila melanogaster, is the only Drosophila-specific rhabdovirus that has been described. We have discovered two new rhabdoviruses, D. obscura and D. affinis, which we have named DObsSV and DAffSV, respectively. We sequenced the complete genomes of DObsSV and DMelSV, and the L gene from DAffSV. Combining these data with sequences from a wide range of other rhabdoviruses, we found that the three sigma viruses form a distinct clade which is a sister group to the Dimarhabdovirus supergroup, and the high levels of divergence between these viruses suggest that they deserve to be recognized as a new genus. Furthermore, our analysis produced the most robustly supported phylogeny of the Rhabdoviridae to date, allowing us to reconstruct the major transitions that have occurred during the evolution of the family. Our data suggest that the bias towards research into plants and vertebrates means that much of the diversity of rhabdoviruses has been missed, and rhabdoviruses may be common pathogens of insects.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19812076      PMCID: PMC2842628          DOI: 10.1098/rspb.2009.1472

Source DB:  PubMed          Journal:  Proc Biol Sci        ISSN: 0962-8452            Impact factor:   5.349


Introduction

Rhabdoviruses are single-stranded negative sense RNA viruses in the order Mononegavirales. The family is diverse and has a wide host range, infecting plants, invertebrates and vertebrates (ICTVdB 2006). Rhabdoviruses were originally classified as a family based on their shared bullet-shaped morphology and on serological evidence, but genome sequencing has since confirmed their shared ancestry (Fu 2005). The rhabdoviruses are divided into six genera. The genus Lyssavirus infects a range of mammals and includes the rabies virus. The genera Cytorhabdovirus and Nucleorhabdovirus are arthropod-vectored and infect plants, while the genus Novirhabdovirus infects various species of fish. Members of the genera Vesiculovirus and Ephemerovirus infect a wide range of animals including fishes, invertebrates and mammals, and together form the dimarhabdovirus super group (Bourhy ). A large proportion of the known dimarhabdoviruses have been isolated from vertebrates and arthropods, which are thought to vector them. The full diversity of the rhabdovirus family is unknown because of a strong sampling bias towards lineages of agronomic and medical importance (Fu 2005; Ammar ). One area of neglect is the study of rhabdoviruses in arthropod hosts. As the majority of known dimarhabdoviruses, cytorhabdoviruses and nucleorhabdoviruses are arthropod-vectored (often insect-vectored), by studying arthropod-specific rhabdoviruses we may be able to understand how and why these viruses evolved traits such as virulence towards vertebrates. The only arthropod-specific rhabdovirus that has been described to date is the sigma virus (DMelSV), which is a natural pathogen of Drosophila melanogaster (L'Heritier & Teissier 1937; Contamine & Gaumer 2008). Sigma has an unusual mode of transmission, in that it is only transmitted vertically (through both eggs and sperm), and does not move horizontally between hosts. It was initially placed in the Rhabdoviridae based on its bullet-shaped viral particles (Berkalof ; Teninges 1968), and this has subsequently been confirmed using sequence data (Bjorklund ). However, only about half of DMelSV's approximately 12.7 kb genome has previously been sequenced (Teninges ), and the full sequence of the L gene—which encodes the RNA-dependant RNA polymerase (RDRP)—is unknown (Huszar & Imler 2008). This has hampered phylogenetic analyses of DMelSV because the L gene contains conserved domains that are useful in determining the evolutionary relationships between distantly related viruses (Poch et al. 1989, 1990; Bourhy ). Previous phylogenies that have included DMelSV have been based on the less-conserved N gene, but many have lacked strong statistical support or only included a few closely related viruses. This may explain why the different studies have found conflicting results, either placing DMelSV as a sister group to the vesiculoviruses, or as an outgroup to the ephemeroviruses and vesiculoviruses (Bjorklund ; Hogenhout ; Fu 2005; Kuzmin ). It is possible that rhabdoviruses may be common pathogens in insect populations. Flies infected with DMelSV become paralysed or die on exposure to high concentrations of CO2, whereas uninfected flies recover, and similar symptoms occur when other rhabdoviruses are injected into mosquitoes or Drosophila (Rosen 1980; Shroyer & Rosen 1983). It has also been noted that aphids have reduced longevity after CO2 exposure following rhabdovirus injection (Sylvester & Richardson 1992). There have been reports of CO2 sensitivity occurring in at least 15 other species of Drosophila (Brun & Plus 1980) and in Culex mosquitoes (Shroyer & Rosen 1983), suggesting that rhabdoviruses may be common in insects. The most extensive of these studies looked at CO2 sensitivity in D. affinis and D. athabasca, and found that the sensitivity was caused by a vertically transmitted infectious agent (Williamson 1959, 1961). However, it is not known if this agent is a rhabdovirus, as other viruses (e.g. DXV which was isolated from cell culture) can also cause sensitivity to anoxia in Drosophila (Teninges ). In this study we have identified two new rhabdoviruses associated with CO2 sensitivity in D. obscura and D. affinis. To see where these new Drosophila rhabdoviruses are placed within the phylogeny, we sequenced the L gene from all viruses. In addition, we have completed the genome sequence of DMelSV and the new virus in D. obscura. The L gene of these viruses was combined with all the rhabdoviruses L gene sequences available from public databases to produce the most comprehensive phylogeny of the Rhabdoviridae published to date.

Material and methods

Identifying and sequencing viruses

Drosophila affinis were collected from Raleigh NC, USA and D. obscura were collected from Essex, UK in the summer/autumn of 2007. Flies were collected by netting from yeasted fruit baits, and isofemale lines were created by placing single females in a vial of Drosophila medium and allowing them to lay eggs. Offspring were then exposed to pure CO2 for 15 min at 12°C, then placed at room temperature and examined 30 min later. The lines where the flies were dead or paralysed were used for RNA extractions. CO2-sensitive lines were stabilized (Brun & Plus 1980) by selecting female offspring that transmitted the virus to 100 per cent of their offspring and maintained in the laboratory for over 15 generations. RNA was also extracted from two lines of D. melanogaster infected with the Hap23 and Ap30 isolates of DMelSV (Gay 1978; Carpenter ). Ap30 is known to be genetically distinct from all the other DMelSV isolates that have been sequenced. Total RNA was extracted using Trizol reagent (Invitrogen Corp, San Diego, CA, USA) in a chloroform–isoproponal extraction. RNA was then reverse-transcribed with MMLV reverse transcriptase (Invitrogen Corp) using random hexamer primers. The L gene of rhabdoviruses contains highly conserved domains (Poch et al. 1989, 1990), and is the most conserved gene in rhabdoviruses and other non-segmented negative sense RNA viruses (Fu 2005). This conservation is useful in designing polymerase chain reaction (PCR) primers which will work on a range of rhabdoviruses. Rhabdovirus L gene sequences were downloaded from GenBank and were aligned (as amino acids) using ClustalW. We manually designed degenerate primers that are conserved across most of the dimarhabdoviruses (electronic supplementary material, table S1). PCR reactions with all primer combinations were carried out using a touchdown PCR cycle. PCR products were treated with exonuclease 1 and shrimp alkaline phosphatase to remove unused PCR primers and dNTPs, and then sequenced directly using BigDye reagents (ABI, Carlsbad California, USA) on an ABI capillary sequencer. A sequence's similarity to rhabdoviruses was confirmed using a tBLASTN search of GenBank. Once a small region of the L gene had been sequenced, 3′ RACE (rapid amplification of cDNA ends) was used to reach the 3′-end of the L gene mRNA. RNA was reverse-transcribed using superscript (Invitrogen Corp) and a T linker primer (5′- GATCGAT[17]VN -3′). Products were then purified using a PCR purification column kit (Qiagen Corp, MD, USA), and concentrated to a volume of 10–20 µl in a rotoevaporator. A PCR reaction (Long-Range PCR kit, Invitrogen Corp) was carried out using 2 µl of the cDNA using a T-linker primer and a gene-specific forward primer. In some cases a nested PCR was required on the first PCR (which was diluted 1:10 first). Products were then sequenced by primer walking and sequences were assembled using Sequencher (v. 4.5/4.8; Gene Codes Corp). To obtain the remainder of the L gene and to attempt to obtain the rest of the genome, 3′-RACE was carried out on the viral genome itself. A polyA tail was added to the 3′-end of the virus using polyA polymerase (PAP). Approximately 5 µg of total RNA, 4 units (0.8 µl) PAP (New England Biolabs), 2 µl 10× PAP buffer, 2 µl rATP (10 mM) (Promega Corporation) and RNase-free water to 20 µl was incubated at 37°C for 40 min. The RNA was then purified using a spin column kit (Zymo clean, Cambridge Biosciences, UK). The eluted RNA was then reverse-transcribed using superscript (Invitrogen Corp) and a T linker primer. A PCR reaction was carried out using 2 µl of the cDNA using a T-linker and a gene-specific primer (Long Range PCR kit, Invitrogen Corp). In some cases a nested PCR was required on the first PCR (which was diluted 1:10 first). Products were then sequenced by primer walking using the methods described above. Although most of the 3′-end of the DMelSV genome has already been sequenced, the 3′ leader sequence is unknown. Therefore, we also used this approach to acquire the DMelSV leader sequence. To obtain the 5′ genomic trailer sequence, and to determine the N gene transcription initiation site 5′-RACE was used. For the 5′-RACE on the viral genome, a gene-specific primer was used for a reverse transcription reaction using superscript RT (Invitrogen Corp), whereas for the 5′-RACE on mRNA sequences a T-linker primer was used. Two 5′-RACE methods were used. In the first 1 µl of BSA (20×) and 1 µl of Manganese (20×) were added to the reverse transcription reaction. Twenty microlitres of the cDNA was then incubated overnight at 16°C with 5 µl buffer 2 (New England Biolabs), 6 µl dNTPs (2 mM), 1 µl Klenow enzyme (5000 U µl−1) (New England Biolabs), 1 µl (50 µM) TS-short primer (5′-GGTCTGGAGCTAGTGTTGTGGG-3′) and 17 µl water. This was then purified in a spin column PCR purification kit (Qiagen Corp), and was used with a gene-specific primer and the TS short primer for PCR amplification. In the second method the cDNA was first purified using a spin column purification kit (Qiagen Corp). A polyA tail was added to the cDNA by incubating 21.5 µl of the purified cDNA with 1 µl of terminal transferase (30 U µl−1) (Promega Corp), 6 µl 5× terminal transferase buffer and 1.5 µl of dATP (2 mM) at 37°C for 40 min, then at 70°C for 10 min. This was then used for PCRs with a gene-specific primer and a T-linker primer. In both methods nested PCRs on the first round of PCRs was required, after diluting the samples 1:10. Once the initial sequences from the RACE were obtained, new primers were designed along the length of the gene. These were used for PCR reactions on random-hexamer reverse-transcribed cDNA, which were then sequenced in both directions (see above) to obtain high-quality sequence data. The GenBank accession numbers for our new sequences are GQ375258 (DMelSV-HAP23), AM689309 (DMelSV-Ap30), GQ410979 (DObsSV) and GQ410980 (DAffSV).

Phylogenetic analysis

To infer the phylogeny, we obtained all the available full-length L gene sequences from GenBank. L gene coding sequences and the three sigma virus L gene sequences were aligned as translated amino acid sequences using ClustalW. As some of the sequences are highly divergent, we employed three different approaches for aligning the sequences to ensure our results were robust and not sensitive to the alignment used. First, we aligned the full-length L gene sequences from all of the viruses, then the most conserved region of the L gene from all of the viruses (corresponding to nucleotides 1284–3862 of rabies virus L gene coding region, GenBank accession NC_001542), and finally the full L gene sequences of the dimarhabdoviruses (with three lyssaviruses as an outgroup). The conserved region and the alignments of the dimarhabdoviruses alignments are likely to be the most robust, as they do not include very different sequences that are hard to align. Human parainfluenza virus 1 was also included in the first two of these alignments as an outgroup to root the tree. The phylogeny of these sequences was reconstructed using both a Bayesian and a maximum-likelihood approach. Bayesian posterior support values are less conservative than maximum-likelihood bootstrap support, and so both the values can be used as an upper and lower support for nodes (Douady ). In addition to nucleotide models, we ran Bayesian analysis with amino acid sequences and models of protein evolution. In total we carried out nine analyses, using three different methods of inference and three different sequence alignments. Phylogenies were created from the amino acid alignments translated back into nucleotides. For the maximum-likelihood trees, ModelTest (v. 3.7) (Posada & Crandall 1998) was used to estimate the model of sequence evolution and the analysis was run in PAUP (v. 4.0b10) (Swofford 1993). A parsimony tree created from tree bisection and reconnection with a heuristic search was used as a starting tree for the maximum-likelihood analysis. A general time-reversible model with a gamma distribution of rate variation and proportion of invariable sites was used. The maximum-likelihood analysis used a heuristic search with a nearest neighbour interchange algorithm. The substitution rate parameters, shape of the gamma distribution and proportion of invariable sites used were those estimated by ModelTest. Support for the nodes was calculated by bootstrapping and trees were drawn using FigTree (v. 1.2; http://tree.bio.ed.ac.uk/software/figtree/). Bayesian trees were created using the MrBayes program (v. 3.1.2) (Huelsenbeck & Ronquist 2001). A general time-reversible model was used with a gamma distribution and a proportion of invariable sites, with parameters estimated from the data during the analysis. As there is likely to be a considerable amount of noise from third codon positions between these divergent sequences, a site-specific rate model was used allowing each codon position to have its own rate. Two runs of four chains were run for 2 000 000 MCMC generations (20 000 000 for the conserved region tree), with trees being sampled every 100 generations. In addition, Bayesian amino acid trees were created using the MrBayes programme (v. 3.1.2) (Huelsenbeck & Ronquist 2001). A fixed rate model of protein evolution was assumed, and the phylogeny was reconstructed using a model jumping method. This allows switching between different models of amino acid substitution during the MCMC process, and all the models contribute to the final result and are weighted according to their posterior probability. A gamma distribution of rate variation among sites was used, with the shape estimated from the data. Two runs of four chains were run for 5 000 000 MCMC generations (1 000 000 for the dimarhabdovirus alignment tree), with trees being sampled every 100 generations. The average s.d. of split frequencies between the two runs approaching zero, and the log-likelihood values of the cold chain becoming stable, were used to assess when to stop the run. The first 25 per cent of the trees were discarded to ensure that the chains had reached stationarity, and a consensus tree was created from the remaining trees. Figures were created using FigTree (v. 1.2) (http://tree.bio.ed.ac.uk/software/figtree/). To compare the topology of the sigma virus phylogeny with the Drosophila phylogeny, we reconstructed the maximum-likelihood phylogeny of Dimarhabdoviruses using the full-length L gene alignment under the constraint that the sigma virus phylogeny follows that of the hosts (i.e. D. affinis and D. obscura form a monophyletic group). We then tested whether the likelihood of the constrained tree was significantly less than the unconstrained tree using a Shimodaira–Hasegawa test (SH test) in PAUP using the maximum-likelihood dimarhabodvirus trees (Shimodaira & Hasegawa 1999). Accession numbers for the sequences used in the phylogenetic analysis are available as supplementary materials.

Results

Identification of two new Drosophila sigma viruses

In samples from wild populations, we detected one line of D. affinis from Raleigh NC, USA and two lines of D. obscura from Essex, UK that were paralysed or died after exposure to CO2. To test whether these lines were infected with a rhabdovirus, we created cDNA from the flies and attempted to amplify a region of the RDRP gene using PCR primers designed in conserved sequences. All the CO2-sensitive lines produced a PCR product, which was sequenced and confirmed to be rhabdovirus-like by BLAST searches. These viruses were tentatively named as D. affinis sigma virus (DAffSV) and D. obscura sigma virus (DObsSV).

Genome sequences

We next attempted to sequence the genomes of these newly discovered viruses and the D. melanogaster sigma virus (DMelSV). Our strategy was to first use the short sequences produced with the conserved primers as the basis for 3′-RACE on both the L gene mRNA and the negative sense genome, and then to sequence the trailer sequence using 5′-RACE. This allowed us to completely sequence the genome of one DMelSV isolate, and to sequence all of the genome except the short 3′ leader and 5′ trailer sequences from a second DMelSV isolate. We sequenced the whole genome except the short 5′ trailer sequence for one DObsSV isolate. We also sequenced the entire L gene of DAffSV, but were unable to retrieve the remainder of the genome, possibly owing to a poly-A region causing mispriming of the T-linker primer.

DObsSV genome

The genome of DObsSV (excluding the 5′ trailer) is 12 676 bp long. There are six open reading frames which, based on their predicted protein sequence and gene order, appear to be homologous to the N-P-X-M-G-L genes in DMelSV (figure 1). The N, P and X genes are in reading frame one, the M and the L genes are in frame two and the G gene is in frame three.
Figure 1.

The sigma virus genomes. Numbers below show the position of the start codon, numbers above represent the stop codon. Dotted lines represent parts of the genome we were unable to sequence. In DMelSV the mRNA transcripts for the M and the G genes overlap by 33 bps, but the open reading frames do not overlap. Note that the X gene has also been referred to as gene 3 in some literature.

The sigma virus genomes. Numbers below show the position of the start codon, numbers above represent the stop codon. Dotted lines represent parts of the genome we were unable to sequence. In DMelSV the mRNA transcripts for the M and the G genes overlap by 33 bps, but the open reading frames do not overlap. Note that the X gene has also been referred to as gene 3 in some literature. To annotate the coding sequence, we have assumed that each open reading frame starts at the first AUG occurring after the previous transcription termination sequence, and continues to the first stop codon. These coding regions make up 96 per cent of the genome, with the L gene covering 51 per cent of the total genome (figure 1). A tBLASTn search of the NCBI nucleotide collection using the predicted protein sequences of these genes returned significant alignments (blast alignment scores over 80) with homologous genes from other rhabdoviruses for the N, G and L genes. The M gene (which is thought to encode the matrix protein in DMelSV) had a weakly significant alignment to Flanders virus M gene and no significant alignments were found with the P or X genes, which are the least conserved genes in the genome. To predict the structure and function of the P and X, we used PHYRE (Kelley & Sternberg 2009), which compares the query sequence with proteins of known structure and function, Interproscan, which searches for protein signatures in the InterPro database (Zdobnov & Apweiler 2001), and SignalP, which predicts signal peptides (Bendtsen ). The X gene contains a signal peptide (SignalP: p = 0.98), but no predicted transmembrane regions, and has regions that are similar to viron RNA polymerases (PHYRE: 90% estimated precision), as has been reported for its homologue in DMelSV (Landesdevauchelle ). However, it also shares similarities to topoisomerases, signal proteins and toxin molecules. We identified structures in the P gene as a viral RNA polymerase (PHYRE: 85% estimated precision). In the non-coding regions, the motif 3′-GGUACUUUUUUU-5′ is found after all of the first five open reading frames in the genome. Based on its homology to other rhabdoviruses it is likely that it acts as the transcription termination sequence, and the seven U residues trigger polyadenylation of mRNAs (Huszar & Imler 2008). At the 3′ end of the genome there is a 3′ leader sequence of 99 bases before the first ATG. 5′ RACE on viral mRNAs failed, possibly owing to the T-linker primer annealing to polyA regions in the positive sense genomic strand, meaning we were unable to confirm the transcription initiation sequence.

DMelSV genome

The genome of DMelSV is 12 625 bp long, the first five genes of which have been published previously (Teninges ; Bras ; Landesdevauchelle ). The newly sequenced L gene open reading frame is 6389 bases long and compromises 51 per cent of the total genome (figure 1). By using RACE to confirm the sequence at the start of the N gene and end of the L gene, we were able to annotate the 54 bp 3′ leader sequence and 180 bp trailer sequence. In DMelSV, the transcription initiation site is 3′-GUUGUNG-5′ (Teninges ) for all the genes bar the N, where it is 3′-UUGUUG-5′. The transcription initiation site occurs shortly after the previous transcription termination signal, with the exception of the M/G gene junction where the M gene and G gene mRNAs overlap by 33 bases (Teninges ). The protein-coding regions, however, do not overlap. In addition we found the G gene to be 21 amino acids longer than described in its original GenBank annotation owing to what was possibly a sequencing error causing a false stop codon (accession number X91062) (Landesdevauchelle ). In sequencing the 5′ trailer region and comparing this with the L gene mRNA we found the same transcription termination sequence (3′-GUACUUUUUUU-5′) as previously reported (Teninges ), at the end of the L gene.

DAffSV L gene

We sequenced the entire L gene and the 5′ trailer of DAffSV (figure 1). The predicted protein-coding sequence of the L gene has significant tBLASTN alignments to other Rhabdovirus L genes. The transcription termination sequence is the same as DMelSV (3′-GAUCUUUUUUU-5′) based on comparing where the L gene mRNA terminates (sequenced by 3′ RACE on the mRNA) with the genome sequence (sequenced by 5′ RACE on genomic RNA).

Sequence conservation

The amount of protein sequence divergence between the three sigma viruses is very similar, suggesting that they all diverged at a similar time (electronic supplementary material, table S2). However, the different genes in the genome have very different levels of amino acid sequence conservation (electronic supplementary material, table S2), with the L gene being the most conserved, and the P, X and M genes the least conserved. There is a high level of amino acid sequence divergence between the three Drosophila sigma viruses (electronic supplementary material, table S3). Comparing the amino acid sequences of the L genes of the sigma viruses with those from related clades, we see that DMelSV, DObsSV and DAffSV share only a slightly higher sequence identity to one another than they do to viruses in a range of different rhabdovirus genera (electronic supplementary material, table S3). Furthermore, the amino acid sequence divergence between the three sigma viruses is only slightly less than that seen when rhabdoviruses in different genera are compared, and is similar to the maximum divergence seen between rhabdoviruses in the same genus.

Phylogeny of the Rhabdoviridae

The phylogeny of the Rhabdoviridae was reconstructed from both full-length and conserved regions of L gene sequence. The three sigma viruses form a well-supported monophyletic group that is distinct from the other rhabdoviruses (figures 2 and 3). As was seen in the analysis of sequence identity, the divergence between the viruses is substantial, and similar to that between the most divergent members of some genera. Therefore, these viruses constitute a major new group of rhabdoviruses.
Figure 2.

Phylogeny of the Rhabdviridae from full-length nucleotide sequences of the L gene. The tree was reconstructed using the Bayesian method, node labels are posterior supports and labels in brackets are maximum-likelihood bootstrap supports (500 replicates). The tree is rooted with human parainfluenza virus 1. The following viruses are unassigned rhabdoviruses; Flanders virus, Iranian maize mosaic virus, starry flounder rhabdovirus, Siniperca chuatsi rhabdovirus, wongabel virus, taro vein chlorosis virus, maize fine streak virus, lettuce yellow mottle virus, orchid fleck virus, but have been included in the different genera for illustrative purposes, based on their position in the phylogeny.

Figure 3.

Bayesian nucleotide tree using full-length sequence alignments for the dimarhabdovirus and sigma virus clades. Node labels are posterior supports, labels in brackets are bootstrap supports taken from the maximum likelihood tree with 1000 bootstrap replicates. The tree is rooted with three lyssaviruses (West Caucasian bat virus, Mokola virus 86101RCA and rabies virus).

Phylogeny of the Rhabdviridae from full-length nucleotide sequences of the L gene. The tree was reconstructed using the Bayesian method, node labels are posterior supports and labels in brackets are maximum-likelihood bootstrap supports (500 replicates). The tree is rooted with human parainfluenza virus 1. The following viruses are unassigned rhabdoviruses; Flanders virus, Iranian maize mosaic virus, starry flounder rhabdovirus, Siniperca chuatsi rhabdovirus, wongabel virus, taro vein chlorosis virus, maize fine streak virus, lettuce yellow mottle virus, orchid fleck virus, but have been included in the different genera for illustrative purposes, based on their position in the phylogeny. Bayesian nucleotide tree using full-length sequence alignments for the dimarhabdovirus and sigma virus clades. Node labels are posterior supports, labels in brackets are bootstrap supports taken from the maximum likelihood tree with 1000 bootstrap replicates. The tree is rooted with three lyssaviruses (West Caucasian bat virus, Mokola virus 86101RCA and rabies virus). Although the sigma viruses form a well-supported monophyletic clade, the relationships between the three viruses are uncertain. While the Bayesian analysis gives significant posterior support for relationships shown in figure 3, the more conservative maximum-likelihood bootstrapping does not support this topology (figure 3). As DMelSV is vertically transmitted, we were interested in whether the topology of the virus phylogeny differs from that of the host, which would indicate that the virus has switched hosts during its evolution rather than co-speciating with them. However, when we forced the topology of the sigma virus phylogeny to match the host phylogeny, there was no significant reduction in the likelihood of the tree (SH test: p = 0.173, difference in log likelihood=5.24). Therefore, we are unable to reject the hypothesis that the host and viral tree topologies are the same. Our analysis produced a robust and well-supported phylogeny of the rhabdoviruses (figure 2). The rhabdoviruses contain two major clades, with the fish-infecting novirhabdoviruses forming a clade basal to all the other genera. In the other group, the arthropod-vectored plant viruses (cytorhabdoviruses and nucleorhabdoviruses) form a clade that is a sister group to the lyssaviruses, sigma viruses and the dimarhabdovirus supergroup. The dimarhabdovirus group (figure 3) contains the vesiculoviruses, the ephemeroviruses and some other viruses which are unassigned or have only tentatively been placed to this group (ICTVdB 2006). The sigma virus clade forms a sister group to all the other dimarhabdoviruses. To assess whether our results are sensitive to the sequence alignment or method of phylogenetic reconstruction, we produced a total of six Bayesian trees and three maximum-likelihood trees (see §2). There was greater resolution in the Bayesian nucleotide trees with rate variation between codon positions; hence we presented these trees in the figures. However, when different methods of analysis were used, similar tree topologies were inferred. Furthermore, the conserved region alignment (electronic supplementary material, figure S1) and full-length sequence alignment (figure 2) lead to the same general conclusions. In addition to the uncertain relationships among the sigma viruses, there are other minor inconsistencies between the different trees. In the lyssavirus genus, depending on the method and alignment used, the branching order of the clade containing Arravan, Khujand and rabies viruses switched positions, possibly owing to the very short branch lengths in this group (Kuzmin ). Also the branching order between taro vein chlorosis virus, Iranian maize mosaic and maize mosaic virus was sensitive to the method and alignment used.

Discussion

Drosophila sigma viruses

We have discovered two new rhabdoviruses in D. affinis and D. obscura, which together with DMelSV, brings the total number of insect-restricted rhabdoviruses to three. We sequenced the complete genomes of two of these viruses and partial genome of the third, and found that they form a major new clade on the rhabdovirus phylogeny (figure 2; see also Hogenhout ; Kuzmin ). This new clade does not fit into any existing genera as it is a sister group to the dimarhabdoviruses, which itself contains two genera. Additionally, the divergence between these viruses is greater than that seen within four of the six previously classified genera. We therefore suggest these three Drosophila viruses be regarded as a new genus. As DMelSV, and probably the other sigma viruses, are vertically transmitted, it is interesting to ask whether the sigma viruses have co-speciated with their hosts or have moved horizontally between species during their evolution. If parasites have moved between hosts, this can result in incongruence between host and parasite phylogenies. However, all the three sigma viruses diverged from their common ancestor at a same time, and we are unable to tell whether or not the viral phylogeny matches the host phylogeny. Despite this, it seems likely that the viruses have switched between hosts owing to the length of branches on the tree. If these viruses had co-speciated with their hosts, we would expect the DAffSV and DObsSV to be much more closely related to one another than to DMelSV, given that D. obscura and D. affinis diverged from each other approximately 15–18 Myr and from D. melanogaster approximately 30–35 Myr (Gao ). This is not the case, as the viruses all shared a common ancestor at a same time, suggesting that horizontal transfer has occurred. As these sigma viruses are all probably vertically transmitted, it is not clear how they could move between species. One possibility is that they can be vectored by the parasitic mites that feed on Drosophila. These mites have been shown to vector Spiroplasma bacteria (Jaenike ) and are suspected to transfer transposable elements between species of Drosophila (Loreto ). Furthermore, we have found DObsSV in mites removed from wild-caught flies (B. Longdon 2008, unpublished data), although we would highlight it is not known if the virus replicates in the mites or if they can transmit sigma horizontally. It is possible that rhabdoviruses may be common parasites of insects. DAffSV was discovered in D. affinis, where there had been previous reports of CO2 sensitivity (Williamson 1961) and CO2 sensitivity has been described in 15 other species of Drosophila (Brun & Plus 1980). Therefore, our results suggest that many of these species may also be infected (although other viruses may cause flies to die in anoxic conditions, see §1 and Teninges ). The second of these new viruses was found in D. obscura where CO2 sensitivity had not previously been reported. Given that we performed only a limited sampling of a few species, this also suggests that there may be many other insect rhabdoviruses waiting to be discovered. Although the new sigma virus isolates are anciently divergent from other rhabdoviruses, their genomes are typical of the family. The genomes of DObsSV and DMelSV are similar, both containing six open reading frames, which correspond to the N-P-X-M-G-L genes (3′–5′). Based on sequence conservation and from the analysis of predicted proteins, five of the six genes are homologous to genes found in other rhabdoviruses and probably have similar functions. In contrast, the X genes in DObsSV and DMelSV share no detectable sequence similarity either to each other or to other rhabdovirus genes. Although both X genes encode proteins with a signal peptide and domains similar to viral RNA polymerases, their function remains a matter for speculation. Interestingly, the cytorhabdoviruses, the nucleorhabdoviruses and the wongabel and Flanders viruses all contain at least one gene between the P and M genes, some of which are similar sizes, raising the possibility that these genes may be orthologous to the X gene, but have diverged to such an extent that there is no detectable sequence similarity between them.

Rhabdovirus phylogeny

Our analysis has produced the most robustly supported phylogeny of the Rhabdoviridae to date, with the members of the various genera forming distinct, well-supported clades. By using conserved L gene sequence, we are able to root our trees using human parainfluenza virus 1, which allows us to examine the branching order at the base of the tree for the first time. Furthermore, in previous phylogenetic analyses the boundaries between genera in the dimarhabdovirus super group have been unclear (Bourhy ). This has been resolved in our analyses, which has well-supported fine-scale resolution within this clade. It has been suggested that the N gene should be used to obtain fine-scale resolution (Kuzmin ). However, we have found that the more rapidly evolving regions of the L gene coupled to its large size (approx. 6 kb) provides a much greater phylogenetic resolution than the N gene even when looking at closely related viruses. Furthermore, using sequence from less-conserved regions such as the N gene can result in inaccurate sequence alignments, which in turn can result in an incorrect phylogeny (Ogden & Rosenberg 2006). This may explain why previous analyses have sometimes placed DMelSV in very different places in the rhabdoviruses tree (Hogenhout ; Kuzmin et al. 2006, 2009). In addition, using the L gene has the benefit of allowing rapid detection of novel rhabdoviruses, by using a diagnostic PCR with conserved degenerate primers. It is striking how viruses which infect similar hosts have a strong tendency to cluster together on the phylogeny, indicating that it is rare for rhabdoviruses to switch between distantly related hosts. Although it has been suggested that the ancestor of the Rhabdoviridae may have infected insects (Hogenhout ), our results suggest that it may be premature to draw any conclusions on the origin of the group, as there appear to be two equally parsimonious models. Specifically, because the fish-infecting novirhabdoviruses are sister to all other groups, it is possible either that (i) the common ancestor of the rhabdovirus infected arthropods (insects or other crustaceans) and switched to fish on the lineage leading to the novirhabdoviruses, or (ii) the common ancestor infected fish and switched to arthropods on the lineage leading to the other six clades. Perhaps, more importantly, because there are likely to be many undiscovered rhabdoviruses in many different groups of hosts, any inferences about the ancestral ecology of this group would be extremely tentative at best. Nevertheless, it is likely that the ancestor of six of the seven major clades (all rhabdoviruses other than the novirhabdoviruses) infected arthropods, as these clades (with the exception of the lyssaviruses) include viruses which infect arthropods. There have been a number of transitions between host taxa. There is evidence for three switches between aquatic and terrestrial habitats—one between the fish infecting novirhabdoviruses and the terrestrial viruses, and two within the dimarhabdoviruses—and a single transition to infect plants in the cytorhabdoviruses and nucleorhabdoviruses. In the clade containing the sigma viruses, dimarhabdoviruses and lyssaviruses, there has either been two events leading to these viruses gaining the ability to infect vertebrates, or one gain followed by a loss in the sigma virus clade. The incomplete sampling of rhabdoviruses means that there may be many more major host switches to be discovered. In our phylogeny, the sigma viruses are a sister group to the dimarhabdoviruses, which suggests that the common ancestor of this group infected arthropods. Support for this argument comes from the ability of vesicular stomatitis virus to replicate in a range of insects, including sand flies, black flies, Drosophila, leafhoppers and moths (Tesh ; Lastra & Esparza 1976; Rosen 1980; Mead ). In addition, like the sigma virus, this virus can be transmitted transovarially in sandflies (Tesh ). It is even possible that all the dimarhabdoviruses may infect arthropods, including the fish viruses that are usually assumed to be vertebrate-specific. For example, a virus 99 per cent identical to spring viraemia of carp has been found in penaeid shrimps (Johnson ), we have found that EST libraries from fish lice contain rhabdovirus-like sequences (B. Longdon 2008, unpublished observation), and spring viraemia of carp can be vectored by sea lice in the laboratory (Ahne ).

Conclusions

From a quick survey of Drosophila we have found two new viruses, which together with DMelSV form a major new clade in the Rhabdoviridae. It is possible that there is a great deal of diversity in this family yet to be discovered, and a more extensive survey for new rhabdoviruses may uncover viruses from a wide diversity host taxa and further our understanding of the relationships among the Rhabdoviridae.
  35 in total

1.  Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability.

Authors:  Christophe J Douady; Frédéric Delsuc; Yan Boucher; W Ford Doolittle; Emmanuel J P Douzery
Journal:  Mol Biol Evol       Date:  2003-02       Impact factor: 16.240

2.  Plant and animal rhabdovirus host range: a bug's view.

Authors:  Saskia A Hogenhout; Margaret G Redinbaugh; El-Desouky Ammar
Journal:  Trends Microbiol       Date:  2003-06       Impact factor: 17.079

3.  Improved prediction of signal peptides: SignalP 3.0.

Authors:  Jannick Dyrløv Bendtsen; Henrik Nielsen; Gunnar von Heijne; Søren Brunak
Journal:  J Mol Biol       Date:  2004-07-16       Impact factor: 5.469

4.  Protein structure prediction on the Web: a case study using the Phyre server.

Authors:  Lawrence A Kelley; Michael J E Sternberg
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

5.  Revisiting horizontal transfer of transposable elements in Drosophila.

Authors:  E L S Loreto; C M A Carareto; P Capy
Journal:  Heredity (Edinb)       Date:  2008-04-23       Impact factor: 3.821

6.  [Demonstration of virions in Drosophila infected by the hereditary sigma virus].

Authors:  A Berkaloff; J C Bregliano; A Ohanessian
Journal:  C R Acad Hebd Seances Acad Sci D       Date:  1965-05-31

7.  [Demonstration of sigma viruses in the cells of the stabilized male germinal line of Drosophila].

Authors:  D Teninges
Journal:  Arch Gesamte Virusforsch       Date:  1968

8.  Black fly involvement in the epidemic transmission of vesicular stomatitis New Jersey virus (Rhabdoviridae: Vesiculovirus).

Authors:  Daniel G Mead; Elizabeth W Howerth; Molly D Murphy; Elmer W Gray; Raymond Noblet; David E Stallknecht
Journal:  Vector Borne Zoonotic Dis       Date:  2004       Impact factor: 2.133

9.  The recent spread of a vertically transmitted virus through populations of Drosophila melanogaster.

Authors:  Jennifer A Carpenter; Darren J Obbard; Xulio Maside; Francis M Jiggins
Journal:  Mol Ecol       Date:  2007-08-23       Impact factor: 6.185

10.  Vesicular stomatitis virus (Indiana serotype): transovarial transmission by phlebotomine sandlies.

Authors:  R B Tesh; B N Chaniotis; K M Johnson
Journal:  Science       Date:  1972-03-31       Impact factor: 47.728

View more
  35 in total

1.  Eilat virus host range restriction is present at multiple levels of the virus life cycle.

Authors:  Farooq Nasar; Rodion V Gorchakov; Robert B Tesh; Scott C Weaver
Journal:  J Virol       Date:  2014-11-12       Impact factor: 5.103

2.  Application of broad-spectrum resequencing microarray for genotyping rhabdoviruses.

Authors:  Laurent Dacheux; Nicolas Berthet; Gabriel Dissard; Edward C Holmes; Olivier Delmas; Florence Larrous; Ghislaine Guigon; Philip Dickinson; Ousmane Faye; Amadou A Sall; Iain G Old; Katherine Kong; Giulia C Kennedy; Jean-Claude Manuguerra; Stewart T Cole; Valérie Caro; Antoine Gessain; Hervé Bourhy
Journal:  J Virol       Date:  2010-07-07       Impact factor: 5.103

3.  The prevalence and persistence of sigma virus, a biparentally transmitted parasite of Drosophila melanogaster.

Authors:  Marta L Wayne; Gabriela M Blohm; Mollie E Brooks; Kerry L Regan; Brennin Y Brown; Michael Barfield; Robert D Holt; Benjamin M Bolker
Journal:  Evol Ecol Res       Date:  2011

4.  Ledantevirus: a proposed new genus in the Rhabdoviridae has a strong ecological association with bats.

Authors:  Kim R Blasdell; Hilda Guzman; Steven G Widen; Cadhla Firth; Thomas G Wood; Edward C Holmes; Robert B Tesh; Nikos Vasilakis; Peter J Walker
Journal:  Am J Trop Med Hyg       Date:  2014-12-08       Impact factor: 2.345

5.  Almendravirus: A Proposed New Genus of Rhabdoviruses Isolated from Mosquitoes in Tropical Regions of the Americas.

Authors:  Maria Angelica Contreras; Gillian Eastwood; Hilda Guzman; Vsevolod Popov; Chelsea Savit; Sandra Uribe; Laura D Kramer; Thomas G Wood; Steven G Widen; Durland Fish; Robert B Tesh; Nikos Vasilakis; Peter J Walker
Journal:  Am J Trop Med Hyg       Date:  2016-10-31       Impact factor: 2.345

Review 6.  Viruses and antiviral immunity in Drosophila.

Authors:  Jie Xu; Sara Cherry
Journal:  Dev Comp Immunol       Date:  2013-05-13       Impact factor: 3.636

7.  Identification of a novel rhabdovirus in Spodoptera frugiperda cell lines.

Authors:  Hailun Ma; Teresa A Galvin; Dustin R Glasner; Syed Shaheduzzaman; Arifa S Khan
Journal:  J Virol       Date:  2014-03-26       Impact factor: 5.103

8.  Genetic characterization of K13965, a strain of Oak Vale virus from Western Australia.

Authors:  Phenix-Lan Quan; David T Williams; Cheryl A Johansen; Komal Jain; Alexandra Petrosov; Sinead M Diviney; Alla Tashmukhamedova; Stephen K Hutchison; Robert B Tesh; John S Mackenzie; Thomas Briese; W Ian Lipkin
Journal:  Virus Res       Date:  2011-06-29       Impact factor: 3.303

9.  Virus recognition by Toll-7 activates antiviral autophagy in Drosophila.

Authors:  Margaret Nakamoto; Ryan H Moy; Jie Xu; Shelly Bambina; Ari Yasunaga; Spencer S Shelly; Beth Gold; Sara Cherry
Journal:  Immunity       Date:  2012-03-29       Impact factor: 31.745

10.  Partitiviruses Infecting Drosophila melanogaster and Aedes aegypti Exhibit Efficient Biparental Vertical Transmission.

Authors:  Shaun T Cross; Bernadette L Maertens; Tillie J Dunham; Case P Rodgers; Ali L Brehm; Megan R Miller; Alissa M Williams; Brian D Foy; Mark D Stenglein
Journal:  J Virol       Date:  2020-09-29       Impact factor: 5.103

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.