Literature DB >> 21673192

An insect nidovirus emerging from a primary tropical rainforest.

Florian Zirkel1, Andreas Kurth, Phenix-Lan Quan, Thomas Briese, Heinz Ellerbrok, Georg Pauli, Fabian H Leendertz, W Ian Lipkin, John Ziebuhr, Christian Drosten, Sandra Junglen.   

Abstract

Tropical rainforests show the highest level of terrestrial biodiversity and may be an important contributor to microbial diversity. Exploitation of these ecosystems may foster the emergence of novel pathogens. We report the discovery of the first insect-associated nidovirus, tentatively named Cavally virus (CAVV). CAVV was found with a prevalence of 9.3% during a survey of mosquito-associated viruses along an anthropogenic disturbance gradient in Côte d'Ivoire. Analysis of habitat-specific virus diversity and ancestral state reconstruction demonstrated an origin of CAVV in a pristine rainforest with subsequent spread into agriculture and human settlements. Virus extension from the forest was associated with a decrease in virus diversity (P<0.01) and an increase in virus prevalence (P<0.00001). CAVV is an enveloped virus with large surface projections. The RNA genome comprises 20,108 nucleotides with seven major open reading frames (ORFs). ORF1a and -1b encode two large proteins that share essential features with phylogenetically higher representatives of the order Nidovirales, including the families Coronavirinae and Torovirinae, but also with families in a basal phylogenetic relationship, including the families Roniviridae and Arteriviridae. Genetic markers uniquely conserved in nidoviruses, such as an endoribonuclease- and helicase-associated zinc-binding domain, are conserved in CAVV. ORF2a and -2b are predicted to code for structural proteins S and N, respectively, while ORF3a and -3b encode proteins with membrane-spanning regions. CAVV produces three subgenomic mRNAs with 5' leader sequences (of different lengths) derived from the 5' end of the genome. This novel cluster of mosquito-associated nidoviruses is likely to represent a novel family within the order Nidovirales.
© 2011 Zirkel et al.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21673192      PMCID: PMC3111606          DOI: 10.1128/mBio.00077-11

Source DB:  PubMed          Journal:  mBio            Impact factor:   7.867


Introduction

Tropical rainforests comprise the highest level of terrestrial biodiversity (1–3). Microbes are dependent on their hosts, and hot spots of host biodiversity may also be rich in pathogen diversity. Intensive land use changes in West African rainforests began in the 1960s by industrial deforestation, leaving only remnants of the equatorial forest belt (4–6). Patches of forest are typically surrounded by agricultural land and human settlements. Logging is linked to a profound decline in biodiversity (4, 7–12). Declines in biodiversity are associated with an increased risk of infectious disease emergence in humans and domestic animals (reviewed in reference 13). Nevertheless, links between anthropogenic habitat modification and virus emergence remain to be confirmed (13, 14). We recently conducted a survey of mosquito-borne viruses within and around a primary forest habitat in Côte d’Ivoire, West Africa (14). Findings of highly divergent viruses suggested an extension of the diversity of several virus taxa in primary forest habitats (15, 16). We also identified short sequence fragments of an unusual virus in insects that was distantly related to coronaviruses (14). Here we report the genomic sequence and organization of this first insect nidovirus and its evolutionary divergence along an anthropogenic disturbance gradient extending from primary forest into human settlements. The order Nidovirales comprises the families Coronaviridae (subfamilies Coronavirinae [CoV] and Torovirinae [ToV]), as well as the monogeneric families Arteriviridae (ArV, genus Arterivirus) and Roniviridae (RoV, genus Okavirus) (17). Nidoviruses have a wide range of hosts, including crustaceans (RoV) (18), fish (ToV) (19), birds (CoV) (20), and a variety of mammals (ArV, CoV, and ToV) (17, 21–23). With plus-stranded genomes between 26 and 32 kb, CoV, ToV, and RoV have the largest known RNA genomes and are referred to as “large” nidoviruses (17). Genomes of the “small” nidoviruses (ArV) comprise 13 to 16 kb. All nidoviruses encode two replicase polyproteins, pp1a and pp1ab, encoded by open reading frame 1a (ORF1a) and ORF1b located at the 5′ end of the genome and followed by genes encoding structural proteins and, in most cases, several accessory (nonessential) proteins. A distinctive feature of nidoviruses is their transcription strategy. Genes downstream of the replicase polyprotein gene are expressed from a nested set of 3′-coterminal subgenomic mRNAs (in Latin, nidus means nest) (24–27). In this study, analyses of subgenomic mRNAs and major features of genome organization and phylogenetic analysis were employed for a taxonomic classification of a novel insect nidovirus. The summation of results suggests the discovery of a previously unrecognized family within the order Nidovirales.

RESULTS AND DISCUSSION

Prevalence and divergence of CAVV along a gradient of habitat modifications.

Between February and June 2004, 7,067 mosquitoes were trapped along an anthropogenic disturbance gradient in the area of the Taї National Park, Côte d’Ivoire. The gradient comprised sampling sites in the primary (pristine) forest, in secondary (modified) forest, in agriculturally exploited forest edge areas, and in adjacent human settlements (14–16). Initial analyses by cell culture and electron microscopy yielded viral particles with CoV-like morphology in supernatants of C6/36 cells infected with mosquito homogenates. A short genome fragment with low but significant identity to ORF1b of CoV was identified (14). This virus was tentatively named Cavally virus (CAVV), after a river near the sites where mosquitoes were caught. Extensive attempts at virus isolation from adult female mosquitoes pooled in small numbers (up to 23 mosquitoes per pool) yielded virus in 40 (9.3%) of the 432 pools tested. As shown in Table 1, virus was most frequently isolated from Culex mosquitoes, especially Culex nebulosus. Mosquitoes of the genera Aedes, Anopheles, and Uranotaenia were also found to be infected but at lower rates. CAVV was present in all sampled habitat types but with the highest prevalence in human settlements (Table 1). Analysis of variance showed the observed virus isolation rate in human settlements (and only there) to be significantly different from the mean isolation rates (F test, P < 0.00001) and from all isolation rates in any other habitat. In order to gain insight into the genetic diversity of CAVV, a 603-nucleotide (nt) genome fragment representing the 1b polymerase gene region was amplified and sequenced from all positive pools. The pairwise genetic distance between the isolates was up to 15% at the nucleotide level and up to 9.9% at the amino acid level. In particular, the nucleotide distance of isolate CAVV/A4/CI/2004 from all other isolates ranged between 13.5 and 15%, whereas the range of distances was below 1.8% for all other CAVV isolates. This suggested the presence of a diversified virus population comprising two different clusters whose distance is compatible with different species.
TABLE 1

CAVV isolation from pools of female mosquitoes

Sampling sitePoolMosquito speciesNo. of mosquitoesCPEa % CAVV isolation (no. positive/total)Genetic distanceb (%)
CampsA4 Culex spp.203 dpi7.5 (5/67)0.1
A21 Culex (Eumelanomyia) spp.10Weak
A26C. nebulosus103 dpi
A28C. nebulosus227 dpi
A31 Culex spp.92nd P, 7 dpi
Primary forestB14Female, NDc171st P, 3 dpi4.1 (4/98)1.4
B15C. nebulosus12nd P, 7 dpi
B40Female, ND23 dpi
B42C. nebulosus, C. cinerellus93 dpi
Secondary forestC45C. nebulosus163 dpi5.1 (5/99)0.5
C64 Culex (Eumelanomyia) spp.107 dpi
C65 Culex (Eumelanomyia) spp.83 dpi
C79 Aedes harrisoni 61st P, 7 dpi
C82 Anopheles spp.202 dpi
PlantationsD27 Aedes harrisoni 27 dpi9 (9/100)0.1
D28 Anopheles spp.23 dpi
D42C. decens203 dpi
D50C. nebulosus203 dpi
D51C. nebulosus203 dpi
D60Female, ND151st P, 5 dpi
D61Female, ND111st P, 5 dpi
E9 Uranotaenia chorleyi 73 dpi
Human settlementsF2Female, ND23Weak26.5 (18/68)0.3
F3 Aedes harrisoni 91st P, 5 dpi
F24C. nebulosus303 dpi
F25C. nebulosus213 dpi
F26C. nebulosus503 dpi
F27C. nebulosus401st P, 5 dpi
F28C. nebulosus203 dpi
F30C. nebulosus203 dpi
F32C. nebulosus203 dpi
F33C. nebulosus123 dpi
F35C. nebulosus163 dpi
F47 Culicinae spp.108 dpi
F52C. quinquefasciatus203 dpi
F53C. quinquefasciatus81st P, 3 dpi
F54C. antennatus203 dpi
F55C. antennatus98 dpi
F56 Culex spp.182nd P, 8 dpi
F63 Anopheles spp.212nd P, 8 dpi

 Shown is the time point when the first CPE was visible. P, passage; dpi, days postinfection.

 Maximal pairwise nucleotide distance with a 603-bp RT-PCR fragment in the ORF1b gene.

 ND, not determined.

CAVV isolation from pools of female mosquitoes Shown is the time point when the first CPE was visible. P, passage; dpi, days postinfection. Maximal pairwise nucleotide distance with a 603-bp RT-PCR fragment in the ORF1b gene. ND, not determined. CAVV is an insect-associated virus whose hosts were encountered along a gradient of anthropogenic habitat modifications. To examine possible influences of habitat modification on virus diversity, average nucleotide distances over sequence pairs were determined within the amplified 603-nt fragments from all pools and were assigned to one of the habitat types from where the respective isolates originated, i.e., (i) primary forest or research camps within a primary forest (9 isolates), (ii) secondary forest (5 isolates), (iii) plantations (6 isolates), and (iv) villages (19 isolates). The mean numbers of nucleotide exchanges in pairwise sequence comparisons were determined within the samples pertaining to each habitat type. CAVV/A4/CI/2004 was excluded to prevent bias from a virus that may be evolutionarily disconnected from the main clade (putative second species). In habitat types i to iv, the mean pairwise exchange rates within the 603-nt fragment were 4.35, 2.8, 0.6, and 2, respectively. Even though isolates from villages predominated numerically in the data set, the level of diversification was highest in primary forest (chi-square test on exchange rates, P < 0.01). To estimate the habitat association of the putative common ancestor of the 39 virus isolates, sequence fragments were subjected to phylogenetic analysis using a minimum evolution model, considering either the numbers of pairwise synonymous nucleotide exchanges or the percentages of overall (synonymous and nonsynonymous) pairwise nucleotide exchanges. CAVV/A4/CI/2004 was used as an outgroup. Phylogenies were subjected to ancestral state reconstruction under a parsimony assumption. These simple models of phylogeny and trait evolution were chosen because the ordered and sparse pattern of nucleotide exchange across the alignment suggested the absence of multiple or saturating exchanges. As shown in Fig. 1, ancestral state reconstruction suggested the most recent common ancestor linked to the outgroup, as well as the common ancestor of the ingroup, to have existed in a primary forest habitat. This matches the general concept wherein novel viruses are being transferred to areas of human settlement in the course of agricultural exploitation of primary habitats.
FIG 1

CAVV ancestral state reconstruction based on a maximum-parsimony algorithm. Minimum evolution phylogenies based on the numbers of synonymous exchanges (A) and the percent nucleotide distance (B) were calculated in Mega 5 (66). CAVV strain A4 served as an outgroup. Tip traits in Mesquite () were defined by the habitats from which the virus isolates were taken (identified by color code at the bottom left of panel B). The circles at each root point identify by colored segments the likelihood of the root point taxon having existed in the habitat shown.

CAVV ancestral state reconstruction based on a maximum-parsimony algorithm. Minimum evolution phylogenies based on the numbers of synonymous exchanges (A) and the percent nucleotide distance (B) were calculated in Mega 5 (66). CAVV strain A4 served as an outgroup. Tip traits in Mesquite () were defined by the habitats from which the virus isolates were taken (identified by color code at the bottom left of panel B). The circles at each root point identify by colored segments the likelihood of the root point taxon having existed in the habitat shown. We tested 269 pools of 1,716 adult male mosquitoes collected during this survey for CAVV infection by specific real-time reverse transcription (RT)-PCR (14). CAVV was not detected in male mosquitoes, suggesting that CAVV is likely dependent on amplifying vertebrate hosts.

Viral growth and morphology.

Virus growth kinetics and morphological presentation in insect cells were studied. Cells showed strong cytopathic effects (CPE) manifesting in aggregation of cells at 48 h postinfection (hpi) (Fig. 2A and B). Virus replication was measured by real-time RT-PCR every 3 h for 2 days. Maximal RNA concentrations were reached at 15 to 18 hpi, indicating a fast replication cycle (Fig. 2C). Enveloped, spherical CoV-like virions with a mean diameter of 120 nm and large, club-shaped surface projections were detected in cell culture supernatants at 48 hpi by negative staining electron microscopy (Fig. 3). In ultrathin sections of fixed insect cells analyzed by transmission electron microscopy at 48 hpi, vesicles containing spherical, potentially enveloped particles were observed in the cytoplasm of infected cells. These were 50 to 60 nm in diameter and lacked surface projections (Fig. 3A and B). A role for these vesicles filled with virions resembling steps during virus maturation, as shown for other plus-strand RNA viruses, remains to be investigated (28–30). Furthermore, tubular structures likely of viral origin were detected in the cytoplasm of infected cells (Fig. 3A). Separation or adsorption of putative virions on cell membranes is shown in Fig. 3C. These particles were morphologically indistinguishable from typical particles encountered in cleared cell culture supernatant (Fig. 3D).
FIG 2

CAVV growth on insect cells. CPE in C6/36 cells infected with CAVV was observed at 48 hpi (B) compared to mock-infected C6/36 cells (A). Numbers of CAVV genome copies per milliliter in cell culture supernatant of C6/36 cells infected with CAVV at MOIs of 0.1, 0.01, and 0.001 were measured by RT-PCR at 0 to 48 hpi (C).

FIG 3

CAVV replication and morphology as observed by transmission electron microscopy. Ultrathin sections of C6/36 cells infected with CAVV at 48 hpi (A to C), showing an overview of the cytoplasm of infected cells (A; bar, 1 µm; V, vesicle with virus formation; arrowhead, mitochondrion; arrow, tubular structures likely of viral origin) and a higher magnification of vesicles containing spherical, potentially enveloped particles (B; bar, 100 nm) and separation or adsorption of putative virions on cell membranes (C; bar, 100 nm; arrowhead, spikes on virus surface). Negative staining (1% uranyl acetate) of CAVV sedimented by ultracentrifugation through a 36% sucrose cushion (D; bar, 100 nm). It should be noted that better EM results were obtained at 48 hpi than at 24 hpi.

CAVV growth on insect cells. CPE in C6/36 cells infected with CAVV was observed at 48 hpi (B) compared to mock-infected C6/36 cells (A). Numbers of CAVV genome copies per milliliter in cell culture supernatant of C6/36 cells infected with CAVV at MOIs of 0.1, 0.01, and 0.001 were measured by RT-PCR at 0 to 48 hpi (C). CAVV replication and morphology as observed by transmission electron microscopy. Ultrathin sections of C6/36 cells infected with CAVV at 48 hpi (A to C), showing an overview of the cytoplasm of infected cells (A; bar, 1 µm; V, vesicle with virus formation; arrowhead, mitochondrion; arrow, tubular structures likely of viral origin) and a higher magnification of vesicles containing spherical, potentially enveloped particles (B; bar, 100 nm) and separation or adsorption of putative virions on cell membranes (C; bar, 100 nm; arrowhead, spikes on virus surface). Negative staining (1% uranyl acetate) of CAVV sedimented by ultracentrifugation through a 36% sucrose cushion (D; bar, 100 nm). It should be noted that better EM results were obtained at 48 hpi than at 24 hpi. Full genome sequencing of isolate CAVV/C79/CI/2004 was achieved by a combination of adaptor-based random RT-PCR and ultradeep sequencing. The entire CAVV genome comprised 20,108 nt, excluding the 3′ poly(A) tail (GenBank accession number HM746600), a size intermediate between that of arteriviruses (13 to 16 kb) and CoV or RoV (26 to 32 kb) (17). The genome contained seven major ORFs, as well as untranslated regions of 362 and 570 nt at the 5′ and 3′ ends, respectively (Table 2; Fig. 4A). In an effort to identify potential functional domains, the seven ORFs were compared by psiBLAST to a database restricted to nidoviruses (Table 2).
TABLE 2

Predicted genome organization of CAVV

ORFFrameGenome position (nt)No. of amino acidsPredicted protein size (kDa)Predicted protein
1a+3363–78622,500288Polyprotein 1a
1b+27826–156282,600301ORF1b-encoded part of polyprotein 1ab
2a+215650–18352900103Spike protein
2b+115664–1630821424Nucleocapsid protein
3a+118394–1887015818Membrane protein
3b+218746–1909611614Membrane protein
4+219399–19548506Unknown
FIG 4

CAVV genome organization and subgenomic mRNA synthesis. (A) Positions and sizes of CAVV ORFs. (B) Positions of probes used for Northern blotting and placement of putative sgRNAs. Putative leader sequences are marked by red bars. Electropherograms shown next to mRNAs 2, 3, and 4 indicate typical leader-body fusion sites identified by RT-PCR. No clear leader-body fusions were identified for RNAs shown without leader symbols (see Fig. S2 in the supplemental material). (C) Detection of CAVV genome and sgRNAs by Northern blot analysis of intracellular viral RNA from infected C6/36 cells. Specific probes for the 5′ and 3′ prime ends, as well as for each ORF, were employed. The 3′-terminal probe is shown after short and long exposures of the blot. A molecular size marker (MWM) is shown in the left lane. Molecular size indicators on the right summarize estimated sizes of putative subgenomic mRNAs.

CAVV genome organization and subgenomic mRNA synthesis. (A) Positions and sizes of CAVV ORFs. (B) Positions of probes used for Northern blotting and placement of putative sgRNAs. Putative leader sequences are marked by red bars. Electropherograms shown next to mRNAs 2, 3, and 4 indicate typical leader-body fusion sites identified by RT-PCR. No clear leader-body fusions were identified for RNAs shown without leader symbols (see Fig. S2 in the supplemental material). (C) Detection of CAVV genome and sgRNAs by Northern blot analysis of intracellular viral RNA from infected C6/36 cells. Specific probes for the 5′ and 3′ prime ends, as well as for each ORF, were employed. The 3′-terminal probe is shown after short and long exposures of the blot. A molecular size marker (MWM) is shown in the left lane. Molecular size indicators on the right summarize estimated sizes of putative subgenomic mRNAs. Predicted genome organization of CAVV

Putative replicase polyprotein genes.

The replicase genes of nidoviruses share several domains and features conserved across the families Arteriviridae, Coronaviridae, and Roniviridae (25). These are (from the N to the C terminus) transmembrane domain 1 (TMD1), TMD2, a 3C-like protease (3CLpro) (31), TMD3, a ribosomal frameshift site (RFS) (32, 33), an RNA-dependent RNA polymerase (RdRp) (34), a zinc-binding domain (ZBD), an RNA helicase (HEL), and a uridylate-specific endoribonuclease (NendoU) (35, 36). CoV, ToV, and RoV also share a 3′-5′ exonuclease (ExoN) upstream of NendoU (37) and a ribose-2′-O-methyltransferase (MT) at the C terminus (38, 39). CoV and ToV also encode an ADP-ribose 1-phosphatase upstream of TMD1 (19, 40, 41). Using TMHMM v2.0, three hydrophobic regions comprising putative multiple membrane-spanning domains (TMD1 25L-L47, TMD2 1128I-Y1272, and TMD3 1727Y-M1780) were identified within the first predicted ORF of the CAVV genome. Three TMDs are also found in ORF1a of CoV and ToV, whereas RoV has four TMDs (18). However, the position of CAVV TMD1 was similar to that in RoV (18). Between TMD2 and TMD3, a putative 3CLpro domain was identified. Comparative sequence analysis suggested that the CAVV 3CLpro domain is a cysteine protease with a Cys-His-Asp catalytic triad (see Fig. S1a in the supplemental material). Putative functional motifs with similarity to CoV and ToV in the second ORF included an RdRp domain (631Y-V787, cd01699), a ZBD (1051C-L1113, UPF1, pfam09416, Fig. S1b), and a HEL domain (1466K-I1717, superfamily I DNA and RNA helicases, COG1112). Phylogenies of RdRp and HEL domains indicated equidistant basal relationships to CoV and RoV (see below). Furthermore, a putative NendoU was identified. Sequence alignments suggest that the active site of the CAVV NendoU involves residues His4670, His4685, and Lys4725 (Fig. S1c). Although short regions with low similarity suggest conservation of the ExoN and MT domains, reliable alignments could not be generated and biochemical evidence remains to be obtained to confirm the functionality of these domains.

Putative RFS.

Expression of the second major ORF of nidoviruses involves a programmed ribosomal frameshift into the −1 reading frame, occurring just upstream of the ORF1a stop codon (32, 33). The overlap region of ORF1a and -1b typically contains a slippery heptanucleotide sequence and a downstream RNA pseudoknot structure that together promote ribosomal frameshifting (32, 33). With minor variations, previously identified nidovirus slippery sequences conform to the XXXYYYZ consensus sequence conserved in many ribosomal slip sites on viral RNAs (for a review, see reference 42). Thus, for example, ArV, CoV, and ToV use 5′-(U/G)UUAAAC and RoV uses 5′-AAAUUUU as a slippery sequence (18, 42). The short (~35-nt) ORF1a/1b overlap region in CAVV does not contain heptanucleotide sequences related to those of other nidoviruses. The only XXXYYYZ-like sequence identified in this region of the CAVV genome is 7829CCCUUUG. However, previous systematic mutagenesis studies of ribosomal slip sites (43) revealed that the CCCUUUG heptanucleotide sequence does not promote efficient −1 ribosomal frameshifting in vitro and, to our knowledge, has not been reported to mediate efficient ribosomal frameshifting in viral or cellular systems. Further analyses revealed 7835GGAUUUU as a further candidate slip site. The GGAUUUU sequence conforms to the simultaneous-slippage model introduced by Jacks et al. (44). More importantly, data obtained for red clover necrotic mosaic (diantho) virus (RCNMV) (45, 46) have shown that this sequence mediates efficient −1 ribosomal frameshifting and thus expression of the downstream p57 polymerase ORF of RCNMV. Furthermore, the CAVV 7835GGAUUUU sequence is located 5 nt upstream of an energetically favorable RNA secondary structure. Both the length of the spacer element and the presence of a putative stem-loop structure adjacent to the proposed frameshift site support the idea that CAVV ORF1b expression is mediated by −1 ribosomal frameshifting at this site. Even though direct experimental evidence would be desirable to further corroborate this prediction, it seems reasonable to suggest that ORF1a/1b-encoded sequences are fused at a 2491Leu-Asp-Phe-Ser junction site.

Regulation of transcription.

Although all nidovirus subgenomic mRNAs are 3′ coterminal, different families and genera have different subgenomic mRNA 5′ ends. ArV and CoV subgenomic mRNAs contain a common 5′ leader sequence derived from the 5′ terminus of the genomic RNA. The leader template is fused to nascent minus strand mRNA templates via a copy choice-related template switching process called discontinuous extension of minus strands or discontinuous transcription (DT) (47, 48). Copy choice occurs at transcription-regulating sequences (TRS), comprising short conserved sequence motifs that follow the leader and precede each downstream gene’s ORF (48–50). In contrast, the subgenomic mRNAs of RoV do not contain leader sequences (24). Synthesis of subgenomic mRNA templates is believed to be mediated solely by attenuation of minus strand synthesis, involving mRNA transcription on minus-stranded replicative intermediates (nondiscontinuous transcription [NDT]) (26). ToV uses DT to express RNA2 and NDT for RNAs 3 through 5, respectively (51). To investigate the nature of subgenomic mRNAs in CAVV, total RNA was isolated from CAVV-infected cells and subjected to Northern blot analysis. RNA from noninfected C6/36 cells served as a control. To avoid detection of defective interfering RNAs, cells were infected at a low multiplicity of infection (MOI) with virus obtained from limiting dilution endpoints of early-passage supernatants. Northern blot probes were generated against the most 5′ 107 nt and the most 3′ 556 nt of the genome (Fig. 4B). Additional probes were generated for the major predicted ORFs (Fig. 4B). Apart from a band corresponding to the genome size, fragments of approximately 4.7, 2.7, and 1.8 kb were detected with the 5′ probe. These were also represented in a blot with the 3′ probe, suggesting that these RNAs are 5′ and 3′ coterminal with the genome (DT, Fig. 4C). Additional bands of ca. 1.4, 1.2, and 1.0 kb were detected with the 3′ probe but not with the 5′ probe, compatible with an NDT mechanism. The ORF2a probe detected the genome and two additional bands, one corresponding to a subgenomic RNA (sgRNA) starting upstream of ORF2a, the other within ORF2a (Fig. 4A). No band corresponding to a separate sgRNA for the predicted ORF2b was observed. A prominent band of 1.8 kb was seen with all probes in and downstream of ORF3a, suggesting an sgRNA starting upstream of ORF3a. A minor band of 1.4 kb was confirmed with the probe placed in ORF3b and all downstream probes. Two additional minor bands of ca. 1.2 and 1.0 kb were seen with the ORF4 probe and the 3′-end probe. According to estimated molecular weights, these corresponded to sgRNAs starting ca. 300 and 500 nt upstream of the initiation codon of ORF4 (19399AUG). Functional studies are required to determine these positions more precisely, as size estimates are based on RNA gel electrophoresis only. To identify potential fusion sites indicative of DT, one-step RT-PCRs were conducted for each sgRNA detected by Northern blotting. Sense primers were placed at intervals starting from the 5′ end of the genome approximately 300 nt into the genomic sequence (see Fig. S2 in the supplemental material). Antisense primers were selected according to the projected sizes of the sgRNAs. RT-PCRs targeting the 4.7-kb and 2.7-kb bands resulted in products with sense primers up to at least 126 and 152 nt, respectively, from the 5′ end of the genome. The 1.8-kb band was associated with PCR products up to 202 nt from the 5′ end of the genome. PCR products of various and unexpected sizes were observed for the smaller potential sgRNAs (i.e., all sgRNAs smaller than the 1.8-kb band in the Northern blot). Representative PCR products for all putative sgRNAs were cloned and sequenced. PCR products of unexpected sizes from small subgenomic mRNAs yielded sequences indicative of misprimed amplification in various positions of the genome, and it was concluded that these subgenomic mRNAs were generated by NDT, lacking a leader consistent with the Northern blotting results. The three sgRNAs that were clearly codetected with 5′ and 3′ probes revealed potential sites of fusions between genome leader and downstream mRNA sequences. Different sites were used for each sgRNA, as shown in Fig. 4B. Nevertheless, all sgRNAs had an A/C-rich region immediately downstream of the fusion region in common. It should be noted that Northern blot detection intensities of the 4.7-kb and 2.7-kb bands corresponded well between the 5′ and 3′ probes, whereas the detection intensity of the 1.8-kb band was much greater with the 3′ probe than with the 5′ probe (Fig. 4C). This matched the presence of a much smaller fusion region in the RT-PCR product of the 1.8-kb sgRNA. Given the high intensity of the 1.8-kb band, we suspected that a major fraction of the total amount of this sgRNA would not contain a fused leader element; nonetheless, several variant fusion sites were detected in parallel clones for all subgenomic mRNAs. Further studies are required to characterize in more detail the various CAVV RNA species and their functional relevance in CAVV genome expression.

Initial predictions on structural protein genes.

Sequence analyses of proteins predicted to be expressed from the 5 major ORFs in the 3′-proximal region of the CAVV genome revealed little (if any) similarity with other viral (and cellular) proteins, confirming that CAVV diverged profoundly from other nidoviruses and complicating functional assignments of these proteins. As summarized in Table 2, two proteins are expected to be expressed from sgRNA 2. ORF2a is predicted to encode a type I glycoprotein featuring a C-terminal membrane-spanning domain and multiple glycosylation sites. Based on these predictions, the protein likely represents a functional equivalent of the S protein of other nidoviruses, which remains to be confirmed in further studies. The predicted translation start codon of ORF2b is the second AUG on the subgenomic mRNA2, located just downstream of the ORF2a start codon, suggesting that ORF2b may be translated by a leaky scanning mechanism. ORF2b encodes a highly basic protein (pKa, 10.8) with a molecular mass of 24 kDa. Both its size and its charge suggest that this protein may be the viral nucleocapsid protein. Among nidoviruses, this upstream position of the (presumed) N protein gene in the CAVV genome is unusual but has its precedent in members of the family Roniviridae (52). ORF3a and -3b encode proteins with predicted molecular masses of 18 and 14 kDa, respectively. Database searches failed to reveal close homologs of these proteins. Protein analysis software predicts the presence of membrane-spanning domains in both proteins (residues 95 to 117 in ORF3a and residues 73 to 95 in ORF3b), suggesting that both proteins are integral membrane proteins. The ORF3a protein likely contains a signal peptidase cleavage site, 15Ala-Met-Ser|Ala-Glu, and is predicted to be glycosylated, further supporting a role as a membrane-spanning structural protein of the virus. The specific mechanism used to express the ORF3b gene product is unclear but may involve internal ribosomal entry, as shown previously for several downstream ORFs expressed from coronavirus subgenomic mRNAs (53–56). Further studies are required to establish if ORF3a and ORF3b proteins have functions related to those of the membrane-spanning M and E proteins of other nidoviruses. ORF4 encodes a small protein of 50 amino acid residues with unknown functions.

Phylogenetic analyses reveal a novel cluster of insect nidoviruses.

In order to identify the phylogenetic relationship of CAVV with other nidoviruses, we conducted phylogenetic analyses of protein alignments of conserved motifs within 3CLpro (ORF1a), RdRp, and HEL (ORF1b), as well as the putative structural S protein (Fig. 5). In all phylogenies, CAVV branched from a deep node in the nidovirus tree above the Roniviridae family. In the ORF1b and S genes, CAVV had a most recent ancestor in common with the subfamilies Torovirinae and Coronavirinae. In 3CLpro, it branched from an ancestor it has in common with the subfamily Torovirinae. The low bootstrap support at this root point suggested that CAVV might also branch from an ancestor it has in common with Torovirinae and Coronavirinae. Although formal classification criteria for nidoviruses are not established, this phylogenetic positioning suggested CAVV to be a profoundly separated cluster of nidoviruses that might constitute a new family of nidoviruses with a phylogenetic position between Roniviridae and Coronaviridae on one side and Arteriviridae on the other.
FIG 5

Phylogenetic relationship of CAVV to prototype nidovirus strains for selected genomic regions. Phylogenetic analyses were performed using the NJ algorithm, a BLOSUM62 substitution matrix, and no distance correction. Indels were fully deleted. Significance was tested by bootstrap analysis using 1,000 resampling steps as implemented in MEGA 5.0 (66). Bootstrap values are shown above nodes. Analyses were also performed using the maximum-likelihood algorithm in FastML with the same settings. Results of this analysis are not shown because of congruent topologies. Bootstrap support values (1,000 replicates) from maximum-likelihood analysis are shown in grey below nodes.

Phylogenetic relationship of CAVV to prototype nidovirus strains for selected genomic regions. Phylogenetic analyses were performed using the NJ algorithm, a BLOSUM62 substitution matrix, and no distance correction. Indels were fully deleted. Significance was tested by bootstrap analysis using 1,000 resampling steps as implemented in MEGA 5.0 (66). Bootstrap values are shown above nodes. Analyses were also performed using the maximum-likelihood algorithm in FastML with the same settings. Results of this analysis are not shown because of congruent topologies. Bootstrap support values (1,000 replicates) from maximum-likelihood analysis are shown in grey below nodes.

Conclusions

CAVV is the first mosquito nidovirus and represents the prototype species of a family in the order Nidovirales that includes features distinct from those established for the Arteriviridae, Roniviridae, and Coronaviridae. Based on morphology, conserved genome motifs, and phylogenetic relationship, CAVV cannot be assigned to one of the established nidovirus families. Further investigations are required to elucidate further details of the CAVV replication apparatus and structural protein functions. It is unknown whether CAVV infection is restricted to mosquitoes or if transmission to other hosts, potentially vertebrates, occurs. It is interesting that CoV has not been detected in insects, but that the typical reservoir hosts—bats for alpha- and beta-CoV and birds for gamma-CoV—are largely insect feeding. Common ancestors of CAVV and CoV may thus have been insect borne and have diverged after independent host switches to bats and birds. This is in contrast to earlier proposals suggesting acquisition of gamma-CoV by birds from bats via raptors (57) but in agreement with hypotheses that emphasize CoV phylogeny and ecological considerations (58). Moreover, it has been suspected from epidemiological observations that a link between ToV and insects may exist (59). Even though it has not been confirmed that these viruses are carried by insects, the epidemiological implications of insects suggest at least ecologically relevant contact between the virus and its host. An ancestral existence of nidoviruses in arthropods is also supported by the phylogeny of the Nidovirales CAVV included; phylogenetically basal RoV is hosted by crustaceans, which, like mosquitoes, belong to the phylum Arthropoda. An arthropod host at the root of the Nidovirales tree would provide a parsimonious explanation for host associations of several of the known Nidovirales. Our data on CAVV prevalence and divergence demonstrate how a virus may evolve through emergence from a pristine rainforest habitat into surrounding areas of less host biodiversity due to anthropogenic modification. Critically, while extending out of the primary forest habitat, the virus seemed to narrow its genetic diversity while increasing in prevalence (reverse dilution effect [13, 60, 61]). Further investigations are needed to untangle if the higher prevalence of CAVV in human settlements relates to a higher density of hosts (insects or vertebrates) or virus adaption. This principal understanding is necessary to develop experimental ecology models of virus-vector dynamics. This study underlines the importance of linking ecosystem biology and virus ecology to unravel the role of ecosystem modifications in the emergence of novel pathogens.

MATERIALS AND METHODS

Virus isolation and purification.

Virus isolation from 432 pools of 4,839 female mosquito heads was done with Aedes albopictus (C6/36) cells as described previously (15). For virus growth kinetics, C6/36 cells were infected with an MOI of 0.1, 0.01, or 0.001 and incubated for 1 h at 28°C. The inoculum was removed, and cells were washed with phosphate-buffered saline (PBS). L-15 medium was added, and cells were incubated for 48 h. Every 3 h, an aliquot of the cell culture supernatant was removed, RNA was extracted using the QIAamp Viral RNA Mini Kit (Qiagen, Hilden, Germany), cDNA was synthesized using the SuperScript III RT System (Invitrogen, Karlsruhe, Germany), and CAVV viral genome copy numbers were quantified by real-time RT-PCR (14). To obtain pure virus stocks, CAVV was titrated on insect cells and cell culture supernatant was harvested at 22 hpi to allow only single-round infections. Numbers of genome copies were measured by real-time RT-PCR, and the infectious supernatant of the highest dilution still showing virus replication was used for retitration of CAVV on insect cells. This procedure was repeated five times. For purification, CAVV was harvested by freeze-thawing of infected cells. Cell debris was removed by centrifugation at 3,000 rpm for 20 min, followed by ultracentrifugation through a 36% sucrose cushion at 35,000 rpm (SW40 rotor; Beckman) for 2 h at 4°C. The virus pellet was suspended in 150 µl PBS overnight at 4°C. Further purification was achieved on a continuous gradient of 1 to 2 M sucrose in 0.01 M Tris-HCl–4 mM Na-EDTA at 35,000 rpm (SW40 rotor; Beckman) for 22 h at 4°C. Virus-containing fractions were tested by real-time RT-PCR, and fractions with the highest sequence titers were concentrated through a 36% sucrose cushion at 35,000 rpm (SW40 rotor; Beckman) for 2 h at 4°C. The virus pellet was suspended in 150 µl PBS buffer overnight at 4°C.

Electron microscopy.

For electron microscopy, viral particles were purified through a cushion of 36% sucrose and the pellet was suspended in PBS (15, 16). Viral particles were fixed with 2% paraformaldehyde and analyzed by transmission electron microscopy after negative staining with 1% uranyl acetate (62, 63). For ultrathin sections, infected cells were fixed with 2.5% glutaraldehyde, enclosed in low-melting-point agar, embedded in resin, and evaluated by transmission electron microscopy after ultrathin sectioning.

Genome sequencing.

RNA extracted from purified virus preparations was used for unbiased high-throughput sequencing and for conventional sequencing approaches. Genome fragments were generated by adaptor-based random RT-PCR. Following protocols described previously (15, 64), in this specific study, random hexamers linked to a defined primer sequence tail or oligonucleotides that bind to the conserved TRS elements of CoV linked to an oligonucleotide anchor were applied (see Table S1 in the supplemental material).

Genome characterization and phylogenetic analyses.

The nucleotide sequence of the CAVV genome was analyzed for ORFs and translated. Nucleotide and amino acid sequences were compared with other sequences by BLASTn, BLASTx, tBLASTx, and psiBLAST with the GenBank database (), and protein motifs were identified by web-based comparison to the Pfam database (). Identification of cleavage sites for signal peptides was accomplished by using signalP-NN (). Prediction of the hydropathy profile was performed by TMHMM v2.0 (), and N-linked glycosylation sites were identified using the NetNGlyc 1.0 server (). RNA folding was modeled by using the Mfold server () (65). For phylogenetic analysis, CAVV amino acid sequences were aligned with representative sequences of other nidoviruses in MEGA v5.0 (66). Alignments were optimized according to published crystal structure predictions. Phylogenetic analysis of amino acid sequences was conducted by the neighbor-joining (NJ) algorithm with the BLOSUM62 substitution matrix for distance correction with 1,000 bootstrap replicates in MEGA v5.0. Maximum-likelihood analyses were done with Fasttree (67), and tree files were displayed in MEGA versus 5.0. Evolutionary divergence over sequence pairs was estimated in MEGA versus 5.0.

Identification of subgenomic mRNAs.

For Northern blotting, the Northern Blot Starter Kit (Roche, Mannheim, Germany) was used. Digoxigenin (DIG)-labeled probes were generated by PCR using the primers shown in Fig. 4B and listed in Table S1 in the supplemental material. Total RNA of isolate CAVV/C79/CI/2004 was extracted with the Qiagen RNeasy Kit (Qiagen, Hilden, Germany) from C6/36 cells at 24 hpi. RNA was separated on a 2% formaldehyde–1.5% agarose gel, blotted onto a nylon membrane (Roche, Mannheim, Germany), and hybridized with the CAVV-specific, DIG-labeled probes. RNAs were analyzed by chemiluminescence using 1:10,000 anti-DIG–alkaline phosphatase Fab fragments and 1:100 CDP-Star reagent (Roche, Mannheim, Germany). Multiple sequence alignments of putative CAVV replicative proteins. (A) Alignment of astro-, poty-, and nidovirus 3C-like protease domains. Putative catalytic Cys, His, and Asp/Glu residues are indicated by asterisks. Also indicated are conserved residues predicted to be part of the S1 specificity subsite (&). (B) Alignment of CAVV and coronavirus helicase-associated N-terminal ZBDs. Putative zinc-binding residues are indicated (#). (C) Alignment of nidoviral endoribonuclease (NendoU) domains. Putative catalytic residues (His, His, Lys) are indicated by asterisks. Partial pp1a/pp1ab sequences were aligned using ClustalX 2.0 (Larkin MA, et al., Bioinformatics 23:2947–2948, 2007) and rendered with ESPript 2.2 (http://espript.ibcp.fr/ESPript/cgi-bin/ESPript.cgi) (Gouet P, Courcelle E, Stuart DI, Metoz F, Bioinformatics 15:305–308, 1999). Abbreviations of virus names and accession numbers are as follows: BToV, bovine torovirus (strain Breda 1) (AY427798); EToV, equine torovirus (strain Berne) (X52374); WBV, white bream virus (strain DF24/00) (NC_008516); EAV, equine arteritis virus (strain Bucyrus) (NP_127506); PRRSV, porcine reproductive and respiratory syndrome virus (strain Lelystad) (Q04561); LDV, lactate dehydrogenase-elevating virus (neurovirulent type C) (Q06502); GAV, gill-associated virus (AF227196); TEV, tobacco etch virus (AAA47909); HAstV-1, human astrovirus 1 (AAW51880); HCoV-229E, human coronavirus 229E (NC_002645); HCoV-NL63, human coronavirus NL63 (NC_005831); TGEV, transmissible gastroenteritis virus (strain Purdue-115) (Z34093); MHV-A59, murine hepatitis virus (strain A59) (NC_001846); SARS-CoV, severe acute respiratory syndrome-related CoV (strain Frankfurt-1) (AY291315); HCoV-OC43, human CoV strain OC43 (AY585229); IBV, avian infectious peritonitis virus (strain Beaudette) (NC_001451). Download Figure S1, PDF file, 0.420 MB. Figure S1, PDF file, 0.420 MB CAVV leader-to-body fusion sites. Leader-to-body fusion sites of CAVV subgenomic mRNAs were identified using the SuperScript III One-Step RT-PCR System with Platinum Taq High Fidelity (Invitrogen, Karlsruhe, Germany) with leader-specific primers in combination with subgenomic-mRNA-specific reverse primers (primers are indicated by arrowheads) (see Table S1). Primers that yielded no PCR products are shown in grey. PCR products were sequenced and, in the case of unclear sequencing results, TA cloned into the pCR2.1 TOPO vector (Invitrogen, Karlsruhe, Germany) and resequenced. Download Figure S2, JPG file, 0.315 MB. Figure S2, JPG file, 0.315 MB CAVV primers used in this study. Table S1, PDF file, 0.204 MB.
  56 in total

Review 1.  Virus-encoded proteinases and proteolytic processing in the Nidovirales.

Authors:  J Ziebuhr; E J Snijder; A E Gorbalenya
Journal:  J Gen Virol       Date:  2000-04       Impact factor: 3.891

Review 2.  Coronaviruses use discontinuous extension for synthesis of subgenome-length negative strands.

Authors:  S G Sawicki; D L Sawicki
Journal:  Adv Exp Med Biol       Date:  1995       Impact factor: 2.622

3.  Diversity, decoys and the dilution effect: how ecological communities affect disease risk.

Authors:  P T J Johnson; D W Thieltges
Journal:  J Exp Biol       Date:  2010-03-15       Impact factor: 3.312

4.  Gill-associated nidovirus of Penaeus monodon prawns transcribes 3'-coterminal subgenomic mRNAs that do not possess 5'-leader sequences.

Authors:  Jeff A Cowley; Christine M Dimmock; Peter J Walker
Journal:  J Gen Virol       Date:  2002-04       Impact factor: 3.891

Review 5.  Biodiversity loss and the rise of zoonotic pathogens.

Authors:  R S Ostfeld
Journal:  Clin Microbiol Infect       Date:  2009-01       Impact factor: 8.067

6.  Major genetic marker of nidoviruses encodes a replicative endoribonuclease.

Authors:  Konstantin A Ivanov; Tobias Hertzig; Mikhail Rozanov; Sonja Bayer; Volker Thiel; Alexander E Gorbalenya; John Ziebuhr
Journal:  Proc Natl Acad Sci U S A       Date:  2004-08-10       Impact factor: 11.205

Review 7.  Coronavirus diversity, phylogeny and interspecies jumping.

Authors:  Patrick C Y Woo; Susanna K P Lau; Yi Huang; Kwok-Yung Yuen
Journal:  Exp Biol Med (Maywood)       Date:  2009-06-22

8.  Discontinuous and non-discontinuous subgenomic RNA transcription in a nidovirus.

Authors:  A L W van Vliet; S L Smits; P J M Rottier; R J de Groot
Journal:  EMBO J       Date:  2002-12-02       Impact factor: 11.598

Review 9.  The molecular biology of coronaviruses.

Authors:  M M Lai; D Cavanagh
Journal:  Adv Virus Res       Date:  1997       Impact factor: 9.937

10.  Mutational analysis of the "slippery-sequence" component of a coronavirus ribosomal frameshifting signal.

Authors:  I Brierley; A J Jenner; S C Inglis
Journal:  J Mol Biol       Date:  1992-09-20       Impact factor: 5.469

View more
  68 in total

Review 1.  Cell Walls and the Convergent Evolution of the Viral Envelope.

Authors:  Jan P Buchmann; Edward C Holmes
Journal:  Microbiol Mol Biol Rev       Date:  2015-12       Impact factor: 11.056

2.  Discovery of a Highly Divergent Coronavirus in the Asian House Shrew from China Illuminates the Origin of the Alphacoronaviruses.

Authors:  Wen Wang; Xian-Dan Lin; Yong Liao; Xiao-Qing Guan; Wen-Ping Guo; Jian-Guang Xing; Edward C Holmes; Yong-Zhen Zhang
Journal:  J Virol       Date:  2017-08-10       Impact factor: 5.103

3.  Structural basis and functional analysis of the SARS coronavirus nsp14-nsp10 complex.

Authors:  Yuanyuan Ma; Lijie Wu; Neil Shaw; Yan Gao; Jin Wang; Yuna Sun; Zhiyong Lou; Liming Yan; Rongguang Zhang; Zihe Rao
Journal:  Proc Natl Acad Sci U S A       Date:  2015-07-09       Impact factor: 11.205

4.  Negevirus: a proposed new taxon of insect-specific viruses with wide geographic distribution.

Authors:  Nikos Vasilakis; Naomi L Forrester; Gustavo Palacios; Farooq Nasar; Nazir Savji; Shannan L Rossi; Hilda Guzman; Thomas G Wood; Vsevolod Popov; Rodion Gorchakov; Ana Vázquez González; Andrew D Haddow; Douglas M Watts; Amelia P A Travassos da Rosa; Scott C Weaver; W Ian Lipkin; Robert B Tesh
Journal:  J Virol       Date:  2012-12-19       Impact factor: 5.103

Review 5.  Ribosomal frameshifting and transcriptional slippage: From genetic steganography and cryptography to adventitious use.

Authors:  John F Atkins; Gary Loughran; Pramod R Bhatt; Andrew E Firth; Pavel V Baranov
Journal:  Nucleic Acids Res       Date:  2016-07-19       Impact factor: 16.971

6.  Detection of novel and recognized RNA viruses in mosquitoes from the Yucatan Peninsula of Mexico using metagenomics and characterization of their in vitro host ranges.

Authors:  Jermilia Charles; Chandra S Tangudu; Stefanie L Hurt; Charlotte Tumescheit; Andrew E Firth; Julian E Garcia-Rejon; Carlos Machain-Williams; Bradley J Blitvich
Journal:  J Gen Virol       Date:  2018-11-09       Impact factor: 3.891

7.  Identification and characterization of genetically divergent members of the newly established family Mesoniviridae.

Authors:  Florian Zirkel; Hanna Roth; Andreas Kurth; Christian Drosten; John Ziebuhr; Sandra Junglen
Journal:  J Virol       Date:  2013-03-27       Impact factor: 5.103

8.  Evolutionary origins of hepatitis A virus in small mammals.

Authors:  Jan Felix Drexler; Victor M Corman; Alexander N Lukashev; Judith M A van den Brand; Anatoly P Gmyl; Sebastian Brünink; Andrea Rasche; Nicole Seggewiβ; Hui Feng; Lonneke M Leijten; Peter Vallo; Thijs Kuiken; Andreas Dotzauer; Rainer G Ulrich; Stanley M Lemon; Christian Drosten
Journal:  Proc Natl Acad Sci U S A       Date:  2015-11-02       Impact factor: 11.205

9.  Genomes of viral isolates derived from different mosquitos species.

Authors:  Mohammadreza Sadeghi; Vsevolod Popov; Hilda Guzman; Tung Gia Phan; Nikos Vasilakis; Robert Tesh; Eric Delwart
Journal:  Virus Res       Date:  2017-09-21       Impact factor: 3.303

Review 10.  Insect-specific viruses and their potential impact on arbovirus transmission.

Authors:  Nikos Vasilakis; Robert B Tesh
Journal:  Curr Opin Virol       Date:  2015-08-31       Impact factor: 7.090

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.