Literature DB >> 35527222

Genomics and proteomics of Apis mellifera filamentous virus isolated from honeybees in China.

Dahe Yang1, Jun Wang2, Xi Wang2, Fei Deng2, Qingyun Diao3, Manli Wang2, Zhihong Hu4, Chunsheng Hou5.   

Abstract

Apis mellifera filamentous virus (AmFV) is a large DNA virus that is endemic in honeybee colonies. The genome sequence of the AmFV Swiss isolate (AmFV CH-C05) has been reported, but so far very few molecular studies have been conducted on this virus. In this study, we isolated and purified AmFV (AmFV CN) from Chinese honeybee (Apis mellifera) colonies and elucidated its genomics and proteomics. Electron microscopy showed ovoid purified virions with dimensions of 300-500 ​× ​210-285 ​nm, wrapping a 3165 ​× ​40 ​nm filamentous nucleocapsid in three figure-eight loops. Unlike AmFV CH-C05, which was reported to have a circular genome, our data suggest that AmFV CN has a linear genome of approximately 493 ​kb. A total of 197 ORFs were identified, among which 36 putative genes including 18 baculoviral homologs were annotated. The overall nucleotide similarity between the CN and CH-C05 isolates was 96.9%. Several ORFs were newly annotated in AmFV CN, including homologs of per os infectivity factor 4 (PIF4) and a putative integrase. Phylogenomic analysis placed AmFVs on a separate branch within the newly proposed virus class Naldaviricetes. Proteomic analysis revealed 47 AmFV virion-associated proteins, of which 14 had over 50% sequence coverage, suggesting that they are likely to be main structural proteins. In addition, all six of the annotated PIFs (PIF-0-5) were identified by proteomics, suggesting that they may function as entry factors in AmFV infection. This study provides fundamental information regarding the molecular biology of AmFV.
Copyright © 2022 The Authors. Publishing services by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Apis mellifera filamentous Virus (AmFV); Genome sequence; Naldaviricetes; Proteomics; Structural proteins; per os infectivity factor 4 (PIF4)

Mesh:

Year:  2022        PMID: 35527222      PMCID: PMC9437511          DOI: 10.1016/j.virs.2022.02.007

Source DB:  PubMed          Journal:  Virol Sin        ISSN: 1995-820X            Impact factor:   6.947


Introduction

Honeybees are indispensable pollinators in natural ecosystems and contribute to the production of approximately 70% of the crops used for human consumption (Klein et al., 2007). However, multiple factors threaten honeybees, such as pesticides, and parasites including viruses (Brosi et al., 2017). To date, more than 30 honeybee viruses have been reported, most of which are RNA viruses (McMenamin and Genersch, 2015; Remnant et al., 2017; Beaurepaire et al., 2020). Apis mellifera filamentous virus (AmFV) is one of the few DNA viruses identified in honeybees and has been studied less than the RNA viruses. AmFV was initially reported as a honeybee pathogen in the United States in 1978 (Clark, 1978). Acutely infected bees become weak and gather at the hive entrance, while severely infected honeybees exhibit milky-white hemolymph due to tissue degradation (Clark, 1978). In general, AmFV appears to be a weak pathogen, but is endemic in honeybee colonies. For example, it is considered to be the most common and least harmful bee virus in Britain (Bailey, 1982). While AmFV appears to be a weak pathogen, it is not unreasonable to suggest that it may weaken the bee to an extent that makes it more susceptible to other pathogens. Initially, the presence of AmFV was diagnosed using electron microscopy. AmFV was first reported to be an ellipsoidal (400 ​× ​100 ​nm), enveloped virus with a long filamentous nucleocapsid (3060 ​× ​60 ​nm) (Clark, 1978). Later, it was characterized as a DNA virus of slightly different size (450 ​× ​150 ​nm and 3000 ​× ​40 ​nm for virion and nucleocapsid, respectively) (Bailey et al., 1981). The nucleocapsid morphology of AmFV is unique in that it forms three figure-eight loops inside the envelope (Sitaropoulouab et al., 1989). Partial sequences of AmFV were first derived from genome sequencing of Varroa destructor mites, but at the time they were referred to as baculovirus-related (Cornman et al., 2010). A breakthrough in AmFV research was made in 2015, when the complete viral genome was sequenced from infected worker honeybees collected in Switzerland (Gauthier et al., 2015); this led to the molecular detection of AmFV in practice. The prevalence of AmFV has been surveyed in the USA, Switzerland, France, Sweden, China, Syria, Czech Republic, and Argentina, showing that it is commonly found worldwide (Gauthier et al., 2015; Hartmann et al., 2015; Hou et al., 2016; Abou Kubaa, 2018; Prodelalova et al., 2019; Quintana, 2019). AmFV is detectable year-round, with higher viral copy numbers found in the Spring (Hartmann et al., 2015). Apart from honeybees, AmFVs have been detected in a wide range of solitary bee species (Ravoet et al., 2014) and even in honey (Bovo et al., 2018, 2020). The double-stranded DNA genome of the Swiss strain AmFV (AmFV CH–C05) is approximately 498 ​kb, encoding 247 open reading frames (ORFs). However, only a small portion (∼16%) of the predicted ORFs have been annotated, including 13 homologs of baculovirus genes, including per os infectivity factors (PIFs) and baculovirus repeated ORFs (BROs) (Gauthier et al., 2015). Apart from genome sequencing, very few molecular studies have been conducted on AmFV. An early biochemistry study revealed that there were 12 AmFV structural proteins (Bailey et al., 1981), but to our knowledge, no further relevant investigation has been performed since then. Molecular characterization of AmFV is indispensable to better understand its pathogenicity and its interaction with its host. In this study, we isolated and purified AmFV virions from honeybee colonies in China and conducted next-generation genomic sequencing and compared it to that of the Swiss strain. We performed proteomics to identify virion-associated proteins.

Materials and methods

Virus purification and viral DNA extraction

A. mellifera naturally infected with AmFV were collected from Henan Province, China. The AmFV infection was detected by PCR. Briefly, total DNA was extracted from bee workers by phenol-chloroform and ethanol precipitation. PCR amplification was performed using AmFV specific primers (5′-CAGAGAATTCGGTTTTTGTGAGTG-3′ and 5′-CATGGTGGCCAAGTCTTGCT-3′) (Gauthier et al., 2015). The identity of the PCR products was further confirmed by Sanger sequencing. Virions (AmFV CN) were purified from approximately 40 honeybee adults as previously described (Bailey et al., 1981; Laughton and Siva-Jothy, 2011) with slight modifications. Briefly, bees were homogenized in extraction buffer (0.01 ​mol/L ammonium acetate, 0.02% diethyldithiocarbamate, and 0.01% Triton X-100). The homogenate was filtered using 4-layer gauze, then centrifuged at 1000 ​g for 30 ​min. The supernatant was centrifuged on a sucrose density gradient (20%–60%) at 40000 ​g for 1 ​h. The band at 50% sucrose was collected and purified to remove sucrose. The purified virions were imaged by transmission electron microscopy (TEM) using a 100 ​kV Hitachi H-7000FA microscope. DNA extraction was performed as previously described (Gauthier et al., 2015).

Genome sequencing and bioinformatic analysis

Virion DNA was sequenced using the Illumina Hiseq 3000 System with shotgun strategy at the Sequencing Platform of the National Key Laboratory of Crop Genetic Improvement at Huazhong Agricultural University (Wuhan, China). The reads were quality controlled and preprocessed using Trimmomatic (version 0.32), then assembled with Trinity (version 2.5.1). Gaps and unreadable sequences were amplified by PCR and confirmed by Sanger sequencing. ORFs were identified using the FGENESV program (http://linux1.softberry.com/berry.phtml) and ORFfinder (http://www.ncbi.nlm.nih.gov/orffinder/), adopting the criteria of polypeptide length >100, standard ATG start codon, and minimal overlap. A genome map was constructed using an in-house Python script. Gene annotation and function prediction were performed using the NCBI BLASTP algorithm (https://blast.ncbi.nlm.nih.gov/Blast.cgi) against the nr protein and UniRef90 (https://www.uniprot.org/blast/) databases. Conserved domains were determined using RPS-BLAST with the Conserved Domain Database (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) and the hmmscan to Pfam database, with a minimum e-value of 1.0e-3 in both cases. The annotated genome sequence data have been submitted to GenBank (http://www.ncbi.nlm.nih.gov/genbank) under accession number OK392616. The genome sequence of AmFV CH–C05 was used as a reference for comparison (GenBank accession number NC_027925.1, Gauthier et al., 2015).

Phylogenetic analysis

All PIF sequences were aligned using ClustalW with MEGA6 by using default settings. Phylogenetic analysis was conducted using the concatenated PIF amino-acid sequences. The phylogenetic tree was constructed using MEGA6 with the substitution model (LG ​+ ​G ​+ ​I) by using the maximum-likelihood method with 1000 bootstrap replicates.

Proteomics

Shotgun proteomics was used to identify AmFV virion-associated proteins. Briefly, purified AmFV virions were suspended, reduced, alkylated, and subjected to in-solution trypsin digestion. Digested peptides were subjected to liquid chromatography-tandem mass spectrometry (LC-MS/MS) using Q-Exactive Plus coupled to an Easy nLC 1200 (Thermo Fisher Scientific). Proteomics were performed by Bioprofile (Shanghai, China). Peptide sequences were analyzed using the UniProt Protein Database (https://www.uniprot.org/) and Conserved Domain Database (https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml; Lu et al., 2000) with an E-value < 0.1.

Results

AmFV purification and morphology

AmFVs were purified from naturally infected honeybees collected in Henan Province, China. Transmission electron micrographs showed the virions were ovoid in shape with a size of approximately 300–500 ​× ​210–285 ​nm (Fig. 1A), wrapping a long filamentous nucleoprotein into three superimposed figure-eight loops (Fig. 1B). The size of the filamentous nucleoprotein was approximately 3165 ​× ​40 ​nm. The morphology we observed was similar to those reported in previous studies (Clark, 1978; Bailey et al., 1981; Sitaropoulouab et al., 1989).
Fig. 1

Transmission electron micrographs of purified AmFV. A. Whole virions. B. Representative filamentous nucleocapsid with three figure-eight loops. Scale bar, 200 ​nm.

Transmission electron micrographs of purified AmFV. A. Whole virions. B. Representative filamentous nucleocapsid with three figure-eight loops. Scale bar, 200 ​nm.

Genome overview

The AmFV CN genome was assembled from 37,330,706 high-quality reads with an average coverage of 7576 ​×. There were a few gaps (total of ∼2843 bp) that could not be filled by PCR and Sanger sequencing, likely due to the complexity of the DNA sequence and/or structure in those regions. Likewise, gaps of approximately 2500 bp also exist in the reported AmFV CH–C05 genome (Gauthier et al., 2015). The size of the assembled genome was 492,752 bp, which was 3644 bp shorter than that of CH–C05 (∼496,396 bp). The overall identity between AmFV CN and CH–C05 was 96.9%. The G ​+ ​C content of AmFV CN was 50.6%, similar to the 50.9% of CH-05 (Gauthier et al., 2015). However, unlike the CH–C05 genome, which was reported to be circular (Gauthier et al., 2015), our data indicate that the AmFV CN has a linear genome, with no evidence of overlap between the two ends during genome assembly. Initially, 157 methionine-initiated ORFs with a minimum length of 100 residues were predicted in AmFV CN. When the genome of AmFV CH-05 (Gauthier et al., 2015) was used as the reference, an additional 40 ORFs with lengths shorter than 100 residues were identified in AmFV CN, increasing the total ORF number to 197 (Fig. 2). Coding regions constituted 63% of the AmFV CN genome, similar to that observed in the CH-05 genome (65%). In comparison to CH–C05 (a total of 241 ORFs), 45 ORFs were missing in AmFV CN, of which 32 were shorter than 100 residues (Table 1). For consistency, the ORFs in AmFV CN were named from their homologs in CH-05 (NC_027925.1). Notably, ORF183 of CH-C05 was split into two separate ORFs in AmFV CN, which were designated AmFV_183 and AmFV_183a. The genome organization between CN and CH-C05 is highly conserved.
Fig. 2

Diagram of the AmFV CN genome. The linear genome of AmFV is shown with marked length. The arrows indicate predicted ORFs and direction of transcription. ORFs predicted to be related to DNA replication/metabolism, PIFs, and BROs are shown in red, green, and brown, respectively. ORFs identified by proteomics are displayed in a blue font.

Table 1

AmFV CH-05 ORFs not present in AmFV CN.

No.ORFProteinLength (aa)
1AmFV_014hypothetical protein57
2AmFV_026hypothetical protein105
3AmFV_032hypothetical protein61
4AmFV_033hypothetical protein52
5AmFV_035hypothetical protein68
6AmFV_036hypothetical protein72
7AmFV_038hypothetical protein59
8AmFV_039hypothetical protein93
9AmFV_040hypothetical protein105
10AmFV_044hypothetical protein56
11AmFV_046hypothetical protein57
12AmFV_047hypothetical protein33
13AmFV_061hypothetical protein66
14AmFV_090hypothetical protein114
15AmFV_094hypothetical protein55
16AmFV_121hypothetical protein50
17AmFV_131hypothetical protein37
18AmFV_153hypothetical protein58
19AmFV_155hypothetical protein83
20AmFV_160hypothetical protein52
21AmFV_163hypothetical protein58
22AmFV_167hypothetical protein69
23AmFV_176hypothetical protein35
24AmFV_179hypothetical protein26
25AmFV_186hypothetical protein98
26AmFV_190hypothetical protein108
27AmFV_191hypothetical protein56
28AmFV_192hypothetical protein50
29AmFV_194hypothetical protein84
30AmFV_196hypothetical protein173
31AmFV_197hypothetical protein116
32AmFV_199hypothetical protein49
33AmFV_202hypothetical protein346
34AmFV_204hypothetical protein28
35AmFV_205hypothetical protein44
36AmFV_208hypothetical protein58
37AmFV_209hypothetical protein136
38AmFV_217hypothetical protein143
39AmFV_222hypothetical protein75
40AmFV_227hypothetical protein37
41AmFV_236hypothetical protein648
42AmFV_238hypothetical protein80
43AmFV_239hypothetical protein206
44AmFV_240hypothetical protein278
45AmFV_241hypothetical protein118
Diagram of the AmFV CN genome. The linear genome of AmFV is shown with marked length. The arrows indicate predicted ORFs and direction of transcription. ORFs predicted to be related to DNA replication/metabolism, PIFs, and BROs are shown in red, green, and brown, respectively. ORFs identified by proteomics are displayed in a blue font. AmFV CH-05 ORFs not present in AmFV CN. The ORFs were annotated on the basis of homology. Thirty-six ORFs had homologs in the genomes of other species (from viruses, eukaryotes, and bacteria) in public sequence databases (Table 2). Based on predicted functions from databases, these ORFs were categorized as: 6 potential DNA replication and nucleotide metabolism, 6 PIFs, 9 BROs, and 15 with putative other or unknown functions. All 36 annotated ORFs in AmFV CN were present in AmFV CH-C05 with high protein sequence identities (>90%, Table 2).
Table 2

Putative genes in AmFV CN Genome.

Putative functionORFsize (aa)Putative proteinBest match with Pfam-A database
Best match with BLASTP
AmFV CH–C05 identity
Pfam code dmainE-valuePfam noSpecies/VirusScoreE-valueSimilarityaacession no
DNA Replication and nucleotide metabolismAmFV_027610Thymidylate synthaseThymidylat_synt1.5E-85PF00303.19Malassezia pachydermatis2843.0E-8445.5%XP_017993541.198.2%
AmFV_042556Integrasephage_intergrase2.5E-05PF00589.22Fibrobacter sp.55.12.0E-0427.2%NLD99342.198.5%
AmFV_0741955DNA PolDNA_pol_B4.5E-14PF00136.2199.1%
AmFV_0951603DNA ligaseDNA_ligase_A_N8.3E-09PF04675.14Athalia rosae999.0E-1724.0%XP_025602269.199.6%
AmFV_114879RR1Ribonuc_red_IgC0.0E+00PF02867.15Cytophagaceae bacterium7300.0E+0044.4%ODS80019.199.7%
AmFV_2162469RR2Ribonuc_red_sm2.1E-101PF00268.21Uncultured virus3912.0E-121100.0%ADD74390.1100.0%
PIFsAmFV_057334PIF-5Tipula oleracea nudivirus1704.8E-0620.0%YP_009116743.199.1%
AmFV_060830PIF-1PIF7.0E-16PF05092.12Cyclophragma undans nucleopolyhedrovirus914.0E-1530.0%YP_010086634.198.6%
AmFV_0771188PIF-0Baculo_p74PF04583.12Hyphantria cunea nucleopolyhedrovirus1566.4E-0729.4%YP_473207.198.6%
AmFV_088280PIF-3PIF34.4E-04PF05006.12Malacosoma sp. alphabaculovirus541.0E-0420.7%ANW12283.1100.0%
AmFV_100402PIF-2PIF26.0E-18PF04631.12Mauternbach virus922.0E-1631.2%AYP97928.199.8%
AmFV_157204PIF-4Baculo_193.2E-09PF04798.12100.0%
BROsAmFV_008182BRO-1Chrysodeixis chalcites nucleopolyhedrovirus531.0E-0433.3%YP_249718.1100.0%
AmFV_0161306BRO-2Bro-N2.9E-09PF02498.17Helicoverpa armigera nucleopolyhedrovirus1791.60E-629.1%AMN15974.294.0%
AmFV_069262BRO-3AmFV13090.0E+0098.9%YP_009165820.198.9%
AmFV_075434BRO-4Spodoptera litura granulovirus529.0E-0528.0%YP_001257066.198.9%
AmFV_106667BRO-5Bro-N3.2E-07PF02498.17Chrysodeixis includens nucleopolyhedrovirus773.0E-1225.0%AOL57177.195.8%
AmFV_108627BRO-6Bro-N8.6E-04PF02498.17Spodoptera frugiperda ascovirus 1a666.0E-0929.0%YP_762434.192.3%
AmFV_110499BRO-7Bro-N6.6E-29PF02498.17Chrysodeixis chalcites nucleopolyhedrovirus872.0E-1529.8%AGE61478.197.0%
AmFV_111437BRO-8AmFV2011.9E-1425.2%YP_009165857.198.4%
AmFV_133157BRO-9Bro-N4.1E-03PF02498.1798.7%
OthersAmFV_0061699Protein kinase (PK)Acanthamoeba polyphaga mimivirus1116.7E-0326.0%AKI8006999.0%
AmFV_009442PARPTrypan_PARP3.1E-04PF05887.11Paramecium bursaria Chlorella virus CviKI1792.4E-0928.4%AGE51657.199.8%
AmFV_023648ATPaseAAA domain4.0E-26PF00004.29Lasius niger1241.0E-2734.0%KMQ86848.199.2%
AmFV_034501Serpin-likePacifastin inhibitor (LCMII)4.6E-10PF05375.13Blattella germanica2696.5E-2434.9%PSN39366.197.0%
AmFV_0431633Myristoylated membraneMimivirus sp. SH1866.2E-1246.7%AZL89416.196.0%
AmFV_068326RING finger protein 413Rzf-C3HC4_32.1E-03PF13920.6100.0%
AmFV_080572RING finger proteinCollichthys lucidus521.0E-0333.3%TKS65457.198.6%
AmFV_082510HZV 115-likeDUF45802.3E-05PF15162.6Oryctes rhinoceros nudivirus853.0E-1531.1%YP_002321369.199.8%
AmFV_091534hypothetical proteinHirundo rustica rustica1445.0E-0440.2%RMB88007.198.2%
AmFV_1011982Gamma-glutamyltranspeptidaseG_glu_transpept1.6E-14PF01019.21Diachasmimorpha longicaudata entomopoxvirus1475.9E-0734.6%AKS26328.197.5%
AmFV_113583hypothetical proteinThalassiosira oceanica1452.4E-0832.5%EJK57244.190.5%
AmFV_123948hypothetical proteinHarpegnathos saltator1221.0E-2429.4%EFN83926.199.8%
AmFV_1681236MdSGHV 070Musca hytrosavirus1662.3E-0429.6%YP_001883398.197.3%
AmFV_193969Chitin-bindingLOMP_108.6E-36PF03067.15Apis mellifera1362.0E-3142.4%WP_180560000.196.0%
AmFV_235334PLCPI-PLC-X3.6E-11PF00388.19Taibaiella koreensis726.0E-1033.6%WP_118975952.1100.0%
Putative genes in AmFV CN Genome. As a large DNA virus, AmFV CN has several key factors that facilitate DNA replication. AmFV_074 encodes a hypothetical type-B DNA polymerase with a predicted length of 1960 residues. AmFV_042 and AmFV_095 are a putative integrase and DNA ligase, respectively, which are likely involved in viral DNA replication. Like many DNA viruses, AmFV also encodes enzymes involved in nucleotide metabolism. AmFV_027 is a homolog of thymidylate synthase, which catalyzes the conversion of deoxyuridine monophosphate (dUMP) to deoxythymidine monophosphate (dTMP). AmFV_114 and AmFV_216 are homologs of ribonucleoside reductase subunits 1 and 2 (RR1 and RR2), respectively, which catalyzes the formation of deoxyribonucleotides from ribonucleotides and provides precursors for viral DNA synthesis. Among these six ORFs, the putative integrase (AmFV_042) was newly annotated in AmFV CN. There are six ORFs in AmFV that are homologs of baculoviral PIFs: PIF0 (AmFV_077), PIF1 (AmFV_060), PIF2 (AmFV_100), PIF3 (AmFV_088), PIF4 (AmFV_157), and PIF5 (AmFV_057). In comparison to the reported AmFV CH–C05 genome sequence (Gauthier et al., 2015), PIF4 was newly annotated in this study. PIFs are essential for oral infection by baculoviruses; ten (PIF0–9) have been identified in baculoviruses, with all but PIF5 forming a PIF complex of ∼520 ​kDa to initiate midgut infection (Wang et al., 2019). PIFs are also found in other large arthropod DNA viruses, including nudiviruses, white spot syndrome virus (WSSV), and salivary-gland hypertrophy viruses (Escobedo-Bonilla et al., 2008; Wang and Jehle, 2009; Lietze et al., 2011; Bezier et al., 2015). It is not yet known whether AmFV also produces a functional PIF complex or it has other unidentified PIF homologs. Nine putative bro genes were found in the AmFV CN genome (Table 2), among which AmFV_069, AmFV_111, and AmFV_113 were not initially annotated as bro genes in the report of the AmFV CH–C05 genome (Gauthier et al., 2015), but are noted in the updated genome sequence in GenBank (NC_027925.1). Bros were first identified in the genome of Lymantria dispar multinucleocapsid nucleopolyhedrovirus (LdMNPV), which contains 16 repeated ORFs (Kuzio et al., 1999). The number of BROs in AmFV is in the same range of the numbers present in baculoviruses, 0 to 16 (Li et al., 2021). BROs constitute a superfamily identified in invertebrate dsDNA viruses, bacteriophages, and bacteria that contain a conserved N-terminal predicted DNA-binding motif (Jakob et al., 2001; Bideshi et al., 2003). Their function(s) is not clear, although several studies have shown that some have DNA-binding activities and interact with laminin or translation-associated proteins (Zemskov et al., 2000; Kang et al., 2003; Kotani et al., 2015). The multiple BROs found in AmFV support the hypothesis that BROs play important roles in viral interactions with invertebrates (Bideshi et al., 2003). In addition to the above genes, 15 putative genes were identified with homology to genes from other species (Table 2).

Phylogeny analysis based on PIFs

Of the 36 annotated genes of AmFV CN, 18 (including six PIFs, nine BROs, DNA polymerase, RR1, and RR2) have homologs in baculoviruses, suggesting that it is a baculo-like virus. Based on shared PIFs, the new virus class Naldaviricetes was recently proposed by the International Committee on Taxonomy of Viruses (ICTV) (https://ictv.global/ictv/proposals/2020.006D.R.Naldaviricetes.zip) (Walker et al., 2021). This class contains four families of nuclear arthropod large DNA viruses (NALDVs) including Baculoviridae, Nudiviridae, Hytrosaviridae, and Nimavirida, with AmFV as a free member. To place AmFV among the viruses, we conducted phylogeny analysis using concatenated protein sequences of the PIFs. The results showed that AmFV CN and AmFV CH–C05 formed a separate branch within Naldaviricetes (Fig. 3). Unlike the members of Baculoviridae, Nudiviridae, and Hytrosaviridae, which also encode four subunits of a DNA-directed RNA polymerase, the AmFV CN and CH–C05 genomes lack ORFs encoding these late expression factors and thus, do not belong to the proposed order Lefavirales (https://ictv.global/ictv/proposals/2020.006D.R.Naldaviricetes.zip) (Walker et al., 2021). The PIF phylogenetic tree shows that AmFVs are distinct from the members of Lefavirales, although the AmFV PIFs appeared to be closer to those of Hytrodaviridae with a bootstrap value of 69% (Fig. 3).
Fig. 3

Phylogenetic tree of members of Naldaviricetes derived from concatenated protein sequences of PIFs. The maximum-likelihood (ML) tree on substitution model (LG ​+ ​G ​+ ​I) is present. Numbers on the nodes indicate ML nonparametric bootstrap supports (1000 replicates). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The classifications of viruses are indicated. Virus abbreviations and sequence accession numbers are: AcMNPV, Autographa californica multiple nucleopolyhedrovirus, NC_001623; CpGV, Cydia pomonella granulovirus, NC_002816; NeleNPV, Neodiprion lecontei nucleopolyhedrovirus, NC_005906; CuniNPV, Culex nigripalpus nucleopolyhedrovirus, NC_003084; GbNV, Gryllus bimaculatus nudivirus, NC_009240; HzNV, Helicoverpa zea nudivirus-2, NC_004156; MdSGHV, Musca domestica salivary gland hypertrophy virus, NC_010671; WSSV, White spot syndrome virus, NC_003225; AmFV CH–C05, NC_027925; AmFV CN, OK392616.

Phylogenetic tree of members of Naldaviricetes derived from concatenated protein sequences of PIFs. The maximum-likelihood (ML) tree on substitution model (LG ​+ ​G ​+ ​I) is present. Numbers on the nodes indicate ML nonparametric bootstrap supports (1000 replicates). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The classifications of viruses are indicated. Virus abbreviations and sequence accession numbers are: AcMNPV, Autographa californica multiple nucleopolyhedrovirus, NC_001623; CpGV, Cydia pomonella granulovirus, NC_002816; NeleNPV, Neodiprion lecontei nucleopolyhedrovirus, NC_005906; CuniNPV, Culex nigripalpus nucleopolyhedrovirus, NC_003084; GbNV, Gryllus bimaculatus nudivirus, NC_009240; HzNV, Helicoverpa zea nudivirus-2, NC_004156; MdSGHV, Musca domestica salivary gland hypertrophy virus, NC_010671; WSSV, White spot syndrome virus, NC_003225; AmFV CH–C05, NC_027925; AmFV CN, OK392616.

The AmFV CN proteome

Since most of the ORFs of AmFV remain hypothetical, proteomics was performed to determine which ORFs are authentic proteins and whether they are structurally related. The proteins consistently identified in independent LC-MS/MS experiments are summarized in Table 3; 47 proteins were found to be associated with the AmFV virion (Table 3, Fig. 2).
Table 3

The AmFV CN virion associated proteins.

No.ORFsaProteinMol. weight (kDa)Protein length (aa)Test 1
Test 2
Unique peptidesSequence coverage (%)Unique peptidesSequence coverage (%)
1AmFV_002hypothetical protein108.096187.855.1
2AmFV_017hypothetical protein46.64091027.7514.5
3AmFV_021∗hypothetical protein36.83211866.91131.6
4AmFV_022∗hypothetical protein30.62711252.2726.7
5AmFV_023∗AAA ​+ ​ATPase69.26483557.12948.2
6AmFV_051∗hypothetical protein39.13421852.51445.2
7AmFV_053hypothetical protein15.1127212.717.1
8AmFV_054hypothetical protein111.510621417.71723
9AmFV_056hypothetical protein32.3286312.129.3
10AmFV_057∗PIF536.13342968.32158.9
11AmFV_058∗hypothetical protein148.713536461.94542.9
12AmFV_060PIF193.98302229.61621.4
13AmFV_062hypothetical protein77.667667.634.1
14AmFV_064hypothetical protein38.6350731.1626.9
15AmFV_073hypothetical protein63.362911.4410.2
16AmFV_077PIF0131.911882321.72225.8
17AmFV_078hypothetical protein37.23361229.6827.2
18AmFV_085hypothetical protein13.0118223.1223.1
19AmFV_088PIF330.8280629315.1
20AmFV_089hypothetical protein48.84501646.3819.8
21AmFV_092hypothetical protein40.03571344.71136.8
22AmFV_097hypothetical protein70.26282447.41832.4
23AmFV_099∗hypothetical protein42.63833290.92266.8
24AmFV_100PIF245.04021139.4930.2
25AmFV_104hypothetical protein19.7174536.1313.9
26AmFV_117hypothetical protein10.088218.419.2
27AmFV_122hypothetical protein56.7507410.324.5
28AmFV_128∗hypothetical protein56.64903161.62143.1
29AmFV_130∗hypothetical protein19.81731064.8951.1
30AmFV_138∗hypothetical protein53.445026531840.3
31AmFV_139hypothetical protein18.0168532.9219.8
32AmFV_140hypothetical protein164.814405544.23327.4
33AmFV_141hypothetical protein22.8196417.916.2
34AmFV_146∗hypothetical protein24.921816701151.2
35AmFV_148hypothetical protein38.9346414.8520.9
36AmFV_149∗hypothetical protein12.6127652.4221.4
37AmFV_151∗hypothetical protein44.03962052.21437.7
38AmFV_154hypothetical protein95.38501928.9912
39AmFV_156∗hypothetical protein29.32621663.51763.5
40AmFV_157PIF423.0204741.9312.3
41AmFV_161hypothetical protein36.333211.825.1
42AmFV_164hypothetical protein19.9182419.9213.3
43AmFV_182hypothetical protein102.48953948.92933.9
44AmFV_200hypothetical protein111.4101988.644.2
45AmFV_215hypothetical protein223.4196583.183.9
46AmFV_233hypothetical protein9.283442.7231.7
47AmFV_235hypothetical protein38.73341745.61334.8

The ORFs with sequence coverage over 50% in at least one test were marked with ∗.

The AmFV CN virion associated proteins. The ORFs with sequence coverage over 50% in at least one test were marked with ∗. All the annotated PIFs, PIF0 (AmFV_077), PIF1 (AmFV_060), PIF2 (AmFV_100), PIF3 (AmFV_088), PIF4 (AmFV_157), and PIF5 (AmFV_057), were detected by proteomics (Table 3), suggesting that they are virion structural proteins that likely function as entry factors. AmFV_023, encoding a putative AAA ​+ ​ATPase, was also identified (Table 2, Table 3). The functions of the remaining detected proteins are largely unknown (Table 3). Further more, we detected 14 proteins with predicted molecular weights of 12.6–148.7 ​kDa with over 50% sequence coverage in at least one of the LC-MS/MS runs: AmFV_021 (36.8 ​kDa), AmFV_022 (30.6 ​kDa), AmFV_023 (69.2 ​kDa), AmFV_051 (39.1 ​kDa), PIF5 (AmFV_057, 36.1 ​kDa), AmFV_058 (148.7 ​kDa), AmFV_099 (42.6 ​kDa), AmFV_128 (56.6 ​kDa), AmFV_130 (19.8 ​kDa), AmFV_138 (53.4 ​kDa), AmFV_146 (24.9 ​kDa), AmFV_149 (12.6 ​kDa), AmFV_151 (44.0 ​kDa), and AmFV_156 (29.3 ​kDa), suggesting they are likely major structural proteins (Table 3).

Discussion

AmFV is an endemic DNA virus in honeybee colonies; however, very little was known about its molecular biology. In China, the prevalence of AmFV in honeybee colonies varies from 10% to 85% (Hou et al., 2016, 2017). In this study, we have produced useful information on this mysterious, large DNA virus. The average size of the AmFV CN particle was 300–500 ​× ​200–290 ​nm (Fig. 1), which were similar but slightly wider comparing to those previous reported (Clark, 1978; Bailey et al., 1981; Sitaropoulouab et al., 1989). The initial reported size of the AmFV virion was 450 ​× ​150 ​nm (Bailey et al., 1981), but these authors also mentioned that after purification on sucrose gradients, the particles appeared irregularly ellipsoidal with varying dimensions of 250–500 ​× ​120–200. Therefore, the slight difference we observed was most likely a consequence of different purification methods. The size of the filamentous nucleocapsid we observed is consistent with previous reports (Bailey et al., 1981; Sitaropoulouab et al., 1989). Electron microscopy showed the unique wrapping of the long nucleocapsid within the AmFV virion (Fig. 1), supporting the earlier hypothesis that the nucleocapsid has to form three figure-eight loops to fit into the envelope (Sitaropoulouab et al., 1989). The overall high nucleotide identity (96.9%) between the CN and CH–C05 strains indicates that these two viruses are closely related. Thirty-two of the 45 ORFs missing in AmFV CN (Table 1) encoded ORFs of less than 100 residues, suggesting that they may not be authentic ORFs. The gene order of AmFV CN was similar to that of AmFV CH–C05. However, unlike circular genome reported for the CH–C05 strain (Gauthier et al., 2015), our sequence data suggested that AmFV CN has a linear genome, as no reads overlapped both ends. In fact, our data support an earlier hypothesis that AmFV contains a linear genome based on electron microscopic observations of AmFV DNA (Bailey et al., 1981). Given that the length of the nucleocapsid is about 3.1 ​μm (Clark, 1978; Bailey et al., 1981; Sitaropoulouab et al., 1989) and the measured DNA length is 5.8 ​± ​0.3 ​μm (Bailey et al., 1981), viral DNA appeared to be only slightly condensed during packaging. How AmFV DNA interacts with proteins to form nucleocapsids, and how the long neucleocapsid is wrapped in the virion envelope into three figure-eight loops, remain open and intriguing questions. Identifying structural proteins is fundamental for understanding the unique structure of AmFV and will provide useful information for the development of immunological AmFV diagnostic kits. By proteomic analyses, we identified 47 structural proteins that were present in at least two replicate runs (Table 3). As expected, all six PIFs (PIF 0–5) were present, suggesting that they are AmFV structural proteins and likely functional entry factors. In addition, AmFV_023, with homology to the ATPases associated with diverse cellular activities (AAA+) ATPase family, was identified. These ATPases are a large protein family that utilize energy from ATP hydrolysis to participate in multiple cellular functions. The presence of AmFV_023 in the AmFV virion suggests that it may be involved in providing required energy for nucleocapsid packaging. Among the 47 proteins, 14 with predicted molecular weight of 12.6–148.7 ​kDa appeared to be major structural proteins, as they were detected with over 50% sequence coverage (Table 3). Previous study has shown that 12 proteins, ranging in size from 13 to 70 ​kDa, had been associated with the AmFV virion by polyacrylamide gel electrophoresis (Bailey et al., 1981). Of these 12, P40 and P13 appeared to be major nucleocapsid proteins (although P13 was also present in the envelope), while P37 and P23 appeared to be major envelope proteins (Bailey et al., 1981). We speculate that AmFV_149 (predicted 12.7 ​kDa), AmFV_146 (24.9 ​kDa), PIF5 (36.1 ​kDa), and AmFV_51 (39.1 ​kDa) reflect the previously identified P13, P23, P37, and P40, respectively. However, this speculation is based solely on the rough matches in molecular weight and need verification in the future. One way to do this is to generate specific antibodies for probing western blots or performing immunoelectron microscopy. In summary, our study showed that AmFV CN contains a linear genome of ∼492,752 bp, encoding 197 ORFs. Forty-seven of the ORFs were associated with virions, including six PIFs, which likely function in viral entry. The results provide fundamental information for future molecular studies on the virus.

Data availability

The genome sequence of AmFV CN has been deposited in GenBank under accession number OK392616.

Ethics statement

This article does not contain any studies with human or animal subjects (except insects) performed by any of the authors.

Author contributions

Dahe Yang: investigation, data curation, conceptualization, formal analysis, writing-original draft, writing-review and editing. Jun Wang: data curation, formal analysis, investigation, writing-original draft, writing-review and editing. Xi Wang: investigation, methodology. Fei Deng: funding acquisition, resources, supervision. Qingyun Diao: funding acquisition, resources, supervision. Manli Wang: funding acquisition, resources, supervision. Zhihong Hu: conceptualization, formal analysis, funding acquisition, resources, supervision, writing-original draft, writing-review and editing. Chunsheng Hou: conceptualization, funding acquisition, resources, supervision, writing-review and editing.

Conflict of interest

The authors declare that they have no conflict of interest.
  27 in total

1.  Phylogenetic analysis and possible function of bro-like genes, a multigene family widespread among large double-stranded DNA viruses of invertebrates and bacteria.

Authors:  Dennis K Bideshi; Sylvaine Renault; Karine Stasiak; Brian A Federici; Yves Bigot
Journal:  J Gen Virol       Date:  2003-09       Impact factor: 3.891

2.  Sequence and analysis of the genome of a baculovirus pathogenic for Lymantria dispar.

Authors:  J Kuzio; M N Pearson; S H Harwood; C J Funk; J T Evans; J M Slavicek; G F Rohrmann
Journal:  Virology       Date:  1999-01-05       Impact factor: 3.616

3.  Properties of a filamentous virus of the honey bee (Apis mellifera).

Authors:  L Bailey; J M Carpenter; R D Woods
Journal:  Virology       Date:  1981-10-15       Impact factor: 3.616

4.  Widespread occurrence of honey bee pathogens in solitary bees.

Authors:  Jorgen Ravoet; Lina De Smet; Ivan Meeus; Guy Smagghe; Tom Wenseleers; Dirk C de Graaf
Journal:  J Invertebr Pathol       Date:  2014-09-06       Impact factor: 2.841

5.  Dynamics of Apis mellifera Filamentous Virus (AmFV) Infections in Honey Bees and Relationships with Other Parasites.

Authors:  Ulrike Hartmann; Eva Forsgren; Jean-Daniel Charrière; Peter Neumann; Laurent Gauthier
Journal:  Viruses       Date:  2015-05-22       Impact factor: 5.048

6.  Genome sequence and organization of the Mythimna (formerly Pseudaletia) unipuncta granulovirus Hawaiian strain.

Authors:  Yinü Li; Xingjian Liu; Ping Tang; Huan Zhang; Qilian Qin; Zhifang Zhang
Journal:  Sci Rep       Date:  2021-01-11       Impact factor: 4.379

7.  A Diverse Range of Novel RNA Viruses in Geographically Distinct Honey Bee Populations.

Authors:  Emily J Remnant; Mang Shi; Gabriele Buchmann; Tjeerd Blacquière; Edward C Holmes; Madeleine Beekman; Alyson Ashe
Journal:  J Virol       Date:  2017-07-27       Impact factor: 5.103

8.  Shotgun metagenomics of honey DNA: Evaluation of a methodological approach to describe a multi-kingdom honey bee derived environmental DNA signature.

Authors:  Samuele Bovo; Anisa Ribani; Valerio Joe Utzeri; Giuseppina Schiavo; Francesca Bertolini; Luca Fontanesi
Journal:  PLoS One       Date:  2018-10-31       Impact factor: 3.240

Review 9.  Diversity and Global Distribution of Viruses of the Western Honey Bee, Apis mellifera.

Authors:  Alexis Beaurepaire; Niels Piot; Vincent Doublet; Karina Antunez; Ewan Campbell; Panuwan Chantawannakul; Nor Chejanovsky; Anna Gajda; Matthew Heerman; Delphine Panziera; Guy Smagghe; Orlando Yañez; Joachim R de Miranda; Anne Dalmon
Journal:  Insects       Date:  2020-04-10       Impact factor: 2.769

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.