| Literature DB >> 35527222 |
Dahe Yang1, Jun Wang2, Xi Wang2, Fei Deng2, Qingyun Diao3, Manli Wang2, Zhihong Hu4, Chunsheng Hou5.
Abstract
Apis mellifera filamentous virus (AmFV) is a large DNA virus that is endemic in honeybee colonies. The genome sequence of the AmFV Swiss isolate (AmFV CH-C05) has been reported, but so far very few molecular studies have been conducted on this virus. In this study, we isolated and purified AmFV (AmFV CN) from Chinese honeybee (Apis mellifera) colonies and elucidated its genomics and proteomics. Electron microscopy showed ovoid purified virions with dimensions of 300-500 × 210-285 nm, wrapping a 3165 × 40 nm filamentous nucleocapsid in three figure-eight loops. Unlike AmFV CH-C05, which was reported to have a circular genome, our data suggest that AmFV CN has a linear genome of approximately 493 kb. A total of 197 ORFs were identified, among which 36 putative genes including 18 baculoviral homologs were annotated. The overall nucleotide similarity between the CN and CH-C05 isolates was 96.9%. Several ORFs were newly annotated in AmFV CN, including homologs of per os infectivity factor 4 (PIF4) and a putative integrase. Phylogenomic analysis placed AmFVs on a separate branch within the newly proposed virus class Naldaviricetes. Proteomic analysis revealed 47 AmFV virion-associated proteins, of which 14 had over 50% sequence coverage, suggesting that they are likely to be main structural proteins. In addition, all six of the annotated PIFs (PIF-0-5) were identified by proteomics, suggesting that they may function as entry factors in AmFV infection. This study provides fundamental information regarding the molecular biology of AmFV.Entities:
Keywords: Apis mellifera filamentous Virus (AmFV); Genome sequence; Naldaviricetes; Proteomics; Structural proteins; per os infectivity factor 4 (PIF4)
Mesh:
Year: 2022 PMID: 35527222 PMCID: PMC9437511 DOI: 10.1016/j.virs.2022.02.007
Source DB: PubMed Journal: Virol Sin ISSN: 1995-820X Impact factor: 6.947
Fig. 1Transmission electron micrographs of purified AmFV. A. Whole virions. B. Representative filamentous nucleocapsid with three figure-eight loops. Scale bar, 200 nm.
Fig. 2Diagram of the AmFV CN genome. The linear genome of AmFV is shown with marked length. The arrows indicate predicted ORFs and direction of transcription. ORFs predicted to be related to DNA replication/metabolism, PIFs, and BROs are shown in red, green, and brown, respectively. ORFs identified by proteomics are displayed in a blue font.
AmFV CH-05 ORFs not present in AmFV CN.
| No. | ORF | Protein | Length (aa) |
|---|---|---|---|
| 1 | AmFV_014 | hypothetical protein | 57 |
| 2 | AmFV_026 | hypothetical protein | 105 |
| 3 | AmFV_032 | hypothetical protein | 61 |
| 4 | AmFV_033 | hypothetical protein | 52 |
| 5 | AmFV_035 | hypothetical protein | 68 |
| 6 | AmFV_036 | hypothetical protein | 72 |
| 7 | AmFV_038 | hypothetical protein | 59 |
| 8 | AmFV_039 | hypothetical protein | 93 |
| 9 | AmFV_040 | hypothetical protein | 105 |
| 10 | AmFV_044 | hypothetical protein | 56 |
| 11 | AmFV_046 | hypothetical protein | 57 |
| 12 | AmFV_047 | hypothetical protein | 33 |
| 13 | AmFV_061 | hypothetical protein | 66 |
| 14 | AmFV_090 | hypothetical protein | 114 |
| 15 | AmFV_094 | hypothetical protein | 55 |
| 16 | AmFV_121 | hypothetical protein | 50 |
| 17 | AmFV_131 | hypothetical protein | 37 |
| 18 | AmFV_153 | hypothetical protein | 58 |
| 19 | AmFV_155 | hypothetical protein | 83 |
| 20 | AmFV_160 | hypothetical protein | 52 |
| 21 | AmFV_163 | hypothetical protein | 58 |
| 22 | AmFV_167 | hypothetical protein | 69 |
| 23 | AmFV_176 | hypothetical protein | 35 |
| 24 | AmFV_179 | hypothetical protein | 26 |
| 25 | AmFV_186 | hypothetical protein | 98 |
| 26 | AmFV_190 | hypothetical protein | 108 |
| 27 | AmFV_191 | hypothetical protein | 56 |
| 28 | AmFV_192 | hypothetical protein | 50 |
| 29 | AmFV_194 | hypothetical protein | 84 |
| 30 | AmFV_196 | hypothetical protein | 173 |
| 31 | AmFV_197 | hypothetical protein | 116 |
| 32 | AmFV_199 | hypothetical protein | 49 |
| 33 | AmFV_202 | hypothetical protein | 346 |
| 34 | AmFV_204 | hypothetical protein | 28 |
| 35 | AmFV_205 | hypothetical protein | 44 |
| 36 | AmFV_208 | hypothetical protein | 58 |
| 37 | AmFV_209 | hypothetical protein | 136 |
| 38 | AmFV_217 | hypothetical protein | 143 |
| 39 | AmFV_222 | hypothetical protein | 75 |
| 40 | AmFV_227 | hypothetical protein | 37 |
| 41 | AmFV_236 | hypothetical protein | 648 |
| 42 | AmFV_238 | hypothetical protein | 80 |
| 43 | AmFV_239 | hypothetical protein | 206 |
| 44 | AmFV_240 | hypothetical protein | 278 |
| 45 | AmFV_241 | hypothetical protein | 118 |
Putative genes in AmFV CN Genome.
| Putative function | ORF | size (aa) | Putative protein | Best match with Pfam-A database | Best match with BLASTP | AmFV CH–C05 identity | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pfam code dmain | E-value | Pfam no | Species/Virus | Score | E-value | Similarity | aacession no | |||||
| AmFV_027 | 610 | Thymidylate synthase | Thymidylat_synt | 1.5E-85 | PF00303.19 | 284 | 3.0E-84 | 45.5% | XP_017993541.1 | 98.2% | ||
| AmFV_042 | 556 | Integrase | phage_intergrase | 2.5E-05 | PF00589.22 | 55.1 | 2.0E-04 | 27.2% | NLD99342.1 | 98.5% | ||
| AmFV_074 | 1955 | DNA Pol | DNA_pol_B | 4.5E-14 | PF00136.21 | – | – | – | – | – | 99.1% | |
| AmFV_095 | 1603 | DNA ligase | DNA_ligase_A_N | 8.3E-09 | PF04675.14 | 99 | 9.0E-17 | 24.0% | XP_025602269.1 | 99.6% | ||
| AmFV_114 | 879 | RR1 | Ribonuc_red_IgC | 0.0E+00 | PF02867.15 | Cytophagaceae bacterium | 730 | 0.0E+00 | 44.4% | ODS80019.1 | 99.7% | |
| AmFV_216 | 2469 | RR2 | Ribonuc_red_sm | 2.1E-101 | PF00268.21 | Uncultured virus | 391 | 2.0E-121 | 100.0% | ADD74390.1 | 100.0% | |
| AmFV_057 | 334 | PIF-5 | – | – | – | 170 | 4.8E-06 | 20.0% | YP_009116743.1 | 99.1% | ||
| AmFV_060 | 830 | PIF-1 | PIF | 7.0E-16 | PF05092.12 | 91 | 4.0E-15 | 30.0% | YP_010086634.1 | 98.6% | ||
| AmFV_077 | 1188 | PIF-0 | Baculo_p74 | PF04583.12 | 156 | 6.4E-07 | 29.4% | YP_473207.1 | 98.6% | |||
| AmFV_088 | 280 | PIF-3 | PIF3 | 4.4E-04 | PF05006.12 | Malacosoma sp. alphabaculovirus | 54 | 1.0E-04 | 20.7% | ANW12283.1 | 100.0% | |
| AmFV_100 | 402 | PIF-2 | PIF2 | 6.0E-18 | PF04631.12 | Mauternbach virus | 92 | 2.0E-16 | 31.2% | AYP97928.1 | 99.8% | |
| AmFV_157 | 204 | PIF-4 | Baculo_19 | 3.2E-09 | PF04798.12 | – | – | – | – | – | 100.0% | |
| AmFV_008 | 182 | BRO-1 | – | – | – | 53 | 1.0E-04 | 33.3% | YP_249718.1 | 100.0% | ||
| AmFV_016 | 1306 | BRO-2 | Bro-N | 2.9E-09 | PF02498.17 | 179 | 1.60E-6 | 29.1% | AMN15974.2 | 94.0% | ||
| AmFV_069 | 262 | BRO-3 | – | – | – | AmFV | 1309 | 0.0E+00 | 98.9% | YP_009165820.1 | 98.9% | |
| AmFV_075 | 434 | BRO-4 | – | – | – | 52 | 9.0E-05 | 28.0% | YP_001257066.1 | 98.9% | ||
| AmFV_106 | 667 | BRO-5 | Bro-N | 3.2E-07 | PF02498.17 | 77 | 3.0E-12 | 25.0% | AOL57177.1 | 95.8% | ||
| AmFV_108 | 627 | BRO-6 | Bro-N | 8.6E-04 | PF02498.17 | 66 | 6.0E-09 | 29.0% | YP_762434.1 | 92.3% | ||
| AmFV_110 | 499 | BRO-7 | Bro-N | 6.6E-29 | PF02498.17 | 87 | 2.0E-15 | 29.8% | AGE61478.1 | 97.0% | ||
| AmFV_111 | 437 | BRO-8 | – | – | – | AmFV | 201 | 1.9E-14 | 25.2% | YP_009165857.1 | 98.4% | |
| AmFV_133 | 157 | BRO-9 | Bro-N | 4.1E-03 | PF02498.17 | – | – | – | – | – | 98.7% | |
| AmFV_006 | 1699 | Protein kinase (PK) | – | – | – | 111 | 6.7E-03 | 26.0% | AKI80069 | 99.0% | ||
| AmFV_009 | 442 | PARP | Trypan_PARP | 3.1E-04 | PF05887.11 | Paramecium bursaria Chlorella virus | 179 | 2.4E-09 | 28.4% | AGE51657.1 | 99.8% | |
| AmFV_023 | 648 | ATPase | AAA domain | 4.0E-26 | PF00004.29 | 124 | 1.0E-27 | 34.0% | KMQ86848.1 | 99.2% | ||
| AmFV_034 | 501 | Serpin-like | Pacifastin inhibitor (LCMII) | 4.6E-10 | PF05375.13 | 269 | 6.5E-24 | 34.9% | PSN39366.1 | 97.0% | ||
| AmFV_043 | 1633 | Myristoylated membrane | – | – | – | Mimivirus sp. SH | 186 | 6.2E-12 | 46.7% | AZL89416.1 | 96.0% | |
| AmFV_068 | 326 | RING finger protein 413R | zf-C3HC4_3 | 2.1E-03 | PF13920.6 | – | – | – | – | – | 100.0% | |
| AmFV_080 | 572 | RING finger protein | – | – | – | 52 | 1.0E-03 | 33.3% | TKS65457.1 | 98.6% | ||
| AmFV_082 | 510 | HZV 115-like | DUF4580 | 2.3E-05 | PF15162.6 | 85 | 3.0E-15 | 31.1% | YP_002321369.1 | 99.8% | ||
| AmFV_091 | 534 | hypothetical protein | – | – | – | 144 | 5.0E-04 | 40.2% | RMB88007.1 | 98.2% | ||
| AmFV_101 | 1982 | Gamma-glutamyltranspeptidase | G_glu_transpept | 1.6E-14 | PF01019.21 | Diachasmimorpha longicaudata | 147 | 5.9E-07 | 34.6% | AKS26328.1 | 97.5% | |
| AmFV_113 | 583 | hypothetical protein | – | – | – | 145 | 2.4E-08 | 32.5% | EJK57244.1 | 90.5% | ||
| AmFV_123 | 948 | hypothetical protein | – | – | – | 122 | 1.0E-24 | 29.4% | EFN83926.1 | 99.8% | ||
| AmFV_168 | 1236 | MdSGHV 070 | – | – | – | 166 | 2.3E-04 | 29.6% | YP_001883398.1 | 97.3% | ||
| AmFV_193 | 969 | Chitin-binding | LOMP_10 | 8.6E-36 | PF03067.15 | 136 | 2.0E-31 | 42.4% | WP_180560000.1 | 96.0% | ||
| AmFV_235 | 334 | PLC | PI-PLC-X | 3.6E-11 | PF00388.19 | 72 | 6.0E-10 | 33.6% | WP_118975952.1 | 100.0% | ||
Fig. 3Phylogenetic tree of members of Naldaviricetes derived from concatenated protein sequences of PIFs. The maximum-likelihood (ML) tree on substitution model (LG + G + I) is present. Numbers on the nodes indicate ML nonparametric bootstrap supports (1000 replicates). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The classifications of viruses are indicated. Virus abbreviations and sequence accession numbers are: AcMNPV, Autographa californica multiple nucleopolyhedrovirus, NC_001623; CpGV, Cydia pomonella granulovirus, NC_002816; NeleNPV, Neodiprion lecontei nucleopolyhedrovirus, NC_005906; CuniNPV, Culex nigripalpus nucleopolyhedrovirus, NC_003084; GbNV, Gryllus bimaculatus nudivirus, NC_009240; HzNV, Helicoverpa zea nudivirus-2, NC_004156; MdSGHV, Musca domestica salivary gland hypertrophy virus, NC_010671; WSSV, White spot syndrome virus, NC_003225; AmFV CH–C05, NC_027925; AmFV CN, OK392616.
The AmFV CN virion associated proteins.
| No. | ORFs | Protein | Mol. weight (kDa) | Protein length (aa) | Test 1 | Test 2 | ||
|---|---|---|---|---|---|---|---|---|
| Unique peptides | Sequence coverage (%) | Unique peptides | Sequence coverage (%) | |||||
| 1 | AmFV_002 | hypothetical protein | 108.0 | 961 | 8 | 7.8 | 5 | 5.1 |
| 2 | AmFV_017 | hypothetical protein | 46.6 | 409 | 10 | 27.7 | 5 | 14.5 |
| 3 | AmFV_021∗ | hypothetical protein | 36.8 | 321 | 18 | 66.9 | 11 | 31.6 |
| 4 | AmFV_022∗ | hypothetical protein | 30.6 | 271 | 12 | 52.2 | 7 | 26.7 |
| 5 | AmFV_023∗ | AAA + ATPase | 69.2 | 648 | 35 | 57.1 | 29 | 48.2 |
| 6 | AmFV_051∗ | hypothetical protein | 39.1 | 342 | 18 | 52.5 | 14 | 45.2 |
| 7 | AmFV_053 | hypothetical protein | 15.1 | 127 | 2 | 12.7 | 1 | 7.1 |
| 8 | AmFV_054 | hypothetical protein | 111.5 | 1062 | 14 | 17.7 | 17 | 23 |
| 9 | AmFV_056 | hypothetical protein | 32.3 | 286 | 3 | 12.1 | 2 | 9.3 |
| 10 | AmFV_057∗ | PIF5 | 36.1 | 334 | 29 | 68.3 | 21 | 58.9 |
| 11 | AmFV_058∗ | hypothetical protein | 148.7 | 1353 | 64 | 61.9 | 45 | 42.9 |
| 12 | AmFV_060 | PIF1 | 93.9 | 830 | 22 | 29.6 | 16 | 21.4 |
| 13 | AmFV_062 | hypothetical protein | 77.6 | 676 | 6 | 7.6 | 3 | 4.1 |
| 14 | AmFV_064 | hypothetical protein | 38.6 | 350 | 7 | 31.1 | 6 | 26.9 |
| 15 | AmFV_073 | hypothetical protein | 63.3 | 629 | 1 | 1.4 | 4 | 10.2 |
| 16 | AmFV_077 | PIF0 | 131.9 | 1188 | 23 | 21.7 | 22 | 25.8 |
| 17 | AmFV_078 | hypothetical protein | 37.2 | 336 | 12 | 29.6 | 8 | 27.2 |
| 18 | AmFV_085 | hypothetical protein | 13.0 | 118 | 2 | 23.1 | 2 | 23.1 |
| 19 | AmFV_088 | PIF3 | 30.8 | 280 | 6 | 29 | 3 | 15.1 |
| 20 | AmFV_089 | hypothetical protein | 48.8 | 450 | 16 | 46.3 | 8 | 19.8 |
| 21 | AmFV_092 | hypothetical protein | 40.0 | 357 | 13 | 44.7 | 11 | 36.8 |
| 22 | AmFV_097 | hypothetical protein | 70.2 | 628 | 24 | 47.4 | 18 | 32.4 |
| 23 | AmFV_099∗ | hypothetical protein | 42.6 | 383 | 32 | 90.9 | 22 | 66.8 |
| 24 | AmFV_100 | PIF2 | 45.0 | 402 | 11 | 39.4 | 9 | 30.2 |
| 25 | AmFV_104 | hypothetical protein | 19.7 | 174 | 5 | 36.1 | 3 | 13.9 |
| 26 | AmFV_117 | hypothetical protein | 10.0 | 88 | 2 | 18.4 | 1 | 9.2 |
| 27 | AmFV_122 | hypothetical protein | 56.7 | 507 | 4 | 10.3 | 2 | 4.5 |
| 28 | AmFV_128∗ | hypothetical protein | 56.6 | 490 | 31 | 61.6 | 21 | 43.1 |
| 29 | AmFV_130∗ | hypothetical protein | 19.8 | 173 | 10 | 64.8 | 9 | 51.1 |
| 30 | AmFV_138∗ | hypothetical protein | 53.4 | 450 | 26 | 53 | 18 | 40.3 |
| 31 | AmFV_139 | hypothetical protein | 18.0 | 168 | 5 | 32.9 | 2 | 19.8 |
| 32 | AmFV_140 | hypothetical protein | 164.8 | 1440 | 55 | 44.2 | 33 | 27.4 |
| 33 | AmFV_141 | hypothetical protein | 22.8 | 196 | 4 | 17.9 | 1 | 6.2 |
| 34 | AmFV_146∗ | hypothetical protein | 24.9 | 218 | 16 | 70 | 11 | 51.2 |
| 35 | AmFV_148 | hypothetical protein | 38.9 | 346 | 4 | 14.8 | 5 | 20.9 |
| 36 | AmFV_149∗ | hypothetical protein | 12.6 | 127 | 6 | 52.4 | 2 | 21.4 |
| 37 | AmFV_151∗ | hypothetical protein | 44.0 | 396 | 20 | 52.2 | 14 | 37.7 |
| 38 | AmFV_154 | hypothetical protein | 95.3 | 850 | 19 | 28.9 | 9 | 12 |
| 39 | AmFV_156∗ | hypothetical protein | 29.3 | 262 | 16 | 63.5 | 17 | 63.5 |
| 40 | AmFV_157 | PIF4 | 23.0 | 204 | 7 | 41.9 | 3 | 12.3 |
| 41 | AmFV_161 | hypothetical protein | 36.3 | 332 | 1 | 1.8 | 2 | 5.1 |
| 42 | AmFV_164 | hypothetical protein | 19.9 | 182 | 4 | 19.9 | 2 | 13.3 |
| 43 | AmFV_182 | hypothetical protein | 102.4 | 895 | 39 | 48.9 | 29 | 33.9 |
| 44 | AmFV_200 | hypothetical protein | 111.4 | 1019 | 8 | 8.6 | 4 | 4.2 |
| 45 | AmFV_215 | hypothetical protein | 223.4 | 1965 | 8 | 3.1 | 8 | 3.9 |
| 46 | AmFV_233 | hypothetical protein | 9.2 | 83 | 4 | 42.7 | 2 | 31.7 |
| 47 | AmFV_235 | hypothetical protein | 38.7 | 334 | 17 | 45.6 | 13 | 34.8 |
The ORFs with sequence coverage over 50% in at least one test were marked with ∗.