| Literature DB >> 34578276 |
Kelly S Bateman1, Rose Kerr1, Grant D Stentiford1, Tim P Bean2, Chantelle Hooper1, Benigna Van Eynde3,4, Daan Delbare4, Jamie Bojko5, Olivier Christiaens3, Clauvis N T Taning3, Guy Smagghe3, Monique M van Oers6, Ronny van Aerle1.
Abstract
Multiple enveloped viruses with rod-shaped nucleocapsids have been described, infecting the epithelial cell nuclei within the hepatopancreas tubules of crustaceans. These bacilliform viruses share the ultrastructural characteristics of nudiviruses, a specific clade of viruses infecting arthropods. Using histology, electron microscopy and high throughput sequencing, we characterise two further bacilliform viruses from aquatic hosts, the brown shrimp (Crangon crangon) and the European shore crab (Carcinus maenas). We assembled the full double stranded, circular DNA genome sequences of these viruses (~113 and 132 kbp, respectively). Comparative genomics and phylogenetic analyses confirm that both belong within the family Nudiviridae but in separate clades representing nudiviruses found in freshwater and marine environments. We show that the three thymidine kinase (tk) genes present in all sequenced nudivirus genomes, thus far, were absent in the Crangon crangon nudivirus, suggesting there are twenty-eight core genes shared by all nudiviruses. Furthermore, the phylogenetic data no longer support the subdivision of the family Nudiviridae into four genera (Alphanudivirus to Deltanudivirus), as recently adopted by the International Committee on Taxonomy of Viruses (ICTV), but rather shows two main branches of the family that are further subdivided. Our data support a recent proposal to create two subfamilies within the family Nudiviridae, each subdivided into several genera.Entities:
Keywords: Carcinus maenas; Carcinus maenas nudivirus (CmNV); Crangon crangon; Crangon crangon nudivirus (CcNV); bacilliform virus; brown shrimp; core genes; nudivirus; shore crab; taxonomy; virus classification
Mesh:
Year: 2021 PMID: 34578276 PMCID: PMC8472649 DOI: 10.3390/v13091694
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1(A) Crangon crangon Nudivirus (CcNV) infected nuclei within hepatopancreas tubules. Infected nuclei (arrows) are enlarged with marginalised chromatin and contain an eosinophilic inclusion body. H&E stain. Scale bar = 50 µm. (B) CcNV virions within an infected nucleus. Virions contain a rod shaped, electron dense nucleocapsid surrounded by a trilaminar envelope (inset, scale bar = 100 nm). Virions have been caught in cross section (black arrow) and longitudinal section (white arrow) within the nucleus. TEM. Scale bar = 1 µm. (C) Carcinus maenas Nudivirus (CmNV) infected nuclei within hepatopancreas tubules. Infected nuclei (arrows) are enlarged with marginalised chromatin and contain an eosinophilic inclusion body. H&E stain. Scale bar = 50 µm. (D) CmNV virions within an infected nucleus. Virions contain a rod shaped, electron dense nucleocapsid surrounded by a trilaminar envelope. Virions have been caught in cross section (black arrow) and longitudinal section (white arrow) within the nucleus. Some of the nucleocapsids appear curved (inset, scale bar = 100 nm) or u shaped (line arrow) within the envelope. TEM. Scale bar = 2 µm.
Figure 2Circular genomic maps of (A) Crangon crangon nudivirus (CcNV) and (B) Carcinus maenas nudivirus (CmNV). The outermost to innermost tracks represent: (1) predicted protein-coding genes and their orientation, (2) predicted protein-coding genes on the forward strand, (3) protein-coding genes on the reverse strand, (4) tandem repeat regions, (5) GC content and (6) GC skew, (7) genome coordinates.
Identified open reading frames (ORFs) in the CcNV genome. The start and end positions of the ORFs are shown, as well as the strand it was found on, the deduced protein length (in amino acids), the best hits obtained using blastp similarity searches using the NCBI nr protein database (E-value < 0.001), and additional information obtained from annotated orthogroups (Table S5). Nudivirus core genes are highlighted in grey.
| Gene_ID (ORF) | Start | End | Strand | Protein Length (Amino Acids) | Best Hit (Description, Species, Accession Number) | Additional Annotations |
|---|---|---|---|---|---|---|
| CcNV_1 | 1 | 3111 | + | 1037 | DNA polymerase (Dikerogammarus haemobaphes nudivirus) (QLI62362.1) | |
| CcNV_2 | 3114 | 4613 | + | 500 | hypothetical protein (Apolygus lucorum) (KAF6208213.1) | Methyltransferase |
| CcNV_3 | 4617 | 4874 | - | 86 | KN57_gp007 (Dikerogammarus haemobaphes nudivirus) (QLI62363.1) | |
| CcNV_4 | 4877 | 5572 | - | 232 | Ac92 (Dikerogammarus haemobaphes nudivirus) (QLI62364.1) | P33 |
| CcNV_5 | 5544 | 7580 | - | 679 | Vp91 (Penaeus monodon nudivirus) (YP_009051847.1) | |
| CcNV_6 | 7843 | 9279 | + | 479 | ODV-E56 (Dikerogammarus haemobaphes nudivirus) (QLI62366.1) | PIF-5 |
| CcNV_7 | 9357 | 9833 | + | 159 | KN57_gp012 (Dikerogammarus haemobaphes nudivirus) (QLI62367.1) | |
| CcNV_8 | 9802 | 10995 | + | 398 | p47 (Dikerogammarus haemobaphes nudivirus) (QLI62368.1) | |
| CcNV_9 | 11,007 | 12,209 | - | 401 | Pif-2 (Homarus gammarus nudivirus) (YP_010087649.1) | |
| CcNV_10 | 12,317 | 13,708 | - | 464 | - | |
| CcNV_11 | 13,882 | 14,199 | + | 106 | - | |
| CcNV_12 | 14,208 | 15,503 | + | 432 | KN57_gp020 (Dikerogammarus haemobaphes nudivirus) (QLI62373.1) | FEN-1 |
| CcNV_13 | 15,500 | 15,985 | + | 162 | HgNV_014 (Dikerogammarus haemobaphes nudivirus) (QLI62374.1) | |
| CcNV_14 | 16,535 | 17,410 | - | 292 | hypothetical protein PmNV_022 (Penaeus monodon nudivirus) (YP_009051860.1) | VP39 |
| CcNV_15 | 17,589 | 20,660 | + | 1024 | LEF-8 (Dikerogammarus haemobaphes nudivirus) (QLI62376.1) | |
| CcNV_16 | 21,228 | 22,379 | - | 384 | - | |
| CcNV_17 | 22,523 | 23,827 | + | 435 | - | |
| CcNV_18 | 23,881 | 25,026 | + | 382 | - | |
| CcNV_19 | 25,154 | 25,723 | + | 190 | - | |
| CcNV_20 | 25,863 | 27,080 | + | 406 | p51 (Dikerogammarus haemobaphes nudivirus) (QLI62377.1) | |
| CcNV_21 | 27,305 | 27,478 | + | 58 | - | |
| CcNV_22 | 27,552 | 27,737 | + | 62 | - | |
| CcNV_23 | 28,064 | 29,332 | + | 423 | ODV-E66 (Penaeus monodon nudivirus) (YP_009051872.1) | |
| CcNV_24 | 29,595 | 34,061 | - | 1489 | DhNV_024 (Dikerogammarus haemobaphes nudivirus) (QLI62385.1) | |
| CcNV_25 | 34,158 | 34,835 | + | 226 | - | |
| CcNV_26 | 34,858 | 35,337 | + | 160 | - | dUTP pyrophosphatase/dUTPase |
| CcNV_27 | 36,459 | 36,614 | - | 52 | - | |
| CcNV_28 | 37,945 | 38,391 | + | 149 | - | |
| CcNV_29 | 38,420 | 40,342 | - | 641 | ODV_E66 (Dikerogammarus haemobaphes nudivirus) (QLI62384.1) | |
| CcNV_30 | 40,871 | 42,697 | - | 609 | ODV_E66 (Dikerogammarus haemobaphes nudivirus) (QLI62384.1) | |
| CcNV_31 | 42,958 | 44,778 | - | 607 | ODV_E66 (Dikerogammarus haemobaphes nudivirus) (QLI62384.1) | |
| CcNV_32 | 44,932 | 45,108 | - | 59 | - | |
| CcNV_33 | 45,218 | 47,050 | - | 611 | ODV_E66 (Dikerogammarus haemobaphes nudivirus) (QLI62384.1) | |
| CcNV_34 | 47,201 | 47,683 | - | 161 | - | |
| CcNV_35 | 47,675 | 49,270 | + | 532 | PIF-1 (Penaeus monodon nudivirus) (YP_009051877.1) | |
| CcNV_36 | 49,285 | 49,566 | - | 94 | - | |
| CcNV_37 | 49,556 | 51,547 | - | 664 | HgNV_030 (Dikerogammarus haemobaphes nudivirus) (QLI62398.1) | |
| CcNV_38 | 51,546 | 52,316 | + | 257 | DhNV_036 (Dikerogammarus haemobaphes nudivirus) (QLI62397.1) | |
| CcNV_39 | 52,382 | 53,083 | + | 234 | - | |
| CcNV_40 | 53,107 | 53,847 | + | 247 | HgNV_033 (Dikerogammarus haemobaphes nudivirus) (QLI62395.1) | PmV-like protein |
| CcNV_41 | 53,962 | 54,891 | + | 310 | hypothetical protein (Penaeus monodon nucleopolyhedrovirus) (ABX44696.1) | |
| CcNV_42 | 54,922 | 56,667 | + | 582 | KN57_gp048 (Dikerogammarus haemobaphes nudivirus) (QLI62392.1) | |
| CcNV_43 | 56,737 | 58,020 | + | 428 | - | |
| CcNV_44 | 58,774 | 59,850 | + | 359 | - | |
| CcNV_45 | 59,923 | 61,056 | + | 378 | - | |
| CcNV_46 | 60,983 | 61,204 | - | 74 | - | |
| CcNV_47 | 61,297 | 62,304 | + | 336 | - | |
| CcNV_48 | 62,323 | 63,135 | + | 271 | - | Polyhedrin |
| CcNV_49 | 63,152 | 66,742 | + | 1197 | - | |
| CcNV_50 | 66,844 | 70,668 | + | 1275 | - | |
| CcNV_51 | 70,945 | 71,277 | - | 111 | - | |
| CcNV_52 | 71,252 | 71,461 | + | 70 | Ac92 (Dikerogammarus haemobaphes nudivirus) (QLI62411.1) | |
| CcNV_53 | 71,517 | 71,873 | + | 119 | - | LEF-5 |
| CcNV_54 | 72,349 | 72,531 | + | 61 | hypothetical protein PmNV_053 (Penaeus monodon nudivirus) (YP_009051891.1) | |
| CcNV_55 | 72,528 | 73,169 | + | 214 | hypothetical protein, partial (Penaeus monodon nucleopolyhedrovirus) (ABX44701.1) | |
| CcNV_56 | 73,166 | 74,092 | + | 309 | Integrase (Dikerogammarus haemobaphes nudivirus) (QLI62415.1) | |
| CcNV_57 | 74,169 | 78,131 | + | 1321 | hypothetical protein KM727_gp62 (Homarus gammarus nudivirus) (YP_010087702.1) | |
| CcNV_58 | 78,592 | 79,830 | + | 413 | non-structural protein 1 (Penaeus monodon metallodensovirus) (QGX07563.1) | |
| CcNV_59 | 80,670 | 81,551 | + | 294 | VLF-1 (Dikerogammarus haemobaphes nudivirus) (QLI62416.1) | |
| CcNV_60 | 81,546 | 83,069 | - | 508 | LEF-9 (Dikerogammarus haemobaphes nudivirus) (QLI62417.1) | |
| CcNV_61 | 83,137 | 84,054 | + | 306 | 38K protein (Penaeus monodon nudivirus) (YP_009051897.1) | |
| CcNV_62 | 84,031 | 84,408 | - | 126 | - | |
| CcNV_63 | 84,420 | 85,166 | + | 249 | HgNV_049 (Dikerogammarus haemobaphes nudivirus) (QLI62423.1) | |
| CcNV_64 | 85,212 | 85,556 | + | 115 | - | GbNV_gp51-like |
| CcNV_65 | 85,560 | 85,988 | - | 143 | - | |
| CcNV_66 | 86,110 | 86,490 | - | 127 | - | |
| CcNV_67 | 86,589 | 87,866 | + | 426 | KN57gp_066 (Dikerogammarus haemobaphes nudivirus) (QLI62428.1) | |
| CcNV_68 | 87,891 | 89,111 | + | 407 | - | |
| CcNV_69 | 89,326 | 89,904 | - | 193 | - | |
| CcNV_70 | 90,070 | 93,882 | + | 1271 | - | Polyhedrin |
| CcNV_71 | 93,921 | 94,484 | - | 188 | KN57gp_068 (Dikerogammarus haemobaphes nudivirus) (QLI62436.1) | |
| CcNV_72 | 94,498 | 95,367 | - | 290 | KN57gp_069 (Dikerogammarus haemobaphes nudivirus) (QLI62435.1) | |
| CcNV_73 | 95,355 | 95,990 | - | 212 | KN57gp_070 (Dikerogammarus haemobaphes nudivirus) (QLI62434.1) | |
| CcNV_74 | 95,996 | 97,999 | - | 668 | P74 (Homarus gammarus nudivirus) (YP_010087703.1) | PIF-0 |
| CcNV_75 | 98,378 | 99,340 | + | 321 | - | |
| CcNV_76 | 99,315 | 100,034 | - | 240 | HgNV_068 (Dikerogammarus haemobaphes nudivirus) (QLI62449.1) | |
| CcNV_77 | 100,025 | 102,127 | + | 701 | Helicase 2 (Dikerogammarus haemobaphes nudivirus) (QLI62448.1) | |
| CcNV_78 | 102,130 | 103,560 | + | 477 | - | |
| CcNV_79 | 103,617 | 104,579 | + | 321 | DhNV_085 (Dikerogammarus haemobaphes nudivirus) (QLI62446.1) | |
| CcNV_80 | 104,623 | 105,300 | + | 226 | - | |
| CcNV_81 | 105,322 | 108,495 | - | 1058 | - | |
| CcNV_82 | 108,501 | 108,920 | - | 140 | HgNV_075 (Dikerogammarus haemobaphes nudivirus) (QLI62443.1) | |
| CcNV_83 | 108,991 | 109,458 | + | 156 | Ac81 (Dikerogammarus haemobaphes nudivirus) (QLI62442.1) | |
| CcNV_84 | 109,506 | 111,398 | + | 631 | hypothetical protein PmNV_087 (Penaeus monodon nudivirus) (YP_009051925.1) | |
| CcNV_85 | 111,385 | 111,831 | + | 149 | Ac68-like protein (Homarus gammarus nudivirus) (YP_010087718.1) | PIF-6 |
| CcNV_86 | 111,842 | 113,356 | - | 505 | DhNV_068 (Dikerogammarus haemobaphes nudivirus) (QLI62429.1) | |
| CcNV_87 | 113,499 | 114,260 | + | 254 | VLF-1 (Dikerogammarus haemobaphes nudivirus) (QLI62430.1) | |
| CcNV_88 | 114,264 | 115,541 | - | 426 | - | |
| CcNV_89 | 115,658 | 117,484 | - | 609 | Helicase 2 (Dikerogammarus haemobaphes nudivirus) (QLI62451.1) | |
| CcNV_90 | 117,522 | 118,901 | - | 460 | LEF-4 (Dikerogammarus haemobaphes nudivirus) (QLI62453.1) | |
| CcNV_91 | 118,976 | 119,311 | - | 112 | - | |
| CcNV_92 | 119,355 | 120,035 | - | 227 | PIF-3 (Homarus gammarus nudivirus) (YP_010087723.1) | |
| CcNV_93 | 120,035 | 123,916 | - | 1294 | Helicase (Dikerogammarus haemobaphes nudivirus) (QLI62456.1) | |
| CcNV_94 | 123,918 | 124,592 | + | 225 | ODV-E28 (Dikerogammarus haemobaphes nudivirus) (QLI62457.1) | PIF-4 |
| CcNV_95 | 124,681 | 125,151 | + | 157 | - | |
| CcNV_96 | 125,168 | 125,998 | - | 277 | - | |
| CcNV_97 | 126,024 | 126,755 | - | 244 | KN57gp_097 (Dikerogammarus haemobaphes nudivirus) (QLI62459.1) | |
| CcNV_98 | 126,754 | 127,662 | + | 303 | Esterase (Dikerogammarus haemobaphes nudivirus) (QLI62460.1) | GbNV_gp19-like |
| CcNV_99 | 127,679 | 128,800 | - | 374 | KN57gp_099 (Dikerogammarus haemobaphes nudivirus) (QLI62461.1) | GbNV_gp67-like |
| CcNV_100 | 128,897 | 129,250 | + | 118 | - | |
| CcNV_101 | 128,905 | 129,216 | - | 104 | 11K (Dikerogammarus haemobaphes nudivirus) (QLI62462.1) | |
| CcNV_102 | 129,348 | 129,707 | - | 120 | - | |
| CcNV_103 | 129,688 | 130,482 | + | 265 | KN57gp_102 (Dikerogammarus haemobaphes nudivirus) (QLI62464.1) | |
| CcNV_104 | 130,530 | 131,210 | - | 227 | KN57gp_107 (Dikerogammarus haemobaphes nudivirus) (QLI62465.1) | |
| CcNV_105 | 131,513 | 131,980 | + | 156 | - | |
| CcNV_106 | 72,007 | 72,231 | + | 75 | - | p6.9 |
Identified open reading frames (ORFs) in the CmNV genome. The start and end positions of the ORFs are shown, as well as the strand it was found on, the deduced protein length (in amino acids), the best hits obtained using blastp similarity searches using the NCBI nr protein database (E-value < 0.001), and additional information obtained from annotated orthogroups (Table S5). Nudivirus core genes are highlighted in grey.
| Gene_ID (ORF) | Start | End | Strand | Protein Length (Amino | Best Hit (Description, Species, Accession Number) | Additional Annotations |
|---|---|---|---|---|---|---|
| CmNV_1 | 1 | 3200 | + | 1066 | DNA polymerase (Homarus gammarus nudivirus) (YP_010087641.1) | |
| CmNV_2 | 3264 | 4663 | + | 466 | methyltransferase (Homarus gammarus nudivirus) (YP_010087642.1) | |
| CmNV_3 | 4669 | 4880 | - | 70 | hypothetical protein PmNV_007 (Penaeus monodon nudivirus) (YP_009051845.1) | |
| CmNV_4 | 4868 | 5535 | - | 222 | Ac92-like protein (Homarus gammarus nudivirus) (YP_010087644.1) | P33 |
| CmNV_5 | 5523 | 7579 | - | 685 | Vp91 (Homarus gammarus nudivirus) (YP_010087645.1) | |
| CmNV_6 | 7716 | 9013 | + | 432 | ODV-E56 (Homarus gammarus nudivirus) (YP_010087646.1) | PIF-5 |
| CmNV_7 | 9132 | 9526 | + | 131 | hypothetical protein KM727_gp07 (Homarus gammarus nudivirus) (YP_010087647.1) | |
| CmNV_8 | 9537 | 10,774 | + | 412 | P47 (Homarus gammarus nudivirus) (YP_010087648.1) | |
| CmNV_9 | 10,793 | 11,949 | - | 385 | Pif-2 (Homarus gammarus nudivirus) (YP_010087649.1) | |
| CmNV_10 | 11,991 | 12,712 | - | 240 | HZV 115-like protein (Homarus gammarus nudivirus) (YP_010087650.1) | |
| CmNV_11 | 12,750 | 13,996 | - | 415 | hypothetical protein KM727_gp11 (Homarus gammarus nudivirus) (YP_010087651.1) | |
| CmNV_12 | 14,119 | 14,495 | + | 125 | hypothetical protein KM727_gp12 (Homarus gammarus nudivirus) (YP_010087652.1) | |
| CmNV_13 | 14,469 | 15,715 | + | 415 | hypothetical protein KM727_gp13 (Homarus gammarus nudivirus) (YP_010087653.1) | FEN-1 |
| CmNV_14 | 15,706 | 16,145 | + | 146 | hypothetical protein KM727_gp14 (Homarus gammarus nudivirus) (YP_010087654.1) | |
| CmNV_15 | 16,174 | 17,063 | - | 296 | Vp39/31 k (Homarus gammarus nudivirus) (YP_010087655.1) | VP39 |
| CmNV_16 | 17,183 | 20,256 | + | 1024 | LEF-8 (Homarus gammarus nudivirus) (YP_010087656.1) | |
| CmNV_17 | 20,397 | 21,742 | + | 448 | P51 (Homarus gammarus nudivirus) (YP_010087657.1) | |
| CmNV_18 | 21,993 | 25,855 | - | 1287 | hypothetical protein KM727_gp18 (Homarus gammarus nudivirus) (YP_010087658.1) | |
| CmNV_19 | 26,020 | 27,005 | - | 328 | E3 ubiquitin-protein ligase TRIM39-like protein | |
| CmNV_20 | 29,001 | 29,953 | + | 317 | ||
| CmNV_21 | 30,175 | 31,274 | + | 366 | serine/threonine protein kinase (Homarus gammarus nudivirus) (YP_010087663.1) | |
| CmNV_22 | 31,337 | 31,461 | - | 41 | ||
| CmNV_23 | 31,567 | 31,706 | + | 46 | ||
| CmNV_24 | 31,838 | 32,829 | + | 330 | dihydroxy-acid dehydratase (Homarus gammarus nudivirus) (YP_010087666.1) | |
| CmNV_25 | 32,866 | 33,068 | - | 67 | ||
| CmNV_26 | 33,141 | 34,036 | + | 298 | guanosine monophosphate kinase (Homarus gammarus nudivirus) (YP_010087667.1) | TK2 |
| CmNV_27 | 34,073 | 35,664 | + | 530 | PIF-1 (Homarus gammarus nudivirus) (YP_010087668.1) | |
| CmNV_28 | 35,695 | 36,065 | - | 123 | hypothetical protein PmNV_040 (Penaeus monodon nudivirus) (YP_009051878.1) | |
| CmNV_29 | 36,056 | 38,163 | - | 702 | hypothetical protein KM727_gp30 (Homarus gammarus nudivirus) (YP_010087670.1) | |
| CmNV_30 | 38,180 | 38,934 | + | 251 | hypothetical protein KM727_gp31 (Homarus gammarus nudivirus) (YP_010087671.1) | |
| CmNV_31 | 39,020 | 39,714 | + | 231 | hypothetical protein KM727_gp32 (Homarus gammarus nudivirus) (YP_010087672.1) | |
| CmNV_32 | 39,705 | 40,444 | + | 246 | PmV-like protein (Homarus gammarus nudivirus) (YP_010087673.1) | |
| CmNV_33 | 40,531 | 41,633 | + | 367 | p-loop NTPase (Homarus gammarus nudivirus) (YP_010087674.1) | |
| CmNV_34 | 41,687 | 42,612 | + | 308 | hypothetical protein KM727_gp35 (Homarus gammarus nudivirus) (YP_010087675.1) | |
| CmNV_35 | 42,619 | 44,375 | + | 585 | hypothetical protein KM727_gp36 (Homarus gammarus nudivirus) (YP_010087676.1) | |
| CmNV_36 | 44,330 | 44,643 | - | 104 | hypothetical protein KM727_gp37 (Homarus gammarus nudivirus) (YP_010087677.1) | |
| CmNV_37 | 44,618 | 44,826 | + | 69 | hypothetical protein PmNV_051 (Penaeus monodon nudivirus) (YP_009051889.1) | Ac92-like protein |
| CmNV_38 | 44,847 | 45,364 | + | 172 | hypothetical protein KM727_gp39 (Homarus gammarus nudivirus) (YP_010087679.1) | LEF-5 |
| CmNV_39 | 46,080 | 46,729 | + | 216 | hypothetical protein KM727_gp41 (Homarus gammarus nudivirus) (YP_010087681.1) | |
| CmNV_40 | 46,737 | 47,650 | + | 304 | integrase (Homarus gammarus nudivirus) (YP_010087682.1) | |
| CmNV_41 | 47,682 | 48,532 | + | 283 | VLF-1 (Homarus gammarus nudivirus) (YP_010087683.1) | |
| CmNV_42 | 48,543 | 48,910 | - | 122 | ||
| CmNV_43 | 48,980 | 50,133 | - | 384 | hypothetical protein KM727_gp94 (Homarus gammarus nudivirus) (YP_010087734.1) | |
| CmNV_44 | 50,276 | 51,675 | - | 466 | ||
| CmNV_45 | 52,559 | 52,812 | - | 84 | ||
| CmNV_46 | 52,816 | 54,416 | - | 533 | LEF-9 (Homarus gammarus nudivirus) (YP_010087686.1) | |
| CmNV_47 | 54,427 | 55,265 | + | 279 | 38K protein (Homarus gammarus nudivirus) (YP_010087687.1) | |
| CmNV_48 | 55,271 | 55,533 | - | 87 | hypothetical protein KM727_gp48 (Homarus gammarus nudivirus) (YP_010087688.1) | |
| CmNV_49 | 55,538 | 56,244 | + | 235 | hypothetical protein KM727_gp49 (Homarus gammarus nudivirus) (YP_010087689.1) | |
| CmNV_50 | 56,229 | 56,626 | + | 132 | hypothetical protein KM727_gp50 (Homarus gammarus nudivirus) (YP_010087690.1) | GbNV_gp51-like |
| CmNV_51 | 56,625 | 56,974 | - | 116 | hypothetical protein PmNV_063 (Penaeus monodon nudivirus) (YP_009051901.1) | |
| CmNV_52 | 57,089 | 57,429 | - | 113 | ||
| CmNV_53 | 57,473 | 58,803 | - | 443 | p-loop NTPase (Homarus gammarus nudivirus) (YP_010087693.1) | TK1 |
| CmNV_54 | 58,872 | 60,178 | + | 435 | hypothetical protein KM727_gp54 (Homarus gammarus nudivirus) (YP_010087694.1) | |
| CmNV_55 | 60,317 | 60,600 | + | 94 | ||
| CmNV_56 | 61,344 | 63,214 | + | 623 | ODV-E66 (Penaeus monodon nudivirus) (YP_009051874.1) | |
| CmNV_57 | 63,746 | 64,575 | - | 276 | IAP | |
| CmNV_58 | 65,340 | 65,920 | - | 193 | hypothetical protein PmNV_067 (Penaeus monodon nudivirus) (YP_009051905.1) | |
| CmNV_59 | 66,051 | 66,622 | - | 190 | hypothetical protein KM727_gp58 (Homarus gammarus nudivirus) (YP_010087698.1) | |
| CmNV_60 | 66,657 | 67,597 | - | 313 | hypothetical protein KM727_gp59 (Homarus gammarus nudivirus) (YP_010087699.1) | |
| CmNV_61 | 67,580 | 68,187 | - | 202 | hypothetical protein KM727_gp60 (Homarus gammarus nudivirus) (YP_010087700.1) | |
| CmNV_62 | 68,259 | 69,631 | - | 457 | hypothetical protein KM727_gp61 (Homarus gammarus nudivirus) (YP_010087701.1) | |
| CmNV_63 | 69,724 | 71,798 | - | 691 | P74 (Homarus gammarus nudivirus) (YP_010087703.1) | PIF-0 |
| CmNV_64 | 71,846 | 72,693 | + | 282 | ||
| CmNV_65 | 72,750 | 73,375 | + | 208 | hypothetical protein KM727_gp65 (Homarus gammarus nudivirus) (YP_010087705.1) | |
| CmNV_66 | 73,336 | 75,113 | + | 592 | helicase 2 (Homarus gammarus nudivirus) (YP_010087706.1) | |
| CmNV_67 | 75,187 | 76,055 | + | 289 | hypothetical protein KM727_gp67 (Homarus gammarus nudivirus) (YP_010087707.1) | |
| CmNV_68 | 76,061 | 76,707 | - | 215 | hypothetical protein KM727_gp68 (Homarus gammarus nudivirus) (YP_010087708.1) | |
| CmNV_69 | 76,697 | 78,657 | + | 653 | helicase 2 (Homarus gammarus nudivirus) (YP_010087709.1) | |
| CmNV_70 | 78,639 | 84,124 | + | 1828 | hypothetical protein KM727_gp70 (Homarus gammarus nudivirus) (YP_010087710.1) | |
| CmNV_71 | 84,170 | 85,047 | + | 292 | hypothetical protein KM727_gp72 (Homarus gammarus nudivirus) (YP_010087712.1) | |
| CmNV_72 | 85,039 | 85,598 | + | 186 | hypothetical protein KM727_gp73 (Homarus gammarus nudivirus) (YP_010087713.1) | |
| CmNV_73 | 85,616 | 87,168 | - | 517 | hypothetical protein KM727_gp74 (Homarus gammarus nudivirus) (YP_010087714.1) | |
| CmNV_74 | 87,163 | 87,521 | - | 119 | hypothetical protein KM727_gp75 (Homarus gammarus nudivirus) (YP_010087715.1) | |
| CmNV_75 | 87,520 | 88,037 | + | 172 | Ac81-like protein (Homarus gammarus nudivirus) (YP_010087716.1) | |
| CmNV_76 | 88,025 | 89,913 | + | 629 | hypothetical protein KM727_gp77 (Homarus gammarus nudivirus) (YP_010087717.1) | |
| CmNV_77 | 89,921 | 90,351 | + | 143 | Ac68-like protein (Homarus gammarus nudivirus) (YP_010087718.1) | PIF-6 |
| CmNV_78 | 90,389 | 91,647 | - | 419 | hypothetical protein KM727_gp79 (Homarus gammarus nudivirus) (YP_010087719.1) | |
| CmNV_79 | 91,740 | 92,467 | + | 242 | VLF-1 (Homarus gammarus nudivirus) (YP_010087720.1) | |
| CmNV_80 | 92,474 | 93,867 | - | 464 | LEF-4 (Homarus gammarus nudivirus) (YP_010087721.1) | |
| CmNV_81 | 93,888 | 94,300 | - | 137 | hypothetical protein KM727_gp82 (Homarus gammarus nudivirus) (YP_010087722.1) | |
| CmNV_82 | 94,298 | 95,016 | - | 239 | PIF-3 (Homarus gammarus nudivirus) (YP_010087723.1) | |
| CmNV_83 | 95,017 | 98,831 | - | 1271 | helicase (Homarus gammarus nudivirus) (YP_010087724.1) | |
| CmNV_84 | 98,830 | 99,518 | + | 229 | ODV-E28 (Penaeus monodon nudivirus) (YP_009051934.1) | PIF-4 |
| CmNV_85 | 99,512 | 100,257 | - | 248 | hypothetical protein KM727_gp86 (Homarus gammarus nudivirus) (YP_010087726.1) | |
| CmNV_86 | 100,256 | 101,100 | + | 281 | esterase (Homarus gammarus nudivirus) (YP_010087727.1) | GbNV_gp19-like |
| CmNV_87 | 101,113 | 102,344 | - | 410 | hypothetical protein KM727_gp88 (Homarus gammarus nudivirus) (YP_010087728.1) | GbNV_gp67-like |
| CmNV_88 | 102,446 | 102,744 | - | 99 | 11K virion structural protein (Homarus gammarus nudivirus) (YP_010087729.1) | |
| CmNV_89 | 102,869 | 103,191 | - | 107 | hypothetical protein KM727_gp90 (Homarus gammarus nudivirus) (YP_010087730.1) | |
| CmNV_90 | 103,184 | 103,956 | + | 257 | hypothetical protein KM727_gp91 (Homarus gammarus nudivirus) (YP_010087731.1) | |
| CmNV_91 | 104,012 | 105,081 | + | 356 | death-associated inhibitor of apoptosis 1 (Homarus gammarus nudivirus) (YP_010087732.1) | IAP |
| CmNV_92 | 105,794 | 106,290 | - | 165 | ||
| CmNV_93 | 106,479 | 106,957 | - | 159 | ||
| CmNV_94 | 107,140 | 108,752 | - | 537 | polyhedrin, partial (Penaeus monodon nucleopolyhedrovirus) (AET06106.1) | |
| CmNV_95 | 109,297 | 110,303 | - | 335 | ||
| CmNV_96 | 111,058 | 111,419 | + | 120 | ||
| CmNV_97 | 112,295 | 113,013 | - | 239 | hypothetical protein KM727_gp96 (Homarus gammarus nudivirus) (YP_010087736.1) | |
| CmNV_98 | 113,157 | 113,803 | + | 215 | hypothetical protein KM727_gp97 (Homarus gammarus nudivirus) (YP_010087737.1) | |
| CmNV_99 | 45,460 | 45,651 | + | 64 | p6.9 |
Figure 3Phylogenetic trees based on concatenated nudivirus protein sequences. (A)—Maximum likelihood phylogeny based on a supermatrix of 19 concatenated pruned orthologous protein sequences (6961 amino acids; 38K protein, Ac81-like protein, DNA polymerase, FEN-1, GbNV_gp67-like, integrase, LEF-4, LEF-8, LEF-9, P33, PIF-0, PIF-1, PIF-2, PIF-3, PIF-4, PIF-6, TK3, VLF-1 and VP91) derived from 15 nudiviruses and Autographa californica multiple nucleopolyhedrovirus (AcMNPV; NC_001623.1), which was used as the outgroup. (B). Maximum likelihood phylogeny based on 27 concatenated nudivirus core genes (5244 amino acids in total) of 15 nudiviruses. Gene duplication events identified by OrthoFinder are indicated by orange triangles. hp = hypothetical protein. Node labels indicate bootstrap support expressed as a percentage and solid red dots indicate 100% support. CcNV = Crangon crangon nudivirus (MZ311577), CmNV = Carcinus maenas nudivirus (MZ311578), DhNV = Dikerogammarus haemobaphes nudivirus (MT488302.1), DiNV = Drosophila innubila nudivirus (NC_040699.1), ENV = Esparto virus (NC_040536.1), GbNV = Gryllus bimaculatus nudivirus (NC_009240.1), HgNV = Homarus gammarus nudivirus (MK439999.1), HzNV-1 = Heliothis nudivirus 1 (AF451898.1), HzNV-2 = Heliothis nudivirus 2 (NC_004156.2), KNV = Kallithea virus (NC_033829.1), MNV = Mauternbach virus (MG969167), OrNV = Oryctes rhinoceros nudivirus (NC_011588.1), PmNV = Penaeus monodon nudivirus (KJ184318), TNV = Tomelloso virus (NC_040789.1), ToNV = Tipula oleracea nudivirus (NC_026242.1).
List of identified core genes across 15 nudiviruses. Numbers represent open reading frame (ORF) numbers for each of the nudiviruses. Genes (proteins) that were not reported in the proteomes on NCBI, but that were subsequently found in the current or previous studies are indicated and referred to in the footnote.
| Gene | Description | CcNV | CmNV | DhNV | DiNV | ENV | GbNV | HgNV | HzNV-1 | HzNV-2 | KNV | MNV | OrNV | PmNV | TNV | ToNV |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 11K-like | Occlusion body component | 101 | 88 | 101 | 17 | 37 | 95 | 89 | 124 | 25 | 44 | 40 | 41 | 100 | 32 | 28 |
| 38K | Nucleocapsid protein | 61 | 47 | 60 | 46 | 14 | 1 | 47 | 10 | 108 | 20 | 14 | 87 | 59 | 59 | 63 |
| ac81 | Nucleocapsid envelopment | 83 | 75 | 81 | 62 | 4 | 14 | 76 | 33 | 96 | 7 | 4 | 4 | 86 | 4 | 123 |
| dnapol | DNA polymerase | 1 | 1 | 1 | 65 | 1 | 12 | 1 | 131 | 18 | 10 | 1 | 1 | 5 | 1 | 12 |
| fen-1 | FEN-1/FLAP endonuclease | 12 | 13 | 12 | 100 | 56 | 65 | 13 | 68 | 70 | 68 | 62 | 16 | 20 | 17 | 1 |
| GbNV_gp19-like | Unknown | 98 | 86 | 99 | 22 | 32 | 19 | 87 | 30 | 99 | 39 | 35 | 47 | 98 | 38 | 27 |
| GbNV_gp51-like | Unknown | 64 | 50 | 63 | 34 | 22 | 51 | 50 | a | 79 | 30 | 26 | 61 | 62 | 48 | 19 |
| GbNV_gp67-like | Unknown | 99 | 87 | 100 | 102 | 54 | 67 | 88 | 122 | 27 | 66 | 60 | 18 | 99 | 15 | 6 |
| helicase | DNA helicase | 93 | 83 | 95 | 12 | 42 | 88 | 84 | 104 | 38 | 49 | 45 | 34 | 94 | 26 | 118, 55 |
| helicase-2 | DNA helicase | 77, 89 | 66, 69 | 87, 90 | 90 | 65 | 46 | 66, 69 | 60 | 76 | 78 | 71 | 108 | 76, 79 | 74 | 105 |
| integrase | DNA processing | 56 | 40 | 54 | 39 | 18 | 57 | 42 | 144 | 8 | 25 | 21 | 75 | 55 | 54 | 43 |
| lef-4 | RNA polymerase subunit | 90 | 80 | 92 | 18 | 36 | 96 | 81 | 98 | 43 | 43 | 39 | 42 | 91 | 33 | 25 |
| lef-5 | Transcription initiation factor | 53 | 38 | 51 | 25 | c | 85 | 39 | 101 | 40 | c | c | 52 | 52 | 40 | 50, 66 |
| lef-8 | RNA polymerase subunit | 15 | 16 | 15 | 32 | 24 | 49 | 16 | 90 | 51 | 33 | 29 | 64 | 23 | 46 | 88 |
| lef-9 | RNA polymerase subunit | 60 | 46 | 56 | 57 | 9 | 24 | 46 | 75 a,* | 63 a,* | 4 | 8 | 96 | 58 | 65 | 131 |
| p33_ac92 | Sulfhydryl oxidase | 4 | 4 | 3 | 87 | 67 | 7 | 4 | 13 | 104 | 82 | 74 | 113 | 8 | 75 | 99 |
| p47 | RNA polymerase subunit | 8 | 8 | 7 | 104 | 53 | 69 | 8 | 75 a,* | 63 a,* | 64 | 58 | 20 | 14 | 13 | 115 |
| p6.9 | Nucleocapsid packaging/assembly | 106 b | 99 b | - | 1 | 51 | 72 | 40 | 142 a | a | 61 | 56 | 22 | a | 11 | 51 |
| pif-0/p74 | Per os infectivity factor | 74 | 63 | 71 | 74 | 80 | 45 | 63 | 11 | 106 | 95 | 88 | 126 | 72 | 88 | 45 |
| pif-1 | Per os infectivity factor | 35 | 27 | 39 | 35 | 21 | 52 | 28 | 55 | 82 | 29 | 25 | 60 | 39 | 49 | 69 |
| pif-2 | Per os infectivity factor | 9 | 9 | 8 | 101 | 55 | 66 | 9 | 123 | 26 | 67 | 61 | 17 | 15 | 16 | 7 |
| pif-3 | Per os infectivity factor | 92 | 82 | 94 | 91 | 64 | 3 | 83 | 88 | 53 | 77 | 70 | 107 | 93 | 73 | 13 |
| pif-4 | Per os infectivity factor | 94 | 84 | 96 | 11 | 43 | 87 | 85 | 103 | 39 | 51 | 46 | 33 | 96 | 25 | 119 |
| pif-5/odv-e56 | Per os infectivity factor | 6 | 6 | 5 | 6, 85 | 69 | 5 | 6 | 76 | 62 | 56, 84 | 51, 76 | 115 | 10 | 77 | 102, 74, 96 |
| pif-6 | Per os infectivity factor | 85 | 77 | 79 | 36 | 20 | 55 | 78 | 74 | 64 | 28 | 24 | 72 | 88 | 51 | 56 |
| vlf-1 | Very late gene expression factor | 59 | 41 | 55 | 9 | 45 | 80 | 43 | 121 | 28 | 53 | 48 | 30 | 56 | 24 | 65 |
| vp39 | Major capsid protein | 14 | 15 | 14 | 99 | 57 | 64 | 15 | 89 | 52 | 69 | 63 | 15 | 22 | 18 | 87 |
| vp91/p95 | Nucleocapsid protein | 5 | 5 | 4 | 92 | 63 | 2 | 5 | 46 | 89 | 76 | 69 | 106 | 9 | 72 | 16, 83 |
a Bezier et al., 2015, b manual blast search, c vgas gene prediction, * p47 and lef-9 genes are fused (Bezier et al., 2015; Holt et al., 2019).
Figure 4Genome map of 15 nudiviruses showing the position of 28 core genes. Open reading frames (ORFs) are shown as arrows, which indicate the gene position and orientation in the genome. Grey lines represent the full length (nt) of the genomes. The genome sequences were rearranged, such that all linear representations of the circular viral genomes start with the DNA polymerase gene. The genome map was created using the R package gggenes and combined with the phylogenetic tree shown in Figure 3B using Inkscape.