| Literature DB >> 35215990 |
Sijun Liu1, Thomas W Sappington2, Brad S Coates2, Bryony C Bonning3.
Abstract
Sequences derived from a novel toursvirus were identified from pooled genomic short read data from U.S. populations of southern corn rootworm (SCR, Diabrotica undecimpunctata howardi Barber) and northern corn rootworm (NCR, Diabrotica barberi Smith & Lawrence). Most viral sequences were identified from the SCR genomic dataset. As proteins encoded by toursvirus sequences from SCR and NCR were almost identical, the contig sets from SCR and NCR were combined to generate 26 contigs. A total of 108,176 bp were assembled from these contigs, with 120 putative toursviral ORFs identified indicating that most of the viral genome had been recovered. These ORFs included all 40 genes that are common to members of the Ascoviridae. Two genes typically present in Ascoviridae (ATP binding cassette transport system permeases and Baculovirus repeated open reading frame), were not detected. There was evidence for transposon insertion in viral sequences at different sites in the two host species. Phylogenetic analyses based on a concatenated set of 45 translated protein sequences clustered toursviruses into a distinct clade. Based on the combined evidence, we propose taxonomic separation of toursviruses from Ascoviridae.Entities:
Keywords: Ascoviridae; Diabrotica spp.; beetle; corn rootworm; toursvirust
Mesh:
Substances:
Year: 2022 PMID: 35215990 PMCID: PMC8879594 DOI: 10.3390/v14020397
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Characteristics of ascovirus genomes.
| Virus | Abbr. | % G+C | Accession | Length (bp) | Putative CDS | Host | Ref |
|---|---|---|---|---|---|---|---|
|
| |||||||
|
| SfAV1a | 49.26 | NC_008361.1 | 156922 | 123 |
| [ |
|
| TNAV2c | 35.24 | NC_008518.1 | 174059 | 165 | [ | |
|
| HVAV3e | 45.88 | NC_009233.1 | 186262 | 178 | [ | |
|
| HVAV3f | 46 | NC_044938.1 | 198157 | 190 |
| |
|
| TnAV3g | 45.85 | JX491653.1 | 199721 | 194 |
| [ |
|
| HvAV3h | 45.5 | KU170628.1 | 190519 | 185 |
| [ |
|
| HvAV3i | 45.42 | MF781070.1 | 185650 | 181 |
| [ |
|
| HvAV3j | 45.62 | LC332918.1 | 191718 | 189 |
| |
|
| TnAV6b | 35.43 | KY434117.1 | 185664 | 178 |
| |
|
| |||||||
|
| DpTV1a | 49.16 | NC_011335.1 | 119343 | 119 |
| [ |
| Dasineura jujubifolia toursvirus 2a | DjTV2a | 45.97 | MK867691.1 | 142600 | 141 |
| [ |
* Previously named Diadromus pulchellus ascovirus 4a [27]. Virus names in italics are recognized by the International Committee on Virus Taxonomy.
Figure 1Map of the 26 DiTV3a genomic fragments. Fragments derived from both SCR and NCR were combined. Arrows indicate ORFs and ORF orientation. Dark green, ORFs that hit toursvirus genes (DjTV2a/DpTV1a); light green, similar to other viral genes; blue, ORFs that hit non-viral genes; grey, unknown ORFs. *, partial sequence. The fragments identified from SCR and NCR isolates are provided in Figures S1 and S2.
Sequence coverage and nucleotide content of DiTV3a genome fragments (F1 to F26).
| Fragment | Length (bp) | % G+C | No. Putative CDS | Total Reads Mapped | Average Base Coverage | ||
|---|---|---|---|---|---|---|---|
| SCR | NCR | SCR | NCR | ||||
| F1 | 9099 | 30.02 | 9 | 4775 | 2333 | 52.48 | 25.64 |
| F2 | 19,537 | 29.5 | 22 | 10,479 | 2763 | 53.64 | 14.14 |
| F3 | 10,800 | 29.88 | 12 | 5553 | 1494 | 51.42 | 13.83 |
| F4 | 9493 | 28.68 | 13 | 4941 | 1670 | 52.05 | 17.59 |
| F5 | 6898 | 31.01 | 4 | 3533 | 880 | 51.22 | 12.76 |
| F6 | 5330 | 30.17 | 5 | 2746 | 733 | 51.52 | 13.75 |
| F7 | 4680 | 28.44 | 7 | 2022 | 636 | 43.21 | 13.59 |
| F8 | 4514 | 30.24 | 4 | 2275 | 590 | 50.40 | 13.07 |
| F9 | 4062 | 30.66 | 4 | 2048 | 800 | 50.42 | 19.69 |
| F10 | 3943 | 30.97 | 2 | 2057 | 520 | 52.17 | 13.19 |
| F11 | 3641 | 30.82 | 5 | 1794 | 655 | 49.27 | 17.99 |
| F12 | 2725 | 27.78 | 4 | 1390 | 348 | 51.01 | 12.77 |
| F13 | 2590 | 28.3 | 4 | 1346 | 346 | 51.97 | 13.36 |
| F14 | 2473 | 26.97 | 3 | 1252 | 440 | 50.63 | 17.79 |
| F15 | 2445 | 30.35 | 1 | 1234 | 274 | 50.47 | 11.21 |
| F16 | 2273 | 30.23 | 2 | 1122 | 344 | 49.36 | 15.13 |
| F17 | 1970 | 28.63 | 3 | 994 | 196 | 50.46 | 9.95 |
| F18 | 1606 | 27.21 | 3 | 813 | 178 | 50.62 | 11.08 |
| F19 | 1586 | 31.97 | 2 | 725 | 164 | 45.71 | 10.34 |
| F20 | 1487 | 31.07 | 2 | 698 | 182 | 46.94 | 12.24 |
| F21 | 1327 | 24.86 | 2 | 714 | 220 | 53.81 | 16.58 |
| F22 | 1135 | 29.46 | 2 | 623 | 156 | 54.89 | 13.74 |
| F23 | 1305 | 32.43 | 1 | 604 | 274 | 46.28 | 21.00 |
| F24 | 1270 | 29.53 | 1 | 600 | 316 | 47.24 | 24.88 |
| F25 | 1135 | 29.53 | 2 | 510 | 242 | 44.93 | 21.32 |
| F26 | 826 | 40.56 | 1 | 357 | 266 | 43.22 | 32.20 |
| 108,150 | 29.9719231 | 120 | 55,205 | 17,020 | 51.94 | 15.74 | |
Putative genes of DiTV3a per genome fragment.
| ORF | Length (aa) | Mr (kDa) | Gene | Similar ORFs | Functional Category † |
|---|---|---|---|---|---|
| F1_ORF1 | 166 * | 18.7 | hypothetical protein | IIV6_404L (Invertebrate | |
| F1_ORF2 | 154 | 18.5 | hypothetical protein | N/A | |
| F1_ORF3 | 386 | 45 | ribonucleoside-diphosphate reductase subunit M2 | DjTV_ORF44/AV955_gp066 | |
| F1_ORF4 | 124 | 15.4 | hypothetical protein | N/A | |
| F1_ORF5 | 181 | 21.1 | casein kinase 1, delta | none from viruses | |
|
| 941 | 109.1 | DNA polymerase | DjTV_ORF1/AV955_gp001 | 1 |
| F1_ORF7 | 59 | 6.8 | hypothetical protein | N/A | |
| F1_ORF8 | 109 | 12.8 | mobilome: prophages, transposons | phage anti-repressor protein | |
| F1_ORF9 | 580 | 66.5 | hypothetical protein | DjTV_ORF30/AV955_gp054 | |
| F2_ORF1 | 85 | 9.9 | hypothetical protein | N/A | |
|
| 774 | 90.5 | lipopolysaccharide-modifying enzyme | DjTV_ORF114/AV955_gp105 | 7 |
| F2_ORF3 | 151 | 17.5 | hypothetical protein | DjTV_ORF2/AV955_gp039 | |
| F2_ORF4 | 197 | 22.4 | hypothetical protein | DjTV_ORF98/AV955_gp024 | |
| F2_ORF5 | 280 | 33.9 | hypothetical protein (hit CDD pfam08793, 2c_adapt [cl07414], PTZ00449 [cl33186]) | none from viruses | |
| F2_ORF6 | 136 | 15.6 | hypothetical protein | DjTV_ORF100 | |
| F2_ORF7 | 800 | 93.7 | hypothetical protein | DjTV_ORF57/AV955_gp094 | |
|
| 158 | 18.9 | putative zinc-finger DNA binding protein | DjTV_ORF129/AV955_gp108 | 7 |
| F2_ORF9 | 195 | 21.9 | hypothetical protein | N/A | |
|
| 871 | 101.1 | DEAD-like helicase | DjTV_ORF81/AV955_gp020 | 1 |
| F2_ORF11 | 226 | 26.9 | UMP-CMP kinase 2, mitochondrial-like (thymidylate kinase) | none from viruses | |
| F2_ORF12 | 196 | 22.5 | thymidylate kinase | ORF of | |
| F2_ORF13 | 181 | 20.5 | hypothetic protein histone-lysine N-methyltransferase 2C-like | none from viruses | |
| F2_ORF14 | 117 | 13.3 | hypothetical protein | N/A | |
| F2_ORF15 | 459 | 52.6 | DNA ligase | R303 ( | |
| F2_ORF16 | 50 | 6.1 | hypothetical protein | N/A | |
|
| 278 | 32 | hypothetical protein | DjTV_ORF90/AV955_gp035 | 7 |
|
| 250 | 29.4 | acetyltransferase | DjTV_ORF115/AV955_gp081 | 5 |
| F2_ORF19 | 108 | 13.2 | hypothetical protein | N/A | |
| F2_ORF20 | 146 | 17 | acyl-CoA-binding protein domain containing protein | none from viruses | |
| F2_ORF21 | 91 | 10.9 | hypothetical protein | N/A | |
| F2_ORF22 | 164 | 19.2 | hypothetical protein | DjTV_ORF58/AV955_gp095 | |
| F3_ORF1 | 135 | 16.1 | hypothetical protein | DjTV_ORF19/AV955_gp071 | |
|
| 328 | 37.8 | DNA repair exonuclease | DjTV_ORF18/AV955_gp026 | 1 |
| F3_ORF3 | 87 | 10.4 | hypothetical protein | N/A | |
| F3_ORF4 | 240 | 28.4 | hypothetical protein | DjTV_ORF28/AV955_gp013 | |
|
| 1031 | 122.2 | dynein-like beta chain protein | DjTV_ORF94/AV955_gp085 | 7 |
| F3_ORF6 | 464 | 54.6 | hypothetical protein | DjTV_ORF94/AV955_gp079 | |
|
| 263 | 31.1 | RNaseIII | DjTV_ORF22/AV955_gp003 | 2 |
| F3_ORF8 | 349 | 41.3 | flap structure-specific endonuclease | DH26_gp060 (Anopheles minimus irodovirus) | 1 |
| F3_ORF9 | 162 | 19.4 | hypothetical protein | N/A | |
| F3_ORF10 | 67 | 7.9 | hypothetical protein | N/A | |
| F3_ORF11 | 71 | 7.9 | hypothetical protein | N/A | |
| F3_ORF12 | 60 * | 7.1 | hypothetical protein | N/A | |
| F4_ORF1 | 117 | 13.6 | hypothetical protein | DjTV_ORF107/AV955_gp045 | |
| F4_ORF2 | 130 | 15.5 | hypothetical protein | DjTV_ORF118/AV955_gp072 | |
|
| 345 | 41.4 | DNA binding/packing protein | DjTV_ORF99/AV955_gp103 | 3 |
| F4_ORF4 | 123 | 14.9 | hypothetical protein | N/A | |
|
| 455 | 51.8 | major capsid protein | DjTV_ORF83/AV955_gp019 | 3 |
| F4_ORF6 | 132 | 15.5 | thioredoxin-like protein | DjTV_ORF116/AV955_gp104 | |
|
| 365 | 43.3 | immediate early protein ICP-46 | DjTV_ORF97/AV955_gp043 | 7 |
|
| 104 | 12.4 | yabby-like transcription factor | DjTV_ORF110/AV_955_gp022 | 2 |
| F4_ORF9 | 207 | 23.6 | hypothetical protein | DjTV_ORF92/AV955_gp023 | |
| F4_ORF10 | 146 | 17.3 | putative RING finger protein | IIV22_063R (Invertebrate | |
| F4_ORF11 | 98 | 11.4 | hypothetical protein | putative protein 4 (Dougjudy virga-like virus) | |
|
| 239 | 28.5 | casein kinase 1-like protein 5/major virion DNA-binding protein | DjTV_ORF54/AV955_gp115 | 3 |
| F4_ORF13 | 277 | 31.4 | hypothetical protein | N/A | |
|
| 1029 | 116.2 | DdRp II | DjTV_ORF111/AV955_gp073 | 2 |
| F5_ORF2 | 60 | 7.2 | hypothetical protein | N/A | |
| F5_ORF3 | 650 | 76.1 | hypothetical protein | DjTV_ORF105 | |
| F5_ORF4 | 287 | 33.4 | hypothetical protein | N/A | |
| F6_ORF1 | 66 * | 8.1 | hypothetical protein | N/A | |
| F6_ORF2 | 143 | 16.2 | hypothetical protein | N/A | |
| F6_ORF3 | 982 | 115 | RHS repeat protein | none from viruses | |
| F6_ORF4 | 122 | 14.5 | hypothetical protein | DjTV_ORF69 | |
| F6_ORF5 | 279 | 32.7 | hypothetical protein | DjTV_ORF46/AV955_gp109 | |
| F7_ORF1 | 50 | 5.8 | hypothetical protein | N/A | |
| F7_ORF2 | 61 | 7 | hypothetical protein | N/A | |
| F7_ORF3 | 176 | 21.1 | uyr/REP helicase | DjTV_ORF89/AV955_gp028 | |
|
| 622 | 72.8 | ATPase | DjTV_ORF4/AV955_gp033 | 1 |
| F7_ORF5 | 123 | 14.3 | hypothetical protein | N/A | |
| F7_ORF6 | 170 | 19.5 | hydrolase, NUDIX family | DjTV_ORF83/AV955_gp005 | |
| F7_ORF7 | 81 * | 9.7 | putative RING finger protein | MIV027R (Invertebrate iridescent virus 3) | 7 |
|
| 836 | 97.6 | ATPase | DjTV_ORF63/AV955_gp093 | 1 |
| F8_ORF2 | 84 | 9.6 | hypothetical protein | DjTV_ORF91/AV955_gp030 | |
|
| 189 | 22.6 | thymidine kinase | DjTV_ORF12/AV955_gp055 | 1 |
|
| 224 | 27.2 | fatty acids protein | DjTV_ORF127/AV955_gp015 | 5 |
|
| 408 | 46.9 | RNA polymerase II | DjTV_ORF10/AV955_gp070 | 2 |
| F9_ORF2 | 126 | 14.6 | thiredoxin-like | DjTV_ORF128/AV955_gp044 | |
|
| 290 | 33.8 | myristylated membrane protein-like protein | DjTV_ORF40/AV955_gp065 | 7 |
| F9_ORF4 | 353 * | 40 | dUTP diphosphatase | none from viruses | |
|
| 877 * | 116.2 | DdRp | DjTV_ORF7/AV955_gp089 | 2 |
| F10_ORF2 | 356 | 42.4 | hypothetical protein | IIV31_072L (Armadillidium | |
|
| 254 * | 29.6 | myristylated membrane protein-like protein | DjTV_ORF40/AV955_gp065 | 5 |
| F11_ORF2 | 126 | 14.6 | thiredoxin-like | DjTV_ORF128/AV955_gp044 | |
| F11_ORF3 | 289 | 33.3 | thymidylate synthase | none from viruses | |
| F11_ORF4 | 216 | 25 | hypothetical protein | DjTV_ORF77/AV955_gp031 | |
| F11_ORF5 | 133 | 15.6 | transcription elongation factor S-II | DjTV_ORF78/AV955_gp082 | |
|
| 152 | 18.5 | hypothetical protein | DjTV_ORF26/AV955_gp010 | |
|
| 519 | 60.7 | hypothetical protein | DjTV_ORF25/AV955_gp036 | 4 |
| F12_ORF3 | 57 | 6.5 | hypothetical protein | N/A | |
| F12_ORF4 | 83 * | 9.3 | hypothetical protein | N/A | |
| F13_ORF1 | 79 | 9.5 | hypothetical protein | N/A | |
| F13_ORF2 | 104 | 11.9 | IIV22A_144R-like | IIV22A_144R (Invertebrate | |
|
| 206 | 24.3 | zinc-dependent metalloprotease | DjTV_ORF119/AV955_gp029 | 7 |
|
| 360 | 40.8 | major virion DNA-binding protein | DjTV_ORF98/AV955_gp008 | 3 |
| F14_ORF1 | 117 | 136 | DNA-directed RNA polymerases I, II, and III | DjTV_ORF66/AV955_gp058 | |
| F14_ORF2 | 202 | 23.5 | hypothetical protein | DjTV_ORF103/AV955_gp051 | |
|
| 264 | 31.2 | hypothetical protein | DjTV_ORF59/AV955_gp009 | |
|
| 728 | 83.9 | serine/threonine protein kinase | DjTV_ORF55/AV955_gp046 | 4 |
| F16_ORF1 | 269 | 31 | hypothetical protein | DjTV_ORF87/AV955_gp025 | |
|
| 384 | 44.1 | hypothetical protein | DjTV_ORF86/AV955_gp064 | |
|
| 260 | 29.8 | patatin-like phospholipase | DjTV_ORF140/AV955_gp087 | 5 |
| F17_ORF2 | 118 | 13.4 | hypothetical protein | AV955_gp107 | |
|
| 174 | 20.4 | hypothetical protein | DjTV_ORF9/AV955_gp116 | |
| F18_ORF1 | 95 * | 11.4 | hypothetical protein | N/A | |
| F18_ORF2 | 214 | 25.9 | hypothetical protein | DjTV_ORF121/AV955_gp060 | |
| F18_ORF3 | 140 | 16.9 | hypothetical protein | N/A | |
| F19_ORF1 | 79 * | 9.3 | hypothetical protein | N/A | |
|
| 445 | 49 | lipid membrane protein | DjTV_ORF61/AV955_gp040 | |
|
| 348 | 39.9 | putative myristylated membrane protein | AV955_gp065 | 7 |
|
| 62 * | 6.9 | hypothetical protein | DjTV_ORF67/AV955_gp063 | |
|
| 170 | 20.5 | CDT phosphatase transcription factor | DjTV_ORF81/AV955_gp117 | 2 |
| F21_ORF2 | 148 | 17.2 | hypothetical protein | DjTV_ORF82/AV955_gp097 | |
|
| 104 | 12.3 | sulfhydry1 oxidase Erv1 like protein | DjTV_ORF62/AV955_gp041 | |
|
| 257 | 30 | ATPase 3 | DjTV_ORF137/AV955_gp086 | |
|
| 358 | 40.9 | cathepsin B | DjTV_ORF50/AV955_gp048 | 6 |
| F24_ORF1 | 406 | 47.8 | hypothetical protein | DjTV_ORF72/AV955_gp096 | |
|
| 193 * | 22.7 | iap-3 | DjTV_ORF108/AV955_gp007 | 6 |
| F25_ORF2 | 73 | 8.5 | hypothetical protein | N/A | |
| F26_ORF1 | 234 | 27.1 | hypothetical protein | MIV075R (Invertebrate iridescent virus 3) | |
Bold, genes shared by ascoviruses [25]; Italic, identified by Wang et al., [26]; * partial sequences; † Gene function: 1, DNA replication and repair; 2, transcription; 3 Structural protein; 4, protein modification; 5, lipid metabolism; 6, apoptosis; 7, other.
Figure 2DiTV3a fragments fused to retrotransposon elements. Green, DiTV3a ORFs; blue, host genes with retrotransposon-related genes labeled (endonuclease-reverse transcriptase, activating signal cointegrator 1 complex subunit, GATA zinc finger domain-containing protein 14-like); gray, hypothetical proteins.
Figure 3Similarity of toursvirus proteins to proteins from other virus species. The virus group associated with the top 10 non-redundant hits from 1 to 10 from BLASTp analysis against the NCBI nr database are shown for putative ORFs of DpTV1, DjTV2 and DiTV3a. The numbers of proteins with similarity to the different virus groups or to other proteins are shown.
Figure 4Phylogenetic tree based on the concatenated sequences of 45 proteins encoded by 26 toursviruses, iridoviruses and ascoviruses. Protein sequences were selected based on BLASTp results (E value ≤ 0.001) and downloaded from the NCBI protein database. Methods for phylogenetic tree construction are as described in materials and methods. Bootstrap values (percent) are indicated. Virus groups are shown at right. Corresponding protein accession numbers for each virus are provided in Table S1.