| Literature DB >> 28173110 |
David Lepetit1, Benjamin Gillet2, Sandrine Hughes2, Ken Kraaijeveld3, Julien Varaldi1.
Abstract
Parasites are sometimes able to manipulate the behavior of their hosts. However, the molecular cues underlying this phenomenon are poorly documented. We previously reported that the parasitoid wasp Leptopilina boulardi which develops from Drosophila larvae is often infected by an inherited DNA virus. In addition to being maternally transmitted, the virus benefits from horizontal transmission in superparasitized larvae (Drosophila that have been parasitized several times). Interestingly, the virus forces infected females to lay eggs in already parasitized larvae, thus increasing the chance of being horizontally transmitted. In a first step towards the identification of virus genes responsible for the behavioral manipulation, we present here the genome sequence of the virus, called LbFV. The sequencing revealed that its genome contains an homologous repeat sequence (hrs) found in eight regions in the genome. The presence of this hrs may explain the genomic plasticity that we observed for this genome. The genome of LbFV encodes 108 ORFs, most of them having no homologs in public databases. The virus is however related to Hytrosaviridae, although distantly. LbFV may thus represent a member of a new virus family. Several genes of LbFV were captured from eukaryotes, including two anti-apoptotic genes. More surprisingly, we found that LbFV captured from an ancestral wasp a protein with a Jumonji domain. This gene was afterwards duplicated in the virus genome. We hypothesized that this gene may be involved in manipulating the expression of wasp genes, and possibly in manipulating its behavior.Entities:
Mesh:
Substances:
Year: 2016 PMID: 28173110 PMCID: PMC5381508 DOI: 10.1093/gbe/evw277
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Genomic features of representative dsDNA insect viruses
| Virus name | Family | Accession no. | Genome size (bp) | AT | Coding density (%) | # BLASTp hits with LbFV ORFs as queries | # best BLASTp hit with LbFV ORFs as queries | |
|---|---|---|---|---|---|---|---|---|
| Baculoviridae | NC_001973 | 161,046 | 164 | 42.5 | 87.5 | 8 | 2 | |
| Baculoviridae | NC_002816 | 123,500 | 143 | 54.7 | 90.1 | 7 | 2 | |
| Baculoviridae | NC_001623 | 133,894 | 156 | 59.3 | 97.2 | 5 | 0 | |
| Baculoviridae | NC_005905 | 86,462 | 90 | 66.2 | 84.5 | 7 | 0 | |
| Baculoviridae | NC_003084 | 108,252 | 109 | 49.1 | 91.2 | 4 | 0 | |
| Hytrosaviridae | NC_010356 | 190,032 | 160 | 72.0 | 86.5 | 16 | 9 | |
| Hytrosaviridae | EU522111 | 124,279 | 108 | 56.5 | 90.9 | 20 | 8 | |
| Nudiviridae | EF203088 | 96,944 | 98 | 72.0 | 93.6 | 6 | 2 | |
| Nudiviridae | KM610234 | 145,704 | 131 | 74.5 | 85.7 | 5 | 2 | |
| Nudiviridae | AF451898 | 228,089 | 154 | 58.2 | 69.4 | 8 | 2 | |
| Nudiviridae | EU747721 | 127,615 | 139 | 58.4 | 88.5 | 6 | 1 | |
| Nudiviridae | KJ184318 | 119,638 | 115 | 65.5 | 95.6 | 6 | 0 | |
| unassigned | KR819915 | 496,396 | 247 | 49.2 | 65.0 | 12 | 4 | |
Represents e-values <0.01
FSuperparasitism phenotype associated with LbFV infection: distribution of wasp eggs deposited by (A) nonsuperparasitizing uninfected females and by (B) LbFV-infected superparasitizing females within Drosophila larvae. Wasp were tested under controlled laboratory conditions (see “Methods” section for details). The number of female tested was 6 and 20, respectively, for NS and S lines.
FCharacterization of the viral nucleic acids. (A) Electrophoresis of viral nucleic acids extracted from Leptopilina boulardi wasps and (B) viral nucleic acids were treated with DNAse I or RNAse A. ND, not digested. The ladders are expressed in kilobases.
FSimplified De Bruijn graph showing the connexions among viral contigs after assembly of Illumina reads. Each circle, so called node in the Velvet terminology, represents a contig. Its diameter is proportional to its length. Each edge connection between two contigs indicates contiguity among contigs. Edge width is proportional to the number of connexions found among nodes. The dark grey circles represent the eight large contigs identified by Velvet assembler (13, 101, 65, 42, 99, 70, 67, 12).
FLbFV contigs identified after assembling Illumina and 454 reads are flanked by homologous regions (hrs). Alignment of homologous regions found in the LbFV genome. The contig name, start and end position of each sequence is indicated on the left. The last line of the alignment corresponds to the final consensus sequence of all contigs.
FConnexion among contigs identified by (A) mapping paired-end reads on the LbFV contigs masked for repeated sequence. The eight LbFV contigs are arbitrarily ranked according to their length and represented within a circular chromosome. Grey areas indicate genomic regions homologous to the hrs motif. The blue lines represent the mapping of paired-end reads on different contigs extremities. In total 96 blue lines are represented corresponding to 96 paired Illumina reads. (B) PCR amplification. The 16 PCR primers used in this experiment are symbolically represented by the arrows (not scale). 51 out of 120 primer combinations gave a positive PCR and are represented by a blue line connecting contig extremities. The red lines indicate two cases where the PCR was positive with a single primer. (C) The PCR amplifications often gave several bands. As an example, the PCR product obtained with the 7f and 2r primers (identified in fig. 5 by stars) is shown. The full set of gel images is provided in supplementary figure S2, Supplementary Material online. λ, molecular weight marker; 7f2r, PCR product.
FRepresentation of the gene and repeat content of the circular dsDNA genome of LbFV. The black line represents the coverage for Illumina reads and the grey one represents the coverage for the MinION 2D reads (left y axis). The green line indicates the GC in a sliding window of 2000 bp (right y axis). Unresolved sequences correspond to genomic regions where only MinION reads are available and where polymorphism is very likely.
ORF prediction in LbFV genome
| ORF # | O | Start | Stop | Length (aa) | Hits found | Top Hit accession number | Description | e-value | Top hit species | Kingdom | Accession | Clan | Description | Start | End | Ind-e-value | Cond-e-value | TM | SP | struct. MdSGHV | struct. GpSGHV |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ORF1 | + | 214 | 1164 | 317 | 585 | W6JLD1_9POXV | Translation elongation factor eEF-3 like | 3.50E-34 | Anomala cuprea entomopoxvirus | Virus | PF14388.4 | n/a | Domain of unknown function (DUF4419) | 26 | 311 | 6.60E-45 | 4.00E-49 | ||||
| ORF2 | + | 1243 | 2247 | 335 | 0 | ||||||||||||||||
| ORF3 | + | 2314 | 2472 | 53 | 0 | ||||||||||||||||
| ORF4 | − | 2514 | 3530 | 339 | 0 | ||||||||||||||||
| ORF5 | + | 3767 | 5857 | 697 | 1 | A0A084WAW3_ANOSI | Nav1.4a | 5.70E-06 | Culex quinquefasciatus | Eukaryote | 2 | ||||||||||
| ORF6 | + | 5899 | 6450 | 184 | 0 | ||||||||||||||||
| ORF7 | + | 6455 | 7237 | 261 | 0 | ||||||||||||||||
| ORF8 | + | 7379 | 7633 | 85 | 0 | ||||||||||||||||
| ORF9 | + | 7721 | 8041 | 107 | 0 | ||||||||||||||||
| ORF10 | − | 8197 | 8475 | 93 | 2304 | C8AXE7_9CICH | TMO-4c4 (Fragment) | 1.30E-23 | Oreochromis niloticus | Eukaryote | |||||||||||
| ORF11 | − | 8838 | 11177 | 780 | 94 | B4MZD3_DROWI | Uncharacterized protein | 2.70E-17 | Lucilia cuprina | Eukaryote | PF02373.20 | CL0029 | JmjC domain, hydroxylase | 475 | 602 | 5.00E-09 | 3.10E-13 | ||||
| ORF12 | + | 11590 | 11769 | 60 | 0 | ||||||||||||||||
| ORF13 | − | 12479 | 14209 | 577 | 929 | A0A0P6K4V2_HETGA | V-type proton ATPase subunit a | 2.10E-20 | Fukomys damarensis | Eukaryote | PF02373.20 | CL0029 | JmjC domain, hydroxylase | 283 | 403 | 1.20E-10 | 7.40E-15 | ||||
| ORF14 | + | 15338 | 16237 | 300 | 169 | A0A0B5A143_9ABAC | Uncharacterized protein | 1.10E-21 | Pseudoplusia includens SNPV IE | Virus | PF02498.15 | n/a | BRO family, N-terminal domain | 97 | 195 | 5.50E-14 | 3.40E-18 | ||||
| ORF15 | + | 16558 | 17091 | 178 | 0 | ||||||||||||||||
| ORF16 | + | 17012 | 17185 | 58 | 0 | ||||||||||||||||
| ORF17 | + | 17615 | 17932 | 106 | 0 | ||||||||||||||||
| ORF18 | + | 17954 | 18655 | 234 | 0 | ||||||||||||||||
| ORF19 | + | 18662 | 19342 | 227 | 0 | ||||||||||||||||
| ORF20 | − | 19754 | 20902 | 383 | 0 | ||||||||||||||||
| ORF21 | − | 20974 | 21804 | 277 | 0 | ||||||||||||||||
| ORF22 | − | 21829 | 22842 | 338 | 0 | ||||||||||||||||
| ORF23 | + | 22941 | 23168 | 76 | 0 | ||||||||||||||||
| ORF24 | + | 23234 | 23683 | 150 | 0 | ||||||||||||||||
| ORF25 | − | 23838 | 24026 | 63 | 0 | ||||||||||||||||
| ORF26 | + | 24673 | 24882 | 70 | 0 | ||||||||||||||||
| ORF27 | + | 25506 | 26531 | 342 | 3883 | A0A182XPW4_ANOQN | Uncharacterized protein | 5.00E-100 | Anopheles christyi | Eukaryote | PF00653.19 | CL0417 | Inhibitor of Apoptosis domain | 11 | 78 | 3.50E-15 | 4.30E-19 | ||||
| PF00653.19 | CL0417 | Inhibitor of Apoptosis domain | 172 | 237 | 4.90E-16 | 6.00E-20 | |||||||||||||||
| PF13920.4 | CL0229 | Zinc finger, C3HC4 type (RING finger) | 290 | 334 | 1.10E-11 | 1.30E-15 | |||||||||||||||
| ORF28 | − | 26599 | 27075 | 159 | 0 | ||||||||||||||||
| ORF29 | − | 27068 | 27736 | 223 | 0 | ||||||||||||||||
| ORF30 | + | 27781 | 28251 | 157 | 0 | ||||||||||||||||
| ORF31 | − | 28266 | 28901 | 212 | 0 | ||||||||||||||||
| ORF32 | + | 29002 | 30687 | 562 | 0 | ||||||||||||||||
| ORF33 | + | 30976 | 32172 | 399 | 16 | Q9YVQ5_MSEPV | ORF MSV187 putative late transcription factor VLTF-2 homolog (Vaccinia A1L), similar to SW:P33814 | 9.20E-13 | Melanoplus sanguinipes entomopoxvirus | Virus | |||||||||||
| ORF34 | + | 32165 | 32380 | 72 | 0 | ||||||||||||||||
| ORF35 | + | 32481 | 33695 | 405 | 7 | Q9YVQ5_MSEPV | ORF MSV187 putative late transcription factor VLTF-2 homolog (Vaccinia A1L), similar to SW:P33814 | 6.50E-09 | Melanoplus sanguinipes entomopoxvirus | Virus | |||||||||||
| ORF36 | − | 33675 | 38513 | 1613 | 0 | ||||||||||||||||
| ORF37 | − | 38599 | 39297 | 233 | 106 | Q7T5L7_GVCL | Dbp | 3.20E-15 | Cryptophlebia leucotreta granulosis virus | Virus | PF00293.26 | CL0261 | NUDIX domain | 23 | 165 | 6.20E-09 | 3.80E-13 | ||||
| ORF38 | + | 39418 | 39630 | 71 | 0 | ||||||||||||||||
| ORF39 | − | 39643 | 39909 | 89 | 0 | ||||||||||||||||
| ORF40 | + | 39929 | 40141 | 71 | 0 | ||||||||||||||||
| ORF41 | − | 40381 | 40533 | 51 | 0 | ||||||||||||||||
| ORF42 | − | 40611 | 41324 | 238 | 0 | ||||||||||||||||
| ORF43 | + | 41497 | 42507 | 337 | 0 | ||||||||||||||||
| ORF44 | + | 43692 | 43880 | 63 | 0 | ||||||||||||||||
| ORF45 | + | 44031 | 44186 | 52 | 1 | A6YEY0_9NEOP | NADH- ubiquinone oxidoreductase chain 1 (Fragment) | 0.00018 | Operophtera brumata | Eukaryote | |||||||||||
| ORF46 | + | 44418 | 44576 | 53 | 0 | ||||||||||||||||
| ORF47 | + | 44612 | 44770 | 53 | 0 | ||||||||||||||||
| ORF48 | + | 45204 | 45365 | 54 | 0 | ||||||||||||||||
| ORF49 | − | 45626 | 45889 | 88 | 166 | W4XJ30_STRPU | Uncharacterized protein | 2.20E-06 | Strongylocentrotus purpuratus | Eukaryote | PF01712.17 | CL0023 | Deoxynucleoside kinase | 1 | 86 | 1.90E-11 | 1.20E-15 | ||||
| ORF50 | − | 45946 | 46284 | 113 | 120 | D9YXA4_9CRUS | NADH-ubiquinone oxidoreductase chain 2 (Fragment) | 4.60E-10 | Daphnia magna | Eukaryote | PF01712.17 | CL0023 | Deoxynucleoside kinase | 5 | 111 | 2.70E-16 | 1.70E-20 | ||||
| ORF51 | − | 46894 | 48393 | 500 | 0 | ||||||||||||||||
| ORF52 | − | 48380 | 49501 | 374 | 37 | A0A120HYA7_GHVS | Per-os infectivity factor 2-like protein | 5.30E-13 | Glossina hytrovirus | Virus | 1 | x | |||||||||
| ORF53 | − | 49589 | 49843 | 85 | 0 | ||||||||||||||||
| ORF54 | − | 49905 | 50240 | 112 | 2 | A0A0A1HB82_LINLA | Cytochrome c oxidase subunit 1 (Fragment) | 6.00E-13 | Strongylocentrotus purpuratus | Eukaryote | 2 | ||||||||||
| ORF55 | + | 50261 | 50857 | 199 | 0 | ||||||||||||||||
| ORF56 | − | 50854 | 51411 | 186 | 0 | ||||||||||||||||
| ORF57 | + | 51395 | 51547 | 51 | 0 | ||||||||||||||||
| ORF58 | + | 51528 | 55454 | 1309 | 3 | A0A0K1L634_9VIRU | ORF_007L | 6.30E-05 | Scale drop disease virus Iridoviridae | Virus | PF00136.19 | CL0194 | DNA polymerase family B | 599 | 880 | 2.30E-20 | 1.40E-24 | 2 | |||
| ORF59 | − | 55493 | 56143 | 217 | 0 | ||||||||||||||||
| ORF60 | − | 56177 | 57265 | 363 | 4 | R9XIV9_ASHAC | AaceriAGL139Wp | 4.70E-09 | Glossina hytrovirus | Virus | PF02450.13 | CL0028 | Lecithin:cholesterol acyltransferase | 57 | 351 | 1.60E-07 | 9.60E-12 | ||||
| ORF61 | − | 57490 | 59226 | 579 | 476 | B4JWM1_DROGR | GH22716 | 4.80E-78 | Drosophila melanogaster | Eukaryote | mucin | 1 | |||||||||
| ORF62 | − | 59414 | 59950 | 179 | 0 | ||||||||||||||||
| ORF63 | − | 60093 | 62282 | 730 | 153 | B0YLJ3_GHVS | p74 protein-like protein | 1.30E-68 | Glossina hytrovirus | Virus | x | x | |||||||||
| ORF64 | − | 62255 | 62407 | 51 | 0 | ||||||||||||||||
| ORF65 | + | 63240 | 63398 | 53 | 0 | ||||||||||||||||
| ORF66 | + | 63602 | 65116 | 505 | 1892 | B4NG60_DROWI | Uncharacterized protein | 1.30E-50 | Lucilia cuprina | Eukaryote | PF00653.19 | CL0417 | Inhibitor of Apoptosis domain | 123 | 189 | 2.40E-12 | 2.90E-16 | ||||
| PF00653.19 | CL0417 | Inhibitor of Apoptosis domain | 290 | 355 | 1.60E-14 | 2.00E-18 | |||||||||||||||
| PF13920.4 | CL0229 | Zinc finger, C3HC4 type (RING finger) | 396 | 442 | 2.30E-08 | 2.80E-12 | |||||||||||||||
| PF13920.4 | CL0229 | Zinc finger, C3HC4 type (RING finger) | 452 | 495 | 1.60E-06 | 2.00E-10 | |||||||||||||||
| ORF67 | − | 65213 | 66142 | 310 | 4 | A0A0U4AXZ3_9VIRU | Putative helicase | 2.30E-06 | Pithovirus sibericum | Virus | |||||||||||
| ORF68 | − | 66316 | 68253 | 646 | 0 | ||||||||||||||||
| ORF69 | + | 68261 | 68521 | 87 | 0 | ||||||||||||||||
| ORF70 | + | 68697 | 68873 | 59 | 0 | ||||||||||||||||
| ORF71 | − | 68975 | 69184 | 70 | 0 | ||||||||||||||||
| ORF72 | + | 69345 | 69665 | 107 | 0 | ||||||||||||||||
| ORF73 | + | 69662 | 70030 | 123 | 0 | ||||||||||||||||
| ORF74 | + | 70085 | 70495 | 137 | 0 | ||||||||||||||||
| ORF75 | + | 70641 | 70805 | 55 | 0 | ||||||||||||||||
| ORF76 | − | 70836 | 71072 | 79 | 0 | ||||||||||||||||
| ORF77 | − | 71685 | 71921 | 79 | 0 | ||||||||||||||||
| ORF78 | + | 72249 | 74279 | 677 | 5 | A0A0R3WJU6_HYDTA | Uncharacterized protein | 1.60E-09 | Glossina hytrovirus | Virus | 1 | x | |||||||||
| ORF79 | + | 74511 | 75440 | 310 | 0 | ||||||||||||||||
| ORF80 | + | 75437 | 76492 | 352 | 0 | ||||||||||||||||
| ORF81 | − | 76495 | 78105 | 537 | 5912 | B0YLU3_GHVS | Uncharacterized protein | 2.20E-18 | Glossina hytrovirus | Virus | PF00004.27 | CL0023 | ATPase associated with various activities (AAA) | 202 | 331 | 4.40E-14 | 2.70E-18 | x | x | ||
| ORF82 | + | 78323 | 78643 | 107 | 0 | ||||||||||||||||
| ORF83 | + | 78694 | 79995 | 434 | 0 | ||||||||||||||||
| ORF84 | − | 80036 | 80317 | 94 | 0 | ||||||||||||||||
| ORF85 | + | 81459 | 82106 | 216 | 77 | B2YG85_MHVB | Ac81-like protein | 6.70E-10 | Musca hytrovirus | Virus | PF05820.9 | n/a | Baculovirus protein of unknown function (DUF845) | 29 | 147 | 0.00015 | 9.10E-09 | 2 | x | ||
| ORF86 | + | 82084 | 82293 | 70 | 0 | ||||||||||||||||
| ORF87 | + | 82378 | 82908 | 177 | 0 | ||||||||||||||||
| ORF88 | + | 82908 | 83156 | 83 | 0 | ||||||||||||||||
| ORF89 | - | 83275 | 83892 | 206 | 2 | B6DZA6_9VIRU | Uncharacterized protein | 4.40E-107 | Leptopilina boulardi filamentous virus | Virus | 1 | ||||||||||
| ORF90 | + | 84156 | 85292 | 379 | 0 | ||||||||||||||||
| ORF91 | + | 85375 | 85536 | 54 | 0 | ||||||||||||||||
| ORF92 | + | 85846 | 90627 | 1594 | 2 | B0YLK7_GHVS | Per-os infectivity factor 2-like protein | 3.30E-18 | Glossina hytrovirus | Virus | 4 | x | |||||||||
| ORF93 | + | 90767 | 90979 | 71 | 0 | ||||||||||||||||
| ORF94 | + | 91062 | 91610 | 183 | 0 | ||||||||||||||||
| ORF95 | + | 91631 | 91834 | 68 | 1 | A0A061DCU0_BABBI | Uncharacterized protein | 0.00012 | Theileria annulata | Eukaryote | 1 | ||||||||||
| ORF96 | − | 91717 | 94863 | 1049 | 2 | B0YLT7_GHVS | Putative uncharacterized protein | 5.50E-28 | Glossina hytrovirus | Virus | x | ||||||||||
| ORF97 | + | 94936 | 95808 | 291 | 0 | x | x | ||||||||||||||
| ORF98 | + | 95820 | 96437 | 206 | 0 | ||||||||||||||||
| ORF99 | + | 96467 | 97387 | 307 | 0 | ||||||||||||||||
| ORF100 | − | 97479 | 97787 | 103 | 0 | ||||||||||||||||
| ORF101 | − | 97813 | 98703 | 297 | 1 | R9X889_ASHAC | Vacuolar ATPase assembly integral membrane protein VMA21 | 6.70E-06 | Musca hytrovirus | Virus | PF10553.7 | n/a | MSV199 domain | 23 | 101 | 5.40E-05 | 3.30E-09 | 1 | |||
| ORF102 | − | 99010 | 100092 | 361 | 0 | ||||||||||||||||
| ORF103 | − | 100137 | 100715 | 193 | 0 | ||||||||||||||||
| ORF104 | + | 100910 | 101581 | 224 | 0 | ||||||||||||||||
| ORF105 | + | 103681 | 104427 | 249 | 1 | A0A016TCB2_9BILA | Hexosyltransferase | 2.40E-06 | Caenorhabditis brenneri | Eukaryote | |||||||||||
| ORF106 | − | 104449 | 106506 | 686 | 138 | A0A0B7JV03_BIOOC | Occlusion-derived virus envelope protein | 1.10E-50 | Musca hytrovirus | Virus | PF08124.9 | CL0372 | Polysaccharide lyase family 8, N terminal alpha-helical domain | 56 | 283 | 1.30E-10 | 8.20E-15 | 2 | x | ||
| ORF107 | + | 106977 | 108854 | 626 | 1 | A0A158R6B8_9BILA | Uncharacterized protein | 0.00023 | Helobdella robusta | Eukaryote | |||||||||||
| ORF108 | + | 108899 | 110278 | 460 | 28 | G5AXB1_HETGA | Protein CASP | 2.10E-22 | Fukomys damarensis | Eukaryote |
Note.—The annotation was done using hmmer algorithm for protein (phmmer) and domain (hmmscan) searches.
O: Orientation; SP, signal peptide; TM, transmembrane domain; Struct. MdSGHV: homolog has been identified as a structural protein in MdSGHV; Struct. GpSGHV: homolog has been identified as a structural protein in GpSGHV.
FThe phylogeny of two putative anti-apoptotic genes found in LbFV genome suggest that they have been acquired from eukaryotes. (A) ORF27 and (B) ORF66. The branch supports were estimated by approximate Likelihood ratio tests (aLRT). Only aLRT > 0.70 are represented. Eukaryotic lineages are represented in blue and virus lineages are in red.
FThe phylogeny of LbFV ORFs 11 and 13 suggests that they have been horizontally acquired from eukaryotes and evolved subsequently through a duplication in the genome of LbFV. The branch supports were estimated by approximate Likelihood ratio tests (aLRT). Only aLRT > 0.70 are represented. Eukaryotic lineages are represented in blue and virus lineages are in red.
FPhylogenetic reconstruction based on the concatenation of 6 genes (ORF37, ORF52, ORF58, ORF85 and ORF106). The conserved blocks were concatenated and the phylogeny was built using PhyML. The phylogenetic signal was congruent among the six genes. The branch supports were estimated by approximate Likelihood ratio tests (aLRT). Only aLRT > 0.70 are represented.