| Literature DB >> 30018085 |
Gaelen R Burke1, Kimberly K O Walden2, James B Whitfield2, Hugh M Robertson2, Michael R Strand3.
Abstract
Microplitis demolitor (Hymenoptera: Braconidae) is a parasitoid used as a biological control agent to control larval-stage Lepidoptera and serves as a model for studying the function and evolution of symbiotic viruses in the genus Bracovirus Here we present the M. demolitor genome (assembly version 2.0), with a genome size of 241 Mb, and a N50 scaffold and contig size of 1.1 Mb and 14 Kb, respectively. Using RNA-Seq data and manual annotation of genes of viral origin, we produced a high-quality gene set that includes 18,586 eukaryotic and 171 virus-derived protein-coding genes. Bracoviruses are dsDNA viruses with unusual genome architecture, in which the viral genome is integrated into the wasp genome and is comprised of two distinct components: proviral segments that are amplified, circularized, and packaged into virions for export into the wasp's host via oviposition; and replication genes. This genome assembly revealed that at least two scaffolds contain both nudivirus-like genes and proviral segments, demonstrating that at least some of these components are near each other in the genome on a single chromosome. The updated assembly and annotation are available in several publicly accessible databases; including the National Center for Biotechnology Information and the Ag Data Commons. In addition, all raw sequence data available for M. demolitor have been consolidated and are available for visualization at the i5k Workspace. This whole genome assembly and annotation represents the only genome-scale, annotated assembly from the lineage of parasitoid wasps that has associations with bracoviruses (the 'microgastroid complex'), providing important baseline knowledge about the architecture of co-opted virus symbiont genomes.Entities:
Keywords: Braconidae; Hymenoptera; Microplitis demolitor bracovirus (MdBV); Polydnaviridae; symbiosis
Mesh:
Substances:
Year: 2018 PMID: 30018085 PMCID: PMC6118312 DOI: 10.1534/g3.118.200308
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Raw reads generated for assembly
| SRA | Library type | Read pairs | Base pairs | Material |
|---|---|---|---|---|
| SRX610757 | 180 bp | 138.9 M | 17.5 Gb | Single adult male |
| SRX641381 | 1.5 kb | 164.3 M | 20.9 Gb | 40 pooled adult male wasps |
| SRX641400 | 5 kb | 53.6 M | 7.1 Gb | 40 pooled adult male wasps |
| SRX641403 | 10 kb | 174.3 M | 22.1 Gb | 100 pooled adult male wasps |
| SRX641469 | 20 kb | 230.2 M | 28.3 Gb | 100 pooled adult male wasps |
| SRX981480 | 5 kb | 37.8 M | 5.1 Gb | 120 pooled adult male wasps |
Summary statistics for the Mdem2 assembly, the previous Mdem1 assembly, and select other parasitoid genomes
| Species | Assembly | NCBI BioProject | Contig count (N50 kb) | Scaffold count (N50 Mb) | Total length (Mb) | GC (%) |
|---|---|---|---|---|---|---|
| Mdem2 | PRJNA251518 | 27,508 (14.12) | 1,794 (1.14) | 241.2 | 33.1 | |
| Mdem1 | PRJNA251518/PRJNA195937 | 36,718 (13.54) | 5,174 (0.32) | 250.5 | 33.1 | |
| ASM80636v1 | PRJNA258104 | 8,510 (51.90) | 1,042 (0.98) | 153.6 | 39.4 | |
| Dall1.0 | PRJNA306876 | 25,534 (44.93) | 3,968 (0.65) | 388.8 | 39.1 | |
| ASM95615v1 | PRJNA271135 | 9,156 (46.06) | — | 186.1 | 30.6 | |
| Nvit_2.1 | PRJNA13660 | 25,484 (18.84) | 6,169 (0.71) | 295.8 | 40.6 |
Gene annotation summary statistics
| Feature | Count | Mean length (bp) | Median length (bp) | Min length (bp) | Max length (bp) |
|---|---|---|---|---|---|
| Genes | 12,670 | 10,107 | 3,317 | 71 | 539,413 |
| All transcripts | 19,597 | 2,356 | 1,840 | 71 | 44,309 |
| mRNA | 18,586 | 2,417 | 1,897 | 189 | 44,309 |
| misc_RNA | 224 | 3,134 | 2,043 | 106 | 19,088 |
| tRNA | 183 | 74 | 73 | 71 | 84 |
| lncRNA | 604 | 911 | 671 | 91 | 5,617 |
| CDSs | 18,586 | 1,916 | 1,401 | 105 | 43,512 |
| Exons | 78,299 | 365 | 211 | 2 | 14,552 |
| Introns | 64,033 | 1,911 | 183 | 30 | 353,735 |
Raw reads from M. demolitor tissues or proviral segments used for annotation
| Track name | Bioproject/ Sample ID | Nucleic acid | Number of reads | Percent aligned (quality-filtered) reads | Reference |
|---|---|---|---|---|---|
| Venom glands | PRJNA214515/SAMN02319525 | RNA | 136.4 M | 95% | |
| Larvae | PRJNA214515/SAMN02319526 | RNA | 95.7 M | 93% | |
| Teratocytes | PRJNA214515/SAMN02319527 | RNA | 99.0 M | 88% | |
| Ovaries | PRJNA214515/SAMN02319528 | RNA | 103.3 M | 90% | Burke and Strand 2012 |
| Infected | PRJNA285771/SAMN03758721 | RNA | 73.7 M | 2% | |
| Infected | PRJNA437008/SAMN08637637- | RNA | 22.0 M | 5% | |
| SAMN08637639; | |||||
| SAMN08637643- | |||||
| SAMN08637645 | |||||
| Infected | PRJNA437008/SAMN08637630, SAMN08637631, SAMN08637636, | RNA | 37.5 M | 4% | |
| SAMN08637640- | |||||
| SAMN08637642 | |||||
| Bracovirus DNA | PRJNA319039/SAMN04875661 | DNA | 49.5 M | 99% | |
| Ovary DNA | PRJNA319039/SAMN04875652 | DNA | 17.6 M | 99% |
BUSCO analysis of parasitoid wasp genomes
| Species | CDS count | NCBI Refseq annotation release | Complete (% of total BUSCOs) | Fragmented (% of total BUSCOs) | Missing (% of total BUSCOs) |
|---|---|---|---|---|---|
| 18,586 | 101 | 2621 (97) | 34 (1.2) | 20 (0.7) | |
| 18,906 | 100 | 2605 (97) | 37 (1.3) | 33 (1.2) | |
| 19,692 | 100 | 2622 (98) | 31 (1.1) | 22 (0.8) | |
| 24,846 | 102 | 2585 (96) | 34 (1.2) | 50 (1.8) |
Figure 1Schematic showing genome architecture and replication of bracoviruses. Proviral segments (shown as blue chevrons) reside in dispersed locations in the wasp genome and contain virulence genes. Proviral segments specifically amplify in the nuclei of calyx cells as Replication Units prior to segment excision and circularization. Nudivirus-like replication genes also reside in dispersed locations in the wasp genome (as indicated by hash marks) and encode proteins required to make virions. Expression of nudivirus-like replication genes in calyx cell nuclei results in formation of virions that package circularized proviral segments. Mature virions are released into the calyx region of wasp ovaries by lysis of calyx cells. Mature virions are replication-defective because the encapsidated proviral segments do not contain nudivirus-like replication genes. Wasps inject mature virions into host insects, which infect different cell types and express virulence genes that affect host immune defenses and growth.
Coordinates for boundaries of proviral segments and amplified replication units
| Segment | Locus | Orientation | Accession number | Segment start | Segment end | Amplification start | Amplification end |
|---|---|---|---|---|---|---|---|
| P | 1 | — | NW_014464280.1 | 204368 | 217086 | 202948 | |
| K1 | 1 | — | NW_014464280.1 | 217246 | 232492 | ||
| K | 1 | — | NW_014464280.1 | >235072 | 245355 | ||
| Q | 1 | — | NW_014464280.1 | 245514 | 261072 | ||
| D | 1 | — | NW_014464280.1 | 261199 | >274917 | >274917 | |
| D | 1 | + | NW_014464327.1 | 325333 | >327729 | >327729 | |
| B | 1 | + | NW_014464327.1 | 315630 | 321681 | ||
| A | 1 | + | NW_014464327.1 | 286830 | 312116 | ||
| L | 1 | + | NW_014464327.1 | 270462 | 285675 | ||
| F | 1 | — | NW_014464327.1 | 259195 | >268430 | ||
| I | 1 | — | NW_014464327.1 | 245419 | >258067* | ||
| M | 1 | + | NW_014464327.1 | 228747 | 244908 | ||
| G | 1 | — | NW_014464327.1 | 215667 | 227866 | 214815 | |
| O | 1 | — | NW_014464327.1 | >209595 | 214086 | 186757 | 214590 |
| V | 2 | + | NW_014463818.1 | >1914991* | 1929656 | 1929927 | |
| W | 2 | + | NW_014463818.1 | 1899259 | >1901856* | ||
| E | 2 | — | NW_014463818.1 | 1890266 | 1898510 | ||
| C | 2 | + | NW_014463818.1 | 1882990 | 1890217 | ||
| X | 2 | — | NW_014463818.1 | 1867002 | 1881299 | 1866484 | |
| N | 3 | — | NW_014463791.1 | 1514968 | 1532949 | 1511956 | 1533620 |
| J | 3 | — | NW_014463791.1 | 1535308 | 1549002 | 1534400 | 1550957 |
| H | 4 | + | NW_014463921.1 | 383509 | 394750 | 382579 | 395765 |
| R | 5 | — | NW_014464373.1 | 36111 | 42534 | 32781 | 43423 |
| S | 6 | — | NW_014463823.1 | 1794688 | 1806656 | 1780975 | 1810556 |
| T | 7 | + | NW_014464188.1 | 421495 | 427196 | 421290 | 429489 |
| U | 8 | + | NW_014463797.1 | 3414939 | 3421513 | 3414736 | 3423057 |
Each proviral segment and its associated locus is listed in a row along with the M. demolitor genome scaffold where it is located. Scaffold accession numbers are indicated along with the coordinates for the boundaries of each proviral segment. Amplification start and end coordinates are listed for each RU that contains one segment. For multi-segment RUs, the amplification start and end coordinates correspond to the outermost segments. “>” signs indicate that gaps in scaffolds or scaffold termini prevent determination of segment or replication unit ends. “*” is similar to “>”, but segment ends are detected in smaller contigs that were not incorporated into scaffolds (e.g., ends of Segments V and W are in NW_014463725.1, Mdem_contig_4120015, while the end of Segment I is in NW_014463324.1, Mdem_contig_4046930).