| Literature DB >> 32859687 |
Thomas W R Harrop1, Joseph Guhlin1, Gemma M McLaughlin1, Elizabeth Permina1, Peter Stockwell2, Josh Gilligan1, Marissa F Le Lec1, Monica A M Gruber3, Oliver Quinn3, Mackenzie Lovegrove1, Elizabeth J Duncan4, Emily J Remnant5, Jens Van Eeckhoven4, Brittany Graham4, Rosemary A Knapp4, Kyle W Langford6, Zev Kronenberg6, Maximilian O Press6, Stephen M Eacker6, Erin E Wilson-Rankin7, Jessica Purcell7, Philip J Lester3, Peter K Dearden8.
Abstract
Social wasps of the genus Vespula have spread to nearly all landmasses worldwide and have become significant pests in their introduced ranges, affecting economies and biodiversity. Comprehensive genome assemblies and annotations for these species are required to develop the next generation of control strategies and monitor existing chemical control. We sequenced and annotated the genomes of the common wasp (Vespula vulgaris), German wasp (Vespula germanica), and the western yellowjacket (Vespula pensylvanica). Our chromosome-level Vespula assemblies each contain 176-179 Mb of total sequence assembled into 25 scaffolds, with 10-200 unanchored scaffolds, and 16,566-18,948 genes. We annotated gene sets relevant to the applied management of invasive wasp populations, including genes associated with spermatogenesis and development, pesticide resistance, olfactory receptors, immunity and venom. These genomes provide evidence for active DNA methylation in Vespidae and tandem duplications of venom genes. Our genomic resources will contribute to the development of next-generation control strategies, and monitoring potential resistance to chemical control.Entities:
Keywords: Hymenoptera; Vespula germanica; Vespula pensylvanica; Vespula vulgaris; genomes; social insects
Mesh:
Year: 2020 PMID: 32859687 PMCID: PMC7534447 DOI: 10.1534/g3.120.401579
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Vespula species are major invasive pests. A–C. Examples of (A) Vespula germanica, (B) Vespula pensylvanica (Jon Sullivan, public domain), and (C) Vespula vulgaris (Sid Mosdell, CC BY 2.0). D. Native and invaded ranges of the sequenced Vespula species. E. Rooted phylogenetic species tree of Vespula species and other sequenced Hymenoptera, with Drosophila melanogaster as an outspecies, derived from 415 orthogroups using OrthoFinder.
Figure 2A. Assembled chromosome lengths of the three Vespula species. Chromosomes were named in order of scaffold length in V. pensylvanica. B. Synteny between selected chromosomes of Vespula species. Chromosome 1 has minor translocations and inversions but is syntenic overall. Chromosome 2 has a larger inversion between V. pensylvanica and V. germanica that is not present in V. vulgaris. Chromosome 5 has a large translocation and inversion between V. pensylvanica and V. germanica. Chromosome lengths are scaled to chromosome 1. C. Shared orthogroups between the three Vespula species. D. The pan and core genomes of Hymenoptera. The core genome (the set of orthogroups present in all genomes sampled) decreases as more genomes are sampled, to a final core genome size of 3,092 orthogroups when all 26 hymenopteran genomes are compared. The pan genome (the total set of orthogroups present in one or more of the sampled genomes) does not continue to grow as more genomes are sampled, indicating a closed hymenopteran pan genome. We predicted more orthogroups (13,141–14,022 orthogroups) in Vespula species than in non-Vespula Hymenoptera (median 9,193 orthogroups), which resulted in a larger pan genome in comparisons that included Vespula species. We only analyzed genes that were assigned to an orthogroup, and we plotted a random subset of 1,000 comparisons for genome numbers that resulted in more than 1000 comparisons.
Comparison of Vespidae and honeybee genome assembles. 1. Apart from our Vespula assemblies, repeat content was counted from lowercase nucleotides in published assemblies
| Species | Sequencing Strategy | Total Sequence | Largest | Scaffolds/ contigs | Ns | Gaps | Repeats1 (%) | |
|---|---|---|---|---|---|---|---|---|
| Illumina + Hi-C | 176,275,134 | 19,426,332 | 35 | 8,304,510 | 4,147,610 | 49,679 | 17.12 | |
| Illumina + Hi-C | 178,312,246 | 19,524,135 | 133 | 8,396,154 | 1,783,864 | 18,963 | 18.89 | |
| Illumina + Hi-C | 179,379,562 | 19,704,315 | 225 | 8,532,720 | 444,951 | 4,987 | 19.15 | |
| Illumina | 211,202,212 | 3,185,661 | 3,836 | 521,566 | 14,106,256 | 15,755 | 41.60 | |
| Illumina | 208,026,220 | 7,126,315 | 1,483 | 1,625,592 | 7,426,626 | 14,286 | 44.50 | |
| PacBio Sequel + Illumina + Dovetail | 219,116,742 | 19,629,704 | 187 | 9,116,088 | 4,436,170 | 1,204 | 41.79 | |
| PacBio Sequel + Illumina | 219,838,961 | 15,979,625 | 216 | 4,634,047 | 1,082,619 | 459 | 44.48 | |
| 10x Genomics | 209,288,276 | 20,305,868 | 5,129 | 5,372,633 | 4,771,923 | 8,816 | 42.35 | |
| Sanger + SOLiD + 454 | 250,287,000 | 29,893,408 | 5,321 | 13,219,345 | 21,165,099 | 12,690 | 5.28 | |
| PacBio + 10x Chromium + BioNano + Hi-C | 225,250,884 | 27,754,200 | 177 | 13,619,445 | 1,313,614 | 51 | 44.63 |
Gene content and orthogroup representation for selected hymenopteran genomes
| Species | Genes | Orthogroups represented | Orthogroups represented (%) | Species-specific orthogroups | Genes in species-specific orthogroups (%) |
|---|---|---|---|---|---|
| 16,751 | 13,141 | 58.9% | 5 | 0.1% | |
| 17,854 | 13,739 | 61.6% | 23 | 0.4% | |
| 19,142 | 14,022 | 62.8% | 11 | 0.1% | |
| 10,518 | 9,086 | 40.7% | 12 | 0.2% | |
| 11,069 | 9,193 | 41.2% | 27 | 0.5% | |
| 14,064 | 8,949 | 40.1% | 35 | 0.6% | |
| 14,647 | 8,901 | 39.9% | 616 | 11.1% |
Figure 3A. Whole-genome alignments of five Polistes genomes and genomes of V. germanica and V. pensylvanica against Chr11 of the V. vulgaris genome, showing the Dnmt3 locus (Vvulg11g01820) and the surrounding 20 kb. None of the Polistes assemblies have regions that are homologous to the Dnmt3 locus, and annotations for P. canadensis and P. dominula do not include a Dnmt3 gene (Standage ; Patalano ; Bewick ). All three Vespula genomes encode an ortholog of Dnmt3. B. Frequency histograms of CpG[o/e] observed in coding sequences of Drosophila melanogaster, Apis mellifera, Polistes canadensis, Polistes dominula, Vespula germanica, Vespula vulgaris and Vespula pensylvanica. The y-axis depicts the number of genes (Frequency) with CpG[o/e] values given on the x-axis. The distribution of CpG[o/e] in Vespid wasps is a trimodal distribution, with a low-CpG[o/e] peak consistent with the presence of historical DNA methylation in all five Vespid species.
Figure 4Ultrametric trees of orthogroups encoding Vespid wasp venom components in insect genomes. A. Venom hyaluronidase phylogram showing duplication of this gene in Vespid genomes but not Polistes. B. Venom phospholipase phylogram indicating independent duplications in Polistes and V. germanica from a single gene ancestor in related Hymenoptera. C. Antigen 5 phylogram, indicating two genes encode this venom component in Vespid wasps and Polisties, one of which is missing from the V. vulgaris genome.