| Literature DB >> 32792343 |
Kim B Ferguson1, Tore Kursch-Metz2,3, Eveline C Verhulst4, Bart A Pannebakker5.
Abstract
Trichogramma brassicae (Bezdenko) are egg parasitoids that are used throughout the world as biological control agents and in laboratories as model species. Despite this ubiquity, few genetic resources exist beyond COI, ITS2, and RAPD markers. Aided by a Wolbachia infection, a wild-caught strain from Germany was reared for low heterozygosity and sequenced in a hybrid de novo strategy, after which several assembling strategies were evaluated. The best assembly, derived from a DBG2OLC-based pipeline, yielded a genome of 235 Mbp made up of 1,572 contigs with an N50 of 556,663 bp. Following a rigorous ab initio-, homology-, and evidence-based annotation, 16,905 genes were annotated and functionally described. As an example of the utility of the genome, a simple ortholog cluster analysis was performed with sister species T. pretiosum, revealing over 6000 shared clusters and under 400 clusters unique to each species. The genome and transcriptome presented here provides an essential resource for comparative genomics of the commercially relevant genus Trichogramma, but also for research into molecular evolution, ecology, and breeding of T. brassicae.Entities:
Keywords: Hymenoptera; Wolbachia; biocontrol agent; parasitoid
Mesh:
Substances:
Year: 2020 PMID: 32792343 PMCID: PMC7534424 DOI: 10.1534/g3.120.401344
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Statistics for five assemblies of Trichogramma brassicae. The first strategy was PacBio-only in Canu, while three hybrid assembly strategies were based on SPAdes and modulating k-mer sizes, and an additional hybrid assembly was based on an adapted DBG2OLC+Racon+Pilon protocol. BUSCO score is based on the insect_db09 dataset (Simão )
| Assembler | Version | Size (bp) | Contigs | Longest contig (bp) | N50 (bp) | BUSCO (Complete %) |
|---|---|---|---|---|---|---|
| Canu | v1.0 | 69,522,446 | 3,007 | 126,800 | 27,303 | 18.7 |
| SPAdes (k = 21, 33, 55) | v2.1 | 227,096,967 | 282,988 | 474,998 | 36,870 | 96.8 |
| SPAdes (k = 21, 33, 55, 77) | v2.2 | 226,864,253 | 189,696 | 548,753 | 49,096 | 97.1 |
| SPAdes (k = 127) | v2.3 | 211,402,326 | 73,567 | 537,817 | 63,558 | 96.4 |
| DBG2OLC+ Racon+Pilon | v3.0 | 235,413,774 | 1,572 | 2,953,580 | 556,663 | 95.5 |
Heterozygosity and repetitive content analysis of Trichogramma brassicae (thelytokous), Trichogramma pretiosum (thelytokous), and T. pretiosum (arrhenotokous) lines based on sequence data
| Heterozygosity (%) | Repetitive content (%) | Source of sequence data | |
|---|---|---|---|
| 0.0332 | 0.608 | This publication | |
| 0.0289 | 0.482 | ||
| 0.863 | 2.64 |
Output of OrthoVenn2 ortholog cluster analysis of Trichogramma brassicae and Trichogramma pretiosum.
| Species | Proteins | Clusters | Singletons | Source of gene set |
|---|---|---|---|---|
| 16,905 | 6,562 | 5,268 | This work (S1.5) | |
| 13,200 | 6,507 | 5,177 |
Figure 1Ortholog cluster analysis between Trichogramma brassicae and Trichogramma pretiosum using OrthoVenn2 (Xu ). The number of clusters shared between the two organisms is in bold, with the number of proteins within each cluster grouping underneath in parentheses.