| Literature DB >> 28584080 |
Scott M Geib1, Guang Hong Liang2, Terence D Murphy3, Sheina B Sim4.
Abstract
The braconid wasp Fopius arisanus (Sonan) is an important biological control agent of tropical and subtropical pest fruit flies, including two important global pests, the Mediterranean fruit fly (Ceratitis capitata), and the oriental fruit fly (Bactrocera dorsalis). The goal of this study was to develop foundational genomic resources for this species to provide tools that can be used to answer questions exploring the multitrophic interactions between the host and parasitoid in this important research system. Here, we present a whole genome assembly of F. arisanus, derived from a pool of haploid offspring from a single unmated female. The genome is ∼154 Mb in size, with a N50 contig and scaffold size of 51,867 bp and 0.98 Mb, respectively. Utilizing existing RNA-Seq data for this species, as well as publicly available peptide sequences from related Hymenoptera, a high quality gene annotation set, which includes 10,991 protein coding genes, was generated. Prior to this assembly submission, no RefSeq proteins were present for this species. Parasitic wasps play an important role in a diverse ecosystem as well as a role in biological control of agricultural pests. This whole genome assembly and annotation data represents the first genome-scale assembly for this species or any closely related Opiine, and are publicly available in the National Center for Biotechnology Information Genome and RefSeq databases, providing a much needed genomic resource for this hymenopteran group.Entities:
Keywords: Genome Report; biocontrol; braconid wasp; tephritid fruit fly; whole genome sequencing
Mesh:
Year: 2017 PMID: 28584080 PMCID: PMC5555450 DOI: 10.1534/g3.117.040741
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Collinear gene blocks between F. arisanus and N. vitripennis. Scaffolds from the F. arisanus assembly containing collinear orthologous gene blocks which consist of three or more genes in the same order in as the chromosome assembly of the N. vitripennis genome. The assembled chromosomes of N. vitripennis are represented as turquoise bars and the F. arisanus scaffolds are represented by orange bars. The links between the collinear blocks between the F. arisanus and N. vitripennis assemblies are colored by the chromosome in which they are located in the N. vitripennis genome (links to the N. vitripennis chromosome 1 are in orange, links to chromosome 2 are in blue, links to chromosome 3 are in yellow, links to chromosome 4 are in green, and links to chromosome 5 are in red).
Raw reads generated for assembly
| SRA | Library Type | Read Pairs | Base Pairs | Coverage |
|---|---|---|---|---|
| SRX689044 | 180 bp | 106.1 M | 21.2 Gb | 137× |
| SRX689045 | 3 kb | 89.7 M | 17.9 Gb | 116× |
| SRX689047 | 8 kb | 21.0 M | 4.2 Gb | 17.5× |
Assembly summary statistics compared to other parasitoid genomes
| Species | NCBI Bio Project (PR-JNA#) | Contig Count (N50 kb) | Scaffold Count (N50 Mb) | Total Length (Mb) | GC (%) |
|---|---|---|---|---|---|
| 258104 | 8510 (51.90) | 1042 (0.98) | 153.6 | 39.4 | |
| 13660 | 25484 (18.84) | 6169 (0.71) | 295.8 | 40.6 | |
| 306876 | 25534 (44.93) | 3968 (0.65) | 388.8 | 39.1 | |
| 195937 | 27508 (14.12) | 1794 (1.14) | 241.2 | 33.1 | |
| 271135 | 9156 (46.06) | — | 186.1 | 30.6 |
Gene annotation summary statistics
| Feature | Count | Mean Length (bp) | Median Length (bp) | Minimum Length (bp) | Maximum Length (bp) |
|---|---|---|---|---|---|
| Genes | 11,661 | 8569 | 3152 | 71 | 490,550 |
| All transcripts | 20,216 | 2844 | 2143 | 71 | 53,694 |
| mRNA | 18,906 | 2947 | 2228 | 248 | 53,694 |
| misc RNA | 367 | 2687 | 2071 | 174 | 13,135 |
| tRNA | 159 | 74 | 73 | 71 | 84 |
| lncRNA | 784 | 996 | 785 | 106 | 7,102 |
| CDSs | 18,906 | 1964 | 1419 | 105 | 52,947 |
| Exons | 71,080 | 442 | 216 | 2 | 14,501 |
| Introns | 57,960 | 1625 | 214 | 30 | 332,337 |
BUSCO analysis on assembly and annotations
| Species | CDS Count | BUSCO Mode | Complete | Fragmented | Missing |
|---|---|---|---|---|---|
| 18,906 | OGS | 2605 (97) | 37 (1.3) | 33 (1.2) | |
| — | Genome | 2355 (88) | 232 (8.6) | 88 (3.2) | |
| 15,346 | Trans | 1803 (67) | 152 (5.6) | 720 (26) | |
| 24,846 | OGS | 2585 (96) | 40 (1.4) | 50 (1.8) | |
| 19,692 | OGS | 2622 (98) | 31 (1.1) | 22 (0.8) | |
| 18,586 | OGS | 2621 (97) | 34 (1.2) | 20 (0.7) |
Number of BUSCO proteins (percent of total BUSCOs).
From NCBI TSA PRJNA259570.
From NCBI RefSeq v201 annotation release.
From NCBI RefSeq v100 annotation release.
From NCBI RefSeq v101 annotation release.