| Literature DB >> 29776407 |
Amelia R I Lindsey1,2, Yogeshwar D Kelkar3, Xin Wu4, Dan Sun4, Ellen O Martinson3,5, Zhichao Yan3,6, Paul F Rugman-Jones7, Daniel S T Hughes8, Shwetha C Murali8, Jiaxin Qu8, Shannon Dugan8, Sandra L Lee8, Hsu Chao8, Huyen Dinh8, Yi Han8, Harsha Vardhan Doddapaneni8, Kim C Worley8, Donna M Muzny8, Gongyin Ye6, Richard A Gibbs8, Stephen Richards8, Soojin V Yi4, Richard Stouthamer9, John H Werren10.
Abstract
BACKGROUND: Trichogrammatids are minute parasitoid wasps that develop within other insect eggs. They are less than half a millimeter long, smaller than some protozoans. The Trichogrammatidae are one of the earliest branching families of Chalcidoidea: a diverse superfamily of approximately half a million species of parasitoid wasps, proposed to have evolved from a miniaturized ancestor. Trichogramma are frequently used in agriculture, released as biological control agents against major moth and butterfly pests. Additionally, Trichogramma are well known for their symbiotic bacteria that induce asexual reproduction in infected females. Knowledge of the genome sequence of Trichogramma is a major step towards further understanding its biology and potential applications in pest control.Entities:
Keywords: Biological control; Chalcidoidea; Comparative genomics; Methylation; Miniaturization; Parthenogenesis; Symbiosis; Wolbachia
Mesh:
Year: 2018 PMID: 29776407 PMCID: PMC5960102 DOI: 10.1186/s12915-018-0520-9
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Genome assembly statistics for an asexual line of Trichogramma pretiosum
| Statistic |
|
|---|---|
| Scaffolds | 357 |
| Total length of scaffolds | 195,087,592 |
| Total ungapped length | 180,028,424 |
| Scaffold N50 | 3,706,225 |
| Contigs | 7879 |
| Contig N50 | 78,655 |
| Predicted genes | 12,928 |
| BUSCO scorea | C:91.8%[D:11.6%],F:3.5%,M:4.5%,n:2675 |
aBenchmarking Universal Single-Copy Orthologs (BUSCO) score in standard BUSCO notation (C complete, D duplicated, F fragmented, M missing, n number of genes used)
Fig. 1Hymenopteran phylogeny. Taxa in blue were used in further comparative analyses. Trichogramma pretiosum is in boldface. Bootstrap values are shown at each node. Major clades of interest are shown in boxes. Apocrita is a suborder that is distinguished from the earlier branching symphytans (sawflies) by their narrow “wasp-waist.” The ancestral state of Apocrita is thought to be that of parasitism, and the narrow waist an adaptation to that lifestyle. Within chalcids the majority of genomic data available are for species in the Pteromalidae, which is a large polyphyletic group of parasitoids
Species used in comparative analyses
| Species | Common name | Family | Superfamily | Assembly | Body Size |
|---|---|---|---|---|---|
|
| European honey bee | Apidae | Apoidea | AADG00000000 | ~ 10 mm [ |
|
| Sawfly | Tenthredinidae | Tenthredinoidea | AOFN00000000 | ~ 7 mm [ |
|
| Fig wasp | Agaonidae | Chalcidoidea | ATAC00000000 | ~ 3 mm [ |
|
| Polyembryonic wasp | Encyrtidae | Chalcidoidea | JBOX00000000 | ~ 1 mm [ |
|
| Braconid parasitoid wasp | Braconidae | Ichneumonoidea | AZMT00000000 | ~ 4 mm [ |
|
| Jewel wasp | Pteromalidae | Chalcidoidea | AAZX00000000 | ~ 2 mm [ |
|
| Parasitic wood wasp | Orussidae | Orussoidea | AZGP00000000 | ~ 11 mm [ |
|
| Trichogrammatidae | Chalcidoidea | JARR00000000 | ~ 0.3 mm [ |
Fig. 2Trichogramma pretiosum comparative genomics. a Trimmed phylogeny representing relationships of species used in genomic comparisons. b For each species, we determined the number of genes in the single-copy core clusters, variable-copy number core clusters, dispensable clusters (non-core, non-species specific), species-specific clusters, and singleton categories, as delineated by OrthoMCL. c Numbers of gene family clusters that have significantly expanded or contracted at branches across the phylogeny. Lowercase letters above pairs of bars refer to branches leading to internal nodes of the tree and corresponding points on the phylogeny in red. d Proportion of Trichogramma pretiosum lineage-specific genes (delimited by OrthoMCL) recovered in other hymenopteran genomes, using blastp and a range of e-value and length thresholds. e Female Trichogramma pretiosum from the asexual Insectary line. Her body is approximately three tenths of a millimeter long
Total numbers of species-specific genes and missing genes for each genome
| Genome | Species-specific genesa,c | Missing gene familiesb,c |
|---|---|---|
|
| 2933 | 267 |
|
| 1148 | 139 |
|
| 2932 | 172 |
|
| 4085 | 169 |
|
| 4203 (1090)d | 403 (48)d |
|
| 2704 | 125 |
|
| 1225 | 110 |
|
| 11,809 | 51 |
aSpecies-specific genes include singletons and genes within species-specific clusters
bNumbers of gene family clusters for which the species in question is the only species to not have at least one representative gene for that family
cDesignation of “missing” and “specific” categories is as determined by OrthoMCL
dCorrected counts of species-specific and missing genes following manual curation in Trichogramma pretiosum (see section on “Species-specific and missing genes in Trichogramma pretiosum”)
Numbers of chalcid proteins with 1st–4th longest branch lengths
| Placea |
|
|
|
|
|---|---|---|---|---|
| 1st | 580 | 353 | 341 | 37 |
| 2nd | 344 | 509 | 339 | 123 |
| 3rd | 263 | 335 | 398 | 315 |
| 4th | 124 | 114 | 233 | 836 |
a1st place means a protein had the longest branch length within an orthologous group, whereas 4th place is the shortest branch length. Placements are based on branch length from the chalcid ancestor to each chalcid species for the 1311 core, single-copy hymenopteran proteins
Numbers of hymenopteran proteins with 1st–7th longest branch lengths
| Placea |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| 1st | 490 | 290 | 274 | 33 | 162 | 54 | 8 |
| 2nd | 311 | 402 | 266 | 78 | 160 | 66 | 29 |
| 3rd | 231 | 318 | 313 | 187 | 174 | 64 | 24 |
| 4th | 148 | 181 | 294 | 381 | 189 | 84 | 39 |
| 5th | 87 | 81 | 123 | 364 | 391 | 171 | 88 |
| 6th | 34 | 30 | 39 | 207 | 179 | 538 | 284 |
| 7th | 10 | 9 | 2 | 61 | 56 | 334 | 839 |
a1st place means a protein had the longest branch length within an orthologous group, whereas 7th place is the shortest branch length. Placements are based on branch length from the hymenopteran ancestor to each species for the 1311 core, single-copy hymenopteran proteins. Athalia rosae was excluded due to its position as the sister to the other Hymenoptera in our analyses
Fig. 3Evolution of core hymenopteran proteins. Branch lengths of proteins from Trichogramma to the chalcid ancestor, relative to the branch length from the chalcid ancestor to the hymenopteran ancestor highlighting (a) proteins with the longest branch leading to Trichogramma (TPRE) as compared to other chalcids (1st place) and (b) 1st place proteins with the 100 longest branch lengths after normalization. In each panel, blue dots represent proteins that are contributing to the overrepresentation of GO terms within either the 1st place (a) or longest 100 of the 1st place sets of proteins (b). The differences in within-genome rankings of the chalcids compared to ranks of proteins based on their base distance (from the chalcid ancestor to the hymenopteran ancestor) (c), Trichogramma within-genome rankings compared to the base rank (d), and Trichogramma within-genome rankings compared to the median within-genome ranking of the other three chalcids (e). In c, d, and e, higher values on the x-axis indicate a longer branch length relative to other proteins. Positive values on the y-axis indicate an increase in rank for the clade in question, indicating a relatively longer branch length. Red dots indicate the top 10% of proteins, per bin, with the largest positive discrepancy in within-genome rank. Blue dots indicate the bottom 10% of proteins, per bin, with the largest negative discrepancy in within-genome rank. f The overlaps of the “fast” proteins (red dots) in panels c, d, and e
Fig. 4Re-sequencing of a sexual Trichogramma pretiosum. a Location of variants identified in the sexual Trichogramma pretiosum genome with respect to the asexual reference. b GO terms identified as diverging in the re-sequencing analyses, and the overlap with GO terms overrepresented in the singleton and rapid protein evolution categories
Fig. 5Putative DNA methyltransferases in Trichogramma pretiosum. Bootstrap values > 70 are shown
Fig. 6Evidence for methylation in Trichogramma pretiosum. a Coding sequence (CDS) CpG and GpC observed/expected (O/E) densities. b Fractional CpG methylation of Trichogramma pretiosum genes (log transformed) plotted against gene CDS CpG O/E. Blue regression line fit for genes with > 0.01 fractional methylation, r = − 0.49; red regression line fit for genes with < 0.01 fractional methylation, r = 0.051. c Average fractional methylation in the first and last four exons and introns of methylated genes and unmethylated genes
Fig. 7Conservation of methylation across hymenopteran species. a Venn diagram depicting methylated orthologs between T. pretiosum, N. vitripennis, and A. mellifera. b, c T. pretiosum lineage-specific methylation gain and loss compared to (b) N. vitripennis and (c) A. mellifera orthologs
Methylation status of gene body methylation across 3180 single-copy orthologs
| Methylation status | Number of orthologsa | Interpretation | ||
|---|---|---|---|---|
|
|
|
| ||
| Methylated | Methylated | Methylated | 1897 | Conserved methylation status |
| Methylated | Unmethylated | Unmethylated | 49 | |
| Unmethylated | Methylated | Methylated | 182 | |
| Unmethylated | Unmethylated | Unmethylated | 669 | Conserved methylation status |
aThe 383 genes not included in the table are those which have Apis mellifera- or Nasonia vitripennis-specific gains or losses
Fig. 8Average Apis mellifera, Nasonia vitripennis, and Trichogramma pretiosum exon and intron lengths. For a all genes, b 3180 single-copy orthologs