| Literature DB >> 31289835 |
Stephan Holger Drukewitz1,2, Lukas Bokelmann3, Eivind A B Undheim4,5, Björn M von Reumont2,6,7.
Abstract
BACKGROUND: Venoms and the toxins they contain represent molecular adaptations that have evolved on numerous occasions throughout the animal kingdom. However, the processes that shape venom protein evolution are poorly understood because of the scarcity of whole-genome data available for comparative analyses of venomous species.Entities:
Keywords: comparative venom-genomics; gene duplication; orphan genes; single gene co-option; toxin gene evolution; venom evolution
Mesh:
Substances:
Year: 2019 PMID: 31289835 PMCID: PMC6615979 DOI: 10.1093/gigascience/giz081
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Fig. 1:The 3D reconstructed venom delivery system of female and male Dasypogon diadema. The general anatomy of D. diadema is similar between both sexes and to the structures described for Eutolmus rufibarbis.A pair of elongated sac-like glands located in the first and second thoracic segments (right and left glands coloured red and orange, respectively) open separately into ducts (coloured green), which fuse just before entering the head capsule and continue to the tip of the proboscis. Compared with the glands of E. rufibarbis, the glands of D. diadema are more elongated, featuring a larger volume and sub-compartmentalization. The labial glands (coloured blue) are located in the middle part of the proboscis and open into the lumen between theca and the labium at the tip of the proboscis.
Fig. 2:Relative expression of putative toxin families in Dasypogon diadema (male and female), compared to Eutolmus rufibarbis and Machimus arthriticus. The expression levels of protein families secreted in the venom glands are given in percent. Only sequences with matches from proteomics and a threshold >1 transcripts per million (TPM) are included. Protein classes with an expression value <1% of the depicted TPM are summarized in the category “others.”
Fig. 4:The evolutionary pattern and the origin of the top 30 putative toxins. The node numbering refers to the nodes in Fig. 3a. Putative toxins present in Dasypogon diadema but missing in Eutolmus rufibarbis or Machimus arthriticus are coloured red. Single-copy genes: putative toxins with only 1 copy on the protein-coding genome of D. diadema; multi-copy genes*: protein-coding genes that belong to orthogroups assembled of ≥2 protein-coding genes in D. diadema. Only 1 member of the orthogroup is present in the venom; multi-copy genes**: protein-coding genes that belong to orthogroups assembled of ≥2 protein-coding genes in D. diadema. Two or more members of the same orthogroup are present in the venom.
Overview of all analysed genomes and their gene completeness
| Order | Species | No. of analysed CDSs | BUSCO completeness (%) | |
|---|---|---|---|---|
| Lepidoptera |
| 14,623 | 84.5 |
|
|
| 15,128 | 94.8 |
| |
|
| 19,032 | 89.9 |
| |
|
| 17,158 | 95.5 |
| |
|
| 14,916 | 98.6 |
| |
|
| 10,519 | 90.1 |
| |
|
| 22,410 | 86.7 |
| |
| Diptera |
| 15,480 | 91.1 |
|
|
| 10,942 | 96.7 |
| |
|
| 19,429 | 99.4 |
| |
|
| 30,429 | 99.7 |
| |
|
| 24,119 | 99.2 |
| |
|
| 16,570 | 68.9 |
| |
|
| 14,452 | 91.7 |
|
To infer the quality of the annotation, a BUSCO analysis was performed using the transcriptome mode and the holometabolous dataset.
*Genome was sequenced and annotated for this study.
**Genome from Dikow et al. [16] was reannotated.
***Protein dataset from ENSEMBL. The order of the species in this table matches the species order in the cladogram in Fig. 3a.
Fig. 3:(a) Phylogenetic relationships of the included taxa. Dasypogon diadema was used as the focal species for the analyses of the orthogroups. Boxes on the split show the number of orthogroups shared by D. diadema and the respective clade of the split (upper number: number of shared orthogroups; middle number: number of orthogroups with putative toxins; lower number: number of orthogroups associated with the 30 predominant putative toxins). (b) Heat map showing the expression level (TPM) in the 3 tissues of the putative toxins of both sexes. The white numbers in the black circle refer to the affiliated orthogroups and splits in 3a (Vg-♂: venom gland male; Vg-♀: venom gland female; Pb-♂: proboscis male; Pb-♀: proboscis female; Bt-♂: body tissue male; Bt-♀: body tissue female). (c) Summarized expression level (TPM) of the putative toxin transcripts in the venom gland of both sexes. The white numbers in the black circle refer to the affiliated orthogroups and splits in 3a (number of putative toxins for all nodes: Node 1: 130; Node 2: 3; Node 3: 0; Node 4: 5; Node 5: 18; Node 6: 1; *no orthogroup: 4).
Overview of DNA libraries generated for the Dasypogon diadema genome assembly
| Library name | Fragment length (nt) | No. of sequenced read pairs | Theoretical genome coverage (fold) |
|---|---|---|---|
| D1130 | 200 | 9,119,970 | 5 |
| D1131 | 400 | 167,137,385 | 89 |
Number of read pairs and fragment size of the libraries used for the genome assembly are shown. The theoretical genome coverage was calculated with a genome size estimate of 450 Mb and a read length of 120 nt (nucleotides) after processing.