| Literature DB >> 34450656 |
Kirsten I Verster1, Rebecca L Tarnopol2, Saron M Akalu1, Noah K Whiteman1,3.
Abstract
A growing body of evidence has underscored the role of horizontal gene transfer (HGT) in animal evolution. Previously, we discovered the horizontal transfer of the gene encoding the eukaryotic genotoxin cytolethal distending toxin B (cdtB) from the pea aphid Acyrthosiphon pisum secondary endosymbiont (APSE) phages to drosophilid and aphid nuclear genomes. Here, we report cdtB in the nuclear genome of the gall-forming "swede midge" Contarinia nasturtii (Diptera: Cecidomyiidae) via HGT. We searched all available gall midge genome sequences for evidence of APSE-to-insect HGT events and found five toxin genes (aip56, cdtB, lysozyme, rhs, and sltxB) transferred horizontally to cecidomyiid nuclear genomes. Surprisingly, phylogenetic analyses of HGT candidates indicated APSE phages were often not the ancestral donor lineage of the toxin gene to cecidomyiids. We used a phylogenetic signal statistic to test a transfer-by-proximity hypothesis for animal HGT, which suggested that microbe-to-insect HGT was more likely between taxa that share environments than those from different environments. Many of the toxins we found in midge genomes target eukaryotic cells, and catalytic residues important for toxin function are conserved in insect copies. This class of horizontally transferred, eukaryotic cell-targeting genes is potentially important in insect adaptation.Entities:
Keywords: Diptera; cdtB; horizontal gene transfer; lysozyme; shiga toxin; toxins
Mesh:
Year: 2021 PMID: 34450656 PMCID: PMC8455502 DOI: 10.1093/gbe/evab202
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Final List of HGT Candidate Genes from Sequenced Cecidomyiid Nuclear Genomes, Including Information About Criteria used to Distinguish HGT from Bacterial Contamination
| Species | Protein Name | Scaffold Size | Eukaryotic Genes on Scaffold | PCR Linking Gene with | PCR of Gene | Transcr. | Introns |
|---|---|---|---|---|---|---|---|
|
| AIP56 | 10,587,749 | Yes (annotated) | Yes | Yes | Yes | 1 |
| 661,609 | Yes (annotated) | Yes | Yes | Yes | 1 | ||
| CdtB | 10,587,749 | Yes (annotated) | Yes | Yes | Yes | 2 | |
| Lysozyme | 3904986 | Yes (annotated) | Yes | Yes | Yes | 1 | |
| 3,904,986 | Yes (annotated) | Yes | Yes | Yes | 0 | ||
| 3,904,986 | Yes (annotated) | Yes | Yes | Yes | 1 | ||
| SltxB | 6,229,930 | Yes (annotated) | Yes | Yes | Yes | 2 | |
| 6,229,930 | Yes (annotated) | Yes | Yes | Yes | 2 | ||
|
| Hypothetical protein | 4,914,483 | Yes | NA | NA | No | NA |
| RHS | 4,914,483 | Yes | NA | NA | No | 3 | |
| SltxB | 1,200,421 | Yes | NA | NA | No | NA | |
| 1,200,421 | Yes | NA | NA | No | NA | ||
| 1,407,356 | Yes | NA | NA | Yes | NA | ||
| 1,407,356 | Yes | NA | NA | No | NA | ||
| 1,407,356 | Yes | NA | NA | No | NA | ||
| 1,407,356 | Yes | NA | NA | No | NA | ||
| 5,150,188 | Yes | NA | NA | No | NA | ||
|
| AIP56 | 3,779,354 | Yes (annotated) | Yes | Yes | No | NA |
| Lysozyme | 586,442 | Yes (annotated) | Yes | Yes | Yes | 0 | |
| RHS | 360,288 | Yes (annotated) | Yes | Yes | Yes | 2 |
Note.—N/A indicates that there was no sample, nearby proximal gene, or predicted gene. For further details, see supplementary file S1, Supplementary Material online.
Fig. 1.Maximum-likelihood Cecidomyiidae species phylogeny shows the approximate history of each HGT event. Filled boxes indicate presence of the toxin, and the numbers indicate copy number of the gene on bona fide eukaryotic scaffolds if >1 (see supplementary file S1, Supplementary Material online for additional copies on scaffolds <10 kb). Bootstrap values are reported out of n = 1,000 bootstraps, and scale bar is substitutions per site. Tick marks on the phylogeny indicate approximate timing of the HGT event based on a parsimony approach incorporating presence/absence of the HGT candidate, individual gene phylogenies, and synteny data. Dashed ticks indicate HGT events for which synteny data were inconclusive.
Fig. 2.Phylogenies of horizontally transferred genes show they are nested within diverse possible donor clades, including viruses, Proteobacteria, and fungi. Indicated co-associated species suggest physical proximity that could facilitate HGT. Black cartoon organisms indicate orthologs encoded in host nuclear genomes, while grey cartoon organisms indicate orthologs encoded by taxa co-associated with the cartoon hosts. Cecidomyiid species are highlighted yellow. Possible contaminants are indicated with an asterisk. Scale bars indicate substitutions per site. For full phylogenies, see supplementary figure S1, Supplementary Material online.
δ Values for Gene Phylogenies Demonstrate That There Is a Relationship between Ecological Niche and Horizontal Gene Transfer
| Real Phylogeny | HGT-Only Phylogeny | |||||||
|---|---|---|---|---|---|---|---|---|
| Tips |
| Shuffled | Tips |
| Shuffled | |||
| AIP56 | 90 | 7.41 |
std. dev. = 0.218 | <0.01* | 52 | 4.197 |
std. dev. = 0.727 | <0.01* |
| CdtB | 76 | 7.12 |
std. dev. = 0.274 | <0.01* | 27 | 5.674 |
std. dev. = 0.819 | <0.01* |
| Lysozyme | 172 | 7.581 |
std. dev. = 0.320 | <0.01* | 117 | 3.640 |
std. dev. = 0.297 | <0.01* |
| RHS | 188 | 8.37 |
std. dev. = 0.134 | <0.01* | 76 | 3.215 |
std. dev. = 0.421 | <0.01* |
| SltxB | 23 | 2.72 |
std. dev. = 0.508 | 0.01* | 8 | 4.500 |
std. dev. = 2.401 | 0.11 |
Note.—δ values for both complete trees and trees for which we removed vertical descendance (“HGT-Only”) are shown. P-value is calculated as the number of simulations (n = 100) in which the shuffled δ is equal to or higher than the realized δ, with an asterisk (*) indicating statistical significance (P<0.05). The mean and standard deviation of the shuffled δ values are also shown.