| Literature DB >> 25483352 |
Robert P Hirt1, Cecilia Alsmark2, T Martin Embley3.
Abstract
Our knowledge of the extent and functional impact of lateral gene transfer (LGT) from prokaryotes to eukaryotes, outside of endosymbiosis, is still rather limited. Here we review the recent literature, focusing mainly on microbial parasites, indicating that LGT from diverse prokaryotes has played a significant role in the evolution of a number of lineages, and by extension throughout eukaryotic evolution. As might be expected, taxonomic biases for donor prokaryotes indicate that shared habitat is a major factor driving transfers. The LGTs identified predominantly affect enzymes from metabolic pathways, but over a third of LGT are genes for putative proteins of unknown function. Finally, we discuss the difficulties in analysing LGT among eukaryotes and suggest that high-throughput methodologies integrating different approaches are needed to achieve a more global understanding of the importance of LGT in eukaryotic evolution.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25483352 PMCID: PMC4728198 DOI: 10.1016/j.mib.2014.11.018
Source DB: PubMed Journal: Curr Opin Microbiol ISSN: 1369-5274 Impact factor: 7.934
Variation of reported cases of LGT between species in a given study or between different studies for a given species for a selection of microbial parasites.
| Species name | Higher rank taxonomy | Total LGT count (%Proteome) | P → E LGT | E → E LGT | Other LGT | Methodology | Reference |
|---|---|---|---|---|---|---|---|
| Amoebozoa (Archamoebae) | 199 (2.1% – 9090?) | 197 | NR | 2 (virus) | Blast & Phylogeny | [ | |
| Amoebozoa (Archamoebae) | 63 (0.68% – 9090) | 51 | 12 | NR | Blast & Phylogeny | [ | |
| Amoebozoa (Archamoebae) | 195 (1.90% – 10,262?) | 194 | NR | 1 (virus) | Blast & Phylogeny | [ | |
| Excavata (Metamonada) | 149 (0.24% – 59,681) | 134 | 15 | NR | Blast & Phylogeny | [ | |
| Excavata (Metamonada) | 21 (0.36% – 6394) | 15 | 6 | NR | Blast & Phylogeny | [ | |
| Excavata (Discoba) | 68 (0.96% – 7111) | 63 | 5 | NR | Blast & Phylogeny | [ | |
| Excavata (Discoba) | 46 (0.47% – 9750) | 45 | 1 | NR | Blast & Phylogeny | [ | |
| SAR (Alveolata) | 19 (0.36% – 5258) | 18 | 1 | NR | Blast & Phylogeny | [ | |
| Opisthokonta (Nucletmycea) | 3 (0.16 – 1918) | 1 | 2 | NR | Blast & Phylogeny | [ |
According to [15]. The two highest taxonomic ranks are indicated. SAR stands for the Stramenopiles, Alveolata and Rhizaria group.
Values in brackets represent the fraction of LGT in % of the number of annotated protein coding genes, total is indicated after the dash. A question mark indicates the ambiguity about the exact dataset analysed as different annotations exist for a given genome.
Candidate prokaryote to eukaryote LGTs. The great majority of candidates LGTs are from Bacteria.
Candidate Eukaryote to Eukaryote LGTs.
Additional sources of LGT investigated.
Different criteria (BlastP and phylogenies) were used to select candidate LGT.
Same dataset analysed in different publications — only two recent publications for one species were considered here. See [12] for additional examples.
NR: none reported.
Figure 1Candidate LGT among parasitic microbial eukaryotes. (a) Taxonomy of donor lineages for candidate LGTs. Comparison of the prokaryotic lineages inferred to be donating genes to the extracellular mucosal parasites Entamoeba histolytica, Trichomonas vaginalis, and Giardia lamblia compared with the inferred donor lineages for the insect-transmitted blood parasites Trypanosoma brucei, T. cruzi, Plasmodium falciparum, P. vivax and P. yoelii yoelii (top panel). Comparison of the prokaryotic lineages inferred to be donating genes to the parasite E. histolytica and its free-living amoebozoan relative D. discoideum (bottom panel). ‘Other bacteria’ comprise the Actinobacteria, Aquificae, Fusobacteria, Plantomycetes, Spirochaetes, or Tenericutes. Fisher's exact test was performed to test the null hypothesis that the taxonomy of the donors is distributed equally between the compared taxa. The P-values for the tests are indicated; they both reject the null hypothesis. The numbers of LGTs considered for each set of taxa are indicated between brackets. (b) Assessment of gains and losses of lateral gene transfer (LGTs) during parasite speciation. Maximum parsimony was used to map candidate LGTs on the species trees for taxa among (a) Trypanosomatidae and (b) Apicomplexa. Gains and losses are indicated as green and orange bars respectively. Characters were analysed using Dollo parsimony, so each character is allowed to have only a single gain, but may have multiple losses. It is inferred that 45 LGTs occurred (over 75 genes affected by LGT) before the divergence of the three parasitic Trypanosomatidae lineages. Interestingly, we detected 26 of the same LGTs in the genome of the free-living kinetoplastid Bodo saltans [45] using Blast similarity scores, suggesting these transfers may predate the transition to parasitism. Figures in panel (a) and (b) are derived from [12]. (c) The mapping of annotated genes (red and blue genes indicate the differential orientation of the inferred open reading frames) on the scaffold DS113827 from the genome sequence data of T. vaginalis strain G3 [46]. A 32 kbp fragment (orange bar) was shown to be highly similar to the Firmicutes Peptoniphilus harei and encode 27 annotated genes. A matching gene cluster was found in all four additional investigated strains of the parasite [24]. Entries labelled with RG in their locus tags correspond to highly repetitive gene families, which are known to litter the genome of T. vaginalis [46]. The figure in panel c was generated using TrichDB [47].
Transcription level of selected candidate LGTs of bacterial origins in Trichomonas vaginalis.
| Locus tag | Annotation | Mean transcription | Median transcription | Standard deviation |
|---|---|---|---|---|
| A recent LGT from a Firmicutes — [ | ||||
| TVAG_243570 to TVAG_243830 | Various — bacterial genomic segment with 27 annotated genes | 6.7 ( | 3.3 ( | 9.0 ( |
| Bacterial LGT encoding candidate glycan degradation enzymes [ | ||||
| TVAG_010780 | beta-N-acetylhexosaminidase — EC 3.2.1.52 | 585.6 | 607.2 | 326.1 |
| TVAG_044970 | N-acetylneuraminate lyase — EC 4.1.3.3 | 471.3 | 461.0 | 157.1 |
| TVAG_123020 | alpha-mannosidase — EC 3.2.1.24 | 0.0 | 0.0 | 0.0 |
| TVAG_371840 | alpha-mannosidase — EC 3.2.1.24 | 796.8 | 787.7 | 514.5 |
| TVAG_270790 | Acylglucosamine 2-epimerase — EC 5.1.3.8 | 1343.8 | 1400.7 | 465.1 |
| TVAG_365600 | beta-galactosidase — EC 3.2.1.23 | 626.6 | 620.9 | 251.1 |
| TVAG_443530 | alpha-fucosidase — EC 3.2.1.51 | 37.0 | 29.8 | 32.7 |
| TVAG_483760 | Beta-mannosidase — EC 3.2.1.25 | 178.3 | 132.7 | 167.7 |
| TVAG_499550 | Exo-alpha-sialidse — EC 3.2.1.18 | 687.5 | 528.7 | 483.3 |
| TVAG_593180 | Glucosylceramidase — EC 3.2.1.45 | 0.0 | 0.0 | 0.0 |
| Overall expression level [ | ||||
| All 59681 annotated genes | NA | 254.1 | 0.15 | 2662.6 |
| All 33157 expressed genes >0 reads | NA | 457.3 | 26.2 | 3559.2 |
| All 20304 expressed genes ≥10 reads | NA | 745.5 | 90.55 | 4524.8 |
Transcriptomics data are from Gould et al. (2013) [30] with shown values being the mean normalised 3′end reads from the 11 distinct growth conditions investigated.
Probable pseudogene: TVAG_123020 and TVAG_123030 are both annotated as alpha-mannosidase and a BlastX using the genomic sequence overlapping both coding sequences (locus DS113221, 216.7 kbp: 205,844–209,198) recovers one likely full-length T. vaginalis protein (TVAG_371840) and bacterial protein hits with two distinct reading frames (+2 and +3).
Likely functional homologue to TVAG_123020 (55%ID to TVAG_123020). The result of a BlastP search is consistent with TVAG_123020 representing a bacterial LGT.
Encoded by a small scaffold, hence it could represent a contaminant — locus DS145301, 1043 bp.
Figure 2Evidence for transcription of LGT in Trichomonas vaginalis. The level of transcription of different gene sets was contrasted through histograms with 11 bins reflecting no read (bin 1, zero mean 3′-normalised reads across 11 conditions in [28]) to the highest level of transcription (bin 11, >100,000 to 300,000 mean reads). The inset in the top panel shows all bins with different levels of transcription expressed as the mean of 3-‘normalised reads across 11 tested growth conditions. The top panel illustrates the variation in transcription level between all annotated protein coding genes from the T. vaginalis G3 genome sequence data [46]. The middle panel contrasts protein-coding genes annotated as ‘hypotheticals’ (blue bars, with no BlastP hits in databases) versus ‘hypothetical conserved’ (green bars, with BlastP hits in databases). The bottom panel indicates the level of transcription of all LGT cases identified in [12]. The LGT genes are notably skewed towards the right hand side of the histogram (higher level of transcription) compared to ‘hypotheticals’. This suggests that the majority of LGTs are likely to be functionally integrated into the biology of the parasite whereas the great majority of ‘hypotheticals’ are not and might represent pseudogenes or miss-annotations of spurious genes.