| Literature DB >> 19204812 |
Francisca C Almeida1, Magdalena Leszczyniecka, Paul B Fisher, Rob Desalle.
Abstract
Details of the genomic changes that occurred in the ancestors of Eukarya, Archaea and Bacteria are elusive. Ancient interdomain horizontal gene transfer (IDHGT) amongst the ancestors of these three domains has been difficult to detect and analyze because of the extreme degree of divergence of genes in these three domains and because most evidence for such events are poorly supported. In addition, many researchers have suggested that the prevalence of IDHGT events early in the evolution of life would most likely obscure the patterns of divergence of major groups of organisms let alone allow the tracking of horizontal transfer at this level. In order to approach this problem, we mined the E. coli genome for genes with distinct paralogs. Using the 1,268 E. coli K-12 genes with 40% or higher similarity level to a paralog elsewhere in the E. coli genome we detected 95 genes found exclusively in Bacteria and Archaea and 86 genes found in Bacteria and Eukarya. These genes form the basis for our analysis of IDHGT. We also applied a newly developed statistical test (the node height test), to examine the robustness of these inferences and to corroborate the phylogenetically identified cases of ancient IDHGT. Our results suggest that ancient inter domain HGT is restricted to special cases, mostly involving symbiosis in eukaryotes and specific adaptations in prokaryotes. Only three genes in the Bacteria + Eukarya class (Deoxyxylulose-5-phosphate synthase (DXPS), fructose 1,6-phosphate aldolase class II protein and glucosamine-6-phosphate deaminase) and three genes-in the Bacteria + Archaea class (ABC-type FE3+-siderophore transport system, ferrous iron transport protein B, and dipeptide transport protein) showed evidence of ancient IDHGT. However, we conclude that robust estimates of IDHGT will be very difficult to obtain due to the methodological limitations and the extreme sequence saturation of the genes suspected of being involved in IDHGT.Entities:
Keywords: blast; eschericia coli; horizontal gene transfer; node height test
Year: 2008 PMID: 19204812 PMCID: PMC2614185
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1Distributional profile method. A first screening for genes involved in ancient IHGT was done using the distributional profile method. Based on the most accepted hypothesis of phylogenetic relationships between the three domains of life, genes that occur in Eukarya and Bacteria, but not in Archaea, or genes that occur in Archaea and Bacteria, but not in Eukarya were potential candidates to have been horizontally transferred between domains. Genes found in one of those two phyletic distributional categories were further tested for HGT (see text).
Figure 3The node height test. (a) If an Euk-Bac distribution was caused by gene loss, than the E1-B1 (Eukarya to Bacteria) average distance is expected to be higher than the average B1-B1 (Bacteria to Bacteria) distance. (b) However, if eukaryotic genes (E1) were gained by transfer from a particular group of bacteria (B1), than the average E1-B1 distance should not be higher than the average B1-B1 distance. These predictions were done based on the assumption that substitution rates are homogeneous across taxa.
Summary of results for genes with a Bacteria—Archaea exclusive pattern. Results of the relative rate test, rate test using only conserved gene regions, node height test, and support of HGT given by phylogenetic analyses.
| Gene name | Rate test | Rate test cons. | Node height test | Trees |
|---|---|---|---|---|
| 2-octaprenyl-6-methoxyphenol | k.s. | k.s. | - | no |
| ATP-dependent specificity component of clpP serine protease, chaperone | k.s. | k.s. | - | no |
| glucosamine-6-phosphate deaminase | k.s. | k.s. | - | yes |
| GTP-binding elongation factor, may be inner membrane protein | k.s. | k.s. | - | no |
| poly(A) polymerase I | k.s. | k.s. | - | no |
| putative GTP-binding factor | k.s. | k.s. | - | no |
| guanylate kinase | k.s. | - | no | |
| 3-demethylubiquinone-9 3-methyltransferase | k.s. | n.s. | no | |
| biosynthetic arginine decarboxylase | k.s. | n.s. | no | |
| esterase D | k.s. | n.s. | no | |
| glyoxylate-induced protein | k.s. | n.s. | no | |
| heat shock protein hslVU, ATPase subunit, homologous to chaperones | k.s. | n.s. | no | |
| probable protein-tyrosine-phosphatase | k.s. | n.s. | no | |
| pyridoxal kinase 2/pyridoxine kinase | k.s. | n.s. | no | |
| enoyl-[acyl-carrier-protein] reductase (NADH) | k.s. | n.s. | no | |
| glutathionine S-transferase | n.s. | no | ||
| polynucleotide phosphorylase | n.s. | no | ||
| probable 6-phospho-beta-glucosidase | n.s. | no | ||
| 1-deoxyxylulose-5-phosphate synthase | n.s. | n.s. | yes | |
| fructose-bisphosphate aldolase, class II | n.s. | n.s. | yes |
Notes: Significant at α 0.05,
Significant at α 0.01.
Abbreviations: k.s.: substitution saturation; n.s.: non-significant at α 0.05.
Figure 2Maximum parsimony analysis of the gene Glucosamine-6-phosphate (, COG0363). The tree depicts the strict consensus of 6 most parsimonious reconstructions. Bootstrap values above 75 are shown on nodes. Rooting was done with the paralog gene Phosphogluconate. E stands for Eukarya, B for Bacteria, and A for Archaea. The number 1 refers to sequences of the gene under analysis, and the number 2 to sequences of the paralog used for rooting.
Summary of results for genes with a Bacteria—Eukarya exclusive pattern. Results of the relative rate test, rate test using only conserved gene regions, node height test, and support of HGT given by phylogenetic analyses.
| Gene name | Rate test | Rate test cons. | Node height test | Phylogenetic analysis |
|---|---|---|---|---|
| ABC-type FE3+-siderophore transport system, permease component | k.s. | ks. | - | yes |
| ATPase of high-affinity potassium transport system, B chain | k.s. | k.s. | - | no |
| excision nuclease subunit A | k.s. | k.s. | - | no |
| ferrous iron transport protein B | k.s. | k.s. | - | yes |
| glucose-1-phosphate thymidylyltransferase | k.s. | k.s. | - | yes |
| phosphoheptose isomerase | k.s. | k.s. | - | no |
| pleiotrophic effects on 3 hydrogenase isozymes (HypD) | k.s. | k.s. | - | no |
| protein secretion, membrane protein | k.s. | k.s. | - | no |
| suppresses inhibitory activity of CsrA | k.s. | k.s. | - | no |
| thiamin-monophosphate kinase | k.s. | k.s. | - | no |
| transcriptional regulatory protein (HypAF) | k.s. | k.s. | - | no |
| transport of potassium | k.s. | k.s. | - | no |
| FKBP-type peptidyl-prolyl cis-trans isomerase | k.s. | k.s. | - | no |
| plays structural role in maturation of all 3 hydrogenases (HypE) | k.s. | k.s. | - | no |
| site-specific recombinase, acts on cer sequence of ColE1, effects chromosome segregation at cell division | k.s. | - | no | |
| 4-aminobutyrate aminotransferase | k.s. | n.s. | no | |
| dipeptide transport protein | k.s. | n.s. | n.s | yes |
| formate dehydrogenase, cytochrome B556 (FDO) subunit | k.s. | n.s. | yes | |
| glucose-1-phosphate uridylyltransferase | k.s. | n.s. | no | |
| high-affinity phosphate-specific transport system | k.s. | n.s. | no | |
| part of maltose permease, inner membrane | k.s. | n.s. | no | |
| phosphotransacetylase | k.s. | n.s. | no | |
| regulator for asnA, asnC and gidA | k.s. | n.s. | no | |
| spermidine/putrescine transport system permease | k.s. | n.s. | no | |
| UDP-N-acetyl-D-mannosaminuronic acid dehydrogenase | k.s. | n.s. | no | |
| probable adenine deaminase (synthesis xanthine) | k.s. | n.s. | n.s. | yes |
Notes: significant at α 0.05
significant at α 0.01.
Abbreviations: k.s.: substitution saturation; n.s.: non-significant at α 0.05.