| Literature DB >> 28894459 |
Hyeonsoo Jeong1, Arshan Nasir2,3.
Abstract
Genome-wide global detection of genes involved in horizontal gene transfer (HGT) remains an active area of research in medical microbiology and evolutionary genomics. Utilizing the explicit evolutionary method of comparing topologies of a total of 154,805 orthologous gene trees against corresponding 16S rRNA "reference" trees, we previously detected a total of 660,894 candidate HGT events in 2,472 completely-sequenced prokaryotic genomes. Here, we report an HGT-index for each individual gene-reference tree pair reconciliation, representing the total number of detected HGT events on the gene tree divided by the total number of genomes (taxa) member of that tree. HGT-index is thus a simple measure indicating the sensitivity of prokaryotic genes to participate (or not participate) in HGT. Our preliminary list provides HGT-indices for a total of 69,365 genes (detected in >10 and <50% available prokaryotic genomes) that are involved in a wide range of biological processes such as metabolism, information, and bacterial response to environment. Identification of horizontally-derived genes is important to combat antibiotic resistance and is a step forward toward reconstructions of improved phylogenies describing the history of life. Our effort is thus expected to benefit ongoing research in the fields of clinical microbiology and evolutionary biology.Entities:
Keywords: bacterial evolution; horizontal gene transfer; phylogenetics; tree reconciliation; tree reconstruction
Year: 2017 PMID: 28894459 PMCID: PMC5581361 DOI: 10.3389/fgene.2017.00112
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Pfam Ids and descriptions for top-20 HGT-genes with the highest calculated HGT-index.
| PF01165 | Ribosomal protein S21 | 191 | 153 | 0.801 | Proteobacteria |
| PF00589, PF13356 | Phage integrase family, Domain of unknown function (DUF4102) | 28 | 20 | 0.714 | Proteobacteria |
| PF02371, PF01548 | Transposase IS116/IS110/IS902 family, Transposase | 28 | 20 | 0.714 | Proteobacteria |
| PF00137 | ATP synthase subunit C | 187 | 127 | 0.679 | Proteobacteria |
| PF00126, PF03466 | Bacterial regulatory helix-turn-helix protein, lysR family, LysR substrate binding domain | 63 | 42 | 0.667 | Proteobacteria |
| PF04655 | Aminoglycoside/hydroxyurea antibiotic resistance kinase | 32 | 21 | 0.656 | Actinobacteria, Proteobacteria |
| PF04432, PF12838 | Coenzyme F420 hydrogenase/dehydrogenase, beta subunit C terminus, 4Fe-4S dicluster domain | 29 | 19 | 0.655 | Bacteroidetes, Euryarchaeota, Firmicutes, Proteobacteria, Spirochaetes |
| PF00584 | SecE/Sec61-gamma subunits of protein translocation complex | 17 | 11 | 0.647 | Proteobacteria |
| PF04397, PF00072 | LytTr DNA-binding domain, Response regulator receiver domain | 14 | 9 | 0.643 | Bacteroidetes |
| PF09035 | Excisionase from transposon Tn916 | 28 | 18 | 0.643 | Firmicutes |
| PF13175 | AAA ATPase domain | 50 | 32 | 0.640 | Actinobacteria, Bacteroidetes, Euryarchaeota, Firmicutes, Fusobacteria, Planctomycetes, Proteobacteria |
| PF01322 | Cytochrome C' | 11 | 7 | 0.636 | Proteobacteria |
| PF12645 | Helix-turn-helix domain | 22 | 14 | 0.636 | Firmicutes |
| PF09278, PF00376 | MerR, DNA binding, MerR family regulatory protein | 52 | 33 | 0.635 | Nitrospirae, Proteobacteria |
| PF11990 | Protein of unknown function (DUF3487) | 38 | 24 | 0.632 | Proteobacteria |
| PF01872 | RibD C-terminal domain | 43 | 27 | 0.628 | Actinobacteria, Chloroflexi, Proteobacteria |
| PF11190 | Protein of unknown function (DUF2976) | 40 | 25 | 0.625 | Proteobacteria |
| PF03795 | YCII-related domain | 64 | 40 | 0.625 | Actinobacteria |
| PF07508, PF13408, PF00239 | Recombinase, Recombinase zinc beta ribbon domain, Resolvase, N terminal domain | 80 | 50 | 0.625 | Actinobacteria, Chloroflexi, Euryarchaeota, Firmicutes, Spirochaetes, Tenericutes, Thermotogae |
| PF10502 | Signal peptidase, peptidase S26 | 34 | 21 | 0.618 | Proteobacteria |
Multiple protein family ids and descriptions are separated by comma. The top-20 list does not include genes for which no Pfam assignment was detected. See Dataset .
Pfam Ids and descriptions are given for top-twenty HGT-genes with the highest number of genomes.
| PF01136 | Peptidase family U32 | 941 | 178 | 0.189 | Actinobacteria, Aquificae, Chlorobi, Deferribacteres, Elusimicrobia, Fibrobacteres, Firmicutes, Fusobacteria, Proteobacteria, Spirochaetes, Tenericutes, Thermotogae |
| PF00416 | Ribosomal protein S13/S18 | 915 | 245 | 0.268 | Actinobacteria, Chloroflexi, Deinococcus-Thermus, Dictyoglomi, Firmicutes Gemmatimonadetes, Nitrospirae, Proteobacteria, Spirochaetes, Synergistetes, Tenericutes, Thermobaculum |
| PF01256, PF03853 | Carbohydrate kinase, YjeF-related protein N-terminus | 825 | 210 | 0.255 | Acidobacteria, Actinobacteria, Aquificae, Bacteroidetes, Caldiserica, Chlamydiae, Chloroflexi, Chrysiogenetes, Crenarchaeota, Deferribacteres, Dictyoglomi, Euryarchaeota Firmicutes, Fusobacteria, Ignavibacteriae, Korarchaeota, Nitrospirae, Proteobacteria, Spirochaetes, Thermobaculum., Thermodesulfobacteria, Thermotogae, candidate division NC10 |
| PF01396, PF01131, PF01751 | Topoisomerase DNA binding C4 zinc finger, DNA topoisomerase, Toprim domain | 823 | 128 | 0.156 | Acidobacteria, Aquificae, Chlamydiae, Chlorobi, Chloroflexi, Deinococcus-Thermus, Elusimicrobia, Firmicutes, Fusobacteria, Nitrospirae, Planctomycetes, Proteobacteria, Tenericutes, Thermobaculum., Thermotogae, candidate division WWE3 |
| PF13344, PF13242 | Haloacid dehalogenase-like hydrolase, HAD-hyrolase-like | 808 | 170 | 0.210 | Actinobacteria, Bacteroidetes, Chloroflexi, Deinococcus-Thermus, Dictyoglomi, Euryarchaeota, Firmicutes, Fusobacteria, Planctomycetes, Proteobacteria, Spirochaetes, Thermobaculum |
| PF00815 | Histidinol dehydrogenase | 802 | 188 | 0.234 | Actinobacteria, Aquificae, Bacteroidetes, Chlorobi, Chrysiogenetes, Cyanobacteria, Deferribacteres, Firmicutes, Fusobacteria, Nitrospirae, Proteobacteria, Spirochaetes, Thermodesulfobacteria, Thermotogae, candidate division NC10 |
| PF00290 | Tryptophan synthase alpha chain | 781 | 199 | 0.255 | Acidobacteria, Actinobacteria, Bacteroidetes, Chloroflexi, Cyanobacteria, Elusimicrobia, Euryarchaeota, Firmicutes, Gemmatimonadetes, Nitrospirae, Planctomycetes, Proteobacteria, Spirochaetes, Synergistetes, Thaumarchaeota, Thermobaculum., Thermodesulfobacteria, Thermotogae, Verrucomicrobia, candidate division NC10 |
| PF07991, PF01450 | Acetohydroxy acid isomeroreductase, catalytic domain, Acetohydroxy acid isomeroreductase, catalytic domain | 779 | 231 | 0.297 | Acidobacteria, Actinobacteria, Aquificae, Chloroflexi, Chrysiogenetes, Cyanobacteria, Deferribacteres, Deinococcus-Thermus, Dictyoglomi, Firmicutes, Nitrospirae, Planctomycetes, Proteobacteria, Spirochaetes, Thermobaculum, Thermodesulfobacteria, Thermotogae, candidate division NC10 |
| PF00334 | Nucleoside diphosphate kinase | 772 | 160 | 0.207 | Chlamydiae, Proteobacteria |
| PF00478, PF00571 | IMP dehydrogenase/GMP reductase domain, CBS domain | 762 | 156 | 0.205 | Acidobacteria, Actinobacteria, Aquificae, Bacteroidetes, Chlorobi, Chloroflexi, Cloacimonetes, Deinococcus-Thermus, Elusimicrobia, Firmicutes, Fusobacteria, Ignavibacteriae, Nitrospirae, Planctomycetes, Proteobacteria, Synergistetes, Tenericutes, Thermobaculum., Thermotogae |
| PF02696 | Uncharacterized ACR, YdiU/UPF0061 family | 761 | 200 | 0.263 | Acidobacteria, Actinobacteria, Bacteroidetes, Chrysiogenetes, Deinococcus-Thermus, Euryarchaeota, Firmicutes, Gemmatimonadetes, Nitrospirae, Planctomycetes, Proteobacteria, Spirochaetes, Verrucomicrobia |
| PF01817, PF00800 | Chorismate mutase type II, Prephenate dehydratase | 755 | 167 | 0.221 | Actinobacteria, Aquificae, Bacteroidetes, Chloroflexi, Chrysiogenetes, Crenarchaeota, Deferribacteres, Deinococcus-Thermus, Dictyoglomi, Elusimicrobia, Firmicutes, Fusobacteria, Nitrospirae, Planctomycetes, Proteobacteria, Spirochaetes, Synergistetes, Thermodesulfobacteria, Verrucomicrobia, candidate division NC10 |
| PF00316 | Fructose-1-6-bisphosphatase | 750 | 204 | 0.272 | Acidobacteria, Actinobacteria, Aquificae, Bacteroidetes, Chlorobi, Chloroflexi, Chrysiogenetes, Cyanobacteria, Deferribacteres, Firmicutes, Gemmatimonadetes, Ignavibacteriae, Nitrospirae, Planctomycetes, Proteobacteria, Spirochaetes, Verrucomicrobia, candidate division NC10 |
| PF02421, PF07664, PF07670 | Ferrous iron transport protein B, Ferrous iron transport protein B C terminus, Nucleoside recognition | 747 | 166 | 0.222 | Acidobacteria, Actinobacteria, Aquificae, Bacteroidetes, Chlorobi, Cloacimonetes, Cyanobacteria, Deinococcus-Thermus, Euryarchaeota, Firmicutes, Fusobacteria, Gemmatimonadetes, Ignavibacteriae, Planctomycetes, Proteobacteria, Spirochaetes, Thermodesulfobacteria, Verrucomicrobia |
| PF01070 | FMN-dependent dehydrogenase | 747 | 152 | 0.203 | Actinobacteria, Chlamydiae, Chlorobi, Chloroflexi, Crenarchaeota, Cyanobacteria, Deinococcus-Thermus, Euryarchaeota, Firmicutes, Fusobacteria, Ignavibacteriae, Korarchaeota, Proteobacteria, Spirochaetes, Tenericutes, Thaumarchaeota, Thermobaculum. |
| PF03462, PF00472 | PCRF domain, RF-1 domain | 733 | 151 | 0.206 | Acidobacteria, Actinobacteria, Candidatus Saccharibacteria, Candidatus Saccharibacteria., Chlamydiae, Chloroflexi, Chrysiogenetes, Deferribacteres, Dictyoglomi, Elusimicrobia, Firmicutes, Fusobacteria, Gemmatimonadetes, Nitrospirae, Planctomycetes, Proteobacteria, Spirochaetes, Synergistetes, Tenericutes, Thermobaculum., Thermodesulfobacteria, Verrucomicrobia |
| PF01676, PF06415 | Metalloenzyme superfamily, BPG-independent PGAM N-terminus (iPGM_N) | 731 | 134 | 0.183 | Bacteroidetes, Chloroflexi, Euryarchaeota, Firmicutes, Fusobacteria, Proteobacteria, Thermobaculum. |
| PF00132 | Bacterial transferase hexapeptide (six repeats) | 728 | 188 | 0.258 | Acidobacteria, Actinobacteria, Aquificae, Chloroflexi, Chrysiogenetes, Cyanobacteria, Deferribacteres, Dictyoglomi, Euryarchaeota, Firmicutes, Nitrospirae, Proteobacteria, Spirochaetes, Synergistetes, candidate division NC10 |
| PF03144, PF00009 | Elongation factor Tu domain 2, Elongation factor Tu GTP binding domain | 722 | 159 | 0.220 | Bacteroidetes, Chlorobi, Chloroflexi, Cyanobacteria, Deferribacteres, Deinococcus-Thermus, Fibrobacteres, Firmicutes, Gemmatimonadetes, Nitrospirae, Planctomycetes, Proteobacteria, Spirochaetes, Verrucomicrobia |
| PF02233 | NAD(P) transhydrogenase beta subunit | 720 | 189 | 0.263 | Actinobacteria, Cyanobacteria, Firmicutes, Proteobacteria |
Multiple protein family ids and descriptions are separated by comma.