Literature DB >> 28426805

Evidence of horizontal gene transfer by transposase gene analyses in Fervidobacterium species.

Alba Cuecas1, Wirojne Kanoksilapatham2, Juan M Gonzalez1.   

Abstract

Horizontal Gene Transfer (HGT) plays an important role in the physiology and evolution of microorganisms above all thermophilic prokaryotes. Some members of the Phylum Thermotogae (i.e., Thermotoga spp.) have been reported to present genomes constituted by a mosaic of genes from a variety of origins. This study presents a novel approach to search on the potential plasticity of Fervidobacterium genomes using putative transposase-encoding genes as the target of analysis. Transposases are key proteins involved in genomic DNA rearrangements. A comprehensive comparative analysis, including phylogeny, non-metric multidimensional scaling analysis of tetranucleotide frequencies, repetitive flanking sequences and divergence estimates, was performed on the transposase genes detected in four Fervidobacterium genomes: F. nodosum, F. pennivorans, F. islandicum and a new isolate (Fervidobacterium sp. FC2004). Transposase sequences were classified in different groups by their degree of similarity. The different methods used in this study pointed that over half of the transposase genes represented putative HGT events with closest relative sequences within the phylum Firmicutes, being Caldicellulosiruptor the genus showing highest gene sequence proximity. These results confirmed a direct evolutionary relationship through HGT between specific Fervidobacterium species and thermophilic Firmicutes leading to potential gene sequence and functionality sharing to thrive under similar environmental conditions. Transposase-encoding genes represent suitable targets to approach the plasticity and potential mosaicism of bacterial genomes.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28426805      PMCID: PMC5398504          DOI: 10.1371/journal.pone.0173961

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The microbial world presents an astonishing high diversity [1, 2]. Natural habitats contain complex microbial communities where different microorganisms interact with the environment representing a driving force to get adapted to the available conditions. The microbial cells live in communities and they continuously interact with others both within and between species [3]. This interactive scenario generates opportunities to exchange functional capabilities and accelerate the pace of evolutionary adaptation. Horizontal gene transfer (HGT) represents the exchange of DNA between species and, this is even more interesting when it occurs among scarcely related cells, for example, between different phyla. HGT events occur frequently in the prokaryotic world and they are thought to have enormous relevance in the evolution of prokaryotes [4, 5, 6]. However, the mechanisms governing these events are poorly understood. Phylogenetic studies are providing valuable information on potential DNA mobility events during microbial evolution. In this respect, the involvement of mobile genetic elements is a major factor involved in the genomic evolution of prokaryotes [6, 7]. The activities of different mobile genetic elements, both within a single genome and across genomes of different microorganisms, have serious impacts on genome structure and function. For instance, mobile genetic elements are able to generate transpositions of DNA fragments and as a result gene inactivation/activation and gene deletions and insertions [8]. Among the mobile elements, the insertion sequences (IS) are considered the simplest autonomous transposition elements. These ISs are constituted by a transposase gene flanked up- and down-stream by repetitive flanking sequences. Consequently, transposase genes are expected to show genome rearrangements more frequently than other genes [9]. The mechanism of action of transposases consists of replicative and non-replicative (conservative) modes that facilitate movement of the transposition elements to a different location. As a consequence, gene disruption caused by the scission of an IS and its insertion into the genome are usually traced back to understand these evolutionary events [10]. The repetitive flanking sequences, or inverted repeat (IR) sequences, are targets of the transposases and are generally included in the transposed DNA. The involvement of transposases in the mobility of DNA within and likely between genomes [6, 7] suggests that these genes can be used as a proxy to better understand the potential of HGT on bacterial evolution and genome plasticity. Extreme conditions, such as high temperatures, represent environments fostering rapid adaptation mechanisms. HGT has proven to represent a major mechanism allowing the adaptation of microorganisms to these environments [11, 12, 13]. For instance, the phylum Thermotogae, mostly represented by thermophiles, has been reported to present a high frequency of HGT events, such as some DNA fragments putatively coming from members of the Phylum Firmicutes and the Domain Archaea. This has been pointed out in the genome of Thermotoga maritima [6, 14], genomes of Thermotoga and Thermosipho species and Fervidobacterium nodosum [13, 15]. In this study, we analyzed transposase genes in Fervidobacterium genomes as potential targets to infer the putative occurrence of HGT events in this genus and their liaison to distantly and closely related taxa.

Materials and methods

Genomes, genes and phylogenies

The genomes of F. nodosum Rt17-B1 (NC_009718) [13, 16], F. pennivorans DSM9070 (NC_017095) [17], F. islandicum AW-1 (NZ_CP014334) [18, 19] and a new isolate [20], Fervidobacterium thailandense strain FC2004 (LWAF01000000) were used in this analysis. Annotated transposases in these four genomes and homology searching (using blast algorithms) [21] resulted in the detection of a number of transposases. They were classified based on their similarity to transposase families as proposed by Siguier et al. [8] at the ISFinder database (http://www-is.biotoul.fr). Blastp was used to determine the closest relatives to the transposase genes detected in Fervidobacterium genomes. Besides the closest relative sequences detected in GenBank, the closest sequences to those transposase families within the Phylum Thermotogae were also incorporated into the analyses as a comparative threshold for the detection of HGT events between different phyla. Sequence alignments were performed by ClustalW [22]. Phylogenetic trees based on the amino acid sequences encoded by the detected genes were constructed using MEGA [23] by the Neighbor-joining method with a bootstrap value of 1000.

Multivariate analyses of tetranucleotide frequencies

Non-metric MultiDimensional Scaling (NMDS) analyses were performed to obtain graphical distributions of the transposase gene sequences corresponding to each transposase family detected in the Fervidobacterium genomes and their related genes. The frequencies of tetranucleotides [24] were used in NMDS analyses. NMDS plots were constructed using R with the Vegan Package [25].

Repetitive flanking sequences

The conservation of insertion sequence endings were studied by searching through visual inspection the sequence alignments corresponding to related transposase genes plus additional 1000 nucleotides up- and down-stream. When present, the detection of the inverted repeats or palindromic sequences (depending on the transposase family) was guided by the information available at the ISFinder database. The percentage of identity between aligned, conserved endings and the distance to the annotated start or stop codons were noted.

K-L divergence estimates

The Kullback-Leiber (K-L) divergence metric [26, 27] was used as an approach to estimate the divergence between related transposase genes clustered in a transposase family. K-L divergence (DKL) was calculated on tetranucleotide frequencies for pairwise comparisons as where g is the parameter used (e.g., frequency of a particular tetranucleotide, i) for the analyzed gene and G the same parameter for a reference transposase gene or whole genome.

Results

Classification of transposases in Fervidobacterium genomes

The number of putative transposase genes detected in the Fervidobacterium (Phylum Thermotogae) genomes ranged between 39 and 48 per genome (Table 1). These transposase genes were classified in six IS families (IS6, IS110, IS200/IS605, ISL3, IS3 and IS4) plus four undefined groups (types A, B, C and D) that were unrelated to transposase families listed in ISFinder [28]. In Table 1, ISCpe7-like transposases (IS6 family) were identified as the most frequent IS element family in the genomes of Fervidobacterium: F. nodosum Rt17-B1 (23 copies), F. pennivorans DSM 9078 (24 copies), Fervidobacterium sp. FC2004 (12 copies) and F. islandicum AW-1 (34 copies). Based on their degree of similarity and phylogenetic relationships, the IS110 (IS110_I–IS110_III), IS200 (IS200_I-IS200_III), IS605 (IS605_I–IS605_III) and IS3 (IS3_I—IS3_III) were further classified into three subgroups. Evidences of HGT Fervidobacterium genomes that might have had occurred across phyla (i.e., other than Thermotoga) were characterized. The results revealed that ISCpe7, IS110, IS605_I, IS605_II, ISL3, IS3_II, IS3_III and the types B, C, and D might be actively involved in the HGT events rendered by other species belonging to other phyla. The IS605_I, ISL3, type C and type D were chosen as four representative models which were described below.
Table 1

Classification of insertion sequences detected in four Fervidobacterium genomes.

IS FamilyTransposase groupsF. nodosum Rt17-B1F. pennivorans DSM 9078Fervidobacterium sp. FC 2004F. islandicum AW-1HGT1
IS6ISCpe7-like23241234y
IS110IS116/IS110/IS902-like
IS110_I6y
IS110_II211n
IS110_III16n
IS200/IS605IS200-like
IS200_I1n
IS200_II3n
IS200_III2n
IS605 OrfB-like
IS605_I1y
IS605_II16y
IS605_III12n
ISL3IS204/IS1001/IS1096/IS1165-like8y
IS3IS3/IS911-like
IS3_I2n
IS3_II2y
IS3_III1y
IS4IS4-like6n
Type AType A2n
Type BType B4y
Type CType C31y
Type DType D5y
Total transposases43413943

1 Putative acquisition of these sequences through interphylum HGT. Yes, y; No, n.

1 Putative acquisition of these sequences through interphylum HGT. Yes, y; No, n.

IS605_I transposases

Phylogenetic analysis using transposase sequences revealed a copy of IS605_I in F. nodosum which shares the highest similarity to a clade formed by Caldicellulosiruptor (Phylum Firmicutes; amino acid similarity 77%; Fig 1). Substantial transposase amino acid sequence similarities of 60–65% and 51% were observed in bacteria belonging to the phyla Aquificae, Synergistetes and Deinococcus-Thermus and a member of the Domain Archaea (Ferroplasma acidarmanus), respectively. In contrast, the closest Thermotogae representatives (Thermotoga spp.) for this transposase group formed a different cluster (Fig 1) showing much lower similarity (32–43%). Thermotogae and Firmicutes are too different phyla represented in phylogenetic dendrograms as independent low-branching bifurcations [29]. This suggests that the potential origin of these transposases in F. nodosum might have been the Phylum Firmicutes rather than the closest relatives in the Phylum Thermotogae.
Fig 1

Phylogenetic relationship among Fervidobacterium nodosum and related gene sequences of the IS605_I transposase family and their closest relatives within the Phylum Thermotogae.

The percentage of similarity related to the F. nodosum amino acidic sequence is shown on the right column.

Phylogenetic relationship among Fervidobacterium nodosum and related gene sequences of the IS605_I transposase family and their closest relatives within the Phylum Thermotogae.

The percentage of similarity related to the F. nodosum amino acidic sequence is shown on the right column. NMDS analysis of tetranucleotide frequencies also showed a clustering of the IS605_I transposase from F. nodosum (1381_Fn) in the vicinity of the closest relatives (Fig 2) mainly from the genus Caldicellulosiruptor (Phylum Firmicutes). The results imply that the IS605 from F. nodosum is the closest relative to this type of transposition elements from Caldicellulosiruptor.
Fig 2

Non-metric MultiDimensional Scaling (NMDS) analysis of tetranucleotide frequencies for F. nodosum and related gene sequences of the IS_605_I transposase family.

Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 1). Dashed black, F. Nodosum; dashed grey, diverse phyla related to the transposase in F. Nodosum; continuous grey, other Thermotogae.

Non-metric MultiDimensional Scaling (NMDS) analysis of tetranucleotide frequencies for F. nodosum and related gene sequences of the IS_605_I transposase family.

Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 1). Dashed black, F. Nodosum; dashed grey, diverse phyla related to the transposase in F. Nodosum; continuous grey, other Thermotogae. Analysis of palindromic IR sequences flanking the transposase gene sequences revealed that the left and the right IR sequences of F. nodosum show remarkably high nucleotide identity, 67–70% and 71–75%, to sequences from species of the genus Caldicellulosiruptor (Phylum Firmicutes), and to other Firmicutes, 55 and 60% to Thermacetogenium phaeum and 73 and 78% to Ammonifex degensii. Lower similarities to the left and right IR sequences from members belonging to the Phylum Aquificae (Hydrogenivirga, 42% and 57%, and Desulfurobacterium, 44–46% and 76–80%) were observed. In contrast, lowest similarities to the left and right IRs were obtained from members of the Phylum Thermotogae (42–43% and 55–60%). Besides, these IR sequences were highly conserved at the genus level and the high identity observed between these related sequences (i.e., Fervidobacterium and Caldicellulosiruptor) suggested interphyla relationships (Table 2).
Table 2

Repetitive flanking sequences detected in F. nodosum and related sequences within the IS605_I transposase family.

The palindromic inverted repeats upstream (left end) and down-stream (right end) and their percentage of identity with respect to F. nodosum sequences are presented. Identification numbers correspond to those used in the figures for this transposase group.

IDAccession No.OrganismLeft endDistance1Identity1Distance1Right endIdentity1
1381_FnYP_001410911Fervidobacterium nodosum RT17-B1TTACG–TAAATCTACCT–GAAAGGACATGTACTT–CAAGGTATTGGGCTGAAC600AGCA–GGAAGCCAAT–CCCTTA–A–AAGGGG––TAGGAGGAGGTCAA
T12YP_003841390Caldicellulosiruptor obsidiansis OB47....AAA.G...AG..A–.....C....CCC...T..G..G....T.A....T9467.322.––––....T..T..C.....C––––.....AT.G............72.7
T15YP_001180031Caldicellulosiruptor saccharolyticus DSM 8903....AAAG....AG..A– .....C....C.C...T–.G..G....T.A....T7867.385.CATG....T..T..C.....C––––.....AT.G............71.1
T13YP_004022970Caldicellulosiruptor kronotskiensis 2002....AGAG....AG..A–.....C....CCC...–..G..G....T.A....T9468.622..A––....T..T..C.....C––––.....AT.G............75.0
T14YP_002574271Caldecellulosiruptor bescii DSM 6725....A–AG....AG..A–.....C....TCC...–..G..G....T.A....T9370.022..A––....T..T..C.....C––.....AT.G............75.0
T20YP_003239632Ammonifex degensii KC4.....AG.GGC.GC.GG–.....CC..––CT...TT..A.G––....A....A5755.71..G––.......C.C..T..C––––.A...––.G...........C73.1
T19YP_006919201Thermacetogenium phaeum DSM 12270.....A.CCCC.GC.GG–.....CC..––CC...T.....G––....A....A6759.61..A––.......C.C.....T––––.....––.G...........C78.0
T17WP_008287510Hydrogenivirga sp. 128-5-R1-1.....AA...A–....A–––G.ATG..CCCGT..TA..ACG––G.T.A....T7642.3-21....–......TCGGCTT...T––––.AA.C––CGA.....T....C57.1
T18DYP_004281281Desulfurobacterium thermolithotrophum DSM 11699....AG......–CAGC–..G.AA..TCTGCT..T...A––GCA...A.A..T5546.1-26....–.......T.C.A..–––.–...T..––........T....C80.4
T18CYP_004281398Desulfurobacterium thermolithotrophum DSM 11699....AGA...C.GTTGC–..G.AAG.CCTGCT..TT..A––GCA...A....T5244.2-20....–.......CC..A...T––––..T.A––GG......T....C70.7
T18AYP_004281722Desulfurobacterium thermolithotrophum DSM 11699....AGA...A..TTGC–..G.AAG.CCTGCT..TT..A––GCA...A.A..T5144.20....–.......C.C.A...T––––..T..––.G......T....C75.6
T18BYP_004281103Desulfurobacterium thermolithotrophum DSM 11699....AGA...–..TTGC–..G.AAG.CCTGCT..TT..A––GCA...A.A..T5144.2-20....–.......T.C.A...T––––..T..––........T....C78.0
T6YP_007977397Thermotoga maritima MSB8...TCGA.T.AAAC.GCAA.G.A....CCATT..CT.TAAGG.G...A....T12743.30...G–.........GA...CT.T...TCA––.G.T..––T...C60.4
T7ANP_228850Thermotoga maritima MSB8...TCGA.T.AAAC.GCAA.G.A....CCATT..CT.TAAGG.G...A....T12743.30...G–.........GA... CT.T...TCA––.G.T..––T...C60.4
T8AYP_001739778Thermotoga sp. RQ2...TCGA.T.AAAC.GCAA.G.A....CCATT..CT.TAAGG.G...A....T12743.30...G–.........GA...CT.T...TCA––.G.T..––T...C60.4
T8BYP_001739648Thermotoga sp. RQ2...TCGA.C.AAGT–GCAA.G.A....CCATT..CT.TAAGG.G...A....T12741.50...G–....A...CAT...CTGT...GT..–––G.T..––T...C54.5
T9AYP_001245146Thermotoga petrophila RKU-1...TCGA.C.AAGT–GCAA.G.A....CCATT..CT.TAAGG.G...A....T12741.50...G–........CAT...CTGT...GT..–––G.T..––T...C56.8

1 Distance (bp) to the transposase gene start (left) or stop (right) codons. Palindromic sequences are underlined. Identity as percentage (%).

Repetitive flanking sequences detected in F. nodosum and related sequences within the IS605_I transposase family.

The palindromic inverted repeats upstream (left end) and down-stream (right end) and their percentage of identity with respect to F. nodosum sequences are presented. Identification numbers correspond to those used in the figures for this transposase group. 1 Distance (bp) to the transposase gene start (left) or stop (right) codons. Palindromic sequences are underlined. Identity as percentage (%). The K-L divergence estimator (Fig 3) also confirmed the proximity of F. nodosum and Caldicellulosiruptor transposases within IS605_I subgroup. An increase in divergence estimations was in agreement to phylogenetic distance (Fig 1). The other related transposase genes from other phyla showed slightly increasing divergence (Fig 3). The results from different methods confirmed a close relationship between F. nodosum IS605_I transposase and members of the Firmicutes and other phyla rather that to the closest Thermotogae sequences.
Fig 3

Ranking plot of K-L Divergence estimations for F. nododum and related gene sequences within the IS605_I transposase family.

Squares represent pairwise divergence estimates between transposase genes; the line indicates the divergence estimates for transposase to whole genome comparisons. Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 1).

Ranking plot of K-L Divergence estimations for F. nododum and related gene sequences within the IS605_I transposase family.

Squares represent pairwise divergence estimates between transposase genes; the line indicates the divergence estimates for transposase to whole genome comparisons. Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 1).

ISL3 transposases

Eight copies of the transposase genes detected in F. nodosum genome correspond to the ISL3 family (Table 1). The phylogenetic analysis showed that these sequences clustered together with transposases present in Caldicellulosiruptor (Firmicutes; 91–93% amino acid similarity) (Fig 4). Other Firmicutes, such as members of the genera Caldanaerobacter and Thermoanaerobacter also presented related sequences (73–75% amino acid similarity). Transposase genes from other Firmicutes genera, such as Clostridium, Caloramator and Thermobrachium, showed a bit lower similarity (66–71%). The closest sequences within the Phylum Thermotogae represented divergent clusters and much lower similarity to F. nodosum (37–38%) than the previous Firmicutes genera (Fig 4).
Fig 4

Phylogenetic relationship among Fervidobacterium nodosum and related gene sequences of the ISL3 transposase family and their closest relatives within the Phylum Thermotogae.

The percentage of similarity related to the F. nodosum amino acidic sequence is shown on the right column.

Phylogenetic relationship among Fervidobacterium nodosum and related gene sequences of the ISL3 transposase family and their closest relatives within the Phylum Thermotogae.

The percentage of similarity related to the F. nodosum amino acidic sequence is shown on the right column. The clustering organization observed in Fig 4 was in agreement to their distributions on NMDS analysis of tetranucleotide frequencies (Fig 5). Thus, the major lineages differentiated on the phylogenetic dendrogram were also differentiated in the NMDS plot (Fig 5). F. nodosum and Caldicellulosiruptor sequences represented a single cluster, clearly differentiated to the other related sequences with decreasing similarity.
Fig 5

Non-metric MultiDimensional Scaling (NMDS) analysis of tetranucleotide frequencies for F. nodosum and related gene sequences of the ISL3 transposase family.

Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 4). Dashed black, F. Nodosum; dashed grey, Caldicellulosiruptor (Firmicutes); continuous grey, other Thermotogae.

Non-metric MultiDimensional Scaling (NMDS) analysis of tetranucleotide frequencies for F. nodosum and related gene sequences of the ISL3 transposase family.

Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 4). Dashed black, F. Nodosum; dashed grey, Caldicellulosiruptor (Firmicutes); continuous grey, other Thermotogae. The IR sequences surrounding the ISL3 transposase genes in F. nodosum were practically identical among them (Table 3). Only minor differences (89–92% identity) were observed with their corresponding sequences in Caldicellulosiruptor (Table 3). Other Firmicutes ISL3 transposase genes showed close but slightly more distant IR sequences (75–86% identity) which paralleled to their increasing phylogenetic distance (Fig 4). The closest Thermotogae ISL3 transposase sequences were distant from those in F. nodosum with identities below 35% which also corresponded to the divergent phylogeny shown for this group (Fig 4).
Table 3

Repetitive flanking sequences detected in F. nodosum and related sequences within the ISL3 transposase family.

The inverted repeats (IR) upstream (left end) and down-stream (right end) and their percentage of identity with respect to F. nodosum sequences are presented. Identification numbers correspond to those used in the figures for this transposase group.

IDAccession No.OrganismLeft end1Distance2Identity2Distance2Right endIdentity2
243_fnYP_001409773Fervidobacterium nodosum RT17-B1GGCTCTAT–AA–––T–AAATA–––TTGGGGTGGGTA–AA––AAG–GA–––––AAAAA–––––––TATAGCACTTATTCTGAATGCC65129GGCATTCAGAA–––––––––TA––––––––––––––––––––––––––––––CCCACCCCA–C––TACTTGA–––––CAAAGAGCC
343_FnYP_001409873Fervidobacterium nodosum RT17-B1........–..–––.–.....–––............–..––...–..–––––.....–––––––......................65100.0129...........–––––––––..––––––––––––––––––––––––––––––.........–.––.......–––––.........100.0
999_FnYP_001410529Fervidobacterium nodosum RT17-B1........–..–––.–.....–––............–..––...–..–––––.....–––––––......................65100.0129...........–––––––––..––––––––––––––––––––––––––––––.........–.––.......–––––.........100.0
1160_FnYP_001410690Fervidobacterium nodosum RT17-B1........–..–––.–.....–––............–..––...–..–––––.....–––––––......................65100.0129...........–––––––––..––––––––––––––––––––––––––––––.........–.––.......–––––.........100.0
1322_FnYP_001410852Fervidobacterium nodosum RT17-B1........–..–––.–.....–––............–..––...–..–––––.....––––––––......................65100.0129...........–––––––––..––––––––––––––––––––––––––––––.........–.––.......–––––.........100.0
1405_FnYP_001410935Fervidobacterium nodosum RT17-B1........–..–––.–.....–––........T...–..––...–..––––––.....–––––––......................6598.3129...........–––––––––..––––––––––––––––––––––––––––––.........–.––.......–––––.........100.0
1408_FnYP_001410938Fervidobacterium nodosum RT17-B1........–..–––.–.....–––............–..––...–..–––––.....–––––––......................65100.0129...........–––––––––..––––––––––––––––––––––––––––––.........–.––.......–––––.........100.0
1532_FnYP_001411062Fervidobacterium nodosum RT17-B1........–..–––.–.....–––............–..––...–..–––––.....–––––––......................65100.0129...........–––––––––..––––––––––––––––––––––––––––––.........–.––.......–––––.........100.0
T2YP_004003257Caldicellulosiruptor owensensis OL........–..–––.–.....–––.........A..G.TAG––.–..–––––.....–––––––......................6489.2126...........–––––––––..––––––––––––––––––––––––––––––.........A.––.–.....–––––......A..92.5
T1AYP_004024786Caldicellulosiruptor kronotskiensis 2002........–..–––.–.....–––.........–..–.––G..AA..–––––.....–––––––......................6592.1128..........G–––––––––..––––––––––––––––––––––––––––––.........A.––.–.....–––––.........92.5
T1BYP_004023466Caldicellulosiruptor kronotskiensis 2002........–..–––.–.....–––.........–..–.––G..AA..–––––.....–––––––......................6692.1128..........G–––––––––..––––––––––––––––––––––––––––––.........A.––.–.....–––––.........92.5
T1CYP_004022990Caldicellulosiruptor kronotskiensis 2002........–..–––.–.....–––.........–..–.––G..AA..–––––.....–––––––......................6592.1128...........–––––––––..––––––––––––––––––––––––––––––T........A.––.–.....–––––.........92.5
T3ERM93111Caldanaerobacter subterraneus subsp. yonseiensis KB-1..T.....–..–––.–....–––G..........G.A..AG..–ATCT–ATA.....–––––––G.G...................5376.0nd3...........AAA–––––A..––––––––––––––––––––––––––––––.........A.––..–....–––––.........86.3
T4EEB76197Caldanaerobacter subterraneus subsp. pacificus DSM 12653..T.....–..–––.–....–––G..........G.A..AG..–ATCT–ATA.....–––––––G.G...................5376.0nd3...........AAA–––––A..––––––––––––––––––––––––––––––.........A.––..–....–––––.........86.3
T7YP_004819509Thermoanaerobacter wiegelii Rt8.B1..T.....–..–––.–....–––G........AA..A..AG––.ATC–––TA.....–––––––C.G...................4975.3152...........AAA–––––A..––––––––––––––––––––––––––––––.........A.––..–....–––––.........86.3
T5AYP_001665730Thermoanaerobacter pseudethanolicus ATCC 33223..T.....–..–––.–....–––G..........G.A..AG..–ATCT–AT–.....–––––––G.G...................5277.1164...........AAA–––––A..––––––––––––––––––––––––––––––.........A.––..–....–––––.........86.3
T6EGD51473Thermoanaerobacter ethanolicus JW200..T.....–..–––.–....–––G........TAGGGG.–G..AA..––AT–.....–––––––..G...................5077.9153...........AAAA––––A..––––––––––––––––––––––––––––––.........A.––..–....–––––......A..82.2
T10EEU61982Thermoanaerobacter ethanolicus CCSD1..T.....–..–––.–....–––G..........G.A..AG..–ATCT–ATA.....–––––––G.G...................5276.0
T5BYP_001665374Thermoanaerobacter pseudethanolicus ATCC 33223..T.....–..–––.–....–––G..........G.A..AG..–ATCT–ATA.....–––––––G.G...................5276.0
T13BYP_001568581Petrotoga mobilis SJ95.....–––G..T––.C..–.–––––.–––––––––––––––..––TC––A––––––––––––––CTGC.–.–..–..–––––...A27334.88AA..–––––..AAA–T–CGC.GTATTACCTTTTTCATCATCTCTTC–AGTCTATTCTAT..A.CA..T....–––––.........29.4
T13CYP_001568325Petrotoga mobilis SJ95...–.–T.G..AGG.C..–.–––––C–––––––––––––––...–T–––AT–––––––––––––C–G–.–.–.––..––––...GGnd331.88AA..–––––..AAA–T–CTC.GTAATACCTTTTTCATCATCTCTTC–AGTCTATTCT–T..A.CA..T....–––––...G.....29.4

1 Underlined characters indicate a putative transposase start codon.

2 Distance (bp) to the transposase gene start (left) or stop (right) codons. Identity as percentage (%).

3 Not determined, nd.

Repetitive flanking sequences detected in F. nodosum and related sequences within the ISL3 transposase family.

The inverted repeats (IR) upstream (left end) and down-stream (right end) and their percentage of identity with respect to F. nodosum sequences are presented. Identification numbers correspond to those used in the figures for this transposase group. 1 Underlined characters indicate a putative transposase start codon. 2 Distance (bp) to the transposase gene start (left) or stop (right) codons. Identity as percentage (%). 3 Not determined, nd. Similarly, results from K-L divergence analyses (Fig 6) showed increasing divergence at increasing phylogenetic distance, in agreement to the results from Figs 4 and 5 and Table 3. The clusters observed at the phylogeny (Fig 4) were related to stepping up divergence and showed Caldicellulosiruptor as the genus with the lowest divergence from F. nodosum ISL3 transposase genes.
Fig 6

Ranking plot of K-L Divergence estimations for F. nodosum and related gene sequences within the ISL3 transposase family.

Squares represent pairwise divergence estimates between transposase genes; the line indicates the divergence estimates for transposase to whole genome comparisons. Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 4).

Ranking plot of K-L Divergence estimations for F. nodosum and related gene sequences within the ISL3 transposase family.

Squares represent pairwise divergence estimates between transposase genes; the line indicates the divergence estimates for transposase to whole genome comparisons. Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 4).

Type C transposases

Three genes classified within the type C transposase group were found in F. pennivorans genome and one in F. islandicum genome. The closest sequences related to Fervidobacterium type C transposase genes belonged to the genus Caldicellulosiruptor (Firmicutes; 98–99% amino acid similarity) which formed a single clade together with the Fervidobacterium sequences (Fig 7). Other type C related transposases included transposases from Firmicutes such as Thermoanaerobacter, Thermobrachium and Clostridium (70–82% similarity). The closest Thermotogae transposase sequences belonged to Petrotoga mobilis (53% similarity). Other more distant sequences from Thermotogae forming an independent cluster were also detected but they showed levels of similarity below 43% (35–43%).
Fig 7

Phylogenetic relationship among Fervidobacterium and related gene sequences of the type C transposase group and their closest relatives within the Phylum Thermotogae.

The percentage of similarity related to the Fervidobacterium amino acidic sequence is shown on the right column.

Phylogenetic relationship among Fervidobacterium and related gene sequences of the type C transposase group and their closest relatives within the Phylum Thermotogae.

The percentage of similarity related to the Fervidobacterium amino acidic sequence is shown on the right column. NMDS analysis of tetranucleotide frequencies (Fig 8) differentiated the major phylogenetic clade formed by Fervidobacterium and Caldicelulosiruptor type C transposases. Additionally, other related phylogenetic clades observed within the Firmicutes clustered on the NMDS plot. Most Thermotogae sequences were clearly separated in a different and distant group (Fig 8).
Fig 8

Non-metric MultiDimensional Scaling (NMDS) analysis of tetranucleotide frequencies for Fervidobacterium (Ferv.) and related sequences of the type C transposase group.

Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 7). Dashed black, Fervidobacterium (Ferv.); dashed grey, Caldicellulosiruptor (Firmicutes); continuous grey, other Thermotogae.

Non-metric MultiDimensional Scaling (NMDS) analysis of tetranucleotide frequencies for Fervidobacterium (Ferv.) and related sequences of the type C transposase group.

Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 7). Dashed black, Fervidobacterium (Ferv.); dashed grey, Caldicellulosiruptor (Firmicutes); continuous grey, other Thermotogae. F. pennivorans and F. islandicum showed identical IR sequences (Table 4). The IR sequence down-stream of F. islandicum type C transposase remained undetected. Caldicellulosiruptor type C transposases presented practically identical IR sequences (97–100% identity) than Fervidobacterium. Other related Firmicutes showed 74–86% identity to Fervidobacterium type C IR sequences. For the different lineage of Thermotogae, the IR sequence identity was much lower than the related Firmicutes, down to 42–56% identity.
Table 4

Repetitive flanking sequences detected in Fervidobacterium and related sequences of the type C transposase group.

The inverted repeats (IR) upstream (left end) and down-stream (right end) and their percentage of identity with respect to Fervidobacterium sequences are presented. Identification numbers correspond to those used in the figures for this transposase group.

IDAccession No.OrganismLeft endDistance1Identity1Distance1Right endIdentity1
646_FpYP_005470928Fervidobacterium pennivorans DSM 9078–––––––TGTTAATGATAAAATA–––AAAATGT–ACTGAAAGTGGAATATAAGGATGTACAAAAT–TGTAC–––410–––––––––––––GTTTATTTTTTGTACATTTTTATTTTCGAT–TTTTTGTTCATTTTGA–TATTGACATTTACA–
1012_FpYP_005471294Fervidobacterium pennivorans DSM 9078–––––––................–––.......–...............................–.....–––41100.00–––––––––––––..............................–................–..............–100.0
687_FpYP_005470969Fervidobacterium pennivorans DSM 9078–––––––................–––.......–...............................–.....–––41100.00–––––––––––––..............................–................–..............–100.0
43_FiKHG31461Fervidobacterium islandicum AW-1–––––––................–––.......–...............................–.....–––41100.0nd2––––CAGAAATTTTGCGTCC.GG.T..TGC.G..G.GC.TT..–....–...–....AT.T....–T.–.AG–..–50.8
T1YP_004799686Caldecellulosiruptor lactoaceticus 6A–––––––................–––.......–...............................–.....–––41100.00–––––––––––––..............................–................–..............–100.0
T2YP_002572060Caldecellulosiruptor lactoaceticus 6A–––––––G...............–––.......–..................T............–.....–––4196.60–––––––––––––..............................–................–..............–100.0
T3YP_003840928Caldecellulosiruptor lactoaceticus 6A–––––––....G...........–––.......–...................A...........–.....–––4196.60–––––––––––––..............................–................–..............–100.0
T7YP_003675818Thermoanaerobacter matharanii A3––––––C...A...A........–––.G.....–..GA.........A....AA.......G.T.–.....–––8281.7nd2–––––––––––––..AC............AAA....A...C.C–.......A..AAAAT.–.......T......G73.7
T6CCQ92868Clostridium ultunense Esp––––––––..A...........T–––.G.....–..AATT.A.....A.....A..........G–..––––––3680.3nd2––––––––––––––––––––..........AA........C.C–......A.....C.T.–.............––86.5
T12BYP_002941199Kosmotoga olearia TBF 19.5.1GTGTCGACA.CG..T........GTA....C..CGG.C...AAT..GC.A..AC..CGGT..G.AT..AGTAGT3842.4nd2ACTACGCAAATCT.A.CGA.GAAAC...GCAA..T.G......G.....A.G.T....TG––G.C..TG..G...–55.7
T12AYP_002940250Kosmotoga olearia TBF 19.5.1GTGTCGACA.CG..T........GTA....C..CGG.C...AAT..GC.A..AC..CGGT..G.AT..AGTAGT3842.4nd2ACTACGCAAATCT.AGCGA.GAAAT...GCAA..T.GA.....G.....T.G......TG––G....TG..G...–55.7
T12CYP_002941123Kosmotoga olearia TBF 19.5.1GTGTCGACA.CG..T........GTA....C..CGG.C...AAT..GC.A..AC..CGGT..G.AT..AGTAGT3842.4nd2ACTACGCAAATCT.A.CGA.GAAAC...GCAA..T.G......G.....A.G.T....TG––G.C..TG..G...–55.7

1 Distance (bp) to the transposase gene start (left) or stop (right) codons. Identity as percentage (%).

2 Not determined, nd.

Repetitive flanking sequences detected in Fervidobacterium and related sequences of the type C transposase group.

The inverted repeats (IR) upstream (left end) and down-stream (right end) and their percentage of identity with respect to Fervidobacterium sequences are presented. Identification numbers correspond to those used in the figures for this transposase group. 1 Distance (bp) to the transposase gene start (left) or stop (right) codons. Identity as percentage (%). 2 Not determined, nd. K-L divergence analyses of tetranucleotide frequencies also confirmed the proximity of Caldicellulosiruptor type C sequences to Fervidobacterium (Fig 9). Low divergence estimations were observed between Fervidobacterium and Caldicellulosiruptor type C transposase genes. Other Firmicutes and Thermotogae presented much higher K-L Divergence values.
Fig 9

Ranking plot of K-L Divergence estimations for Fervidobacterium and related gene sequences of the type C transposase group.

Symbols represent pairwise divergence estimates between transposase genes; the line indicates the divergence estimates for transposase to whole genome comparisons. Divergence estimates were calculated with respect to sequences 43_Fi (triangules), 646_Fp (squares), 687_Fp (circles) and 1012_Fp (diamonds). Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 7).

Ranking plot of K-L Divergence estimations for Fervidobacterium and related gene sequences of the type C transposase group.

Symbols represent pairwise divergence estimates between transposase genes; the line indicates the divergence estimates for transposase to whole genome comparisons. Divergence estimates were calculated with respect to sequences 43_Fi (triangules), 646_Fp (squares), 687_Fp (circles) and 1012_Fp (diamonds). Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 7).

Type D transposases

Transposase type D was only detected in Fervidobacterium sp. strain FC2004 with five copies found in its genome. These Fervidobacterium transposase genes presented as closest relatives genes from Firmicutes (68–72% similarity; Fig 10), specifically from the genera Caldicellulosiruptor and some sequences from Caldanaerobacter, Thermoanaerobacter and Clostridium which formed a single cluster to the Fervidobacterium type D transposases. A relatively close transposase gene was found in Thermotoga thermarum but it formed a divergent clade and presented lower similarity (65% similarity) that those Firmicutes genes. Other related genes from the Thermotogae presented much lower similarities (42–44%) and represented a divergent clade. Transposase genes forming related lineages with decreasing similarity to Fervidobacterium type D transposases were detected in Aquificae (63–67%), Proteobacteria (58%), Thermodesulfobacteria (59%) and Archaea (49%) (Fig 10).
Fig 10

Phylogenetic relationship among Fervidobacterium sp. FC2004 and related gene sequences of the type D transposase group and their closest relatives within the Phylum Thermotogae.

The percentage of similarity related to the Fervidobacterium amino acidic sequence is shown on the right column.

Phylogenetic relationship among Fervidobacterium sp. FC2004 and related gene sequences of the type D transposase group and their closest relatives within the Phylum Thermotogae.

The percentage of similarity related to the Fervidobacterium amino acidic sequence is shown on the right column. A similar pattern to that observed from the phylogeny was observed through NMDS analysis of tetranucleotide frequencies (Fig 11). The Fervidobacterium type D transposase genes clustered closed to the most related Firmicutes genes and clearly differentiated of more distant transposase genes from other taxonomic groups, including Thermotogae.
Fig 11

Non-metric MultiDimensional Scaling (NMDS) analysis of tetranucleotide frequencies for Fervidobacterium sp. FC2004 (Ferv.) and related sequences of the type D transposase group.

Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 10). Continuous black, Fervidobacterium sp. FC2004 (Ferv.); dashed grey, Firmicutes; continuous grey, other Thermotogae.

Non-metric MultiDimensional Scaling (NMDS) analysis of tetranucleotide frequencies for Fervidobacterium sp. FC2004 (Ferv.) and related sequences of the type D transposase group.

Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 10). Continuous black, Fervidobacterium sp. FC2004 (Ferv.); dashed grey, Firmicutes; continuous grey, other Thermotogae. Palindromic IR sequences were detected surrounding the transposase type D genes (Table 5). The IRs within the Fervidobacterium FC2004 genome were highly conserved (95–100% identity). Sequence identity between the IRs of Fervidobacterium and some Caldicellulosiruptor species were 59% and 67% up- and down-stream, respectively. In other Firmicutes (Caldicellulosiruptor, Thermoanaerobacter, Clostridium) identities between 50% to 69% were observed revealing proximity between IR sequences from Fervidobacterium and some Firmicutes. The closest Thermotogae transposase genes to these type D transposases presented IR nucleotide identities between 30% to 43% in agreement to a much higher phylogenetic distance than the pointed Firmicutes to Fervidobacterium type D transposase genes.
Table 5

Repetitive flanking sequences detected in Fervidobacterium sp. FC2004 and related sequences of the type D transposase group.

The palindromic inverted repeats upstream (left end) and down-stream (right end) and their percentage of identity with respect to Fervidobacterium sp. FC2004 sequences are presented. Identification numbers correspond to those used in the figures for this transposase group.

IDAccession No.OrganismLeft endDistance1Identity1Distance1Right endIdentity1
10_FtLWAF01000000Fervidobacterium thailandense FC 2004GGGAGTACAATAAATTTTGTGTCTGGAAAATTTTAGCTTGCTAA4071GTGTTTGTTCTCTTTTCTTCGGACACAAAATTCTTGGACA–GTCTC
11_FtLWAF01000000Fervidobacterium thailandense FC 2004...............................C............4097.770...G.................................T..–.....95.5
29_FtLWAF01000000Fervidobacterium thailandense FC 2004..........G.................................4097.7nd2...G.................................T..–.....95.5
3_FtLWAF01000000Fervidobacterium thailandense FC 2004............................................40100.071...G...............A....................–.....95.5
T5BYP_004798513Caldecellulosiruptor lactoaceticus 6A...T..TA..AT..AA.........A.GT..GA..AAA.CGC..1559.0-33CA...ACGCTA.–...ACC.A...............–...A.....67.3
T4CYP_004026612Caldecellulosiruptor kristjanssonii I77R1B...T..TA..AT..AA.........A.GT..GA..AAA.CGC..1559.0-33CA...ACGCTA.–...ACC.A...............–...A.....67.3
T4BYP_004025224Caldecellulosiruptor kristjanssonii I77R1B...T..TA..AT..AA.........A.GT..GA..AAA.CGC..1559.0-33CA...ACGCTA.–...ACC.A...............–...A.....67.3
T9AYP_004003046Caldecellulosiruptor owensensis OL...T..TA..AT..AG.........AT.T..AA..AAA.CAC..5059.0-33CA...ACGCTA.–...ACC.A...............–...A.....67.3
T9BYP_004003334Caldicellulosiruptor owensensis OL...T..TA..AT..AG.........AT.T..AA..AAA.CAC..5159.0-33CA...ACGCTA.–...ACC.A...............–...A.....67.3
T20EEB76117Caldanaerobacter subterraneus DSM 12653..AT..TA..AT..AA.........ATGT..AA.GAAA.TG...33254.5-33CA...ACG.TA.–...AAC.A...............–...A.....69.5
T19YP_001662008Thermoanaerobacter sp. X514..AT..TA..AT..AA.........ATGT..AA.GAAA.TG...33354.5-33CA...ACG.TA.–...AAC.A...............–...A.....69.5
T7AYP_007372016Clostridium stercorarium DSM 8532...T..TA..AT..AA..........TGT..AA..AAA.CAC..1559.0-33CA...ACG.TA.–...AA..A...............–...AC....69.5
T7BYP_007373409Clostridium stercorarium DSM 8532...T..TA..AT..AA..........TGT..AA..AAA.CAC..1559.016CA...ACG.TA.–...AA..A...............–...AC....69.5
T6AYP_001179002Caldicellulosiruptor saccharolyticus DSM 8903......TA..AT..AA.......CACTGT..AA.TAG.GTTG..2752.2-33AATA.ACGCTAT–GAAA.GGAA..............–...A.....52.1
T6BYP_001178898Caldicellulosiruptor saccharolyticus DSM 8903......TA..AT..AA.......CACTGT..AA.TAG.GTTG..2752.2-33AATA.ACGCTAT–GAAA.GGAA..............–...A.....52.1
T6CYP_001181500Caldicellulosiruptor saccharolyticus DSM 8903.AA...TA..AT..AA.......CATTGT.GAA.TAG.ATTG..7845.4-12AATA.ACGCTAT.GA.G.GGAA..............–...A.....56.5
T5AYP_004797950Caldicellulosiruptor lactoaceticus 6A......TA..AT..AA.......CACTGT..AA.TAG.ATTG..2652.2-33AATA.GCGCTAT–GAAA.GGAA..............–...A.....52.1
T4AYP_004025286Caldicellulosiruptor kristjanssonii I77R1B......TA..AT..AA.......CACTGT..AA.TAG.ATTG..2652.2-33AATA.GCGCTAT–GAAA.GGAA..............–...A.....52.1
T5CYP_004798536Caldicellulosiruptor lactoaceticus 6A......TA..AT..AA........ACTGT..AA.TAG.ATTG.G2552.2-33AAC..ACG.TAT–AAAT.GGAA..............–...A.....56.5
T1YP_003839631Caldicellulosiruptor obsidiansis OB47......TA..AT..AA.......CACTGT..AA.TAG.ATTG..2652.2-33AAT..ACG.TAT–AAAT.GGAA..............–...A.....56.5
T3YP_002572774Caldicellulosiruptor bescii DSM 6725......TA..AT..AA.......CACTGT..AA.TAG.ATTG..2652.2-33AAC..GCG.TAT–AAAT.GGAA..............–...A.....56.5
T2AYP_003478109Thermoanaerobacter italicus Ab9......TA..AT..AA.......CACTGT..AA.TAGAATTG..2650.0-33AAC..ACG.TAT–AAGT.GGAA..............–...A.....56.5
T2BYP_003476860Thermoanaerobacter italicus Ab9......TA..AT..AA.......CACTGT..AA.TAGAATTG..2650.0-33AAC..ACG.TAT–AAGT.GGAA..............–...A.....56.5
T12YP_004660785Thermotoga thermarum DSM 5069......T.G.AT..............TGGTAAGATTA...TAC.1759.0-35AAC.G–AGGTAT–...G.C.A............ACC–CT.A.....52.1
T17YP_007646834Hydrogenobaculum sp. SN......TA..AT.T..A.......TA..CT.AAACA...TA.C.2059.0-33AACAGGAGAT.T–.G.TA.GA...........A...–...A.....56.5
T10YP_001930578Sulfurihydrogenibium sp. YO3AOP1......TA..AT...G........AA.TT.AA..TATAA.GAGG754.5-33AATAAAAAGT.T–.CAA.C.A...........A...–.T.A.....54.3
T23YP_004339944Hippea maritima DSM 10411....C.TA..AT.T..........CCCCC.CAAACCTACTAA..6947.77––––..TGGTATGA.GAGAGA.........A.TAG.–GG.T.....43.4
T18YP_004625801Thermodesulfatator indicus DSM 15286.A.G.CGGG.A...........AA..GT..AA..TAAGAAACTC1947.74AACAGGT.ATCAGC.AACCTTT..........TC.T–.T.G.C...41.3
T25YP_002830539Sulfolobus islandicus M4.25.AAGC.CGGGGCT.CCAGT.A.AGTTGTGG.AG.CAAAG.TG..522.78––––––––.TA.A.A.T...TT......CTA.ACCC–.GTA.C.C.39.1
T13AYP_002940754Kosmotoga olearia TBF 19.5...TC.GTT.GC..........AAACTCCC.CCG.AA.ATGAG.2943.1nd2TCT...CCGGAG...GGGGTTT.........ATG..–...C.C.CG47.8
T13BYP_002940481Kosmotoga olearia TBF 19.5...TC.GTT.GT..........AAACTCCC.CCG.AA.ATGAG.2943.1nd2TCT...CCGGAG...GGAGTTT.........ATG..–...T.C.CG47.8
T15AYP_006346499Mesotoga prima MesG1.Ag.4.2.A.TA.GT..AG..GA......AAACCCTC.AGA.TGGAAGAG.1936.37ACCCA.TACAAAAAAGGGGTTT........AA–GA.–...GAA.C–30.4
T15BYP_006345816Mesotoga prima MesG1.Ag.4.2.A.TA.GT..AG..GA......AAACCCTC.AGA.TGGAAGAG.1936.37ACCCA.TACAAAAAAGGGGTTT........AAAGA.–...GAA.C–30.4

1 Distance (bp) to the transposase gene start (left) or stop (right) codons. Identity as percentage (%).

2 Not determined, nd.

Repetitive flanking sequences detected in Fervidobacterium sp. FC2004 and related sequences of the type D transposase group.

The palindromic inverted repeats upstream (left end) and down-stream (right end) and their percentage of identity with respect to Fervidobacterium sp. FC2004 sequences are presented. Identification numbers correspond to those used in the figures for this transposase group. 1 Distance (bp) to the transposase gene start (left) or stop (right) codons. Identity as percentage (%). 2 Not determined, nd. Gene sequences corresponding to type D transposases presented lowest K-L Divergence estimates when comparing Fervidobacterium and the Firmicutes indicated above. Increasing phylogenetic distance (Fig 10) resulted in a progressive increase of K-L Divergence estimates (Fig 12) suggesting parallel results between the different methods of analysis of HGT used in this study.
Fig 12

Ranking plot of K-L Divergence estimations for Fervidobacterium sp. FC2004 and related gene sequences of the type D transposase group.

Symbols represent pairwise divergence estimates between transposase genes; the line indicates the divergence estimates for transposase to whole genome comparisons. Divergence estimates were calculared with respect to the sequences 3_Ft (triangules), 10_Ft (squares), 11_Ft (circles) and 19_Ft (diamonds). Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 10).

Ranking plot of K-L Divergence estimations for Fervidobacterium sp. FC2004 and related gene sequences of the type D transposase group.

Symbols represent pairwise divergence estimates between transposase genes; the line indicates the divergence estimates for transposase to whole genome comparisons. Divergence estimates were calculared with respect to the sequences 3_Ft (triangules), 10_Ft (squares), 11_Ft (circles) and 19_Ft (diamonds). Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 10).

Discussion

Transposases are highly related to the mobility of DNA within and likely between genomes [6, 7] which suggests that they represent a proxy to detect and analyze HGT events and its influence on bacterial evolution and genome plasticity. In this case, we approached the study of transposases in Fervidobacterium genomes. Using different methods to test for potential HGT events of these sequences, the results indicated the presence of a number of transposases reflecting HGT events. Due to the generally observed IS sequence conservation over a broad range of prokaryotic taxa [9, 30] and their functional trait promoting their own duplication, transposase genes are likely to show relatively homogeneous sequences in comparison to other genes forming the pan genome of a taxon. Wagner [9] proposed that ISs may go periodically extinct in prokaryotic populations and become reintroduced through HGT events alleviating the potential negative long-term effects of their massive expansion in the prokaryote genomes. Nevertheless, ISs are suggested to represent evolutionary benefits in a short run which justifies their implication in species adaptation and genome plasticity [5, 6, 31]. Thus, the detection of transposase genes within a genome represents a snapshot of a prokaryote genome and provides indications of the potential genetic flow affecting the relatively recent history of specific taxa. Using the variability range from a reference IS sequence within a Fervidobacterium genome and the closest relatives and clades within other Thermotogae allows to approach the distinction of those ISs putatively incorporated through interphylum HGT events. Thus, these phylogenetically distant HGT events are easily discriminated from potential vertical transferences of DNA within a taxon genomic context. The approach involved different strategies to discriminate IS sequences including phylogeny and tree topology, multidimensional analysis and divergence metrics on tetranucleotide frequencies, and comparative IR sequence conservation. These methods were mostly in agreement on suggesting HGT events of IS sequences between Fervidobacterium and other phyla. The use of transposase genes as target sequences and different approaches to detect interphylum HGT events has shown to represent valuable tools for HGT analyses. Phylogeny showed clear clustering of similar sequences. Besides, multidimensional analyses (i.e., NMDS) of tetranucleotide frequencies also confirmed clustering of the distribution of transposase sequences from Fervidobacterium and related transposase genes. Divergence metrics of transposase genes confirmed increased values in parallel to increasing phylogenetic distance. In addition, the detection of IR sequences flanking those transposase genes is consistent with the idea of them being functional and likely their relatively recent incorporation [9] into these genomes even if the actual mechanism of transference is hardly understood. This study clearly shows the relationship on gene sharing between Fervidobacterium genomes and other prokaryotic phyla. The most frequently detected interphylum HGT relationship was observed between Fervidobacterium and Firmicutes genomes. Specifically, Caldicellulosiruptor genomes shared multiple common ISs with Fervidobacterium genomes and some of them presented up to 99% amino acid similarity in their transposases and 100% IR nucleotide sequence identities. These results were compared to the closely detected IS sequence within each IS family and other Thermotogae (beyond the Fervidobacterium genus) which showed amino acid similarities and IR nucleotide sequence identities generally below 50%. These data confirmed the occurrence of clear interphylum HGT events between those two genera. Other members of the Firmicutes (i.e., Thermoanaerobacter, Caldanaerobacter, Caloramator, Thermobracchium and Clostridium among others) also showed putative HGT events with the genus Fervidobacterium. As well, bacteria from other phyla (i.e., Aquificae, Synergistetes, Deinococcus-Thermus, Thermodesulfobacteria, Proteobacteria) and some Archaea were also detected as putatively involved in relatively recent HGT events with Fervidobacterium. Interestingly, most of these representatives presenting HGT relationship to Fervidobacterium were also thermophiles which confirmed the requirement of potentially sharing a habitat to make DNA exchanges possible [30].

Conclusions

The diversity of transposase genes detected within the genus Fervidobacterium suggests that these sequences have a major contribution to the adaptation and evolution of these bacteria and their genomes to the environment. The actual implication of transposases, their role in HGT, the rate of HGT events and transposase duplication and exchange remain to be fully understood. This study shows the relevance of transposases on HGT using the Fervidobacterium genus as a model of study and shows that IS sequences can be used to detect the occurrence of HGT events between phylogenetically distant prokaryotes and to identify the taxa involved in potential exchanges of genomic DNA.
  25 in total

Review 1.  Phylogenetic classification and the universal tree.

Authors:  W F Doolittle
Journal:  Science       Date:  1999-06-25       Impact factor: 47.728

Review 2.  Lateral gene transfer and the nature of bacterial innovation.

Authors:  H Ochman; J G Lawrence; E A Groisman
Journal:  Nature       Date:  2000-05-18       Impact factor: 49.962

3.  MEGA6: Molecular Evolutionary Genetics Analysis version 6.0.

Authors:  Koichiro Tamura; Glen Stecher; Daniel Peterson; Alan Filipski; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2013-10-16       Impact factor: 16.240

Review 4.  Horizontal gene transfer in evolution: facts and challenges.

Authors:  Luis Boto
Journal:  Proc Biol Sci       Date:  2009-10-28       Impact factor: 5.349

5.  On the chimeric nature, thermophilic origin, and phylogenetic placement of the Thermotogales.

Authors:  Olga Zhaxybayeva; Kristen S Swithers; Pascal Lapierre; Gregory P Fournier; Derek M Bickhart; Robert T DeBoy; Karen E Nelson; Camilla L Nesbø; W Ford Doolittle; J Peter Gogarten; Kenneth M Noll
Journal:  Proc Natl Acad Sci U S A       Date:  2009-03-23       Impact factor: 11.205

6.  Evidence for horizontal gene transfer of anaerobic carbon monoxide dehydrogenases.

Authors:  Stephen M Techtmann; Alexander V Lebedinsky; Albert S Colman; Tatyana G Sokolova; Tanja Woyke; Lynne Goodwin; Frank T Robb
Journal:  Front Microbiol       Date:  2012-04-17       Impact factor: 5.640

7.  ISfinder: the reference centre for bacterial insertion sequences.

Authors:  P Siguier; J Perochon; L Lestrade; J Mahillon; M Chandler
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

8.  Detection and characterization of horizontal transfers in prokaryotes using genomic signature.

Authors:  Christine Dufraigne; Bernard Fertil; Sylvain Lespinats; Alain Giron; Patrick Deschavanne
Journal:  Nucleic Acids Res       Date:  2005-01-13       Impact factor: 16.971

9.  Genome sequence of a native-feather degrading extremely thermophilic Eubacterium, Fervidobacterium islandicum AW-1.

Authors:  Yong-Jik Lee; Haeyoung Jeong; Gun-Seok Park; Yunyoung Kwak; Sang-Jae Lee; Sang Jun Lee; Min-Kyu Park; Ji-Yeon Kim; Hwan Ku Kang; Jae-Ho Shin; Dong-Woo Lee
Journal:  Stand Genomic Sci       Date:  2015-09-29

Review 10.  Bacterial insertion sequences: their genomic impact and diversity.

Authors:  Patricia Siguier; Edith Gourbeyre; Mick Chandler
Journal:  FEMS Microbiol Rev       Date:  2014-02-26       Impact factor: 16.408

View more
  6 in total

1.  Horizontal Gene Transfer Building Prokaryote Genomes: Genes Related to Exchange Between Cell and Environment are Frequently Transferred.

Authors:  Apuã C M Paquola; Huma Asif; Carlos Alberto de Bragança Pereira; Bruno César Feltes; Diego Bonatto; Wanessa Cristina Lima; Carlos Frederico Martins Menck
Journal:  J Mol Evol       Date:  2018-03-19       Impact factor: 2.395

2.  Understanding Horizontal Gene Transfer network in human gut microbiota.

Authors:  Chen Li; Jiaxing Chen; Shuai Cheng Li
Journal:  Gut Pathog       Date:  2020-07-09       Impact factor: 4.181

3.  Identification, overexpression, purification, and biochemical characterization of a novel hyperthermostable keratinase from Geoglobus acetivorans.

Authors:  Duangjai Sittipol; Sudarat Rodpan; Ya'u S Ajingi; Tassanee Lohnoo; Tassanee Lerksuthirat; Yothin Kumsang; Wanta Yingyong; Pongsak Khunrae; Triwit Rattanarojpong; Kovit Pattanapanyasat; Nujarin Jongruja
Journal:  3 Biotech       Date:  2020-11-27       Impact factor: 2.406

4.  Nitric Oxide Accumulation: The Evolutionary Trigger for Phytopathogenesis.

Authors:  Margarida M Santana; Juan M Gonzalez; Cristina Cruz
Journal:  Front Microbiol       Date:  2017-10-10       Impact factor: 5.640

5.  A comparative analysis of drinking water employing metagenomics.

Authors:  Kyle D Brumfield; Nur A Hasan; Menu B Leddy; Joseph A Cotruvo; Shah M Rashed; Rita R Colwell; Anwar Huq
Journal:  PLoS One       Date:  2020-04-09       Impact factor: 3.240

6.  Virioplankton assemblages from challenger deep, the deepest place in the oceans.

Authors:  Chen Gao; Yantao Liang; Yong Jiang; David Paez-Espino; Meiaoxue Han; Chengxiang Gu; Meiwen Wang; Yumei Yang; Fengjiao Liu; Qingwei Yang; Zheng Gong; Xinran Zhang; Zhixiang Luo; Hui He; Cui Guo; Hongbing Shao; Chun Zhou; Yang Shi; Yu Xin; Jinyan Xing; Xuexi Tang; Qilong Qin; Yu-Zhong Zhang; Jianfeng He; Nianzhi Jiao; Andrew McMinn; Jiwei Tian; Curtis A Suttle; Min Wang
Journal:  iScience       Date:  2022-06-27
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.