| Literature DB >> 28426805 |
Alba Cuecas1, Wirojne Kanoksilapatham2, Juan M Gonzalez1.
Abstract
Horizontal Gene Transfer (HGT) plays an important role in the physiology and evolution of microorganisms above all thermophilic prokaryotes. Some members of the Phylum Thermotogae (i.e., Thermotoga spp.) have been reported to present genomes constituted by a mosaic of genes from a variety of origins. This study presents a novel approach to search on the potential plasticity of Fervidobacterium genomes using putative transposase-encoding genes as the target of analysis. Transposases are key proteins involved in genomic DNA rearrangements. A comprehensive comparative analysis, including phylogeny, non-metric multidimensional scaling analysis of tetranucleotide frequencies, repetitive flanking sequences and divergence estimates, was performed on the transposase genes detected in four Fervidobacterium genomes: F. nodosum, F. pennivorans, F. islandicum and a new isolate (Fervidobacterium sp. FC2004). Transposase sequences were classified in different groups by their degree of similarity. The different methods used in this study pointed that over half of the transposase genes represented putative HGT events with closest relative sequences within the phylum Firmicutes, being Caldicellulosiruptor the genus showing highest gene sequence proximity. These results confirmed a direct evolutionary relationship through HGT between specific Fervidobacterium species and thermophilic Firmicutes leading to potential gene sequence and functionality sharing to thrive under similar environmental conditions. Transposase-encoding genes represent suitable targets to approach the plasticity and potential mosaicism of bacterial genomes.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28426805 PMCID: PMC5398504 DOI: 10.1371/journal.pone.0173961
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Classification of insertion sequences detected in four Fervidobacterium genomes.
| IS Family | Transposase groups | HGT | ||||
|---|---|---|---|---|---|---|
| IS6 | ISCpe7-like | 23 | 24 | 12 | 34 | y |
| IS110 | IS116/IS110/IS902-like | |||||
| IS110_I | 6 | y | ||||
| IS110_II | 2 | 1 | 1 | n | ||
| IS110_III | 1 | 6 | n | |||
| IS200/IS605 | IS200-like | |||||
| IS200_I | 1 | n | ||||
| IS200_II | 3 | n | ||||
| IS200_III | 2 | n | ||||
| IS605 OrfB-like | ||||||
| IS605_I | 1 | y | ||||
| IS605_II | 1 | 6 | y | |||
| IS605_III | 1 | 2 | n | |||
| ISL3 | IS204/IS1001/IS1096/IS1165-like | 8 | y | |||
| IS3 | IS3/IS911-like | |||||
| IS3_I | 2 | n | ||||
| IS3_II | 2 | y | ||||
| IS3_III | 1 | y | ||||
| IS4 | IS4-like | 6 | n | |||
| Type A | Type A | 2 | n | |||
| Type B | Type B | 4 | y | |||
| Type C | Type C | 3 | 1 | y | ||
| Type D | Type D | 5 | y | |||
| Total transposases | 43 | 41 | 39 | 43 | ||
1 Putative acquisition of these sequences through interphylum HGT. Yes, y; No, n.
Fig 1Phylogenetic relationship among Fervidobacterium nodosum and related gene sequences of the IS605_I transposase family and their closest relatives within the Phylum Thermotogae.
The percentage of similarity related to the F. nodosum amino acidic sequence is shown on the right column.
Fig 2Non-metric MultiDimensional Scaling (NMDS) analysis of tetranucleotide frequencies for F. nodosum and related gene sequences of the IS_605_I transposase family.
Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 1). Dashed black, F. Nodosum; dashed grey, diverse phyla related to the transposase in F. Nodosum; continuous grey, other Thermotogae.
Repetitive flanking sequences detected in F. nodosum and related sequences within the IS605_I transposase family.
The palindromic inverted repeats upstream (left end) and down-stream (right end) and their percentage of identity with respect to F. nodosum sequences are presented. Identification numbers correspond to those used in the figures for this transposase group.
| ID | Accession No. | Organism | Left end | Distance | Identity | Distance | Right end | Identity |
|---|---|---|---|---|---|---|---|---|
| 1381_Fn | YP_001410911 | 60 | 0 | |||||
| T12 | YP_003841390 | 94 | 67.3 | 22 | 72.7 | |||
| T15 | YP_001180031 | 78 | 67.3 | 85 | 71.1 | |||
| T13 | YP_004022970 | 94 | 68.6 | 22 | 75.0 | |||
| T14 | YP_002574271 | 93 | 70.0 | 22 | 75.0 | |||
| T20 | YP_003239632 | 57 | 55.7 | 1 | 73.1 | |||
| T19 | YP_006919201 | 67 | 59.6 | 1 | 78.0 | |||
| T17 | WP_008287510 | 76 | 42.3 | -21 | 57.1 | |||
| T18D | YP_004281281 | 55 | 46.1 | -26 | 80.4 | |||
| T18C | YP_004281398 | 52 | 44.2 | -20 | 70.7 | |||
| T18A | YP_004281722 | 51 | 44.2 | 0 | 75.6 | |||
| T18B | YP_004281103 | 51 | 44.2 | -20 | 78.0 | |||
| T6 | YP_007977397 | 127 | 43.3 | 0 | 60.4 | |||
| T7A | NP_228850 | 127 | 43.3 | 0 | 60.4 | |||
| T8A | YP_001739778 | 127 | 43.3 | 0 | 60.4 | |||
| T8B | YP_001739648 | 127 | 41.5 | 0 | 54.5 | |||
| T9A | YP_001245146 | 127 | 41.5 | 0 | 56.8 |
1 Distance (bp) to the transposase gene start (left) or stop (right) codons. Palindromic sequences are underlined. Identity as percentage (%).
Fig 3Ranking plot of K-L Divergence estimations for F. nododum and related gene sequences within the IS605_I transposase family.
Squares represent pairwise divergence estimates between transposase genes; the line indicates the divergence estimates for transposase to whole genome comparisons. Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 1).
Fig 4Phylogenetic relationship among Fervidobacterium nodosum and related gene sequences of the ISL3 transposase family and their closest relatives within the Phylum Thermotogae.
The percentage of similarity related to the F. nodosum amino acidic sequence is shown on the right column.
Fig 5Non-metric MultiDimensional Scaling (NMDS) analysis of tetranucleotide frequencies for F. nodosum and related gene sequences of the ISL3 transposase family.
Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 4). Dashed black, F. Nodosum; dashed grey, Caldicellulosiruptor (Firmicutes); continuous grey, other Thermotogae.
Repetitive flanking sequences detected in F. nodosum and related sequences within the ISL3 transposase family.
The inverted repeats (IR) upstream (left end) and down-stream (right end) and their percentage of identity with respect to F. nodosum sequences are presented. Identification numbers correspond to those used in the figures for this transposase group.
| ID | Accession No. | Organism | Left end | Distance | Identity | Distance | Right end | Identity |
|---|---|---|---|---|---|---|---|---|
| 243_fn | YP_001409773 | 65 | 129 | |||||
| 343_Fn | YP_001409873 | Fervidobacterium nodosum RT17-B1 | 65 | 100.0 | 129 | 100.0 | ||
| 999_Fn | YP_001410529 | Fervidobacterium nodosum RT17-B1 | 65 | 100.0 | 129 | 100.0 | ||
| 1160_Fn | YP_001410690 | Fervidobacterium nodosum RT17-B1 | 65 | 100.0 | 129 | 100.0 | ||
| 1322_Fn | YP_001410852 | 65 | 100.0 | 129 | 100.0 | |||
| 1405_Fn | YP_001410935 | Fervidobacterium nodosum RT17-B1 | 65 | 98.3 | 129 | 100.0 | ||
| 1408_Fn | YP_001410938 | Fervidobacterium nodosum RT17-B1 | 65 | 100.0 | 129 | 100.0 | ||
| 1532_Fn | YP_001411062 | Fervidobacterium nodosum RT17-B1 | 65 | 100.0 | 129 | 100.0 | ||
| T2 | YP_004003257 | 64 | 89.2 | 126 | 92.5 | |||
| T1A | YP_004024786 | 65 | 92.1 | 128 | 92.5 | |||
| T1B | YP_004023466 | 66 | 92.1 | 128 | 92.5 | |||
| T1C | YP_004022990 | 65 | 92.1 | 128 | 92.5 | |||
| T3 | ERM93111 | 53 | 76.0 | nd | 86.3 | |||
| T4 | EEB76197 | 53 | 76.0 | nd | 86.3 | |||
| T7 | YP_004819509 | 49 | 75.3 | 152 | 86.3 | |||
| T5A | YP_001665730 | 52 | 77.1 | 164 | 86.3 | |||
| T6 | EGD51473 | 50 | 77.9 | 153 | 82.2 | |||
| T10 | EEU61982 | 52 | 76.0 | |||||
| T5B | YP_001665374 | 52 | 76.0 | |||||
| T13B | YP_001568581 | 273 | 34.8 | 8 | 29.4 | |||
| T13C | YP_001568325 | nd | 31.8 | 8 | 29.4 |
1 Underlined characters indicate a putative transposase start codon.
2 Distance (bp) to the transposase gene start (left) or stop (right) codons. Identity as percentage (%).
3 Not determined, nd.
Fig 6Ranking plot of K-L Divergence estimations for F. nodosum and related gene sequences within the ISL3 transposase family.
Squares represent pairwise divergence estimates between transposase genes; the line indicates the divergence estimates for transposase to whole genome comparisons. Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 4).
Fig 7Phylogenetic relationship among Fervidobacterium and related gene sequences of the type C transposase group and their closest relatives within the Phylum Thermotogae.
The percentage of similarity related to the Fervidobacterium amino acidic sequence is shown on the right column.
Fig 8Non-metric MultiDimensional Scaling (NMDS) analysis of tetranucleotide frequencies for Fervidobacterium (Ferv.) and related sequences of the type C transposase group.
Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 7). Dashed black, Fervidobacterium (Ferv.); dashed grey, Caldicellulosiruptor (Firmicutes); continuous grey, other Thermotogae.
Repetitive flanking sequences detected in Fervidobacterium and related sequences of the type C transposase group.
The inverted repeats (IR) upstream (left end) and down-stream (right end) and their percentage of identity with respect to Fervidobacterium sequences are presented. Identification numbers correspond to those used in the figures for this transposase group.
| ID | Accession No. | Organism | Left end | Distance | Identity | Distance | Right end | Identity |
|---|---|---|---|---|---|---|---|---|
| 646_Fp | YP_005470928 | 41 | 0 | |||||
| 1012_Fp | YP_005471294 | 41 | 100.0 | 0 | 100.0 | |||
| 687_Fp | YP_005470969 | 41 | 100.0 | 0 | 100.0 | |||
| 43_Fi | KHG31461 | 41 | 100.0 | nd | 50.8 | |||
| T1 | YP_004799686 | 41 | 100.0 | 0 | 100.0 | |||
| T2 | YP_002572060 | Caldecellulosiruptor lactoaceticus 6A | 41 | 96.6 | 0 | 100.0 | ||
| T3 | YP_003840928 | Caldecellulosiruptor lactoaceticus 6A | 41 | 96.6 | 0 | 100.0 | ||
| T7 | YP_003675818 | 82 | 81.7 | nd | 73.7 | |||
| T6 | CCQ92868 | 36 | 80.3 | nd | 86.5 | |||
| T12B | YP_002941199 | 38 | 42.4 | nd | 55.7 | |||
| T12A | YP_002940250 | 38 | 42.4 | nd | 55.7 | |||
| T12C | YP_002941123 | 38 | 42.4 | nd | 55.7 |
1 Distance (bp) to the transposase gene start (left) or stop (right) codons. Identity as percentage (%).
2 Not determined, nd.
Fig 9Ranking plot of K-L Divergence estimations for Fervidobacterium and related gene sequences of the type C transposase group.
Symbols represent pairwise divergence estimates between transposase genes; the line indicates the divergence estimates for transposase to whole genome comparisons. Divergence estimates were calculated with respect to sequences 43_Fi (triangules), 646_Fp (squares), 687_Fp (circles) and 1012_Fp (diamonds). Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 7).
Fig 10Phylogenetic relationship among Fervidobacterium sp. FC2004 and related gene sequences of the type D transposase group and their closest relatives within the Phylum Thermotogae.
The percentage of similarity related to the Fervidobacterium amino acidic sequence is shown on the right column.
Fig 11Non-metric MultiDimensional Scaling (NMDS) analysis of tetranucleotide frequencies for Fervidobacterium sp. FC2004 (Ferv.) and related sequences of the type D transposase group.
Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 10). Continuous black, Fervidobacterium sp. FC2004 (Ferv.); dashed grey, Firmicutes; continuous grey, other Thermotogae.
Repetitive flanking sequences detected in Fervidobacterium sp. FC2004 and related sequences of the type D transposase group.
The palindromic inverted repeats upstream (left end) and down-stream (right end) and their percentage of identity with respect to Fervidobacterium sp. FC2004 sequences are presented. Identification numbers correspond to those used in the figures for this transposase group.
| ID | Accession No. | Organism | Left end | Distance | Identity | Distance | Right end | Identity |
|---|---|---|---|---|---|---|---|---|
| 10_Ft | LWAF01000000 | 40 | 71 | |||||
| 11_Ft | LWAF01000000 | 40 | 97.7 | 70 | 95.5 | |||
| 29_Ft | LWAF01000000 | 40 | 97.7 | nd2 | 95.5 | |||
| 3_Ft | LWAF01000000 | 40 | 100.0 | 71 | 95.5 | |||
| T5B | YP_004798513 | 15 | 59.0 | -33 | 67.3 | |||
| T4C | YP_004026612 | 15 | 59.0 | -33 | 67.3 | |||
| T4B | YP_004025224 | 15 | 59.0 | -33 | 67.3 | |||
| T9A | YP_004003046 | 50 | 59.0 | -33 | 67.3 | |||
| T9B | YP_004003334 | 51 | 59.0 | -33 | 67.3 | |||
| T20 | EEB76117 | 332 | 54.5 | -33 | 69.5 | |||
| T19 | YP_001662008 | 333 | 54.5 | -33 | 69.5 | |||
| T7A | YP_007372016 | 15 | 59.0 | -33 | 69.5 | |||
| T7B | YP_007373409 | 15 | 59.0 | 16 | 69.5 | |||
| T6A | YP_001179002 | 27 | 52.2 | -33 | 52.1 | |||
| T6B | YP_001178898 | 27 | 52.2 | -33 | 52.1 | |||
| T6C | YP_001181500 | 78 | 45.4 | -12 | 56.5 | |||
| T5A | YP_004797950 | 26 | 52.2 | -33 | 52.1 | |||
| T4A | YP_004025286 | 26 | 52.2 | -33 | 52.1 | |||
| T5C | YP_004798536 | 25 | 52.2 | -33 | 56.5 | |||
| T1 | YP_003839631 | 26 | 52.2 | -33 | 56.5 | |||
| T3 | YP_002572774 | 26 | 52.2 | -33 | 56.5 | |||
| T2A | YP_003478109 | 26 | 50.0 | -33 | 56.5 | |||
| T2B | YP_003476860 | 26 | 50.0 | -33 | 56.5 | |||
| T12 | YP_004660785 | 17 | 59.0 | -35 | 52.1 | |||
| T17 | YP_007646834 | 20 | 59.0 | -33 | 56.5 | |||
| T10 | YP_001930578 | 7 | 54.5 | -33 | 54.3 | |||
| T23 | YP_004339944 | 69 | 47.7 | 7 | 43.4 | |||
| T18 | YP_004625801 | 19 | 47.7 | 4 | 41.3 | |||
| T25 | YP_002830539 | 5 | 22.7 | 8 | 39.1 | |||
| T13A | YP_002940754 | 29 | 43.1 | nd | 47.8 | |||
| T13B | YP_002940481 | 29 | 43.1 | nd | 47.8 | |||
| T15A | YP_006346499 | 19 | 36.3 | 7 | 30.4 | |||
| T15B | YP_006345816 | 19 | 36.3 | 7 | 30.4 |
1 Distance (bp) to the transposase gene start (left) or stop (right) codons. Identity as percentage (%).
2 Not determined, nd.
Fig 12Ranking plot of K-L Divergence estimations for Fervidobacterium sp. FC2004 and related gene sequences of the type D transposase group.
Symbols represent pairwise divergence estimates between transposase genes; the line indicates the divergence estimates for transposase to whole genome comparisons. Divergence estimates were calculared with respect to the sequences 3_Ft (triangules), 10_Ft (squares), 11_Ft (circles) and 19_Ft (diamonds). Identification of transposases follows the IDs used in the dendrogram for these sequences (Fig 10).