| Literature DB >> 25018750 |
Shannon M Soucy1, Matthew S Fullmer1, R Thane Papke1, Johann Peter Gogarten1.
Abstract
This research uses inteins, a type of mobile genetic element, to infer patterns of gene transfer within the Halobacteria. We surveyed 118 genomes representing 26 genera of Halobacteria for intein sequences. We then used the presence-absence profile, sequence similarity and phylogenies from the inteins recovered to explore how intein distribution can provide insight on the dynamics of gene flow between closely related and divergent organisms. We identified 24 proteins in the Halobacteria that have been invaded by inteins at some point in their evolutionary history, including two proteins not previously reported to contain an intein. Furthermore, the size of an intein is used as a heuristic for the phase of the intein's life cycle. Larger size inteins are assumed to be the canonical two domain inteins, consisting of self-splicing and homing endonuclease domains (HEN); smaller sizes are assumed to have lost the HEN domain. For many halobacterial groups the consensus phylogenetic signal derived from intein sequences is compatible with vertical inheritance or with a strong gene transfer bias creating these clusters. Regardless, the coexistence of intein-free and intein-containing alleles reveal ongoing transfer and loss of inteins within these groups. Inteins were frequently shared with other Euryarchaeota and among the Bacteria, with members of the Cyanobacteria (Cyanothece, Anabaena), Bacteriodetes (Salinibacter), Betaproteobacteria (Delftia, Acidovorax), Firmicutes (Halanaerobium), Actinobacteria (Longispora), and Deinococcus-Thermus-group.Entities:
Keywords: gene flow; gene symbiosis; genome as an ecosystem; halobacteria; horizontal gene transfer; inteins; mobile genetic elements
Year: 2014 PMID: 25018750 PMCID: PMC4071816 DOI: 10.3389/fmicb.2014.00299
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Exteins in the halobacteria.
| Cell division control protein 21 | |
| DNA polymerase B1 | |
| DNA polymerase II large subunit | |
| Deoxycytadine triphosphate deaminase | |
| DNA gyrase subunit B | |
| ATP-dependent helicase | |
| ATP-dependent DNA ligase I | |
| Replication factor C small subunit | |
| Ribonucleoside-diphosphate reductase | |
| DNA-directed RNA polymerase subunit A | |
| UDP-glucose 6-dehydrogenase | |
| DNA topoisomerase I | |
| DNA topoisomerase VI subunit B |
Denotes intein alleles discovered in this work.
Denotes extein sequences not previously reported to be invaded by an intein.
Figure 1Intein Invasion Pattern in the Halobacteria. Intein pattern of presence-absence is mapped onto the tips of a ribosomal reference tree, teal boxes indicate the presence of a full size intein, yellow boxes indicate the presence of a mini-intein, black boxes indicate the absence of an intein, and white boxes indicate missing data. Purple shaded boxes indicate the genera with more than five species represented on the tree. Nodes with bootstrap support <70 are in gray.
Figure 2Relationships among intein alleles in the Halobacteria. This tree depicts the phylogenetic relationships among intein alleles in the Halobacteria. Inteins that clustered in concordance with the allele were collapsed to a single node, and labeled with the name of the intein allele. Two polB-d sequences did not group with the polB-d allele, and instead are located amongst the polB-c alleles these are indicated in red. Nodes with bootstrap support <70 are colored gray.
Figure 3Clustering of Halobacteria based on intein sequences and distribution. Halobacteria were clustered based on intein sequences and the distribution in each genome. Clusters with posterior probability >95% are shaded purple.
Taxonomic distribution in each intein allele.
| Monophyletic | 55 | 4 | 16 | |
| Monophyletic | 1 | 0 | 0 | |
| Monophyletic | 6 | 0 | 0 | |
| Monophyletic | 6 | 19 | 1 | |
| Monophyletic | 1 | 2 | 1 | |
| Monophyletic | 1 | 0 | 0 | |
| Monophyletic | 9 | 0 | 1 | |
| Monophyletic | 6 | 0 | 1 | |
| Monophyletic | 16 | 0 | 13 | |
| Monophyletic | 5 | 0 | 0 | |
| Monophyletic | 15 | 55 | 5 | |
| Monophyletic | 4 | 15 | 0 | |
| Monophyletic | 5 | 1 | 0 | |
| Monophyletic | 3 | 3 | 0 | |
| Monophyletic | 10 | 0 | 0 | |
| Monophyletic | 8 | 0 | 0 | |
| Monophyletic | 4 | 0 | 1 | |
| Monophyletic | 7 | 2 | 6 | |
| Monophyletic | 1 | 4 | 0 | |
| Monophyletic | 20 | 1 | 1 | |
| Polyphyletic-bacteria | 16 | 2 | 1 | |
| Polyphyletic-bacteria | 38 | 3 | 0 | |
| Polyphyletic-Euryarchaeota | 75 | 0 | 16 | |
| Polyphyletic-Euryarchaeota | 51 | 1 | 3 |
Denotes intein alleles discovered in this work.
Denotes exteins discovered in this work.
Figure 4Phylogenetic diversity in halobacterial intein alleles. A stacked column graph depicts the representation of the Halobacteria (in purple), the Bacteria (in blue), and other Euryarchaeota (in green). Intein alleles are ordered by the number of intein sequences recovered for each allele, which is reported in parenthesis after the intein allele name on the x-axis. The number of genera for each intein allele is indicated by the number of breaks in the column (white lines) and the height of each of the fragments that make up a column indicate the proportion of sequences in that allele found in a particular genus.
Figure 5Intein size distributions in the . The size of inteins in the Haloarcula (A), Haloferax (B), and Halorubrum (C) are indicated in the column corresponding to the intein allele. Mini-inteins are colored yellow, large inteins are colored teal, black boxes indicate no intein, and white boxes indicate missing data, clusters from Figure 3 are indicated by numbered orange boxes. The cdc21-a, and b sequences for Halorubrum sp. J07HR59, though smaller than the rest, cannot be considered mini-inteins, as the intein sequences in these positions are not complete.
Protein sequence identifiers for intein sequences.
| YP_004340760.1 | Euryarchaeota | ||
| YP_003400528.1 | Euryarchaeota | ||
| YP_008072558.1 | Euryarchaeota | ||
| WP_021836378.1 | Cyanobacteria | ||
| YP_003435419.1 | Euryarchaeota | ||
| WP_020220725.1 | Halobacteria | ||
| WP_020504136.1 | Gammaproteobacteria | ||
| WP_019178416.1 | Euryarchaeota | ||
| YP_004576471.1 | Euryarchaeota | ||
| GAD83132.1 | Actinobacteria | ||
| WP_020380316.1 | Actinobacteria | ||
| NP_127115.1 | Euryarchaeota | ||
| NP_578211.1 | Euryarchaeota | ||
| NP_142122.1 | Euryarchaeota | ||
| YP_004424138.1 | Euryarchaeota | ||
| YP_008429717.1 | Euryarchaeota | ||
| YP_002306424.1 | Euryarchaeota | ||
| YP_002994932.1 | Euryarchaeota | ||
| YP_002582218.1 | Euryarchaeota | ||
| YP_006424652.1 | Euryarchaeota | ||
| WP_010479121.1 | Euryarchaeota | ||
| KJ_865687.1 | Halobacteria | ||
| KJ_865689.1 | Halobacteria | ||
| YP_003887897.1 | Cyanobacteria | ||
| WP_020220725.1 | Halobacteria | ||
| YP_008072558.1 | Euryarchaeota | ||
| WP_019178416.1 | Euryarchaeota | ||
| YP_004070279.1 | Euryarchaeota | ||
| KJ_865687.1 | Halobacteria | ||
| KJ_865688.1 | Halobacteria | ||
| KJ_865689.1 | Halobacteria | ||
| YP_003400528.1 | Euryarchaeota | ||
| YP_003572085.1 | Bacteroidetes | ||
| YP_446104.1 | Bacteroidetes | ||
| WP_020678478.1 | Halobacteria | ||
| YP_006544623.1 | Euryarchaeota | ||
| WP_006885382.1 | Halobacteria | ||
| YP_003572085.1 | Bacteroidetes | ||
| YP_446104.1 | Bacteroidetes | ||
| WP_005489097.1 | Firmicutes | ||
| WP_020678478.1 | Halobacteria | ||
| YP_004202875.1 | Deinococcus-Thermus | ||
| YP_004483799.1 | Euryarchaeota | ||
| KJ_865686.1 | Halobacteria | ||
| YP_004341738.1 | Euryarchaeota | ||
| WP_006882195.1 | Halobacteria | ||
| YP_003616947.1 | Euryarchaeota | ||
| ABU41683.1 | Euryarchaeota | ||
| YP_006544019.1 | Euryarchaeota | ||
| YP_001048029.1 | Euryarchaeota | ||
| WP_004037227.1 | Euryarchaeota | ||
| WP_007314808.1 | Euryarchaeota | ||
| WP_004076782.1 | Euryarchaeota | ||
| YP_003893638.1 | Euryarchaeota | ||
| YP_001403293.1 | Euryarchaeota | ||
| YP_007242862.1 | Euryarchaeota | ||
| YP_002467270.1 | Euryarchaeota | ||
| YP_503855.1 | Euryarchaeota | ||
| NP_142130.1 | Euryarchaeota | ||
| YP_002958492.1 | Euryarchaeota | ||
| YP_002994988.1 | Euryarchaeota | ||
| uncultured haloarchaeon | ABQ75865.1 | Halobacteria | |
| KJ_865692.1 | Halobacteria | ||
| KJ_865690.1 | Halobacteria | ||
| KJ_564691.1 | Halobacteria | ||
| WP_006882195.1 | Halobacteria | ||
| YP_004624494.1 | Euryarchaeota | ||
| uncultured haloarchaeon | ABQ75865.1 | Halobacteria | |
| YP_003443943.1 | Gammaproteobacteria | ||
| YP_006997726 | Cyanobacteria | ||
| WP_016950132.1 | Cyanobacteria | ||
| BAM51471.1 | Firmicutes | ||
| WP_019489451.1 | Cyanobacteria | ||
| WP_006099284.1 | Cyanobacteria | ||
| WP_006276716.1 | Cyanobacteria | ||
| YP_007173052.1 | Cyanobacteria | ||
| WP_021780646.1 | Halobacteria | ||
| WP_019178436.1 | Euryarchaeota | ||
| WP_002774451.1 | Cyanobacteria | ||
| WP_008190351.1 | Cyanobacteria | ||
| WP_017715151.1 | Cyanobacteria | ||
| WP_019509077.1 | Cyanobacteria | ||
| WP_017710941.1 | Cyanobacteria | ||
| WP_009342634.1 | Cyanobacteria | ||
| YP_007054134.1 | Cyanobacteria | ||
| YP_007037469.1 | Actinobacteria | ||
| NP_441040.1 | Cyanobacteria | ||
| YP_723459.1 | Cyanobacteria | ||
| uncultured bacterium | EKD46222.1 | ||
| YP_005540906.1 | Firmicutes | ||
| WP_017696872.1 | Firmicutes | ||
| Nanoarchaeota archaeon SCGC AAA011-L22 | WP_018204386.1 | ||
| NP_248426.1 | Euryarchaeota | ||
| YP_003458055.1 | Euryarchaeota | ||
| YP_004576337.1 | Euryarchaeota | ||
| WP_007044297.1 | Euryarchaeota | ||
| NP_125803.1 | Euryarchaeota | ||
| NP_577822.1 | Euryarchaeota | ||
| NP_142122.1 | Euryarchaeota | ||
| YP_006353924.1 | Euryarchaeota | ||
| YP_184631.1 | Euryarchaeota | ||
| YP_008428897.1 | Euryarchaeota | ||
| YP_004763272.1 | Euryarchaeota | ||
| YP_002582171.1 | Euryarchaeota | ||
| YP_006425306.1 | Euryarchaeota | ||
| KJ_865684.1 | Halobacteria | ||
| KJ_865685.1 | Halobacteria | ||
| YP_001995975.1 | Chlorobi | ||
| YP_007273179.1 | Firmicutes | ||
| uncultured Chloroflexi bacterium | BAL53207.1 | Chloroflexi | |
| YP_007181218.1 | Deinococcus-Thermus | ||
| YP_004233126.1 | Betaproteobacteria | ||
| WP_007856012.1 | Betaproteobacteria | ||
| WP_008903130.1 | Betaproteobacteria | ||
| WP_019631066.1 | Actinobacteria | ||
| WP_018131875.1 | Firmicutes | ||
| WP_006300529.1 | Synergistetes | ||
| WP_006300529.1 | Firmicutes | ||
| WP_018718131.1 | Gammaproteobacteria | ||
| WP_016885361.1 | Firmicutes | ||
| WP_017697104.1 | Firmicutes | ||
| YP_007136749.1 | Cyanobacteria | ||
| YP_004863563.1 | Acidobacteria | ||
| YP_001717412.1 | Firmicutes | ||
| WP_006314960.1 | Firmicutes | ||
| NP_296095.1 | Deinococcus-Thermus | ||
| WP_016451949.1 | Betaproteobacteria | ||
| YP_004490724.1 | Betaproteobacteria | ||
| WP_005810476.1 | Firmicutes | ||
| YP_002955841.1 | Deltaproteobacteria | ||
| WP_009106508.1 | Deltaproteobacteria | ||
| YP_008141532.1 | Euryarchaeota | ||
| WP_021787573.1 | Euryarchaeota | ||
| WP_016418429.1 | Gammaproteobacteria | ||
| WP_017429019.1 | Gammaproteobacteria | ||
| WP_016854101.1 | Gammaproteobacteria | ||
| YP_004462974.1 | Firmicutes | ||
| WP_018405479.1 | Gammaproteobacteria | ||
| WP_004040239.1 | Euryarchaeota | ||
| WP_020160338.1 | Gammaproteobacteria | ||
| WP_017366201.1 | Gammaproteobacteria | ||
| WP_017841702.1 | Gammaproteobacteria | ||
| nanoarchaeote Nst1 | WP_004578017.1 | ||
| WP_017572347.1 | Actinobacteria | ||
| CAJ57177.1 | Cyanobacteria | ||
| WP_019499030.1 | Cyanobacteria | ||
| YP_007101092.1 | Cyanobacteria | ||
| WP_007082010.1 | Gammaproteobacteria | ||
| YP_007588821.1 | Gammaproteobacteria | ||
| WP_008437232.1 | Gammaproteobacteria | ||
| YP_004824118.1 | Bacteroidetes | ||
| WP_016187732.1 | Firmicutes | ||
| CAJ57178.1 | Cyanobacteria | ||
| YP_400626.1 | Cyanobacteria | ||
| YP_007060778.1 | Cyanobacteria | ||
| YP_006391581.1 | Firmicutes | ||
| YP_003851043.1 | Firmicutes | ||
| WP_018663796.1 | Firmicutes | ||
| YP_184312.1 | Euryarchaeota | ||
| YP_004625205.1 | Thermodesulfobacteria | ||
| YP_004932130.1 | Deinococcus-Thermus | ||
| WP_018110436.1 | Deinococcus-Thermus | ||
| CAJ57170.1 | Deinococcus-Thermus | ||
| WP_019570879.1 | Gammaproteobacteria | ||
| WP_018881426.1 | Gammaproteobacteria | ||
| WP_017926201.1 | Gammaproteobacteria | ||
| YP_003459507.1 | Gammaproteobacteria | ||
| uncultured bacterium | EKE25755.1 | ||
| WP_017907463.1 | Gammaproteobacteria | ||
| WP_017915139.1 | Gammaproteobacteria | ||
| zeta proteobacterium SCGC AB-604-B04 | WP_018280466.1 | Zetaproteobacteria | |
| YP_001995975.1 | Chlorobi | ||
| WP_019011777.1 | Deinococcus-Thermus | ||
| YP_007166732.1 | Cyanobacteria | ||
| WP_021313783.1 | Gammaproteobacteria | ||
| YP_003681238.1 | Actinobacteria | ||
| WP_019609645.1 | Actinobacteria | ||
| YP_004826277.1 | Bacteroidetes | ||
| YP_007273179.1 | Firmicutes | ||
| YP_003299200.1 | Actinobacteria | ||
| YP_005899.1 | Deinococcus-Thermus | ||
| CAJ57173.1 | Deinococcus-Thermus | ||
| YP_006059430.1 | Deinococcus-Thermus | ||
| YP_005639869.1 | Deinococcus-Thermus | ||
| YP_720358.1 | Cyanobacteria | ||
| uncultured Chloroflexi bacterium | BAL53207.1 | Chloroflexi | |
| WP_003044118.1 | Deinococcus-Thermus | ||
| CAJ57173.1 | Deinococcus-Thermus | ||
| YP_005639869.1 | Deinococcus-Thermus | ||
| uncultured Chloroflexi bacterium | BAL53207.1 | Chloroflexi | |
| WP_020250137.1 | |||
| YP_002250310.1 | Dictyglomi | ||
| NP_248048.1 | Euryarchaeota | ||
| YP_003246412.1 | Euryarchaeota | ||
| YP_001324612.1 | Euryarchaeota | ||
| YP_004575831.1 | Euryarchaeota | ||
| WP_007044255.1 | Euryarchaeota | ||
| YP_002960518.1 | Euryarchaeota | ||
| WP_007044255.1 | Euryarchaeota | ||
| WP_021780130.1 | Halobacterium |
Indicates the intein detected is a mini-intein.
Indicates taxa that grouped within the halobacterial intein sequences.