| Literature DB >> 30285626 |
Ismael L Hernández-González1, Gabriel Moreno-Hagelsieb2, Gabriela Olmedo-Álvarez3.
Abstract
BACKGROUND: Members of the Bacillus genus have been isolated from a variety of environments. However, the relationship between potential metabolism and the niche from which bacteria of this genus have been isolated has not been extensively studied. The existence of a monophyletic aquatic Bacillus group, composed of members isolated from both marine and fresh water has been proposed. Here, we present a phylogenetic/phylogenomic analysis to investigate the potential relationship between the environment from which group members have been isolated and their evolutionary origin. We also carried out hierarchical clustering based on functional content to test for potential environmental effects on the genetic content of these bacteria.Entities:
Keywords: Bacillus evolution; Comparative genomics; Evolutionary genomics; Functional content; Homoplasy; Phylogenomics
Mesh:
Year: 2018 PMID: 30285626 PMCID: PMC6171248 DOI: 10.1186/s12862-018-1261-7
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Overview of the methodology used in this work
Fig. 3Comparison of Genomic Similarity Score and Core 70 trees. a Neighbor-joining tree based on GSS (Genomic Similarity Score) distance [10]. b Core 70 tree based on the alignment of 437 genes present in at least 70% of the species. In both trees, strain names in blue correspond to Bacillus species isolated from aquatic environments. Branch colors indicate clearly delineated groups. The dots indicate support based on p≤0.05 (GSS tree) and bootstrap values ≥80 (Core 70 tree). Indicated nodes and dashed rectangles show that aquatic Bacillus cluster better in the GSS tree. The figure was drawn using the web-based tool iTOL [68]
Number of nodes with bootstrap values ≥80
| 16S rRNA | AMPHORA markers | Core genome | Core 70 | ||
|---|---|---|---|---|---|
| Num. Nodes ≥80 | 22 | 44 | 69 | 74 | 46 |
*The GSS node quality is based on maximum likelihood (p≤0.05), not bootstrap analysis
Symmetric differences between trees
| 16S rRNA | AMPHORA markers | Core genome | Core 70 | ||
|---|---|---|---|---|---|
| 16S rRNA | 0 | - | - | - | - |
| AMPHORA markers | 112 | 0 | - | - | - |
| Core genome | 102 | 64 | 0 | - | - |
| Core 70 | 100 | 70 | 10 | 0 | - |
| 102 | 72 | 32 | 24 | 0 |
Fig. 2Phylogenetic reconstruction based on 196 protein orthologous groups comprising the Core Genome. Note the well supported nodes. Branch colors indicate the nine main groups discussed in the text. The names of the species corresponding to Bacillus isolated from aquatic environments are shown in light blue. Bootstrap values ≥80 are indicated by dots. The figure was constructed using the web-based tool iTOL [68]
Fig. 4Hierarchical clustering using COGs-based Jaccard’s distance (Jd). The dendrogram was obtained using the COGs-based Jaccard’s distance and the Ward clustering method. The silhouette values are shown as blue and pink bars for k=3 and k=10, respectively. The blue and pink lines indicate the heights where the dendrogram was cut (k=3 and k=10, respectively). The names of the strains shown in light blue correspond to Bacillus isolated from aquatic environments
Fig. 5Cluster analysis using Jaccard’s distance (Jd) based on Figfams. The dendrogram was obtained using the Ward clustering method. Silhouette values are shown as blue and pink bars for k=3 and k=6, respectively. The blue and pink lines indicate the heights where the dendrogram was cut (k=3 and k=6, respectively). The names of the strains shown in light blue correspond to Bacillus isolated from aquatic environments
Fig. 6Significant association between the natural environment and the COG-based clustering groups. The p-values were calculated based on the hypergeometric distribution of the groups obtained from the hierarchical clustering with COGs. The significant associations are indicated by black squares. Facultative corrected P=1.7×10−03, Aquatic corrected P=5.6×10−04
Fig. 7Significant association between the natural environment and the Figfam-based clustering groups. The p-values were calculated based on the hypergeometric distribution of the groups obtained from the hierarchical clustering with Figfams. The significant associations are indicated by black squares. Facultative corrected P=1.8×10−03, aquatic corrected P=9.0×10−05
Putative adaptive COGs found in the Aquatic Bacillus group
| COG Category | COG Id | Description |
|---|---|---|
| EP | COG0601 | ABC-type dipeptideoligopeptidenickel transport systems, permease components |
| E | COG0624 | Acetylornithine deacetylaseSuccinyl-diaminopimelate desuccinylase and related deacylases |
| E | COG0747 | ABC-type dipeptide transport system, periplasmic component |
| E | COG1703 | Putative periplasmic protein kinase ArgK and related GTPases of G3E family |
| G | COG0395 | ABC-type sugar transport system, permease component |
| G | COG1175 | ABC-type sugar transport systems, permease components |
| G | COG1653 | ABC-type sugar transport system, periplasmic component |
| HE | COG0111 | Phosphoglycerate dehydrogenase and related dehydrogenases |
| I | COG0183 | Acetyl-CoA acetyltransferase |
| I | COG1250 | 3-hydroxyacyl-CoA dehydrogenase |
| I | COG1884 | Methylmalonyl-CoA mutase, N-terminal domainsubunit |
| I | COG1960 | Acyl-CoA dehydrogenases |
| I | COG2185 | Methylmalonyl-CoA mutase, C-terminal domainsubunit (cobalamin-binding) |
| J | COG4108 | Peptide chain release factor RF-3 |
| M | COG1215 | Glycosyltransferases, probably involved in cell wall biogenesis |
| O | COG0695 | Glutaredoxin and related proteins |
| O | COG1765 | Predicted redox protein, regulator of disulfide bond formation |
| P | COG0607 | Rhodanese-related sulfurtransferase |
| Q | COG0179 | 2-keto-4-pentenoate hydratase2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
| R | COG0388 | Predicted amidohydrolase |
| R | COG0673 | Predicted dehydrogenases and related proteins |
| R | COG1647 | Esteraselipase |
| S | COG2966 | Uncharacterized conserved protein |
| S | COG3610 | Uncharacterized conserved protein |
| T | COG0784 | FOG: CheY-like receiver |
| T | COG2199 | FOG: GGDEF domain |
| T | COG2200 | FOG: EAL domain |
| T | COG2202 | FOG: PASPAC domain |
| V | COG0841 | Cationmultidrug efflux pump |