| Literature DB >> 30816235 |
Laetitia G E Wilkins1,2, Cassandra L Ettinger3, Guillaume Jospin3, Jonathan A Eisen3,4,5.
Abstract
Culture-independent methods have contributed substantially to our understanding of global microbial diversity. Recently developed algorithms to construct whole genomes from environmental samples have further refined, corrected and revolutionized understanding of the tree of life. Here, we assembled draft metagenome-assembled genomes (MAGs) from environmental DNA extracted from two hot springs within an active volcanic ecosystem on the Kamchatka peninsula, Russia. This hydrothermal system has been intensively studied previously with regard to geochemistry, chemoautotrophy, microbial isolation, and microbial diversity. We assembled genomes of bacteria and archaea using DNA that had previously been characterized via 16S rRNA gene clone libraries. We recovered 36 MAGs, 29 of medium to high quality, and inferred their placement in a phylogenetic tree consisting of 3,240 publicly available microbial genomes. We highlight MAGs that were taxonomically assigned to groups previously underrepresented in available genome data. This includes several archaea (Korarchaeota, Bathyarchaeota and Aciduliprofundum) and one potentially new species within the bacterial genus Sulfurihydrogenibium. Putative functions in both pools were compared and are discussed in the context of their diverging geochemistry. This study adds comprehensive information about phylogenetic diversity and functional potential within two hot springs in the caldera of Kamchatka.Entities:
Year: 2019 PMID: 30816235 PMCID: PMC6395817 DOI: 10.1038/s41598-019-39576-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Sampling locations in the Uzon Caldera, Kamchatka Russia. DNA had been extracted in 2009 by Burgess et al.[9] from sediment samples of two active thermal pools Arkashin Schurf (b) and Zavarzin Spring (c). Photos were taken by Dr. Russell Neches (ORCID: 0000-0002-2055-8381) during an expedition in 2012 and he granted permission through a CC-BY license 4.0. Maps were plotted in R v. 3.4.0 with the package ‘ggmap’ v. 2.6.1[101].
Genomic feature summary for metagenome-assembled genomes identified in Arkashin Schurf (ARK) and Zavarzin Spring (ZAV).
| Draft Quality | Length (mbp) | Number Contigs | N50 | GC Content | Percent Complet. | Percent Contam. | Putative Taxonomy | |
|---|---|---|---|---|---|---|---|---|
| ARK-15 | High | 1.37 | 181 | 12768 | 40.31% | 98.39% | 2.42% | Aciduliprofundum |
| ZAV-10 | High | 1.61 | 185 | 13861 | 31.81% | 97.95% | 4.03% | Desulfurella |
| ARK-08 | High | 1.80 | 342 | 8349 | 31.77% | 97.58% | 4.78% | Desulfurella |
| ARK-05 | High | 1.50 | 153 | 17303 | 34.88% | 95.83% | 2.44% | Hydrogenobaculum |
| ARK-13 | High | 1.28 | 123 | 14898 | 34.23% | 94.31% | 0% | Sulfurihydrogenibium |
| ARK-11 | High | 1.77 | 126 | 29603 | 39.25% | 94.07% | 1.69% | Mesoaciditoga |
| ARK-03 | High | 3.79 | 367 | 18161 | 44.50% | 94.05% | 1.67% | Mucilaginibacter |
| ZAV-08 | High | 1.37 | 107 | 18405 | 31.37% | 92.81% | 1.67% | Thermodesulfobacterium |
| ARK-02 | High | 2.55 | 187 | 20870 | 43.10% | 92.08% | 3.42% | Aminicenantes |
| ZAV-16 | Medium | 1.31 | 105 | 19753 | 34.32% | 91.87% | 6.10% | Sulfurihydrogenibium |
| ZAV-01 | High | 4.75 | 1224 | 5081 | 60.56% | 90.90% | 0.92% | Roseiflexus |
| ZAV-15 | Medium | 1.30 | 103 | 17588 | 37.05% | 89.3% | 2.67% | Caldimicrobium |
| ZAV-18 | Medium | 1.25 | 358 | 4481 | 44.24% | 88.25% | 0.93% | Korarchaeota |
| ZAV-05 | Medium | 1.42 | 79 | 23586 | 35.82% | 83.72% | 0.88% | Calditerrivibrio |
| ZAV-02 | Medium | 3.19 | 1166 | 3244 | 55.15% | 83.22% | 1.89% | Chloroflexus |
| ZAV-14 | Medium | 1.74 | 99 | 24404 | 33.40% | 82.76% | 0% | Dictyoglomus |
| ARK-12 | Medium | 1.35 | 159 | 22660 | 36.27% | 82.09% | 4.41% | Fervidiococcus |
| ARK-14 | Medium | 1.46 | 312 | 8641 | 44.53% | 81.94% | 2.53% | Fervidiococcus |
| ZAV-04 | Medium | 1.29 | 61 | 31704 | 34.82% | 76.31% | 0% | Thermodesulfovibrio |
| ARK-04 | Medium | 1.23 | 483 | 2868 | 31.05% | 75.37% | 2.92% | Thermodesulfobacterium |
| ARK-16 | Medium | 2.06 | 748 | 3438 | 49.66% | 74.62% | 2.80% | Korarchaeota |
| ZAV-19 | Medium | 0.94 | 24 | 38811 | 30.96% | 71.31% | 3.80% | Caldisphaera |
| ARK-09 | Medium | 1.34 | 169 | 11257 | 32.80% | 65.67% | 0% | Thermodesulfobium |
| ZAV-07 | Medium | 0.86 | 64 | 20494 | 34.54% | 62.50% | 0% | Caldisericum |
| ZAV-13 | Medium | 0.60 | 30 | 27058 | 42.39% | 61.99% | 2.80% | Bathyarchaeota |
| ZAV-06 | Medium | 0.66 | 15 | 45310 | 35.16% | 59.26% | 1.90% | Fervidicoccus |
| ZAV-03 | Medium | 0.85 | 20 | 45162 | 40.77% | 58.47% | 1.69% | Mesoaciditoga |
| ARK-06 | Medium | 1.41 | 105 | 21803 | 30.21% | 52.83% | 1.89% | Caldisphaera |
| ARK-07 | Medium | 0.82 | 92 | 21052 | 31.42% | 52.69% | 3.80% | Caldisphaera |
| ARK-01 | Low | 0.65 | 344 | 1984 | 59.14% | 49.78% | 0% | Nitrosphaera |
| ARK-10 | Low | 1.04 | 354 | 3951 | 34.51% | 45.63% | 0% | Caldisericum |
| ZAV-20 | Low | 0.46 | 13 | 40289 | 31.23% | 34.39% | 3.38% | Caldisphaera |
| ZAV-11 | Low | 0.42 | 84 | 8297 | 42.05% | 30.42% | 1.94% | Bathyarchaeota |
| ZAV-17 | Low | 0.30 | 29 | 12906 | 43.88% | 29.28% | 0% | Bathyarchaeota |
| ZAV-12 | Low | 0.45 | 12 | 37424 | 35.63% | 28.86% | 2.44% | Hydrogenobaculum |
| ZAV-09 | Low | 1.35 | 34 | 39463 | 42.63% | 28.57% | 4.29% | Mucilaginibacter |
Genomic features are summarised below for each metagenome-assembled genome (MAG) including length (mbp), number of contigs, N50, percent GC content, putative taxonomic identity and completion and contamination estimates as generated by CheckM. MAGs are sorted by percent completion and their draft-quality is indicated.
Figure 2Placement of the MAGs into their phylogenetic context. Taxonomy of the MAGs (metagenome-assembled genomes) was refined by placing them into a phylogenetic tree using PhyloSift v. 1.0.1 with its updated markers database for the alignment and RAxML v. 8.2.10 on the CIPRES web server for the tree inference. This tree includes the 36 MAGs (red dots), all taxa previously identified by Burgess et al. (2012) with complete genomes available on NCBI (n = 148)[82], and 3,102 archaeal (yellow) and bacterial (grey) genomes previously used in Hug et al. (2016)[79–81]. The complete tree in Newick format and its alignment of 37 concatenated marker genes can be found on Figshare[75,100]. Branches with MAGs found in Arkashin Schurf (ARK) and Zavarzin Spring (ZAV) are enlarged (orange nodes). Blue: taxa from Burgess et al. (2012), black: taxa from Hug et al. (2016). GCA IDs from NCBI are shown for the closest neighbours of the MAGs. (a) Microbial tree of life, reconstructed with genomes representing taxa reported in Burgess et al. highlighting the placement of MAGs in this study; (b) Dictyoglomales, Thermoanaerobacteriales, Caldisericales, and Mesoaciditogales; (c) Nitrosphaera, Bathyarchaeota, Korarchaeota, and Crenarchaeota; (d) Chloroflexales; (e) Euryarchaeota; and (f) Deferribacteriales, Desulfobacteriales, and Aquificales. ARK-02, ARK-03, ARK-04, ZAV-04, ZAV-08, ZAV-09, and ZAV-15 can be found in Supplementary Figure S1.
Taxonomic identification of MAGs in ARK.
| BIN ID | Phylum | Class | Order | Family | Genus | Species | Proportion of clones |
|---|---|---|---|---|---|---|---|
| ARK-05 | Aquificae | Aquificae | Aquificales | Aquificaceae | — | 9.7% | |
| ARK-13 | Aquificae | Aquificae | Aquificales | Hydrogenothermaceae | — | 1.5% | |
| ARK-03 | Bacteroidetes | Sphingobacteriia | Sphingobacteriales | Sphingobacteriaceae | — | 38.3% | |
| ARK-10 | Caldiserica | Caldisericia | Caldisericales | Caldisericaceae | — | ||
| ARK-02 | Candidatus Aminicenantes | — | — | — | — | — | 3.4% |
| ARK-09 | Firmicutes | Clostridia | Thermoanaerobacterales | Thermodesulfobiaceae | 5.8% | ||
| ARK-08 | Proteobacteria | Deltaproteobacteria | Desulfurellales | Desulfurellaceae | — | 7.8% | |
| ARK-04 | Thermodesulfobacteria | Thermodesulfobacteria | Thermodesulfobacteriales | Thermodesulfobacteriaceae | — | ||
| ARK-11 | Thermotogae | Thermotogae | Mesoaciditogales | Mesoaciditogaceae | — | 13.1% | |
| ARK-16 | Candidatus Korarchaeota | — | — | — | — | — | NA |
| ARK-12 | Crenarchaeota | Thermoprotei | Fervidicoccales | Fervidicoccaceae | NA | ||
| ARK-14 | Crenarchaeota | Thermoprotei | Fervidicoccales | Fervidicoccaceae | — | NA | |
| ARK-06 | Crenarchaeota | Thermoprotei | Acidilobales | Caldisphaeraceae | — | NA | |
| ARK-07 | Crenarchaeota | Thermoprotei | Acidilobales | Caldisphaeraceae | — | NA | |
| ARK-15 | Euryarchaeota | — | — | — | — | NA | |
| ARK-01 | Thaumarchaeota | Nitrososphaeria | Nitrososphaerales | Nitrososphaeraceae | — | NA |
Here we report putative taxonomies for metagenome-assembled genomes (MAGs) identified in Arkashin Schurf (ARK) and indicate their relative abundance in the re-analysed bacterial clone libraries constructed in Burgess et al.[9]. They were unable to amplify archaeal sequences from ARK which is indicated in this table using ‘NA’.
Taxonomic identification of MAGs in ZAV.
| BIN ID | Phylum | Class | Order | Family | Genus | Species | Proportion of clones |
|---|---|---|---|---|---|---|---|
| ZAV-16 | Aquificae | Aquificae | Aquificales | Hydrogenothermaceae | — | 3% | |
| ZAV-12 | Aquificae | Aquificae | Aquificales | Aquificaceae | — | — | |
| ZAV-09 | Bacteroidetes | Sphingobacteriia | Sphingobacteriales | Sphingobacteriaceae | — | — | |
| ZAV-07 | Caldiserica | Caldisericia | Caldisericales | Caldisericaceae | 1% | ||
| ZAV-01 | Chloroflexi | Chloroflexia | Chloroflexales | Roseiflexaceae | 31.3% | ||
| ZAV-02 | Chloroflexi | Chloroflexia | Chloroflexales | Chloroflexaceae | 7% | ||
| ZAV-05 | Deferribacteres | Deferribacteres | Deferribacterales | Deferribacteraceae | 7.7% | ||
| ZAV-14 | Dictyoglomi | Dictyoglomia | Dictyoglomales | Dictyoglomaceae | 2% | ||
| ZAV-04 | Nitrospirae | Nitrospira | Nitrospirales | Nitrospiraceae | 2.7% | ||
| ZAV-10 | Proteobacteria | Deltaproteobacteria | Desulfurellales | Desulfurellaceae | 5% | ||
| ZAV-08 | Thermodesulfobacteria | Thermodesulfobacteria | Thermodesulfobacteriales | Thermodesulfobacteriaceae | — | ||
| ZAV-15 | Thermodesulfobacteria | Thermodesulfobacteria | Thermodesulfobacteriales | Thermodesulfobacteriaceae | — | ||
| ZAV-03 | Thermotogae | Thermotogae | Mesoaciditogales | Mesoaciditogaceae | — | — | |
| ZAV-13 | Candidatus Bathyarchaeota | — | — | — | — | — | — |
| ZAV-11 | Candidatus Bathyarchaeota | — | — | — | — | — | — |
| ZAV-17 | Candidatus Bathyarchaeota | — | — | — | — | — | — |
| ZAV-18 | Candidatus Korarchaeota | — | — | — | — | — | 20.9% |
| ZAV-19 | Crenarchaeota | Thermoprotei | Acidilobales | Caldisphaeraceae | — | — | |
| ZAV-20 | Crenarchaeota | Thermoprotei | Acidilobales | Caldisphaeraceae | — | — | |
| ZAV-06 | Crenarchaeota | Thermoprotei | Fervidicoccales | Fervidicoccaceae | 5.5% |
Here we report the putative taxonomies for metagenome-assembled genomes (MAGs) identified in Zavarzin Spring (ZAV) and indicate their relative abundance in the re-analysed bacterial and archaeal clone libraries constructed in Burgess et al.[9].
Figure 3Shared phyla between MAGs and different sequencing methods. Venn diagrams depict the number of shared phyla observed between metagenome assembled genomes (MAGs) and different methods of sequencing and taxonomic assignment for (a) Arkashin Schurf (ARK) and (b) Zavarzin Spring (ZAV). The different methods include the Ribosomal Database Project v. 11.5 inferred taxonomy of the 16S rRNA gene Sanger clone libraries prepared by Burgess et al.[9], the Kaiju v. 1.6.2 inferred taxonomy for the Sanger metagenomes prepared by TIGR and the Kaiju v. 1.6.2 inferred taxonomy for the Solexa reads which were later assembled to bin the MAGs. The different circles represent the 16S rRNA genes (blue), the MAGs (yellow), the Sanger metagenomes (orange) and the Solexa reads (magenta).
Figure 4Pangenomic comparison of shared genera between pools. Desulfurella genera (a) and Sulfurihydrogenibium genera (b) identified in both Arkashin Schurf (ARK) and Zavarzin Spring (ZAV) are visualized respectively in anvi’o against reference genomes downloaded from NCBI. ARK-08 and ZAV-10 were compared to three representative Desulfurella genomes including D. multipotens (GCA_900101285.1), D. acetivorans (GCA_000517565.1) and D. amilsii (GCA_002119425.1), while bins ARK-13 and ZAV-16 were compared to four representative Sulfurihydrogenibium genomes including S. subterraneum (GCA_000619805.1), S. yellowstonense (GCA_000173615.1), S. azorense (GCA_000021545.1) and S. sp. YO3AOP1 (GCA_000020325.1). Genomes are arranged based on a phylogenetic tree of shared single-copy core genes produced in anvi’o using FastTree v. 2.1. Gene clusters have been grouped into categories based on presence/absence including: ‘Single-copy core genes’ (gene clusters representing # genes from Campbell et al.)[64], ‘Universally shared’ (gene clusters present in all genomes), ‘Often Shared’ (gene clusters present in two or more genomes) and ‘Unique’ (gene clusters present in only one genome). Gene calls were annotated in anvi’o using NCBI’s Clusters of Orthologous Groups (COG’s). Gene clusters with an assigned NCBI COG are indicated in black.
Complete energy metabolism KEGG pathways that were predicted to be present in both pools based on the recovery of putatively homologous genes.
| KEGG-ID | Pathway Name | Module |
|---|---|---|
| M00168 | CAM (Crassulacean acid metabolism), dark | Carbon fixation |
| M00169 | CAM (Crassulacean acid metabolism), light | Carbon fixation |
| M00579 | Phosphate acetyltransferase-acetate kinase pathway, acetyl-CoA = >acetate | Carbon fixation |
| M00377 | Reductive acetyl-CoA pathway (Wood-Ljungdahl pathway) | Carbon fixation |
| M00173 | Reductive citrate cycle (Arnon-Buchanan cycle) | Carbon fixation |
| M00166 | Reductive pentose phosphate cycle, ribulose-5P = >glyceraldehyde-3P | Carbon fixation |
| M00422 | Acetyl-CoA pathway, CO2 = >acetyl-CoA | Methane metabolism |
| M00378 | F420 biosynthesis | Methane metabolism |
| M00345 | Formaldehyde assimilation, ribulose monophosphate pathway | Methane metabolism |
| M00356 | Methanogenesis, methanol = >methane | Methane metabolism |
| M00531 | Assimilatory nitrate reduction, nitrate = >ammonia | Nitrogen metabolism |
| M00530 | Dissimilatory nitrate reduction, nitrate = >ammonia | Nitrogen metabolism |
| M00175 | Nitrogen fixation, nitrogen = >ammonia | Nitrogen metabolism |
| M00176 | Assimilatory sulfate reduction, sulfate = >H2S | Sulfur metabolism |
| M00596 | Dissimilatory sulfate reduction, sulfate = >H2S | Sulfur metabolism |
| M00595 | Thiosulfate oxidation by SOX complex, thiosulfate = >sulfate | Sulfur metabolism |
Shown are all complete KEGG pathways; i.e., gene pathways of which all genes (blocks) were represented (n > 5 to 1,860) in both pools (Arkashin Schurf and Zavarzin Spring). For each pathway its KEGG-ID, name and pathway module are given. Presence of pathways was predicted based on the retrieval of homologous genes.